obsidian-kb

A skill that scans any project and builds a custom knowledge base: tiered context packs for LLM consumption + an Obsidian wiki for human browsing.

Works with Claude Code, Cursor, Windsurf, GitHub Copilot, Cline, Continue, and Zed.

Inspired by Karpathy's LLM Knowledge Base workflow.

Install

One command. Run inside any project:

curl -sL https://raw.githubusercontent.com/chiKeka/obsidian-kb/main/install.sh | bash

It auto-detects your IDE. Or specify one:

# Pick your IDE
bash install.sh --claude
bash install.sh --cursor
bash install.sh --windsurf
bash install.sh --copilot
bash install.sh --cline
bash install.sh --continue
bash install.sh --zed

# Install for all IDEs at once
bash install.sh --all

# Claude Code global (available in every local project)
bash install.sh --claude --global

Where It Goes

IDE	Installed To
Claude Code	`.claude/skills/obsidian-kb/SKILL.md`
Cursor	`.cursor/rules/obsidian-kb.md`
Windsurf	`.windsurf/rules/obsidian-kb.md`
GitHub Copilot	`.github/instructions/obsidian-kb.instructions.md`
Cline	`.clinerules/obsidian-kb.md`
Continue	`.continue/rules/obsidian-kb.md`
Zed	`.rules` (appended)

What It Does

You ask your AI assistant to run obsidian-kb init. It scans your project, figures out what data you have, and proposes a knowledge base architecture. You approve it, run obsidian-kb build, and it generates:

A three-tier context pack system - markdown files optimized for LLM token efficiency
A Python compiler - tailored to your project's specific data schema
An Obsidian wiki - interlinked pages with graph view and color-coded sections
Query and lint commands - search, health checks, structural integrity scoring

The Three Tiers

Tier 0 - Index (~1200 tokens)
  Always loaded. Full inventory of everything in the KB.
  Purpose: routing - "what exists here?"

Tier 1 - Group Context (~4000 tokens each)
  One file per domain/category/module.
  Purpose: working context for a specific area.

Tier 2 - Entity Deep Context (~1500 tokens each)
  One file per entity with all its connections.
  Purpose: deep work on a specific thing.

An LLM can load your entire KB index in ~1200 tokens, drill into a domain in ~4000, and get full entity context in ~1500. No wasted context window.

Usage

In Claude Code:

/obsidian-kb init       # Scan project, propose architecture
/obsidian-kb build      # Generate everything, run initial compile
/obsidian-kb rebuild    # Re-analyze when project structure changes

In other IDEs, prompt your assistant:

Run obsidian-kb init to analyze this project and propose a knowledge base architecture.

After build, you also get:

/kb                     # Stats and health
/kb compile             # Re-run the compiler
/kb search [query]      # Search across compiled pages
/kb gaps                # Find orphans and missing relationships

/lint-kb                # Health check with scoring
/lint-kb --fix          # Auto-fix mechanical issues

Works With Any Project

The skill adapts to whatever it finds:

Project Type	What It Extracts
Data files (JSONL, JSON, YAML, CSV)	Entities, relationships, groupings
Software (any language)	Components, APIs, modules, configs, dependencies
Research / academic	Papers, findings, methods
Documentation	Pages, sections, cross-references
Consulting / portfolio	Deliverables, risks, briefs

For software projects without structured data, it extracts knowledge atoms from the code structure itself.

How It Works

The skill doesn't use templates. It instructs the AI to:

Scan your project and sample actual data records
Classify what each record represents (knowledge atom types)
Map relationships between atom types
Design a tier structure based on what it finds
Write a Python compiler from scratch for your schema
Generate an Obsidian vault with wikilinks and graph coloring

The architecture spec (data/kb-architecture.yaml) is saved as a contract between analysis and generation. You can edit it before building.

The generated compiler is self-contained Python (stdlib only), idempotent, and fast (< 5 seconds). Run it with python3 scripts/compile-kb.py for context packs or python3 scripts/compile-kb.py --human to also generate the wiki.

What Gets Generated

your-project/
  data/
    kb-architecture.yaml          # Architecture spec (editable)
    compiled/
      index.md                    # Tier 0: full inventory
      [group]/                    # Tier 1: group context packs
        [group-name].md
      [type]/                     # Tier 2: entity deep context
        [entity-name].md
      freshness.json              # Compilation timestamp
  scripts/
    compile-kb.py                 # Project-specific Python compiler
  wiki/                           # Obsidian vault
    .obsidian/                    # Vault config with graph coloring
    [section]/                    # One section per atom type
      [entity].md                 # Interlinked pages
    index.md                      # Map of Content

Credits

Inspired by Andrej Karpathy. The three-tier compiled context pack system was developed as an LLM-optimized adaptation of that workflow.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
skills/obsidian-kb		skills/obsidian-kb
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
install.sh		install.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

obsidian-kb

Install

Where It Goes

What It Does

The Three Tiers

Usage

Works With Any Project

How It Works

What Gets Generated

Credits

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

obsidian-kb

Install

Where It Goes

What It Does

The Three Tiers

Usage

Works With Any Project

How It Works

What Gets Generated

Credits

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages