lsearch

English | 中文

Local RAG (Retrieval-Augmented Generation) knowledge base for Claude Code. Search your project documentation using hybrid semantic + keyword search.

Features

🔍 Hybrid Search - Combines semantic vector search (Chroma) with BM25 keyword search
🔗 Link Graph - Obsidian-style bidirectional link support with automatic expansion
🌐 Web Fetching - Download and index API documentation, Swagger specs
🤖 Local Models - Uses small embedding models (70-300MB), no cloud dependencies
⚡ Fast - Pure local operation with hot-loaded indices
📝 Smart Context - Token-aware context building with interactive selection
🎯 Multiple Triggers - Use lsearch:, @kb, or slash commands

Installation

Option 1: Claude Code Plugin (Recommended)

Install as a Claude Code plugin with automatic MCP configuration:

# Clone to Claude plugins directory
git clone https://github.com/moringchen/lsearch.git ~/.claude/plugins/lsearch

# Run install script (IMPORTANT: Required for slash commands to work)
cd ~/.claude/plugins/lsearch && python install.py

⚠️ You must restart Claude Code after installation for slash commands (/lsearch, /lsearch-index, etc.) to appear.

Option 2: npx (推荐)

使用 npx 快速安装，自动配置 Claude Code：

npx @moringchen/lsearch

或者全局安装：

npm install -g @moringchen/lsearch
lsearch-install

Option 3: PyPI

pip install lsearch

然后手动添加到 ~/.claude/settings.json：

{
  "mcpServers": {
    "lsearch": {
      "command": "python",
      "args": ["-m", "lsearch.server"]
    }
  }
}

Option 4: Development Install

git clone https://github.com/moringchen/lsearch.git
cd lsearch
pip install -e ".[dev]"

Quick Start

1. Initialize Knowledge Base (In Claude Code)

⚠️ IMPORTANT: You must initialize lsearch before using it!

In Claude Code, run:

/lsearch-init

This displays an interactive form where you can:

Name - Enter a knowledge base name
Paths - Select directories to index (checkbox selection)
Model - Choose an embedding model (dropdown selection)
Custom Paths - Add additional paths if needed
Force Reinitialize - Overwrite existing configuration (shown when already initialized)

If already initialized:

Current configuration is displayed (name, paths, model)
Choose to keep current settings or modify via the form
Check "Force Reinitialize" to update configuration

This creates .lsearch/config.yaml in your project directory.

Or use CLI in terminal:

lsearch init --name my-project --path ./docs --model bge-small-zh

2. Index Your Documents

In Claude Code:

/lsearch-index

Auto-generated names:

If run in /home/user/projects/my-app, name becomes: my-app
If my-app already exists, name becomes: projects-my-app

This creates .lsearch/config.yaml:

name: my-project
paths:
  - path: ./docs
    session_only: false
  - path: ./README.md
    session_only: false
embedding_model: all-MiniLM-L6-v2
token_limit: 4000
auto_expand_links: true

3. Use in Claude Code

Once initialized (via lsearch init in terminal), use these triggers in Claude Code:

Trigger	Description	Example
`/lsearch-init`	Initialize knowledge base (MUST run first)	`/lsearch-init`
`lsearch: <query>`	Automatic RAG search	`lsearch: How does auth work?`
`@kb <query>`	Force knowledge base search	`@kb deployment process`
`/lsearch <query>`	Search via slash command	`/lsearch API documentation`
`/lsearch-index`	Manually trigger indexing	`/lsearch-index`
`/lsearch-fetch <url>`	Fetch and index web page	`/lsearch-fetch https://docs.example.com`
`/lsearch-add <path>`	Add temporary path for session	`/lsearch-add ~/notes`
`/lsearch-stats`	Show knowledge base statistics	`/lsearch-stats`

Keyword-Based Auto-Trigger

When the user asks questions containing specific keywords, automatically search the knowledge base:

Trigger Keywords:

"knowledge base" / "知识库"
"auto search" / "自动搜索"

Examples of auto-trigger:

"Search knowledge base for auth" → Auto-search
"自动搜索部署文档" → Auto-search
"Use knowledge base" → Auto-search
"请自动搜索相关文档" → Auto-search

How it works:

Detect keywords in user query
Automatically call mcp__lsearch__search_with_context
Include results in response

How It Works

Markdown Files → Chunks → Vector Index (Chroma) + BM25 Index + Link Graph
                                              ↓
                                    Hybrid Search (RRF Fusion)
                                              ↓
                                     Context Builder → Claude

Indexing Process

Document Processing - Markdown files are parsed, frontmatter extracted, wiki-links identified
Chunking - Documents split into overlapping chunks (default 500 words)
Embedding - Each chunk embedded using local models (all-MiniLM-L6-v2 or bge-small-zh)
Indexing - Stored in Chroma (vector) + Whoosh (BM25) + NetworkX (link graph)

Search Process

Vector Search - Semantic similarity using cosine distance
BM25 Search - Keyword matching with term frequency
RRF Fusion - Reciprocal Rank Fusion combines both results
Link Expansion - Include linked notes if enabled
Context Building - Assemble results respecting token limits

Configuration

Config File (`.lsearch/config.yaml`)

name: my-project                          # Knowledge base name
paths:                                    # Paths to index
  - path: ./docs
    session_only: false
  - path: ./README.md
    session_only: false
exclude:                                  # Patterns to exclude
  - node_modules/**
  - .git/**
  - "*.tmp"
embedding_model: bge-small-zh             # Embedding model (default: Chinese-optimized)
token_limit: 4000                         # Max tokens per query
auto_expand_links: true                   # Include linked notes
chunk_size: 500                           # Words per chunk
chunk_overlap: 50                         # Overlap between chunks

Embedding Models

Model	Size	Best For	Language	Default
`bge-small-zh`	300MB	Optimized for Chinese	Chinese	✅ Default
`all-MiniLM-L6-v2`	70MB	General purpose	English
`bge-small-en`	130MB	Optimized for English	English

CLI Commands

# Initialize a knowledge base
lsearch init --name my-project --path ./docs

# Add paths to existing knowledge base
lsearch add-path ./more-docs

# Check status and statistics
lsearch status

# List available embedding models
lsearch models

# Run MCP server (for Claude Code integration)
lsearch server

Usage Examples

Example 1: Project Documentation Setup

# Navigate to your project
cd ~/projects/my-web-app

# Step 1: Initialize lsearch (in Claude Code)
/lsearch-init
# Follow the interactive form to configure name, paths, and model

# Create docs directory and add files
mkdir -p docs
echo "# API Documentation" > docs/api.md
echo "# Deployment Guide" > docs/deployment.md

# Step 2: Index the documents (in Claude Code)
/lsearch-index

# Step 3: Search your documentation
lsearch: How do I deploy this project?

Example 2: Multi-language Project

In Claude Code:

/lsearch-init
# Select paths: ./backend/docs, ./frontend/docs, ./README.md
# Select model: bge-small-zh

Or use CLI:

lsearch init --name fullstack \
  --path ./backend/docs \
  --path ./frontend/docs \
  --path ./README.md \
  --model bge-small-zh

Example 3: Web Documentation

# Fetch and index external API docs
/lsearch-fetch https://docs.example.com/api

# Search the fetched documentation
@kb example API authentication

Example 4: Temporary Knowledge Base

# Add a temporary path for this session only
/lsearch-add ~/personal-notes/project-ideas.md

# Search includes both project docs and temporary notes
lsearch: What are the project requirements?

Example 5: Multi-Project Knowledge

# .lsearch/config.yaml
name: work-projects
paths:
  - path: ~/work/project-a/docs
    session_only: false
  - path: ~/work/project-b/docs
    session_only: false
  - path: ~/work/shared-guides
    session_only: false
exclude:
  - "**/node_modules/**"
  - "**/.git/**"
embedding_model: bge-small-zh
token_limit: 4000

Web Fetching

Fetch and index web documentation:

/lsearch-fetch https://docs.python.org/3/library/asyncio.html

Supports:

HTML → Markdown conversion
Swagger/OpenAPI JSON → Markdown
Auto title extraction

Fetched documents are stored in ~/.lsearch/fetched/ and indexed.

Project Structure

lsearch/
├── src/lsearch/
│   ├── server.py              # MCP Server (main entry)
│   ├── cli.py                 # CLI commands
│   ├── config.py              # Configuration management
│   ├── embedding.py           # Embedding models
│   ├── document_processor.py  # Markdown processing
│   ├── fetcher.py             # URL fetching
│   ├── indexers/
│   │   ├── chroma_indexer.py  # Vector database (Chroma)
│   │   ├── bm25_indexer.py    # Keyword index (Whoosh)
│   │   └── link_graph.py      # Note relationships (NetworkX)
│   └── search/
│       ├── hybrid_search.py   # RRF fusion
│       └── context_builder.py # Token management
├── .claude/
│   ├── commands/              # Slash commands
│   └── skills/                # Skill definition
├── skill/
│   └── SKILL.md               # Claude Code skill definition
├── install.py                 # Installation script
├── README.md                  # This file
├── README.zh.md               # Chinese documentation
└── pyproject.toml             # Package configuration

Development

# Setup
pip install -e ".[dev]"

# Format code
black src/ tests/
ruff check src/ tests/

# Run tests
pytest

# Type checking
mypy src/

Troubleshooting

Issue: MCP server not starting

Solution: Check if lsearch is installed:

python -m lsearch.server --version

If not installed, run:

pip install lsearch

Issue: No search results

Solution: Ensure documents are indexed:

/lsearch-index

Or check index status:

/lsearch-stats

Issue: Model download fails

Solution: Embedding models are downloaded on first use. Ensure you have:

Internet connection (first time only)
~300MB disk space for Chinese model, ~70MB for English

License

MIT License - see LICENSE

Contributing

Contributions welcome! Please open an issue or PR.

Acknowledgments

Chroma - Vector database
Whoosh - BM25 indexing
sentence-transformers - Embeddings
MCP - Model Context Protocol

Contact

GitHub: @moringchen
Email: 843115404@qq.com

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.claude-plugin		.claude-plugin
.claude		.claude
.doc		.doc
.github/workflows		.github/workflows
.lsearch		.lsearch
example-marketplace		example-marketplace
skills-submission		skills-submission
src/lsearch		src/lsearch
test		test
tests		tests
.gitignore		.gitignore
.npmignore		.npmignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
PYEOF		PYEOF
README.md		README.md
README.zh.md		README.zh.md
install.js		install.js
install.py		install.py
package.json		package.json
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

lsearch

Features

Installation

Option 1: Claude Code Plugin (Recommended)

Option 2: npx (推荐)

Option 3: PyPI

Option 4: Development Install

Quick Start

1. Initialize Knowledge Base (In Claude Code)

2. Index Your Documents

3. Use in Claude Code

Keyword-Based Auto-Trigger

How It Works

Indexing Process

Search Process

Configuration

Config File (.lsearch/config.yaml)

Embedding Models

CLI Commands

Usage Examples

Example 1: Project Documentation Setup

Example 2: Multi-language Project

Example 3: Web Documentation

Example 4: Temporary Knowledge Base

Example 5: Multi-Project Knowledge

Web Fetching

Project Structure

Development

Troubleshooting

Issue: MCP server not starting

Issue: No search results

Issue: Model download fails

License

Contributing

Acknowledgments

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Config File (`.lsearch/config.yaml`)

Packages