Salem Docs Assistant API

AI-powered documentation chatbot API for OpenCoven. Salem helps people navigate OpenCoven documentation through natural conversation.

Overview

This API serves Salem, the OpenCoven docs and pathfinding assistant. It uses RAG (Retrieval-Augmented Generation) to:

Index OpenCoven documentation into a vector store
Retrieve relevant docs based on user questions
Stream AI-generated answers grounded in the documentation

The OpenCoven documentation source is https://docs.opencoven.ai/llms-full.txt. Authorized deployments can also index private OpenCoven research from server-only private sources without exposing those papers through public docs.

Stack

Framework: Next.js 16 with Edge Runtime
Runtime: Bun
Deployment: Vercel Edge Functions
Vector Store: Upstash Vector
Rate Limiting / BM25 Index: Upstash Redis
AI: OpenAI for chat completions, Gemini for embeddings, optional Cohere reranking
Language: TypeScript

API Endpoints

Endpoint	Method	Description
`/api/chat`	POST	Send a question, get a streaming response
`/api/health`	GET	Health check
`/api/webhook`	POST	GitHub docs webhook for re-indexing
`/api/cron/reindex`	POST	Protected scheduled re-index safety net

POST /api/chat

{
  "message": "How do I get started with OpenCoven?"
}

Returns a streaming text/plain response with an AI-generated answer grounded in OpenCoven documentation.

Rate Limit Headers:

X-RateLimit-Limit - Maximum requests allowed
X-RateLimit-Remaining - Requests remaining in window
X-RateLimit-Reset - Timestamp when the limit resets

Debug Headers:

X-Query-Id
X-Best-Score
X-Low-Confidence
X-Result-Count
X-Strategy
X-Intent
X-Retrieval-Ms
X-Rerank-Ms
X-Relevance-Rank

No persistent query analytics or feedback endpoint is included.

Setup

Install dependencies:

bun install

Copy .env.example to .env and fill in your credentials:

cp .env.example .env

Environment Variables

Variable	Required	Description
`OPENAI_API_KEY`	Yes	OpenAI key for streaming chat completions and primary embeddings
`GEMINI_API_KEY`	No	Gemini key for embeddings when OpenAI is unavailable
`EMBEDDINGS_PROVIDER`	No	Force `openai` or `gemini`; defaults to OpenAI when available
`UPSTASH_VECTOR_REST_URL`	Yes	Upstash Vector endpoint
`UPSTASH_VECTOR_REST_TOKEN`	Yes	Upstash Vector auth token
`UPSTASH_REDIS_REST_URL`	Yes	Upstash Redis endpoint for rate limits and BM25
`UPSTASH_REDIS_REST_TOKEN`	Yes	Upstash Redis auth token
`COHERE_API_KEY`	No	Cohere key for reranking
`GITHUB_WEBHOOK_SECRET`	No	Secret for GitHub webhook
`REINDEX_SECRET`	No	Secret for scheduled re-index endpoint
`SALEM_ADMIN_PASSWORD`	No	Server-only password required for follow-up conversations after the first website question
`SALEM_PRIVATE_RESEARCH_DOCS_BASE64`	No	Base64-encoded private research markdown to include in Salem's index
`SALEM_PRIVATE_RESEARCH_REPO`	No	Private GitHub repo for research sources, for example `OpenCoven/coven-research`
`SALEM_PRIVATE_RESEARCH_REF`	No	Git ref for private research sources, defaults to `main`
`SALEM_PRIVATE_RESEARCH_PATHS`	No	Comma-separated private research markdown paths
`SALEM_PRIVATE_RESEARCH_GITHUB_TOKEN`	No	Server-only token for private GitHub research fetches
`ALLOWED_ORIGINS`	No	Comma-separated CORS allowlist

SALEM_ADMIN_PASSWORD is intentionally not exposed through any PUBLIC_ or NEXT_PUBLIC_ variable. Follow-up requests fail closed when this env var is missing; there is no fallback password.

Private research variables are also server-only. If SALEM_PRIVATE_RESEARCH_DOCS_BASE64 is set, Salem indexes that markdown directly. If SALEM_PRIVATE_RESEARCH_REPO and SALEM_PRIVATE_RESEARCH_PATHS are set, Salem fetches those private Markdown files through the GitHub Contents API using SALEM_PRIVATE_RESEARCH_GITHUB_TOKEN.

Build the vector index:

bun run build:index

Development

bun run dev

Runs locally at http://localhost:3000.

Scripts

Script	Description
`bun run dev`	Start development server
`bun run build`	Build for production
`bun run start`	Start production server
`bun run test`	Validate OpenCoven/Salem port wiring
`bun run build:index`	Index documentation into vector store
`bun run deploy`	Deploy to Vercel

Automatic Documentation Updates

The API supports automatic re-indexing when documentation changes are pushed to the docs repository's main branch, plus a protected scheduled safety net for missed webhooks or docs deploy timing races.

A push is made to the main branch of the docs repository.
GitHub sends a webhook payload to /api/webhook.
The API verifies the signature, fetches https://docs.opencoven.ai/llms-full.txt plus configured private research sources, hashes the combined source text, and skips re-indexing when the content is unchanged.
When the hash changed, Salem chunks the content, generates embeddings, replaces the vector store, rebuilds BM25, and stores the new source hash in Upstash Redis.

Scheduled Re-index

Configure QStash or another scheduler to call the protected endpoint periodically:

curl -X POST "https://salem.opencoven.ai/api/cron/reindex" \
  -H "Authorization: Bearer $REINDEX_SECRET"

Use ?force=1 only for manual recovery when you need to rebuild the index even if llms-full.txt has the same hash:

curl -X POST "https://salem.opencoven.ai/api/cron/reindex?force=1" \
  -H "Authorization: Bearer $REINDEX_SECRET"

The scheduler should run after docs publishing has had time to update https://docs.opencoven.ai/llms-full.txt. A daily schedule is usually enough; every few hours is reasonable while docs are changing quickly.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 114 Commits
app		app
docs		docs
patches		patches
public		public
rag		rag
scripts		scripts
.env.example		.env.example
.eslintrc.js		.eslintrc.js
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
bun.lock		bun.lock
next-env.d.ts		next-env.d.ts
next.config.ts		next.config.ts
package.json		package.json
tsconfig.json		tsconfig.json
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Salem Docs Assistant API

Overview

Stack

API Endpoints

POST /api/chat

Setup

Environment Variables

Development

Scripts

Automatic Documentation Updates

Scheduled Re-index

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Salem Docs Assistant API

Overview

Stack

API Endpoints

POST /api/chat

Setup

Environment Variables

Development

Scripts

Automatic Documentation Updates

Scheduled Re-index

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages