Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md

Memory & Knowledge

Postgres pgvector semantic memory, project knowledge graph, RAG retrieval.

Overview

Claude Code has no memory between sessions by default. Forge adds persistent memory: the system captures issue content, agent session outputs, decisions, and resolved errors; embeds them and stores the vectors in Postgres via pgvector (same connection as the rest of the data); and surfaces relevant context to agents at the start of each session.

Data Flow

  Sources of memory:
    - Issue title + description
    - Comment bodies
    - Job outputs (stdout, tool results)
    - sessionContext (decisions, resolved errors, files modified)
    - User-added memory notes
          │
          ▼ lifecycle hooks
  ┌────────────────────┐
  │ Embed + normalize  │ (via embeddings service)
  └────────┬───────────┘
           │
           ▼
  ┌────────────────────┐
  │ pgvector upsert    │ row in `memories` with metadata cols + vector
  └────────────────────┘

  Retrieval:
  ┌────────────────────┐
  │ Query embedding    │ ← agent session start, or user query
  └────────┬───────────┘
           │
           ▼
  ┌────────────────────┐
  │ Multi-strategy      │
  │ search (semantic +  │ ← project-scoped by default
  │  keyword + graph)   │
  └────────┬───────────┘
           │
           ▼
  Relevant context snippets returned to agent's system prompt

Input Sources

Data	Source	Indexed when
Issue description	`issues-pipeline` lifecycle `issue:created` / `issue:updated`	On save
Comment body	`comments` lifecycle	On save
Job result + sessionContext	`agents-jobs` lifecycle `job:completed`	On terminal
User memory note	User explicitly added via UI	On save
Project knowledge snapshot	`.forge/knowledge.json` from device	On project sync

ID Resolution

Input	Transform	Stored as
Issue/comment/job text	Embedding (model per project config)	`vector` column in `memories` table
Source type	Column	`source: 'issue' \| 'comment' \| 'job' \| 'note' \| 'knowledge'`
Project scope	Column	`project_id: <uuid>` (indexed)

Core Entities

`Memory` (DB record — canonical form before embedding)

Field	Description
`documentId`	Canonical ID
`project`	Belongs to one project
`source`	`issue` \| `comment` \| `job` \| `note` \| `knowledge`
`sourceRef`	Reference to the source record
`text`	The content embedded
`metadata`	Additional tags (priority, status, tools used, etc.)
`embeddedAt`	Timestamp of vector upsert

`memories` table layout (Postgres + pgvector)

Single table for all projects, partitioned by project_id filter on every query (project scope enforced in the policy layer)
vector vector(N) column — N matches the embedding model dimension (default 1536)
Index: HNSW on vector (USING hnsw (vector vector_cosine_ops)) per ADR 0011
Indexed columns: (project_id, source), (project_id, source_ref)
Payload columns: source, source_ref, project_id, metadata jsonb, embedded_at

Key Business Flows

Indexing on issue create

User creates issue → issue:created hook fires
Embeddings service normalizes text (strip markdown, canonicalize whitespace)
POST to embedding provider (LiteLLM)
INSERT/UPDATE into memories (vector + metadata) in one statement
embeddedAt set; broadcast memory:indexed over ws to subscribed clients

Retrieval at session start

Agent session starts on device
System prompt builder calls forge_memory.search(query, projectId) via MCP
Server runs SELECT ... FROM memories WHERE project_id = $1 ORDER BY vector <=> $2 LIMIT K (cosine distance via HNSW index)
Top-K results returned, sorted by relevance
Returned as context snippets in system prompt
Session runs with context

Project knowledge indexing (manual trigger)

User clicks "Reindex codebase" on project settings
Device runs index-codebase skill: scans filesystem, runs grep + semantic search
Generates .forge/knowledge.json with: architecture notes, key files, conventions
Uploads to server
Server embeds and stores as source: 'knowledge' memory

API Endpoints

Method	Endpoint	Principal	Description
`GET`	`/api/projects/:id/memory/search?q=`	user / device	Query memory semantically
`POST`	`/api/projects/:id/memory`	user	Add manual memory note
`DELETE`	`/api/memory/:id`	user	Remove a memory entry
`POST`	`/api/projects/:id/memory/reindex`	user	Trigger full reindex

MCP tool:

forge_memory — exposes the same search to agents

Cross-Module Touchpoints

Direction	Module	What	When
Receives from	issues-pipeline	Issue / comment embeddings	On lifecycle save
Receives from	agents-jobs	Job result + sessionContext	On job completion
Read by	agents-jobs	Relevant context via `forge_memory` MCP tool	At session start
Read by	chat	Same retrieval surface for chat conversations	On each turn

Commands / Jobs

Command/Job	Description
`memory-reindexer` (manual trigger)	Rebuild all embeddings for a project (e.g., after model change)
`knowledge-sync` (device → server)	Upload `.forge/knowledge.json` changes

Future (v0.2+)

Knowledge graph edges (explicit entity relations, not just embeddings)
Semantic search UI (currently agents-only via MCP)
Memory decay / forgetting policies
Per-user memory (separate from project memory)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Memory & Knowledge

Overview

Data Flow

Input Sources

ID Resolution

Core Entities

`Memory` (DB record — canonical form before embedding)

`memories` table layout (Postgres + pgvector)

Key Business Flows

Indexing on issue create

Retrieval at session start

Project knowledge indexing (manual trigger)

API Endpoints

Cross-Module Touchpoints

Commands / Jobs

Future (v0.2+)

FilesExpand file tree

memory-knowledge

Directory actions

More options

Directory actions

More options

Latest commit

History

memory-knowledge

Folders and files

parent directory

README.md

Memory & Knowledge

Overview

Data Flow

Input Sources

ID Resolution

Core Entities

Memory (DB record — canonical form before embedding)

memories table layout (Postgres + pgvector)

Key Business Flows

Indexing on issue create

Retrieval at session start

Project knowledge indexing (manual trigger)

API Endpoints

Cross-Module Touchpoints

Commands / Jobs

Future (v0.2+)

`Memory` (DB record — canonical form before embedding)

`memories` table layout (Postgres + pgvector)