propstore

Compiler and reasoning backend for structured scientific claims.

propstore is the engine layer in a larger workflow. The paper-facing frontend lives in the sibling research-papers-plugin repo, which handles paper retrieval, reading, annotation, and claim extraction. propstore takes the resulting structured concepts, claims, contexts, and stances; validates them; compiles them into a sidecar SQLite store; and supports conflict analysis, argumentation queries, counterfactual overlays, and ATMS-style inspection over the compiled corpus.

Architecture

Layer 6: Agent Workflow    — extract-claims, reconcile, relate, adjudicate
Layer 5: Render            — resolution strategies, world queries, worldlines, hypotheticals
Layer 4: Argumentation     — Dung AF, ASPIC+, PrAF, bipolar, ATMS
Layer 3: Heuristic         — embeddings, LLM stance classification (proposals only)
Layer 2: Theory / Typing   — forms, dimensions, CEL type-checking, Z3 conditions
Layer 1: Source Storage    — claims, concepts, contexts, stances, provenance (immutable)
        └── propstore/repo/ — git-backed storage, branch isolation, semantic merge

Dependencies flow downward only. Layers 1-2 are the formal core. Layer 3 produces proposals, never mutations. Layer 4 operates over assumption-labeled data. Layer 5 applies policy at query time. Layer 6 orchestrates multi-step agent workflows.

Where it fits

PDFs / paper directories
  -> research-papers-plugin
  -> claims.yaml + concept registry + stance proposals
  -> propstore
  -> sidecar SQLite + query engine + reasoning backends

If you want to work with raw papers, start with research-papers-plugin.

If you want to work on the compiler, validator, sidecar builder, or reasoning engine, start here.

If you want a reproducible showcase corpus, use a dedicated demo repo or local demo corpus rather than assuming this source tree is itself a polished demo dataset.

Semantic Merge

propstore now exposes a formal repository-merge layer instead of treating merge classification as a claim-bucket side path. Branches still isolate paper processing, agent runs, and hypotheses, but merges are represented as explicit partial argumentation frameworks over branch-local argument summaries.

Branches isolate epistemic states and preserve source-local history (Darwiche & Pearl 1997)
Formal merge framework records arguments plus attack / ignorance / non-attack over the merged universe
Exact merge operators — Sum, Max, Leximax — select merged completions over that framework
Completion queries answer skeptical and credulous acceptance before any storage merge is written
Storage merge commits are two-parent git commits produced from the formal merge object, not the merge semantics themselves

See Semantic Merge for the full architecture and Git Backend for branch operations.

What It Does Today

Validates forms, concepts, contexts, and claims against LinkML schemas and CEL type-checking
Compiles a knowledge repo into a queryable SQLite sidecar
Detects condition-sensitive conflicts with Z3, including regime splits vs real disagreement
Projects claim relations into argumentation frameworks via multiple reasoning backends (see below)
Computes grounded, complete, preferred, and stable extensions
Runs hypothetical add/remove overlays without mutating the base world
Derives values through parameterization chains and computes symbolic sensitivity
Compares algorithm claims for equivalence across differently written implementations
Exposes ATMS-style support, stability, relevance, and intervention queries over the active belief space
Materializes worldlines: traced, reproducible paths through the knowledge space with full provenance
Versions knowledge repos with git-backed storage (Dulwich), enabling historical builds and branch-based isolation
Inspects and commits branch merges through a formal partial-argumentation layer before writing two-parent git commits
Validates dimensional consistency of claims via Pint and bridgman, with SI normalization at build time
Provides a Python API for programmatic access (WorldModel, BoundWorld, HypotheticalWorld)
Represents uncertainty honestly via subjective logic opinions with calibrated evidence mapping

Reasoning Backends

propstore ships multiple reasoning backends at different abstraction levels, selectable via --reasoning-backend:

Backend	What it does	Key files
`claim_graph`	Dung AF over active claim rows with heuristic metadata preferences. Default backend.	`argumentation.py`, `dung.py`, `dung_z3.py`
`aspic`	Full ASPIC+ argument construction: recursive PremiseArg/StrictArg/DefeasibleArg, three-type attack (Def 8), last-link/weakest-link preference defeat (Defs 19-21). `structured_projection` remains a legacy alias for parsed inputs, not a public CLI backend name.	`aspic_bridge.py`, `aspic.py`
`praf`	Probabilistic argumentation: MC sampling with Agresti-Coull stopping (Li et al. 2012), DF-QuAD gradual semantics (Freedman et al. 2025), optional COH enforcement (Hunter & Thimm 2017)	`praf.py`, `praf_dfquad.py`
`atms`	ATMS label propagation and bounded replay over the active belief space (de Kleer 1986). Does not expose Dung extensions — use ATMS-native commands instead.	`world/atms.py`

Additional argumentation infrastructure:

Bipolar argumentation (bipolar.py) — Cayrol 2005 derived defeats with fixpoint, d/s/c-admissibility, stable extensions
Subjective logic (opinion.py) — Jøsang 2001 Opinion = (b,d,u,a) with consensus fusion, discounting, negation, conjunction, disjunction, ordering, uncertainty maximization
Calibration (calibrate.py) — Temperature scaling (Guo et al. 2017), corpus CDF calibration, evidence-to-opinion mapping (Sensoy et al. 2018), ECE
Decision criteria — Pignistic (default), Hurwicz, lower/upper bound per Denoeux 2019, selectable via --decision-criterion

What It Does Not Claim

It does not read PDFs or extract claims from raw papers by itself
It is not the main end-user paper workflow; that lives in research-papers-plugin
It is not a general scientific truth machine
ASPIC+ rationality postulates (Thms 12-15) are achieved by construction via transposition closure and c-consistency, not runtime-verified
The ATMS backend is not itself full AGM revision or a full de Kleer runtime manager
This repo includes engine code plus evolving local knowledge fixtures; it should not be presented as a turnkey corpus demo

What This Can Actually Do

Compile a knowledge repo into a sidecar:

uv run pks -C knowledge build

Inspect conflicts under bindings:

uv run pks -C knowledge claim conflicts
uv run pks -C knowledge world check-consistency domain=argumentation

Ask which claims survive the current argumentation projection:

uv run pks -C knowledge world extensions --semantics grounded
uv run pks -C knowledge world extensions --semantics preferred --set-comparison democratic

Run counterfactual overlays:

uv run pks -C knowledge world hypothetical --remove claim42
uv run pks -C knowledge world hypothetical --add '{"id":"synth1","concept_id":"concept1","value":150,"conditions":[]}'

Promote heuristic proposals to source-of-truth:

# Promote heuristic proposals to source-of-truth
uv run pks -C knowledge promote

Inspect or execute a formal branch merge:

uv run pks -C knowledge merge inspect agent/paper-a agent/paper-b --semantics grounded
uv run pks -C knowledge merge commit agent/paper-a agent/paper-b --target-branch master

Compare algorithms across papers:

uv run pks -C knowledge claim compare claim50 claim51 -b T0=0.008

Rank claims by fragility (what to learn next):

uv run pks -C knowledge world fragility --top-k 10
uv run pks -C knowledge world fragility --concept concept5 --sort-by roi

Export graphs and inspect sensitivity:

uv run pks -C knowledge world export-graph --format json
uv run pks -C knowledge world sensitivity concept5 task=speech

Inspect ATMS support and intervention plans:

uv run pks -C knowledge world atms-status domain=argumentation
uv run pks -C knowledge world atms-interventions claim_id domain=argumentation

Inspect one-shot and iterated revision over derived belief state:

uv run pks -C knowledge world revision-base
uv run pks -C knowledge world revise --atom '{"kind":"claim","id":"synthetic_freq","value":123.0}' --conflict freq_claim1
uv run pks -C knowledge world iterated-revise --atom '{"kind":"claim","id":"synthetic_freq","value":123.0}' --conflict freq_claim1 --operator lexicographic

Create and run worldlines:

# Create a worldline targeting specific concepts with a reasoning backend
uv run pks -C knowledge worldline create my_query --target speech.pitch --bind domain=speech --reasoning-backend praf --praf-strategy mc

# Materialize results
uv run pks -C knowledge worldline run my_query

# Create and run a revision-aware worldline
uv run pks -C knowledge worldline create revised_query --target speech.pitch --revision-operation revise --revision-atom '{"kind":"claim","id":"synthetic_pitch","value":110.0}' --revision-conflict claim:synthetic_pitch=pitch_claim
uv run pks -C knowledge worldline run revised_query

# Inspect results, compare worldlines, refresh with new data
uv run pks -C knowledge worldline show my_query
uv run pks -C knowledge worldline diff query_a query_b
uv run pks -C knowledge worldline refresh my_query

Typical Workflow

Use research-papers-plugin to retrieve papers, read them, and produce structured source artifacts.
Initialize a propstore knowledge repo.
Create a source branch and write source-local concepts, claims, justifications, and stances through pks source.
Build the sidecar.
Query, compare, export, and run reasoning passes over the compiled result.

# 1. Create a fresh knowledge repo
uv run pks init knowledge

# 2. Create and populate a source branch
uv run pks -C knowledge source init Demo_2026 --kind academic_paper --origin-type file --origin-value Demo_2026.pdf
uv run pks -C knowledge source add-concepts Demo_2026 --batch ../your-paper-project/papers/Demo_2026/concepts.yaml
uv run pks -C knowledge source add-claim Demo_2026 --batch ../your-paper-project/papers/Demo_2026/claims.yaml
uv run pks -C knowledge source add-justification Demo_2026 --batch ../your-paper-project/papers/Demo_2026/justifications.yaml
uv run pks -C knowledge source add-stance Demo_2026 --batch ../your-paper-project/papers/Demo_2026/stances.yaml
uv run pks -C knowledge source finalize Demo_2026
uv run pks -C knowledge source promote Demo_2026

# 3. Validate and compile
uv run pks -C knowledge build

# 4. Query the compiled world
uv run pks -C knowledge world status
uv run pks -C knowledge world query fundamental_frequency
uv run pks -C knowledge world bind task=speech speaker_sex=male fundamental_frequency

Why This Is Interesting

Most claim stores stop at ingestion or retrieval. propstore tries to push further into compilation and reasoning:

Claims are scoped by typed CEL conditions, so disagreement can be classified as real conflict, partial overlap, or clean regime split
Stances and conflict records can be projected into multiple reasoning backends — from flat Dung AFs through structured ASPIC+ arguments to probabilistic acceptance
Uncertainty is represented honestly via subjective logic opinions with calibrated evidence mapping, not collapsed to point estimates
Hypothetical overlays let you ask "what changes if I remove or add this claim?"
Revision operators let you ask "what should give way if I admit this new information?" without mutating source storage
Parameterization chains and symbolic derivatives let you inspect how derived quantities move
ATMS-style queries expose support quality, relevance, stability, and bounded intervention plans instead of only returning a winner
Worldlines capture reproducible, traced paths through the knowledge space with full provenance, staleness detection, and optional revision episodes

That is the real value proposition: not "we solved science", but "we can compile a structured claims corpus into something you can interrogate like a reasoning system".

Installation

Requires Python 3.11+. Uses uv for dependency management.

uv sync
uv run pks --help

The current development setup expects a sibling ../ast-equiv checkout via uv.lock.

Optional for embeddings and LLM-assisted stance classification:

uv pip install "propstore[embeddings]"

How It Works

pks build runs a compiler pipeline over a knowledge repo:

Form validation
Concept validation
Context validation
Claim validation
Sidecar build
Conflict detection

The result is a SQLite sidecar plus a query layer over active claims, relations, parameterizations, and conflicts.

Data Model

The core entities are:

concepts
forms
claims
conditions
stances
justifications
contexts

Claim types currently include:

parameter
equation
measurement
observation
model
algorithm
mechanism
comparison
limitation

See docs/data-model.md for concrete YAML examples.

Documentation

Data Model — concepts, forms, claim types, conditions, stances, contexts
Argumentation — reasoning backends, conflict detection, ATMS overview
Structured Argumentation (ASPIC+) — recursive argument construction, three-type attack, preference defeat
Probabilistic Argumentation (PrAF) — MC sampling, Agresti-Coull stopping, DF-QuAD gradual semantics
Subjective Logic and Calibration — Opinion algebra, temperature scaling, evidence-to-opinion mapping, decision criteria
ATMS — assumption-based truth maintenance, label propagation, bounded replay, interventions
Revision — operator-based one-shot and iterated AGM-style revision over derived belief state
Bipolar Argumentation — Cayrol 2005, derived defeats, three admissibility variants
Conflict Detection — Z3 condition reasoning, regime splits, six conflict classes
Parameterization and Sensitivity — derivation chains, chain queries, elasticity analysis
Algorithm Comparison — ast-equiv four-tier equivalence ladder
Fragility Analysis — parametric, epistemic, and conflict fragility with ROI ranking
Worldlines — materialized query artifacts, provenance tracking, staleness detection
CLI Reference — command reference for the current pks surface, including merge inspection and merge commit commands
Semantic Merge — formal repository merge frameworks, completion queries, exact AF merge operators, branch reasoning
Git Backend — Dulwich-backed versioning, KnowledgeRepo, TreeReader, branch primitives
Units and Forms — dimensional type system, SI normalization, form algebra
Python API — WorldModel, BoundWorld, HypotheticalWorld, result types
Integration — how this fits with research-papers-plugin

Future Work

Epistemic operating system roadmap — The next program is no longer just “more backends.” It is formal merge completion, policy/governance objects, snapshots/journals/diffs, and investigation/process-manager surfaces. See proposals/epistemic-operating-system-roadmap.md.
Structured merge deepening — The formal merge backbone is implemented; the remaining work is richer branch-local structured projection, stronger structured correspondence results, and more source-preference integration. See docs/semantic-merge.md.
AF-level revision — Claim/context revision is implemented; remaining work is argument-level and warrant-level revision over AF / ASPIC+ consumers.
Extended Josang operators — Deduction, comultiplication, abduction (Josang & McAnally 2004; Josang 2008). Requires retrieving source papers.
Interval dominance — Denoeux 2019 interval dominance criterion for decision-making under uncertainty.

Dependencies

Package	Role
Z3	SMT solving for condition reasoning, conflict analysis, and extension computation
SymPy	Symbolic math for parameterization and sensitivity
LinkML	Schema definition and JSON Schema generation
click	CLI framework
bridgman	Dimensional analysis for unit compatibility
pint	Unit parsing and conversion
graphviz	Graph export (DOT format)
`ast-equiv`	Algorithm canonicalization and equivalence comparison
litellm	Optional embeddings and stance classification
sqlite-vec	Optional vector search in SQLite

Name		Name	Last commit message	Last commit date
Latest commit History 988 Commits
.ward/rules		.ward/rules
design-spikes		design-spikes
docs		docs
knowledge		knowledge
notes		notes
papers		papers
plans		plans
prompts		prompts
proposals		proposals
propstore		propstore
reports		reports
schema		schema
scripts		scripts
tests		tests
.claude_semantic_contract_review.txt		.claude_semantic_contract_review.txt
.claude_test_all_streams.txt		.claude_test_all_streams.txt
.gitignore		.gitignore
.python-version		.python-version
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
README.md		README.md
TODO.md		TODO.md
algorithms.md		algorithms.md
aspic.md		aspic.md
propstore-narrative-review.md		propstore-narrative-review.md
propstore.txt		propstore.txt
pyproject.toml		pyproject.toml
research-papers-integration.md		research-papers-integration.md
review-plan.md		review-plan.md
review-v0.md		review-v0.md
review-v1-response.md		review-v1-response.md
review-v1.md		review-v1.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

propstore

Architecture

Where it fits

Semantic Merge

What It Does Today

Reasoning Backends

What It Does Not Claim

What This Can Actually Do

Typical Workflow

Why This Is Interesting

Installation

How It Works

Data Model

Documentation

Future Work

Dependencies

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

propstore

Architecture

Where it fits

Semantic Merge

What It Does Today

Reasoning Backends

What It Does Not Claim

What This Can Actually Do

Typical Workflow

Why This Is Interesting

Installation

How It Works

Data Model

Documentation

Future Work

Dependencies

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages