You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Implement v0.8 as Agent-rated Evidence Readiness and Release Eligibility.
This is not a claim that MABW can automatically prove truth or authorize publication. The v0.8 boundary is:
Semantic reviewer agents rate evidence and release risks using structured rubrics. Deterministic Python validators, policy packs, and control artifacts convert those ratings into warnings, blockers, allowed-use labels, and human-review requirements.
Why this matters
v0.7.x can already produce workflow traceability: material claims are linked to Claim Ledger entries and sources, and finalize produces reader-facing delivery artifacts. But recent real-use review exposed a gap:
a run can be workflow/audit pass while still not being ready for formal research or external release;
weak-but-present sources can pass traceability checks;
mixed metric scopes can look cited but remain non-comparable;
legal/policy/company-event claims can rely on media without official text;
institutional branding/confidential labels can create release risk even when facts are sourced.
v0.8 should make these risks explicit, machine-readable, and blocking where the selected mode requires it.
Summary
Implement v0.8 as Agent-rated Evidence Readiness and Release Eligibility.
This is not a claim that MABW can automatically prove truth or authorize publication. The v0.8 boundary is:
Why this matters
v0.7.x can already produce workflow traceability: material claims are linked to Claim Ledger entries and sources, and finalize produces reader-facing delivery artifacts. But recent real-use review exposed a gap:
workflow/audit passwhile still not being ready for formal research or external release;v0.8 should make these risks explicit, machine-readable, and blocking where the selected mode requires it.
Target control surfaces
Add four run-scoped artifacts:
semantic_review_report.json
Agent-written. Records rubric-based semantic judgments: source authority, source-to-claim support, metric scope, official-source coverage, and branding/authorization risk.
evidence_report.json
Python-written. Converts semantic findings and Claim Ledger metadata into mode-aware evidence status, warnings, blockers, and verification paths.
release_readiness_report.json
Python-written. Aggregates workflow, audit, quality gates, evidence report, finalize report, approval record, branding/use-boundary, and package hygiene into release status.
human_approval_ledger.json
Python-written from explicit human/reviewer commands. Records required reviewer status for
formal_release_candidatemode.Sub-issues / execution sequence
release_modeplumbing.evidence_reportcontrol surface and policy-pack blocker mapping.release_readiness_report, branding gate, approval gate, and package hygiene checks.Expected end-state example
A traceable but not release-ready run should be able to produce:
Acceptance criteria
release_modeis a first-class run concept.semantic_review_report.jsonexists and validates.evidence_report.jsonexists and validates.release_readiness_report.jsonexists and validates.human_approval_ledger.jsonexists and validates where required.workflow_status: passcan coexist withrelease_status: blocked.Non-goals