CACP — Compressed Agent Communication Protocol

Structured I/O format for LLM coding agents. ~200 tokens vs ~2000 of prose.

What is CACP?

CACP replaces free-form agent responses with typed fields that orchestrators parse programmatically — no LLM needed to understand the result.

Dispatch (orchestrator → agent)

TASK: Implement JWT auth middleware
CONTEXT: Go backend, chi router
ACCEPTANCE: 1. Middleware validates Bearer tokens  2. Tests pass
SCOPE: src/middleware/
VERIFY: go test ./...
DONE: return STATUS format

Response (agent → orchestrator)

STATUS:ok
FILES_CREATED:src/middleware/jwt.go,src/middleware/jwt_test.go
FILES_MODIFIED:go.mod
TESTS:pass:12
BUILD:pass
LEARNED:JWT tokens need 24h expiry for mobile clients

Why?

Without CACP, agents return 2,000-token prose explaining what they did. With CACP, the same information is ~200 tokens. The orchestrator parses it programmatically.

Without CACP	With CACP
"I've completed the task. I created two new files..." (2000 tokens)	`STATUS:ok\nFILES_CREATED:jwt.go,jwt_test.go` (200 tokens)
Need LLM to parse response	Regex/JSON parse
Ambiguous success/failure	Explicit STATUS field
No structured file tracking	FILES_CREATED/MODIFIED lists

Parser tolerance rules

The canonical wire format is FIELD:value with no whitespace between the colon and the value, for token compactness (the "C" in CACP). This is what emitters SHOULD produce.

Parsers MUST tolerate whitespace variants — single space, multiple spaces, and tab — between the colon and the value. All four of these MUST parse identically:

STATUS:ok
STATUS: ok
STATUS:  ok
STATUS:	ok

Parsers MUST treat field names as case-insensitive: STATUS:, status:, Status:, and STATus: all denote the same field.

Parsers MUST accept the following STATUS values:

ok, fail, partial, needs_decision, no_changes, decomposed, rejected, retry, fixture_gap

Parsers MUST accept the following TESTS and BUILD values:

pass, fail, skip — with optional :N count for TESTS (e.g. TESTS:pass:42).

Why these rules exist

The standard's whole value proposition is interop: multiple agents and dispatchers must agree on the wire format. Without tolerance rules, every implementer makes their own choice and the standard fragments. A real production cascade traced back to a single missing \s* in a parser regex — the system prompt taught the agent to emit STATUS: ok (with a space) while the parser enforced STATUS:ok (without). Five agent-side fixes were shipped before the root cause was identified as a parser bug.

Conformance test vector

Any conformant parser MUST pass this vector:

Input line	Parsed `(field, value)`
`STATUS:ok`	`("STATUS", "ok")`
`STATUS: ok`	`("STATUS", "ok")`
`STATUS: ok`	`("STATUS", "ok")`
`STATUS:\tok`	`("STATUS", "ok")`
`status: ok`	`("STATUS", "ok")`
`Status:OK`	`("STATUS", "ok")`

Implementations SHOULD also run a prompt round-trip test: feed the literal system-prompt example block through the parser and assert it accepts everything the prompt teaches the agent to emit.

Status

Spec in development. Used in production for multi-agent AI coding dispatch.

See benchmark compliance rates on ServingCard.

Ecosystem

CACP is part of the standra.ai open standards stack:

Axiom — Rule Definition Language; CACP is one of two normative serializations (alongside TOON). Axiom §17 standardizes orchestration shape vocabulary, complexity tier vocabulary, fixture_gap status, verification_runs[], and artifact_quality schemas that CACP responses can carry.
Pawbench — reference benchmark that scores LLMs against CACP-formatted prompts and responses, including the orchestration × complexity matrix.
ServingCard — model serving config standard.

Additive fields for orchestration & complexity reporting

CACP responses can carry these additional fields for runners that report multi-dimensional dispatch results. Vocabularies are normative in Axiom §17:

complexity_tier — display / crud / transactional / cross_cutting. Stratifies aggregate scores so display-tier passes don't mask transactional-tier cliffs.
verification_runs[] — N-run AC re-verification, one record per run with verdict, prompt_hash, elapsed_ms. Surfaces verifier flake.
artifact_quality — static-analysis score over the changed files only; orthogonal to AC pass.
fixture_gap (terminal status) — AC un-evaluable due to missing setup (seed data, env, services). Not counted against the agent — counts against the scenario author.

Pawbench is the reference implementation of these fields end-to-end.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CACP — Compressed Agent Communication Protocol

What is CACP?

Dispatch (orchestrator → agent)

Response (agent → orchestrator)

Why?

Parser tolerance rules

Why these rules exist

Conformance test vector

Status

Ecosystem

Additive fields for orchestration & complexity reporting

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

CACP — Compressed Agent Communication Protocol

What is CACP?

Dispatch (orchestrator → agent)

Response (agent → orchestrator)

Why?

Parser tolerance rules

Why these rules exist

Conformance test vector

Status

Ecosystem

Additive fields for orchestration & complexity reporting

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Packages