Features

Authorization context analyzer. Describes code, systems, and behaviors by what they do versus what they assume the right to do.

Features • Installation • Usage • Input format • Severity • Corpus • Scope

cortex parses a structured markdown description of any artifact into three sections (SKELETON, VIOLATIONS, CONTEXT) and emits JSON, markdown, and HTML reports with severity scoring. The thesis: operations are neutral. Context is what makes behavior harmful. Code cannot express intent or consent. cortex formats the external metadata into a reviewable, citable analysis.

The gap between what something does (SKELETON) and what it assumes the right to do (VIOLATIONS) is the load-bearing metric. A positive gap signals social-engineering or cognitive-layer attacks. A negative gap signals living-off-the-land, supply-chain, or boundary-test samples. A generic LLM verdict says "this looks malicious." cortex says "5 operations, 8 violations, critical, see the bullets and the cited primary source." A human reviewer can read the markdown and check every claim.

The repo ships a 14-sample reference corpus spanning historical worms, APTs, ICS safety-system attacks, social engineering, LLM-layer prompt injection, and null-violation boundary tests (Mimikatz, mgeeky red-teaming).

Warning

Do not use cortex for this loop:

Draft a cortex MD for a malicious script or prompt (with LLM help)
Run the analyzer to see the explicit violations and context gaps
Edit the original artifact to hide or reduce those gaps
Repeat until a generic LLM no longer flags it

Features

Three-section input format: SKELETON, VIOLATIONS, CONTEXT (REFERENCES optional)
Severity scoring keyed to violation count plus weighted context, with a violations <= 1 cap that holds boundary-test samples at informational
Side-by-side compare across all three sections, plus a summary row covering severity, counts, and the skeleton-to-violation gap
validate subcommand returns exit 0 when all three sections parse non-empty
JSON, markdown, and optional HTML output (dark theme)
14-sample reference corpus across four delivery media, citable side-by-side
Standard library only. Python 3.10 or later. No pip install required.

Installation

git clone https://github.com/nuclide-research/cortex-framework
cd cortex-framework

Usage

python3 analyzer.py analyze examples/iloveyou.md
python3 analyzer.py analyze examples/xz_utils_backdoor.md --html
python3 analyzer.py validate examples/stuxnet.md
python3 analyzer.py compare examples/iloveyou.md examples/conficker.md

Outputs land in output/reports/ by default:

<name>.json - structured data
<name>_report.md - human-readable markdown
<name>_report.html - optional HTML (dark theme)

Full help (analyze flags)

Flag	Default	Effect
`file`	required	input markdown file
`--json PATH`	`output/reports/<stem>.json`	JSON output path
`--report PATH`	`output/reports/<stem>_report.md`	markdown report path
`--html [PATH]`	off	also emit HTML (dark theme), path optional
`--output-dir DIR`	`output/reports`	base output directory
`--force`	off	write output even when validation fails
`-q, --quiet`	off	suppress the startup banner
`--banner`	off	force the banner even when stderr is not a TTY
`--version`		print version and exit

Using cortex with a coding agent

The fastest way to produce a structured analysis of an unknown sample:

Clone this repo next to your sample:

git clone https://github.com/nuclide-research/cortex-framework

Paste a prompt like this into your coding agent:

Read framework.md and a few examples/*.md to learn the cortex format. Then analyze <path/to/sample> and write examples/<short-name>.md with SKELETON, VIOLATIONS, CONTEXT, and an optional REFERENCES section. Then run python3 analyzer.py analyze examples/<short-name>.md --html and show me the output.

The agent reads the methodology plus sample analyses, drafts a three-section cortex markdown, and runs the analyzer. The output:

A reviewable analysis where every authorization claim is a falsifiable bullet, not an opaque verdict
Deterministic JSON, markdown, and HTML reports from the analyzer
Side-by-side comparability against the 14-sample reference corpus
A citable artifact that holds up in a bug report, disclosure writeup, or incident review

The structure is enforced by the framework. The drafting uses the model. The authorization claims stay falsifiable by any human reviewer who reads the markdown.

See framework.md for the full methodology, violation-class taxonomy, and reference analyses.

Input format

# Subject - Authorization Context Analysis

## SKELETON
- functional operation 1
- functional operation 2

## VIOLATIONS
- Assumes right to X without authorization
- Zero checks before Y

## CONTEXT THAT MAKES IT BAD
- Impact on owner or system
- Deception involved

## REFERENCES
- Primary source 1

Section headers are case-insensitive. Aliases: AUTHORIZATION VIOLATIONS for VIOLATIONS, CONTEXT or BAD CONTEXT for the context section. REFERENCES is optional.

Severity scoring

if violations <= 1:  severity = informational
else:
    score = violations + (context_notes * 0.5)
    score >= 10  ->  critical
    score >=  6  ->  high
    score >=  3  ->  medium
    otherwise    ->  low

The violations <= 1 cap prevents boundary-test samples (Mimikatz, mgeeky tooling) from inheriting inflated severity from heavy CONTEXT sections that are framework commentary rather than authorization harm.

The report also prints the skeleton-to-violation gap (violations - operations). A positive gap is the tell for social-engineering or cognitive-layer attacks (the attacker claims more rights than operations they visibly perform). A deeply negative gap is the tell for LOTL, supply-chain, or boundary-test samples (many operations, fewer explicit rights claims).

Output shape (JSON)

{
  "analysis_name": "ILOVEYOU Worm",
  "source_file": "examples/iloveyou.md",
  "analyzed_at": "2026-06-03T12:00:00Z",
  "skeleton": ["Reads address book from Outlook", "Sends itself as email attachment"],
  "violations": ["Assumes right to read all Outlook contacts without consent", "..."],
  "context": ["Caused $10B in damage globally", "..."],
  "analysis_summary": {
    "total_operations": 5,
    "total_violations": 8,
    "total_context_notes": 6,
    "skeleton_violation_gap": 3,
    "severity": "critical"
  },
  "parse_warnings": []
}

Reference corpus

14 samples in examples/:

Category	Samples
Historical worms and APTs	ILOVEYOU, Conficker, Stuxnet, WannaCry, NotPetya, SUNBURST, Volt Typhoon, xz-utils, 3CX cascade
ICS safety-system attacks	Triton, TRISIS
Social engineering	Phishing, 419 advance-fee scams
LLM-layer attacks	Prompt injection, jailbreak chains
Boundary tests (null-violation samples)	gentilkiwi/mimikatz, mgeeky/Penetration-Testing-Tools

See framework.md § Reference Analyses for the full metrics table (ops, violations, context, gap, severity) across all 14.

Repository layout

cortex-framework/
├── analyzer.py        CLI: analyze / validate / compare subcommands
├── framework.md       Methodology, violation-class taxonomy, reference metrics table
├── genesis.md         Motivating examples and origin
├── examples/          14 cortex markdown analyses
├── output/reports/    Generated JSON, markdown, HTML artifacts
└── assets/            Banner SVG

Scope

cortex is not a detector. It does not classify samples automatically. It formats human-authored descriptions into a structured, comparable, citable report. The drafting step requires a person (or a coding agent reading framework.md) to write the SKELETON, VIOLATIONS, and CONTEXT sections. The analyzer parses and scores what you write. It does not infer intent from code.

Our other projects

aimap - vulnerability scanner for AI and ML infrastructure
warrant - book-grounded coding agent
VisorLog - finding ledger and ingest pipeline
VisorCorpus - adversarial corpus for LLM-adjacent surfaces
AI-LLM-Infrastructure-OSINT - population-scale exposed-AI-infrastructure surveys

License

MIT. Part of the NuClide toolchain. Contact: nuclide-research.com

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
assets		assets
examples		examples
output/reports		output/reports
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
analyzer.py		analyzer.py
framework.md		framework.md
genesis.md		genesis.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Authorization context analyzer. Describes code, systems, and behaviors by what they do versus what they assume the right to do.

Features

Installation

Usage

Using cortex with a coding agent

Input format

Severity scoring

Output shape (JSON)

Reference corpus

Repository layout

Scope

Our other projects

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Authorization context analyzer. Describes code, systems, and behaviors by what they do versus what they assume the right to do.

Features

Installation

Usage

Using cortex with a coding agent

Input format

Severity scoring

Output shape (JSON)

Reference corpus

Repository layout

Scope

Our other projects

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages