Problem
Native /codex:review consistently returns at most 2–3 findings per run, regardless of diff size. On a ~1000-line refactor touching 13 files, the reviewer found real P1/P2 bugs — but only 2–3 per iteration. It took 8 sequential review→fix cycles to surface ~20 legitimate issues that were all present in the original diff.
This forces a tedious manual loop:
/codex:review → get 3 findings
- Fix them
/codex:review → get 3 more
- Fix them
- Repeat 6+ more times
Each cycle costs tokens and wall-clock time. The reviewer clearly can find more issues — it just stops early.
Expected behavior
The reviewer should return all material findings it discovers, not an arbitrary subset. At minimum, provide a way to configure the cap:
/codex:review --max-findings 10
or a setting in /codex:setup:
{ "reviewMaxFindings": 10 }
Context
Workaround
Currently using iterative review→fix loops, which works but is slow and expensive.
This issue was co-authored with Claude Code (Opus 4.6) during a real review session where the 3-finding cap was the bottleneck.
Problem
Native
/codex:reviewconsistently returns at most 2–3 findings per run, regardless of diff size. On a ~1000-line refactor touching 13 files, the reviewer found real P1/P2 bugs — but only 2–3 per iteration. It took 8 sequential review→fix cycles to surface ~20 legitimate issues that were all present in the original diff.This forces a tedious manual loop:
/codex:review→ get 3 findings/codex:review→ get 3 moreEach cycle costs tokens and wall-clock time. The reviewer clearly can find more issues — it just stops early.
Expected behavior
The reviewer should return all material findings it discovers, not an arbitrary subset. At minimum, provide a way to configure the cap:
or a setting in
/codex:setup:{ "reviewMaxFindings": 10 }Context
/codex:adversarial-reviewuses a local prompt (prompts/adversarial-review.md) without a hard cap — but native review goes through thereview/startserver API where the limit appears to be baked in.Workaround
Currently using iterative review→fix loops, which works but is slow and expensive.
This issue was co-authored with Claude Code (Opus 4.6) during a real review session where the 3-finding cap was the bottleneck.