Self-evolving skill systems for AI coding agents (Claude Code + Codex).
A curated collection of operational skills, comparison guides, and source-level analysis for getting the most out of AI coding agents -- particularly Claude Code and OpenAI Codex CLI. Focused on the "command center" use case: one human orchestrating multiple AI agents.
docs/codex-skills.md -- Complete 12-section guide covering:
- Three usage modes: MCP Tool, tmux interactive, non-interactive exec
- MCP Tool invocation (recommended): structured, 30-second results
- Prompt four elements: Goal, Context, Constraints, Done When
- Codex vs Claude Code capability comparison: when to use which
- Common stuck states and fixes: auth issues, sandbox failures, rate limits
- Command center best practices: MCP Tool, tmux session, AGENTS.md configuration
- Self-evolution loop: review -> fix -> test -> record skills -> update rules
- Real-world practice records: 16 rounds of A-grade code audit output
- MCO multi-CLI orchestration: parallel multi-model review
- Codex Plugin deep dive: setup, commands, review gate, internal skills
- Plugin internals: architecture, design principles, when to use what
docs/codex-source-study.md -- Analysis of the Codex CLI Rust codebase:
- Architecture: 80+ crate workspace, three runtime modes (TUI, Exec, MCP Server)
- Sandbox system: Why bwrap fails on certain servers, three-layer sandbox (bubblewrap + seccomp + Landlock), troubleshooting guide
- MCP Server integration: As client (connecting to external servers) and as server (exposing codex/codex-reply tools)
- Approval system: Four policies, granular approvals, guardian subagent
- Network proxy: Built-in HTTP + SOCKS5 proxy, domain allowlist/blocklist
- config.toml reference: Complete configuration with profiles, model providers, permissions
- Session management: Resume, fork, ephemeral modes
- Operator reference: Quick-reference tables for exec parameters, review usage, sandbox troubleshooting
- Give to Codex: Code review, independent bug fixes, test writing, large-scale refactoring, parallel tasks
- Give to Claude Code: Multi-step coordinated tasks, browser automation, MCP tool chains, video production
- Both together: Claude Code plans + Codex executes; Claude Code reviews Codex output
Round N:
1. Codex reviews code -> finds issues
2. Codex exec fixes issues
3. Run tests to verify
4. Record Skills (what worked / what failed)
5. Update AGENTS.md rules
6. -> Round N+1
Goal: What to change, why
Context: Related files, error messages, screenshots
Constraints: Standards, architecture requirements, security red lines
Done When: Completion criteria (tests pass / behavior changes / bug doesn't reproduce)
| Symptom | Cause | Fix |
|---|---|---|
Can't mount proc |
Container lacks proc mount permission | Codex auto-fallbacks, or add --no-proc |
bwrap not found |
bubblewrap not in PATH | apt install bubblewrap (vendored fallback exists) |
Operation not permitted on namespace |
Kernel blocks unprivileged user namespaces | sysctl kernel.unprivileged_userns_clone=1 |
| All sandbox fails in container | Docker restrictions | Use --sandbox danger-full-access |
sleep2agi — 专注 AI Agents 编排、群体智能、视频生成,帮助普通人跨越到 AGI。
MIT