feat(loop): automated watch-CI skill + loop-log observability (delivery-loop hardening #4+#5) by helebest · Pull Request #143 · OpenDIKW/dikw-web

helebest · 2026-06-29T14:53:22Z

What

Two coupled items from the delivery-loop hardening effort (docs/adr/0005-delivery-loop-hardening.md), combined since #5 exists to feed #4:

#4 — dikw-web-watch-ci skill. Turns delivery-loop step 8 ("monitor CI and PR comments; resolve then merge") from prose into a bounded, executable loop:

gh pr checks <N> --watch → classify a failure → route a real one to the fresh-context fixer → re-push → re-watch
Brakes: MAX_ROUNDS = 3, a same-failure circuit breaker, and a one-rerun budget for the single known flake (graph.spec.ts Pixi)
Always pulls the review prose (reviews / pulls/{N}/comments / issues/{N}/comments) before merge, and merges explicitly (--squash --delete-branch, never --auto, which outraces the review)

#5 — scripts/loop-log.mjs. Appends one structured JSON line per transition ({ts, event, detail}) to a gitignored .loop-log.jsonl, so an autonomous/background run is diagnosable after the fact (which of the four loop deaths happened — runaway / stuck / flake / silent death). The watch-ci skill logs iter_start / flake_rerun / fixer / ci_fail / merged.

Why

Implements Delba Oliveira's bundled "watch CI and fix failures as they come in" plus h100envy's loop-engineering brakes (iteration cap, circuit breaker, structured log). Fixes the real timing bug where an independent review is outraced by --auto. This is the last pair of the 5-item plan (item #1 reward-hacking gate and item #3 measured perf/a11y already merged; #2 was resolved by #140).

Test / verification

scripts/loop-log.test.mjs — 4 unit tests (TDD): single-line JSON, multi-line detail escaping, default detail, JSONL append accumulation.
Full suite green locally: 941 tests, lint + format:check clean.
npm run check:gate passes with no label (touches no verification machinery — a clean demonstration of the normal path).
Deliberately not an on-disk state protocol / container isolation / token accounting — see ADR 0005 "Excluded" (over-engineering for an interactive loop).

🤖 Generated with Claude Code

…ry-loop hardening #4+#5) Turn delivery-loop step 8 into a bounded, self-logging loop. New dikw-web-watch-ci skill: gh pr checks --watch → classify → route real failures to the fresh-context fixer → re-push, gated by MAX_ROUNDS=3 + a same-failure circuit breaker + a one-rerun budget for the known graph.spec.ts Pixi flake; always pulls review prose before merge; merges explicitly (never --auto, which outraces the review). scripts/loop-log.mjs appends one structured JSON line per transition to a gitignored .loop-log.jsonl so an autonomous run is diagnosable. Wires into dikw-web-delivery-workflow step 9 + CLAUDE.md; ADR 0005 items 4+5.

coderabbitai · 2026-06-29T14:53:38Z

Warning

Review limit reached

@helebest, you've reached your PR review limit, so we couldn't start this review.

Next review available in: 29 minutes

Enable usage-based reviews in Billing to review now. Otherwise, wait until the next included review is available.
You're only billed for reviews past your plan's rate limits ($0.25/file).

How can I continue?

After more reviews become available, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

To avoid repeated limits, reduce automatic review volume by pausing incremental auto-reviews earlier, using label-based review opt-in, excluding WIP or generated PR titles, or requesting reviews manually when the PR is ready. If your team needs uninterrupted high-volume reviews, an organization admin can enable usage-based reviews.

How do review limits work?

CodeRabbit enforces per-developer PR review limits for each organization. Most developers receive the normal plan review availability.

For paid Pro and Pro+ PR reviews, CodeRabbit uses adaptive limits for sustained high-volume activity. When a developer's recent PR review activity reaches the 95th percentile or higher among CodeRabbit users, additional reviews become available more gradually as earlier reviews age out of the rolling window.

Please refer docs for additional details.

Review details

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro Plus

Run ID: dbab8467-4a14-4e71-9bfd-37e7590440e3

📥 Commits

Reviewing files that changed from the base of the PR and between e5eb85f and 28d6184.

📒 Files selected for processing (9)

.claude/skills/dikw-web-delivery-workflow/SKILL.md
.claude/skills/dikw-web-watch-ci/SKILL.md
.gitignore
CHANGELOG.md
CLAUDE.md
docs/adr/0005-delivery-loop-hardening.md
package.json
scripts/loop-log.mjs
scripts/loop-log.test.mjs

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch ci/watch-ci-automation

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands.}

helebest merged commit b2df975 into main Jun 29, 2026
10 checks passed

helebest deleted the ci/watch-ci-automation branch June 29, 2026 14:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(loop): automated watch-CI skill + loop-log observability (delivery-loop hardening #4+#5)#143

feat(loop): automated watch-CI skill + loop-log observability (delivery-loop hardening #4+#5)#143
helebest merged 1 commit into
mainfrom
ci/watch-ci-automation

helebest commented Jun 29, 2026

Uh oh!

coderabbitai Bot commented Jun 29, 2026

Review limit reached

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

helebest commented Jun 29, 2026

What

Why

Test / verification

Uh oh!

coderabbitai Bot commented Jun 29, 2026

Review limit reached

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant