feat: multi-dimensional quality scoring for structured outputs by sungdark · Pull Request #5 · Mint-Claw/content-split

sungdark · 2026-02-26T07:37:28Z

Summary

Implements bounty #1: scoring structured submissions (JSON/markdown/code/text) with a weighted 0-1 quality score and per-dimension feedback.

Included

- format auto-detection
- 5 rubric dimensions: completeness, format compliance, coverage, clarity, validity
- weighted output schema:
- schema validation
- multi-format scoring coverage
- benchmark check for 100 submissions <10s

Validation

Run:

All tests pass.

… helper

sungdark · 2026-02-26T09:16:07Z

Follow-up update from my side:

I pushed an improvement commit to make the scorer easier to tune and review:

Added configurable weights (auto-normalized)
Improved per-dimension feedback wording
Added evaluate_against_ground_truth() helper for dataset-level error reporting
Expanded tests to include schema, format coverage, benchmark, and ground-truth alignment checks

Happy to tighten/adjust thresholds if you share your preferred calibration dataset.

sungdark added 2 commits February 26, 2026 07:37

feat: add multi-dimensional quality scorer with tests

e463da6

refactor: improve scoring config, feedback quality, and gt evaluation…

de47dc3

… helper

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: multi-dimensional quality scoring for structured outputs#5

feat: multi-dimensional quality scoring for structured outputs#5
sungdark wants to merge 2 commits intoMint-Claw:mainfrom
sungdark:bounty-1-quality-scorer

sungdark commented Feb 26, 2026

weighted output schema:

Uh oh!

sungdark commented Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sungdark commented Feb 26, 2026

Summary

Included

weighted output schema:

Validation

Uh oh!

sungdark commented Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant