fix: prevent TypeError in metrics collect_metrics when reward is None by JasonOA888 · Pull Request #243 · benchflow-ai/benchflow

JasonOA888 · 2026-05-07T03:16:48Z

collect_metrics picks the best result per task across retries by comparing reward values. When a verifier returns rewards={"reward": None, "rubric": [...]} (a truthy dict with a None reward), the comparison float > None raises TypeError.

This can happen when a rubric verifier returns partial results — individual items pass but the overall reward is unresolved.

Fix: add _safe_reward() helper that normalizes None/missing reward values to 0.0 before comparison.

Repro:

# First attempt: rewards with None reward (rubric verifier partial result)
# Second attempt: rewards with 0.5 reward
# Comparison: 0.5 > None → TypeError!
<!-- devin-review-badge-begin -->

---

<a href="https://app.devin.ai/review/benchflow-ai/benchflow/pull/243" target="_blank">
  <picture>
    <source media="(prefers-color-scheme: dark)" srcset="https://static.devin.ai/assets/gh-open-in-devin-review-dark.svg?v=1">
    <img src="https://static.devin.ai/assets/gh-open-in-devin-review-light.svg?v=1" alt="Open in Devin Review">
  </picture>
</a>
<!-- devin-review-badge-end -->

collect_metrics picks the best result per task across retries by comparing reward values. When a verifier returns rewards={'reward': None, 'rubric': [...]} (a dict with a None reward), the comparison raises TypeError. Fix: add _safe_reward() helper that normalizes None/missing reward values to 0.0 before comparison.

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 2 additional findings.

devin-ai-integration Bot reviewed May 7, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: prevent TypeError in metrics collect_metrics when reward is None#243

fix: prevent TypeError in metrics collect_metrics when reward is None#243
JasonOA888 wants to merge 1 commit intobenchflow-ai:mainfrom
JasonOA888:fix/metrics-reward-comparison-typeerror

JasonOA888 commented May 7, 2026 •

edited by devin-ai-integration Bot

Loading

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

JasonOA888 commented May 7, 2026 • edited by devin-ai-integration Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

JasonOA888 commented May 7, 2026 •

edited by devin-ai-integration Bot

Loading