Skip to content

Conversation

@abrichr
Copy link
Member

@abrichr abrichr commented Jan 17, 2026

Summary

This PR adds qualifiers to claims in the README to ensure intellectual honesty, per the publication roadmap.

Changes:

  • Change "Core Innovation" to "Core Approach" (more accurate positioning)
  • Change "key differentiator" to "explores" (less marketing language)
  • Correct accuracy figure (46.7% -> 100%, not 33% -> 100%)
  • Add context that all 45 tasks in the benchmark share the same navigation entry point
  • Link to publication roadmap for methodology and limitations
  • Change "No technical expertise needed" to "Reduced prompt engineering" (more accurate)

Rationale

The goal is accuracy over marketing appeal. Per the publication roadmap:

  • The demo-conditioning result is valid but has specific scope
  • Claims should be defensible to a skeptical reviewer
  • Better to be accurate than to oversell

Test plan

  • README renders correctly
  • Link to publication roadmap is valid (relative link)
  • Claims align with documented evidence

Generated with Claude Code

- Change "Core Innovation" to "Core Approach" (more accurate)
- Change "key differentiator" to "explores" (less marketing)
- Correct accuracy figure (46.7% -> 100%, not 33% -> 100%)
- Add context that all 45 tasks share same navigation entry point
- Link to publication roadmap for methodology and limitations
- Change "No technical expertise needed" to "Reduced prompt engineering"

The goal is accuracy over marketing appeal.

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants