Skip to content

docs(scope): 18th c.+ cursive Hebrew script; Yiddish in scope#36

Merged
shaypal5 merged 1 commit into
mainfrom
scope/handwriting-era-and-yiddish
May 24, 2026
Merged

docs(scope): 18th c.+ cursive Hebrew script; Yiddish in scope#36
shaypal5 merged 1 commit into
mainfrom
scope/handwriting-era-and-yiddish

Conversation

@shaypal5

Copy link
Copy Markdown
Contributor

What

Tightens the corpus scope based on explicit project requirements:

  • Minimum date 18th century (~1700) — medieval scribal hands are out of scope
  • Script target is everyday cursive כתב יד (not דפוס/printed/typeset/lithographed)
  • Yiddish in Hebrew script is in scope — same round letter-shapes as Hebrew cursive
  • Judeo-Arabic (Arabic script) is explicitly out of scope
  • README opening paragraph updated to match

Why

The dataset is intended for Hebrew handwriting recognition. Medieval scribal hands and printed material use fundamentally different letter-forms and aren't useful training data for modern Hebrew handwriting models.

🤖 Generated with Claude Code

…add Yiddish

- Minimum date is now 18th century (~1700); medieval scribal hands are out
- Explicit target: everyday cursive כתב יד (not דפוס/printed)
- Yiddish in Hebrew script is explicitly in scope (same letter-shapes)
- Judeo-Arabic (Arabic script) is explicitly out of scope
- README opening paragraph updated to match

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@shaypal5 shaypal5 merged commit ce50475 into main May 24, 2026
1 check passed
@shaypal5 shaypal5 deleted the scope/handwriting-era-and-yiddish branch May 24, 2026 20:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant