feat: improve language detection with multi-sampling by Zonrotan · Pull Request #7 · ellite/anchor-sub-sync

Zonrotan · 2026-04-29T08:09:37Z

Description

The new auto-detect works great, but has a hidden flaw: when language=None, Whisper natively limits its language detection to the first 30 seconds of the audio.

If a video starts with silent logos, long musical intros, or background noise, Whisper often hallucinates a random language (e.g., guessing Thai for a Swedish video). Because Whisper locks in its first guess for the entire file, this ruins the transcription and triggers unnecessary translation logic.

Solution

This PR bypasses Whisper's 30-second limitation with a lightweight detect_robust_language helper function that samples three snippets from the file. Instead of trusting just the first 30 seconds, it:

Samples three 30-second snippets (at 15%, 50%, and 85%).
Silently runs model.transcribe() on them.
Uses majority voting to determine the true language.
Falls back to native detection for videos under 2 minutes.

Why this approach?
Since the audio is already loaded as a numpy array, slicing it and running inference on these snippets takes only a fraction of a second and requires no external dependencies (like ffmpeg).

Changes Made

Added detect_robust_language in whisper.py
Injected the check before the main model.transcribe call (when no metadata is present).
Muted the progress bar for the detection samples.

Sounds good, no? 😃

Changed wording for clarity.

Zonrotan and others added 2 commits April 29, 2026 09:51

feat: improve language detection with multi-sampling

f517c43

Update whisper.py

3fe9077

Changed wording for clarity.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: improve language detection with multi-sampling#7

feat: improve language detection with multi-sampling#7
Zonrotan wants to merge 2 commits into
ellite:mainfrom
Zonrotan:feature/robust-language-detection

Zonrotan commented Apr 29, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Zonrotan commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Solution

Changes Made

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Zonrotan commented Apr 29, 2026 •

edited

Loading