Skip to content

Record: Order-Adaptive BackoffMixer (mean val_bpb=0.5440)#825

Open
hypery11 wants to merge 1 commit intoopenai:mainfrom
hypery11:submission/2026-03-26_final_champion
Open

Record: Order-Adaptive BackoffMixer (mean val_bpb=0.5440)#825
hypery11 wants to merge 1 commit intoopenai:mainfrom
hypery11:submission/2026-03-26_final_champion

Conversation

@hypery11
Copy link

Results

Seed val_bpb Eval time
42 0.5437 ~391s
1337 0.5450 ~391s
2024 0.5434 ~391s
Mean 0.5440
Std 0.0008
  • Artifact: ~16.0 MB
  • Train: 600s on 8xH100 SXM
  • Eval: ~391s (well under 600s)

Method

11-layer transformer (512d, 8/8 full MHA, XSA-all, LeakyReLU(0.5)^2, 3.5x MLP). Order-adaptive entropy-gated BackoffNgramMixer with per-order entropy thresholds. Score-first, backward-looking, deterministic.

Acknowledgments

Huge thanks to the incredible community that made this possible:

This competition has been an amazing collaborative experience. Every improvement here builds on ideas shared openly.

  • 8xH100 SXM, train <=600s
  • Eval <=600s (391s)
  • Artifact <=16MB
  • 3-seed validation (std 0.0008)

Seeds: 0.5437 / 0.5450 / 0.5434 (std 0.0008).
Order-adaptive entropy gating + BackoffNgramMixer.
~16MB artifact. Train 600s, eval 391s.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant