MERA-Evaluation / MERA Public

Notifications You must be signed in to change notification settings
Fork 7
Star 46

Code
Issues 7
Pull requests 9
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security and quality
Insights

Pull requests: MERA-Evaluation/MERA

Labels 18 Milestones 0

New pull request New

9 Open 7 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Luzitania init

#25 opened Apr 14, 2026 by 077136

Loading…

add riddles task

#24 opened Apr 1, 2026 by Alex-ast7

Loading…

Add MMReD: Dense Context Reasoning Benchmark

#23 opened Mar 25, 2026 by Fr0do

Loading…

add new_reason task

#21 opened Mar 10, 2026 by Alex-ast7

Loading…

add enantiosemy task

#20 opened Mar 6, 2026 by Alex-ast7

Loading…

add characters task

#19 opened Mar 6, 2026 by Alex-ast7

Loading…

add sage task code: TO_CHECK

Проверить корректность реализации задачи в LMEH. Запустить прогон.

dataset: TO_CHECK

Проверить формат и содержание сета на HF(PUBLIC)/ZIP или OBS(PRIVATE).

docs: TO_CHECK

Проверить корректность документации и метаинформации по сету.

new_dataset

The dataset for the new release

PRIVATE

Приватный датасет. Вопросы загружены на HF и доступны пользователям, ответы недоступны.

#18 opened Feb 17, 2026 by Alex-ast7

Loading…

adapted MERA text for common lm-eval fork

#16 opened Jan 16, 2026 by ZenMan123

Loading…

ruAIME dataset code: OK

Задача корректно реализована, прогон запускается и выдает метрики.

dataset: OK

Формат и содержание сета корректны

docs: OK

Документация и метаинформация по сету написаны корректно.

new_dataset

The dataset for the new release

PUBLIC

Публичный датасет. Вопросы и ответы загружены на HF и доступны пользователям.

#15 opened Sep 23, 2025 by antoshkaxxr

Loading…

ProTip! What’s not been updated in a month: updated:<2026-03-20.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!