Skip to content

Conversation

@Lingxiao-code
Copy link

Submitting DeBERTaV3-ChatGLM-Detector predictions and metadata for RAID benchmark evaluation

@github-actions
Copy link

Eval run succeeded! Link to run: link

Here are the results of the submission(s):

DeBERTaV3-ChatGLM-Detector

Release date: 2026-01-23

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 68.87 and a TPR of 35.75% at FPR=5% and 26.48% at FPR=1%.
Without adversarial attacks, it achieved AUROC of 79.72 and a TPR of 50.81% at FPR=5% and 39.65% at FPR=1%.

If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants