Skip to content

Turn-Gate/.github

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

TurnGate Project: Detecting Malicious Intent

Official organization for research paper One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue — a novel turn-level monitor that identifies the earliest turn where multi-turn interactions become sufficient for harm, providing a robust defense against state-of-the-art adaptive attackers such as the CKA-Agent.

Links

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors