Skip to content
View x-zheng16's full-sized avatar

Highlights

  • Pro

Organizations

@CongGroup @Tsinghua-Space-Robot-Learning-Group

Block or report x-zheng16

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. JustAsk JustAsk Public

    JustAsk: Curious Code Agents Reveal System Prompts in Frontier LLMs

    Python 8

  2. OpenRedRL OpenRedRL Public

    [FCS] OpenRedRL: A Light-Weight Benchmark for RL-Based Red Teaming

    Python 5 1

  3. CIM CIM Public

    [IJCAI 2024] Constrained Intrinsic Motivation for RL

    Python 4

  4. IMAP IMAP Public

    [DSN 2024] Toward Evaluating Robustness of Reinforcement Learning with Adversarial Policy

    Python 8

  5. CALM CALM Public

    [AAAI 25] CALM: Curiosity-Driven Auditing for LLMs

    Python 5 2

  6. BlueSuffix BlueSuffix Public

    Forked from Vinsonzyh/BlueSuffix

    [ICLR 2025] Reinforced Blue Teaming for VLMs Against Jailbreak Attacks

    Python 1