x-zheng16

Follow

Xiang Zheng x-zheng16

Follow

Research Assistant Professor@HKAI-Sci

8 followers · 40 following

City University of Hong Kong
Hong Kong
https://x-zheng16.github.io
https://orcid.org/0000-0002-2990-2169

Achievements

Achievements

Highlights

Pro

Organizations

Pinned Loading

JustAsk JustAsk Public

JustAsk: Curious Code Agents Reveal System Prompts in Frontier LLMs

Python 8
OpenRedRL OpenRedRL Public

[FCS] OpenRedRL: A Light-Weight Benchmark for RL-Based Red Teaming

Python 5 1
CIM CIM Public

[IJCAI 2024] Constrained Intrinsic Motivation for RL

Python 4
IMAP IMAP Public

[DSN 2024] Toward Evaluating Robustness of Reinforcement Learning with Adversarial Policy

Python 8
CALM CALM Public

[AAAI 25] CALM: Curiosity-Driven Auditing for LLMs

Python 5 2
BlueSuffix BlueSuffix Public

Forked from Vinsonzyh/BlueSuffix

[ICLR 2025] Reinforced Blue Teaming for VLMs Against Jailbreak Attacks

Python 1