影响力指数

-/100

发表论文3 篇

平均评分5.2

年均产出1.5 篇/年

Kai Yuanqing Xiao

PhD student@Massachusetts Institute of Technology·OpenReview

研究方向

deep learning · robust machine learning · adversarial examples · verification · recreational computer science · algorithms

The Attacker Moves Second: Stronger Adaptive Attacks Bypass Defenses Against LLM Jailbreaks and Prompt Injections

ICLR 2026Rejected

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

ICLR 2025Rejected

Diverse and Effective Red Teaming with Auto-generated Rewards and Multi-step Reinforcement Learning

ICLR 2025Rejected

合作者 (18)

Johannes Heidecke

Abhradeep Guha Thakurta

Chawin Sitawarin

Florian Tramèr

Harsh Chaudhari