Paper
Hub
搜索
Toggle language
Kai Yuanqing Xiao
~Kai_Yuanqing_Xiao1
2
论文总数
2.0
年均投稿
4.8
平均评分
接收情况
0
/
2
会议分布
ICLR
2
发表论文 (2 篇)
2025
2 篇
4.3
4
Diverse and Effective Red Teaming with Auto-generated Rewards and Multi-step Reinforcement Learning
ICLR 2025
Rejected
5.3
7
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
ICLR 2025
Rejected
合作者 (5)
AB
Alex Beutel
2 篇
JH
Johannes Heidecke
2 篇
LW
Lilian Weng
2 篇
EW
Eric Wallace
1 篇
RL
Reimar Heinrich Leike
1 篇