Paper
Hub
搜索
Toggle language
Stephen Casper
~Stephen_Casper1
2
论文总数
1.0
年均投稿
5.0
平均评分
接收情况
0
/
2
会议分布
ICLR
2
发表论文 (2 篇)
2025
1 篇
4.8
4
Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs
ICLR 2025
Rejected
2024
1 篇
5.3
4
Explore, Establish, Exploit: Red Teaming Language Models from Scratch
ICLR 2024
Rejected
合作者 (13)
DH
Dylan Hadfield-Menell
2 篇
AS
Abhay Sheshadri
1 篇
AL
Aengus Lynch
1 篇
AE
Aidan Ewart
1 篇
AS
Asa Cooper Stickland
1 篇
CW
Cindy Wu
1 篇
EP
Ethan Perez
1 篇
HS
Henry Sleight
1 篇
查看全部 13 位合作者