Paper
Hub
搜索
Toggle language
Thomas Kwa
~Thomas_Kwa1
3
论文总数
1.5
年均投稿
6.8
平均评分
接收情况
3
/
3
会议分布
NeurIPS
3
发表论文 (3 篇)
2025
1 篇
7.8
4
Measuring AI Ability to Complete Long Software Tasks
NeurIPS 2025
Poster
2024
2 篇
6.3
4
Catastrophic Goodhart: regularizing RLHF with KL divergence does not mitigate heavy-tailed reward misspecification
NeurIPS 2024
Poster
6.3
4
Compact Proofs of Model Performance via Mechanistic Interpretability
NeurIPS 2024
Poster
合作者 (20)
LC
Lawrence Chan
2 篇
AG
Alex Gibson
1 篇
CY
Chun Hei Yip
1 篇
EO
Euan Ong
1 篇
JG
Jason Gross
1 篇
RA
Rajashree Agrawal
1 篇
SN
Soufiane Noubir
1 篇
AD
Amy Deng
1 篇
查看全部 20 位合作者