Paper
Hub
搜索
Toggle language
Shivam Singhal
~Shivam_Singhal1
3
论文总数
1.5
年均投稿
5.4
平均评分
接收情况
1
/
3
会议分布
ICLR
3
发表论文 (3 篇)
2025
2 篇
4.0
4
Reliability-Aware Preference Learning for LLM Reward Models
ICLR 2025
withdrawn
7.2
5
Correlated Proxies: A New Definition and Improved Mitigation for Reward Hacking
ICLR 2025
Spotlight
2024
1 篇
5.0
5
Preventing Reward Hacking with Occupancy Measure Regularization
ICLR 2024
Rejected
合作者 (2)
AD
Anca Dragan
3 篇
CL
Cassidy Laidlaw
3 篇