Cassidy Laidlaw
~Cassidy_Laidlaw1
7
论文总数
3.5
年均投稿
平均评分
接收情况5/7
会议分布
ICLR
6
ICML
1
发表论文 (7 篇)
20254 篇
3
AssistanceZero: Scalably Solving Assistance Games
ICML 2025Poster
5
Correlated Proxies: A New Definition and Improved Mitigation for Reward Hacking
ICLR 2025Spotlight
4
Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision
ICLR 2025Spotlight
4
Reliability-Aware Preference Learning for LLM Reward Models
ICLR 2025withdrawn