Jonah Brown-Cohen
~Jonah_Brown-Cohen1
5
论文总数
2.5
年均投稿
平均评分
接收情况2/5
会议分布
ICLR
4
NeurIPS
1
发表论文 (5 篇)
20244 篇
4
Scalabale AI Safety via Doubly-Efficient Debate
ICLR 2024Rejected
4
Learning Differentially Private Rewards from Human Feedback
ICLR 2024Rejected
4
On scalable oversight with weak LLMs judging strong LLMs
NeurIPS 2024Poster
3
SKILL-MIX: a Flexible and Expandable Family of Evaluations for AI Models
ICLR 2024Poster