Paper
Hub
搜索
Toggle language
Oliver Jaffe
~Oliver_Jaffe2
5
论文总数
2.5
年均投稿
5.5
平均评分
接收情况
3
/
5
会议分布
ICLR
3
ICML
1
NeurIPS
1
发表论文 (5 篇)
2025
3 篇
5.5
3
PaperBench: Evaluating AI’s Ability to Replicate AI Research
ICML 2025
Poster
5.0
3
AI Sandbagging: Language Models can Strategically Underperform on Evaluations
ICLR 2025
Poster
8.0
4
MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
ICLR 2025
Oral
2024
2 篇
5.5
4
AI Sandbagging: Language Models can Strategically Underperform on Evaluations
NeurIPS 2024
Rejected
3.7
3
Tall Tales at Different Scales: Evaluating Scaling Trends For Deception in Language Models
ICLR 2024
Rejected
合作者 (20)
FH
Felix Hofstätter
3 篇
FW
Francis Rhys Ward
3 篇
SB
Samuel F. Brown
3 篇
DS
Dane Sherburn
2 篇
EM
Evan Mays
2 篇
GS
Giulio Starace
2 篇
JA
James Aung
2 篇
JC
Jun Shern Chan
2 篇
查看全部 20 位合作者