Jonathan Berant
~Jonathan_Berant1
9
论文总数
4.5
年均投稿
平均评分
接收情况7/9
会议分布
ICLR
4
ICML
2
COLM
2
NeurIPS
1
发表论文 (9 篇)
20255 篇
4
InfAlign: Inference-aware language model alignment
ICML 2025Poster
4
From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty
ICLR 2025Rejected
4
Theoretical guarantees on the best-of-n alignment policy
ICML 2025Poster
4
Don’t lie to your friends: Learning what you know from collaborative self-play
COLM 2025Poster
7
Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
ICLR 2025Spotlight
20244 篇
4
Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors
ICLR 2024Oral
4
Making Retrieval-Augmented Language Models Robust to Irrelevant Context
ICLR 2024Poster
3
Robust Preference Optimization through Reward Model Distillation
NeurIPS 2024Rejected
4
Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
COLM 2024Poster