影响力指数

19.19/100

前 43.4%

全站排名 #27,960

发表论文6 篇

平均评分5.5

年均产出3.0 篇/年

Johan Ferret

Researcher@Google·OpenReview

研究方向

reinforcement learning · deep learning · credit assignment · inductive biases · scaling · interpretability

BOND: Aligning LLMs with Best-of-N Distillation

ICLR 2025Poster

Diversity-Rewarded CFG Distillation

ICLR 2025Poster

WARP: On the Benefits of Weight Averaged Rewarded Policies

ICLR 2025Rejected

On Teacher Hacking in Language Model Distillation

ICML 2025Poster

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

ICLR 2024Rejected

Direct Language Model Alignment from Online AI Feedback

NeurIPS 2024Rejected

合作者 (20)

Geoffrey Cideron

Leonard Hussenot

Pier Giuseppe Sessa