Paper
Hub
搜索
Toggle language
Johan Ferret
~Johan_Ferret1
6
论文总数
3.0
年均投稿
5.5
平均评分
接收情况
3
/
6
会议分布
ICLR
4
ICML
1
NeurIPS
1
发表论文 (6 篇)
2025
4 篇
5.5
4
WARP: On the Benefits of Weight Averaged Rewarded Policies
ICLR 2025
Rejected
5.5
3
On Teacher Hacking in Language Model Distillation
ICML 2025
Poster
6.0
4
Diversity-Rewarded CFG Distillation
ICLR 2025
Poster
6.4
5
BOND: Aligning LLMs with Best-of-N Distillation
ICLR 2025
Poster
2024
2 篇
5.8
4
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
ICLR 2024
Rejected
4.0
4
Direct Language Model Alignment from Online AI Feedback
NeurIPS 2024
Rejected
合作者 (20)
AR
Alexandre Rame
5 篇
NV
Nino Vieillard
3 篇
OB
Olivier Bachem
3 篇
SP
Sarah Perrin
3 篇
SG
Sertan Girgin
3 篇
GC
Geoffrey Cideron
2 篇
LH
Leonard Hussenot
2 篇
PS
Pier Giuseppe Sessa
2 篇
查看全部 20 位合作者