Paper
Hub
搜索
Toggle language
Olivier Bachem
~Olivier_Bachem1
5
论文总数
2.5
年均投稿
6.0
平均评分
接收情况
4
/
5
会议分布
ICLR
4
NeurIPS
1
发表论文 (5 篇)
2025
3 篇
6.0
4
Diversity-Rewarded CFG Distillation
ICLR 2025
Poster
5.5
4
WARP: On the Benefits of Weight Averaged Rewarded Policies
ICLR 2025
Rejected
6.4
5
BOND: Aligning LLMs with Best-of-N Distillation
ICLR 2025
Poster
2024
2 篇
6.5
4
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes
ICLR 2024
Poster
5.4
5
Imitating Language via Scalable Inverse Reinforcement Learning
NeurIPS 2024
Poster
合作者 (20)
NV
Nino Vieillard
4 篇
AR
Alexandre Rame
3 篇
JF
Johan Ferret
3 篇
SG
Sertan Girgin
3 篇
GC
Geoffrey Cideron
2 篇
LH
Leonard Hussenot
2 篇
NM
Nikola Momchev
2 篇
PS
Pier Giuseppe Sessa
2 篇
查看全部 20 位合作者