Paper
Hub
搜索
Toggle language
Lewis Tunstall
~Lewis_Tunstall1
5
论文总数
2.5
年均投稿
6.7
平均评分
接收情况
5
/
5
会议分布
COLM
4
ICML
1
发表论文 (5 篇)
2025
3 篇
7.0
3
Optimizing Test-Time Compute via Meta Reinforcement Finetuning
ICML 2025
Poster
7.3
4
SmolLM2: When Smol Goes Big — Data-Centric Training of a Fully Open Small Language Model
COLM 2025
Poster
6.8
4
SmolVLM: Redefining small and efficient multimodal models
COLM 2025
Poster
2024
2 篇
6.3
4
Zephyr: Direct Distillation of LM Alignment
COLM 2024
Poster
6.4
5
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization
COLM 2024
Poster
合作者 (20)
LW
Leandro Von Werra
3 篇
TW
Thomas Wolf
3 篇
AM
Andrés Marafioti
2 篇
AL
Anton Lozhkov
2 篇
CF
Clémentine Fourrier
2 篇
CZ
Cyril Zakka
2 篇
EB
Elie Bakouch
2 篇
HL
Hugo Larcher
2 篇
查看全部 20 位合作者