Paper
Hub
搜索
Toggle language
Hynek Kydlíček
~Hynek_Kydlíček1
2
论文总数
2.0
年均投稿
7.8
平均评分
接收情况
2
/
2
会议分布
COLM
2
发表论文 (2 篇)
2025
2 篇
8.3
4
FineWeb2: One Pipeline to Scale Them All — Adapting Pre-Training Data Processing to Every Language
COLM 2025
Poster
7.3
4
SmolLM2: When Smol Goes Big — Data-Centric Training of a Fully Open Small Language Model
COLM 2025
Poster
合作者 (20)
CR
Colin Raffel
2 篇
GP
Guilherme Penedo
2 篇
LW
Leandro Von Werra
2 篇
TW
Thomas Wolf
2 篇
AL
Agustín Piqueres Lajarín
1 篇
AM
Andrés Marafioti
1 篇
AL
Anton Lozhkov
1 篇
BB
Ben Burtenshaw
1 篇
查看全部 20 位合作者