Paper
Hub
搜索
Toggle language
Atli Kosson
~Atli_Kosson1
3
论文总数
3.0
年均投稿
6.0
平均评分
接收情况
2
/
3
会议分布
NeurIPS
2
ICLR
1
发表论文 (3 篇)
2024
3 篇
6.3
4
Analyzing & Reducing the Need for Learning Rate Warmup in GPT Training
NeurIPS 2024
Poster
4.5
4
Rotational Equilibrium: How Weight Decay Balances Learning Across Neural Networks
ICLR 2024
Rejected
7.3
3
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations
NeurIPS 2024
Spotlight
合作者 (6)
MJ
Martin Jaggi
3 篇
BM
Bettina Messmer
2 篇
AH
Alexander Hägele
1 篇
EB
Elie Bakouch
1 篇
LW
Leandro Von Werra
1 篇
LA
Loubna Ben allal
1 篇