Paper
Hub
搜索
Toggle language
Gavia Gray
~Gavia_Gray1
3
论文总数
1.5
年均投稿
6.2
平均评分
接收情况
3
/
3
会议分布
NeurIPS
2
ICLR
1
发表论文 (3 篇)
2025
2 篇
6.8
5
Power Lines: Scaling laws for weight decay and batch size in LLM pre-training
NeurIPS 2025
Poster
6.3
3
Straight to Zero: Why Linearly Decaying the Learning Rate to Zero Works Best for LLMs
ICLR 2025
Poster
2024
1 篇
5.5
4
Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in Transformers
NeurIPS 2024
Poster
合作者 (6)
JH
Joel Hestness
3 篇
SB
Shane Bergsma
3 篇
DS
Daria Soboleva
2 篇
GG
Gurpreet Gosal
2 篇
ND
Nolan Simran Dey
2 篇
AT
Aman Tiwari
1 篇