Gurpreet Gosal
~Gurpreet_Gosal2
3
论文总数
3.0
年均投稿
平均评分
接收情况3/3
会议分布
NeurIPS
1
ICLR
1
COLM
1
发表论文 (3 篇)
20253 篇
5
Power Lines: Scaling laws for weight decay and batch size in LLM pre-training
NeurIPS 2025Poster
3
Straight to Zero: Why Linearly Decaying the Learning Rate to Zero Works Best for LLMs
ICLR 2025Poster
4
Sherkala-Chat: Building a State-of-the-Art LLM for Kazakh in a Moderately Resourced Setting
COLM 2025Poster