Nikhil Vyas
~Nikhil_Vyas1
6
论文总数
3.0
年均投稿
平均评分
接收情况5/6
会议分布
ICLR
6
发表论文 (6 篇)
20255 篇
4
SOAP: Improving and Stabilizing Shampoo using Adam for Language Modeling
ICLR 2025Poster
5
How Does Critical Batch Size Scale in Pre-training?
ICLR 2025Poster
4
A New Perspective on Shampoo's Preconditioner
ICLR 2025Poster
4
Deconstructing What Makes a Good Optimizer for Autoregressive Language Models
ICLR 2025Poster
4
Mixture of Parrots: Experts improve memorization more than reasoning
ICLR 2025Poster