Yiming Dong
~Yiming_Dong1
4
论文总数
4.0
年均投稿
平均评分
接收情况4/4
会议分布
NeurIPS
4
发表论文 (4 篇)
20254 篇
4
On the $O(\frac{\sqrt{d}}{K^{1/4}})$ Convergence Rate of AdamW Measured by $\ell_1$ Norm
NeurIPS 2025Poster
4
Stepsize anything: A unified learning rate schedule for budgeted-iteration training
NeurIPS 2025Poster
4
Improving Model Representation and Reducing KV Cache via Skip Connections with First Value Heads
NeurIPS 2025Poster
4
AdaMSS: Adaptive Multi-Subspace Approach for Parameter-Efficient Fine-Tuning
NeurIPS 2025Poster