zhou Xun
~zhou_Xun2
9
论文总数
4.5
年均投稿
平均评分
接收情况8/9
会议分布
NeurIPS
4
ICML
3
ICLR
2
发表论文 (9 篇)
20257 篇
4
Stepsize anything: A unified learning rate schedule for budgeted-iteration training
NeurIPS 2025Poster
4
MARS: Unleashing the Power of Variance Reduction for Training Large Models
ICML 2025Poster
3
Investigating the Overlooked Hessian Structure: From CNNs to LLMs
ICML 2025Poster
4
Ultra-Sparse Memory Network
ICLR 2025Poster
4
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling
ICML 2025Poster
4
HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
NeurIPS 2025Poster
3
Model Merging in Pre-training of Large Language Models
NeurIPS 2025Poster