Lei Wu
~Lei_Wu1
7
论文总数
3.5
年均投稿
平均评分
接收情况4/7
会议分布
ICLR
3
NeurIPS
3
ICML
1
发表论文 (7 篇)
20253 篇
5
How Transformers Implement Induction Heads: Approximation and Optimization Analysis
ICLR 2025Rejected
4
Functional Scaling Laws in Kernel Regression: Loss Dynamics and Learning Rate Schedules
NeurIPS 2025Spotlight
4
The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training
ICML 2025Poster
20244 篇
4
The Noise Geometry of Stochastic Gradient Descent: A Quantitative and Analytical Characterization
ICLR 2024withdrawn
3
Achieving Margin Maximization Exponentially Fast via Progressive Norm Rescaling
ICLR 2024withdrawn
3
Parameter Symmetry and Noise Equilibrium of Stochastic Gradient Descent
NeurIPS 2024Poster
3
Improving Generalization and Convergence by Enhancing Implicit Regularization
NeurIPS 2024Poster