Tun Lu
~Tun_Lu1
5
论文总数
2.5
年均投稿
平均评分
接收情况3/5
会议分布
ICLR
3
NeurIPS
1
ICML
1
发表论文 (5 篇)
20253 篇
5
Efficiently pre-training language models with mixtures of cluster-oriented, trainability-aware experts
ICLR 2025withdrawn
4
Large Learning Rates without the Agonizing Pain: Dispelling the Curse of Singularities in Deep Neural Networks
ICLR 2025withdrawn
4
Oracle-MoE: Locality-preserving Routing in the Oracle Space for Memory-constrained Large Language Model Inference
ICML 2025Poster