Dan Busbridge
~Dan_Busbridge1
7
论文总数
3.5
年均投稿
平均评分
接收情况6/7
会议分布
ICML
3
ICLR
3
NeurIPS
1
发表论文 (7 篇)
20255 篇
3
Distillation Scaling Laws
ICML 2025Poster
4
Scaling Laws for Forgetting during Finetuning with Pretraining Data Injection
ICML 2025Poster
4
Scaling Laws for Optimal Data Mixtures
NeurIPS 2025Poster
4
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models
ICML 2025Poster
3
Theory, Analysis, and Best Practices for Sigmoid Self-Attention
ICLR 2025Poster