Sho Takase
~Sho_Takase2
4
论文总数
4.0
年均投稿
平均评分
接收情况3/4
会议分布
COLM
2
ICLR
1
ICML
1
发表论文 (4 篇)
20254 篇
3
Spike No More: Stabilizing the Pre-training of Large Language Models
ICLR 2025Rejected
3
Spike No More: Stabilizing the Pre-training of Large Language Models
COLM 2025Poster
4
Efficient Construction of Model Family through Progressive Training Using Model Expansion
COLM 2025Poster
3
Scaling Laws for Upcycling Mixture-of-Experts Language Models
ICML 2025Poster