暂无评分数据
ICLR 2025
BLIMEY: Towards Better Routing Methods in Sparse Mixture of Experts
摘要
关键词
mixture-of-expertsneural networkslarge-scale machine learningrouting algorithmsinterpretabilityscaling lawscomputational efficiencynlp
评审与讨论
作者撤稿通知
I have read and agree with the venue's withdrawal policy on behalf of myself and my co-authors.