PaperHub

暂无评分数据

ICLR 2025

BLIMEY: Towards Better Routing Methods in Sparse Mixture of Experts

OpenReviewPDF
提交: 2024-09-24更新: 2024-10-03

摘要

关键词
mixture-of-expertsneural networkslarge-scale machine learningrouting algorithmsinterpretabilityscaling lawscomputational efficiencynlp

评审与讨论

撤稿通知

I have read and agree with the venue's withdrawal policy on behalf of myself and my co-authors.