Xin Lu
~Xin_Lu8
3
论文总数
1.5
年均投稿
平均评分
接收情况2/3
会议分布
NeurIPS
2
ICLR
1
发表论文 (3 篇)
20242 篇
4
How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on FFN-Wider and MoE Transformers
NeurIPS 2024Poster
-
How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on Wider Transformer Models
ICLR 2024withdrawn