Paper
Hub
搜索
Toggle language
暂无评分数据
ICLR 2024
How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on Wider Transformer Models
Xin Lu
,
Yanyan Zhao
,
Bing Qin
OpenReview
PDF
提交: 2023-09-24
更新: 2024-03-26
摘要
关键词
Pre-trained Language Models
Base Capabilities
评审与讨论
暂无评审记录