PaperHub

暂无评分数据

ICLR 2024

How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on Wider Transformer Models

OpenReviewPDF
提交: 2023-09-24更新: 2024-03-26

摘要

关键词
Pre-trained Language ModelsBase Capabilities

评审与讨论

暂无评审记录