Robert P. Dick
~Robert_P._Dick1
8
论文总数
4.0
年均投稿
平均评分
接收情况4/8
会议分布
ICLR
7
NeurIPS
1
发表论文 (8 篇)
20253 篇
4
A Percolation Model of Emergence: Analyzing Transformers Trained on a Formal Language
ICLR 2025Poster
4
Large Learning Rates without the Agonizing Pain: Dispelling the Curse of Singularities in Deep Neural Networks
ICLR 2025withdrawn
5
Efficiently pre-training language models with mixtures of cluster-oriented, trainability-aware experts
ICLR 2025withdrawn
20245 篇
4
In-Context Learning Dynamics with Random Binary Sequences
ICLR 2024Poster
4
How Capable Can a Transformer Become? A Study on Synthetic, Interpretable Tasks
ICLR 2024Rejected
3
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks
ICLR 2024Poster
4
Toward a Mechanistic Understanding of Stepwise Inference in Transformers: A Synthetic Graph Navigation Model
ICLR 2024Rejected
5
Once Read is Enough: Domain-specific Pretraining-free Language Models with Cluster-guided Sparse Experts for Long-tail Domain Knowledge
NeurIPS 2024Poster