Robert P. Dick

~Robert_P._Dick1

8

论文总数

4.0

年均投稿

5.6

平均评分

接收情况4/8

会议分布

ICLR

7

NeurIPS

1

发表论文 (8 篇)

20253 篇

A Percolation Model of Emergence: Analyzing Transformers Trained on a Formal Language

ICLR 2025Poster

Large Learning Rates without the Agonizing Pain: Dispelling the Curse of Singularities in Deep Neural Networks

ICLR 2025withdrawn

Efficiently pre-training language models with mixtures of cluster-oriented, trainability-aware experts

ICLR 2025withdrawn

20245 篇

In-Context Learning Dynamics with Random Binary Sequences

ICLR 2024Poster

How Capable Can a Transformer Become? A Study on Synthetic, Interpretable Tasks

ICLR 2024Rejected

Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks

ICLR 2024Poster

Toward a Mechanistic Understanding of Stepwise Inference in Transformers: A Synthetic Graph Navigation Model

ICLR 2024Rejected

Once Read is Enough: Domain-specific Pretraining-free Language Models with Cluster-guided Sparse Experts for Long-tail Domain Knowledge

NeurIPS 2024Poster

合作者 (20)

Ekdeep Singh Lubana5 篇

Hidenori Tanaka5 篇

Dongsheng Li3 篇

Jixian Zhou3 篇

Mengyi Chen3 篇