Jonathan Ragan-Kelley
~Jonathan_Ragan-Kelley1
7
论文总数
3.5
年均投稿
平均评分
接收情况5/7
会议分布
ICLR
3
ICML
2
NeurIPS
1
COLM
1
发表论文 (7 篇)
20253 篇
4
Ladder Residual: Redefining Tensor Parallelism in Transformers for Accelerated Inference
ICLR 2025Rejected
3
Ladder-Residual: Parallelism-Aware Architecture for Accelerating Large Model Inference with Communication Overlapping
ICML 2025Poster
4
Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding
ICML 2025Poster
20244 篇
4
How to Guess a Gradient
ICLR 2024withdrawn
3
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention
NeurIPS 2024Poster
3
Hydra: Sequentially-Dependent Draft Heads for Medusa Decoding
COLM 2024Poster
4
The Cost of Scaling Down Large Language Models: Reducing Model Size Affects Memory before In-context Learning
ICLR 2024Poster