Tri Dao
~Tri_Dao1
10
论文总数
5.0
年均投稿
平均评分
接收情况8/10
会议分布
NeurIPS
4
ICLR
3
COLM
2
ICML
1
发表论文 (10 篇)
20253 篇
4
Hardware-Efficient Attention for Fast Decoding
COLM 2025Poster
4
Ladder Residual: Redefining Tensor Parallelism in Transformers for Accelerated Inference
ICLR 2025Rejected
3
Ladder-Residual: Parallelism-Aware Architecture for Accelerating Large Model Inference with Communication Overlapping
ICML 2025Poster
20247 篇
4
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
ICLR 2024Poster
4
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
ICLR 2024Rejected
4
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
COLM 2024Poster
4
Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers
NeurIPS 2024Poster
3
The Mamba in the Llama: Distilling and Accelerating Hybrid Models
NeurIPS 2024Poster
4
BitDelta: Your Fine-Tune May Only Be Worth One Bit
NeurIPS 2024Poster
4
FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision
NeurIPS 2024Spotlight