Paper
Hub
搜索
Toggle language
Amir Gholami
~Amir_Gholami2
5
论文总数
2.5
年均投稿
6.1
平均评分
接收情况
4
/
5
会议分布
NeurIPS
2
ICML
2
ICLR
1
发表论文 (5 篇)
2025
3 篇
6.8
4
Multipole Attention for Efficient Long Context Reasoning
NeurIPS 2025
Poster
5.5
4
Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks
ICML 2025
Poster
6.1
4
QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache
ICML 2025
Poster
2024
2 篇
5.8
4
SqueezeLLM: Dense and Sparse Quantization
ICLR 2024
Rejected
6.3
4
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
NeurIPS 2024
Poster
合作者 (20)
KK
Kurt Keutzer
5 篇
SK
Sehoon Kim
5 篇
CH
Coleman Richard Charles Hooper
4 篇
MM
Michael W. Mahoney
4 篇
SS
Sophia Shao
2 篇
HM
Hiva Mohammadzadeh
1 篇
LM
Luca Manolache
1 篇
SZ
Sebastian Zhao
1 篇
查看全部 20 位合作者