Paper
Hub
搜索
Toggle language
Avner May
~Avner_May1
5
论文总数
2.5
年均投稿
6.2
平均评分
接收情况
5
/
5
会议分布
NeurIPS
3
ICML
1
ICLR
1
发表论文 (5 篇)
2025
2 篇
7.0
3
Cost-efficient Collaboration between On-device and Cloud Language Models
ICML 2025
Poster
6.8
4
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding
ICLR 2025
Poster
2024
3 篇
6.5
4
Sequoia: Scalable and Robust Speculative Decoding
NeurIPS 2024
Spotlight
5.8
4
SpecExec: Massively Parallel Speculative Decoding For Interactive LLM Inference on Consumer Devices
NeurIPS 2024
Poster
5.0
3
The Mamba in the Llama: Distilling and Accelerating Hybrid Models
NeurIPS 2024
Poster
合作者 (20)
BC
Beidi Chen
3 篇
ZC
Zhuoming Chen
3 篇
MR
Max Ryabinin
2 篇
RS
Ruslan Svirschevski
2 篇
ZJ
Zhihao Jia
2 篇
IY
Ian En-Hsu Yen
1 篇
JC
Jian Chen
1 篇
JS
Jinyuan Shi
1 篇
查看全部 20 位合作者