Paper
Hub
搜索
Toggle language
Size Zheng
~Size_Zheng1
5
论文总数
2.5
年均投稿
5.9
平均评分
接收情况
3
/
5
会议分布
ICLR
2
ICML
2
NeurIPS
1
发表论文 (5 篇)
2025
3 篇
4.9
4
MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design
ICML 2025
Poster
6.6
4
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference
ICML 2025
Spotlight
6.8
4
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference
ICLR 2025
Rejected
2024
2 篇
4.5
4
MoteS: Memory Optimization via Fine-grained Scheduling for DNNs on Tiny Devices
ICLR 2024
withdrawn
6.5
4
ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction
NeurIPS 2024
Poster
合作者 (20)
ML
Meng Li
2 篇
RC
Renze Chen
2 篇
XL
Xiuhong Li
2 篇
YL
Yun Liang
2 篇
BC
Beidi Chen
2 篇
HS
Hanshi Sun
2 篇
HD
Harry Dong
2 篇
LC
Li-Wen Chang
2 篇
查看全部 20 位合作者