Yiwu Yao
~Yiwu_Yao1
6
论文总数
3.0
年均投稿
平均评分
接收情况5/6
会议分布
ICLR
4
NeurIPS
1
ICML
1
发表论文 (6 篇)
20255 篇
4
DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization
NeurIPS 2025Poster
3
KVTuner: Sensitivity-Aware Layer-Wise Mixed-Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference
ICML 2025Poster
5
Dynamic Low-Rank Sparse Adaptation for Large Language Models
ICLR 2025Poster
7
RazorAttention: Efficient KV Cache Compression Through Retrieval Heads
ICLR 2025Poster
4
Extreme composite compression of large language models through joint optimization
ICLR 2025Rejected