Zhefeng Wang
~Zhefeng_Wang1
8
论文总数
4.0
年均投稿
平均评分
接收情况3/8
会议分布
ICLR
7
ICML
1
发表论文 (8 篇)
20257 篇
5
FASP: Fast and Accurate Structured Pruning of Large Language Models
ICLR 2025withdrawn
4
FISTAPruner: Layer-wise Post-training Pruning for Large Language Models
ICLR 2025Rejected
5
Adapprox: Memory Efficient Optimization via Adaptive Randomized Low-Rank Approximation
ICLR 2025Rejected
4
CASD: Enhancing Generation Accuracy via Context-Aware Speculative Decoding
ICLR 2025withdrawn
4
Beware of Calibration Data for Pruning Large Language Models
ICLR 2025Poster
5
SinkQ: Accurate 2-bit KV Cache Quantization with Dynamic Sink Tracking
ICLR 2025withdrawn
3
Efficiently Serving Large Multimodal Models Using EPD Disaggregation
ICML 2025Poster