Dawei Yang
~Dawei_Yang3
8
论文总数
8.0
年均投稿
平均评分
接收情况5/8
会议分布
ICLR
5
ICML
2
NeurIPS
1
发表论文 (8 篇)
20258 篇
5
I-LLM: Efficient Integer-Only Inference for Fully-Quantized Low-Bit Large Language Models
ICLR 2025Rejected
5
OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting
ICLR 2025Poster
4
MQuant: Unleashing the Inference Potential of Multimodal Large Language Models via Full Static Quantization
ICLR 2025Rejected
4
MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance
ICML 2025Poster
4
ASVD: Activation-aware Singular Value Decomposition for Compressing Large Language Models
ICLR 2025Rejected
3
RSAVQ: Riemannian Sensitivity-Aware Vector Quantization for Large Language Models
NeurIPS 2025Poster
4
MambaQuant: Quantizing the Mamba Family with Variance Aligned Rotation Methods
ICLR 2025Poster
4
RWKVQuant: Quantizing the RWKV Family with Proxy Guided Hybrid of Scalar and Vector Quantization
ICML 2025Poster