Xing Hu
~Xing_Hu6
7
论文总数
7.0
年均投稿
平均评分
接收情况5/7
会议分布
ICLR
4
ICML
2
NeurIPS
1
发表论文 (7 篇)
20257 篇
5
I-LLM: Efficient Integer-Only Inference for Fully-Quantized Low-Bit Large Language Models
ICLR 2025Rejected
5
OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting
ICLR 2025Poster
4
MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance
ICML 2025Poster
3
RSAVQ: Riemannian Sensitivity-Aware Vector Quantization for Large Language Models
NeurIPS 2025Poster
4
MambaQuant: Quantizing the Mamba Family with Variance Aligned Rotation Methods
ICLR 2025Poster
4
RWKVQuant: Quantizing the RWKV Family with Proxy Guided Hybrid of Scalar and Vector Quantization
ICML 2025Poster
4
MQuant: Unleashing the Inference Potential of Multimodal Large Language Models via Full Static Quantization
ICLR 2025Rejected