Sifan Zhou
~Sifan_Zhou2
8
论文总数
4.0
年均投稿
平均评分
接收情况6/8
会议分布
ICLR
5
ICML
2
NeurIPS
1
发表论文 (8 篇)
20257 篇
4
MQuant: Unleashing the Inference Potential of Multimodal Large Language Models via Full Static Quantization
ICLR 2025Rejected
4
Point4Bit: Post Training 4-bit Quantization for Point Cloud 3D Detection
NeurIPS 2025Poster
4
MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance
ICML 2025Poster
4
RWKVQuant: Quantizing the RWKV Family with Proxy Guided Hybrid of Scalar and Vector Quantization
ICML 2025Poster
5
OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting
ICLR 2025Poster
4
MambaQuant: Quantizing the Mamba Family with Variance Aligned Rotation Methods
ICLR 2025Poster
5
I-LLM: Efficient Integer-Only Inference for Fully-Quantized Low-Bit Large Language Models
ICLR 2025Rejected