Xu Han
~Xu_Han2
17
论文总数
8.5
年均投稿
平均评分
接收情况13/17
会议分布
ICLR
7
COLM
5
NeurIPS
4
ICML
1
发表论文 (17 篇)
20258 篇
4
Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
ICLR 2025Rejected
4
A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings
NeurIPS 2025Poster
3
Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
ICML 2025Poster
5
Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads
ICLR 2025Rejected
3
BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity
COLM 2025Poster
4
Stuffed Mamba: Oversized States Lead to the Inability to Forget
COLM 2025Poster
5
Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling
ICLR 2025Rejected
4
VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
ICLR 2025Poster
20249 篇
4
OneBit: Towards Extremely Low-bit Large Language Models
NeurIPS 2024Poster
4
Unified View of Grokking, Double Descent and Emergent Abilities: A Comprehensive Study on Algorithm Task
COLM 2024Poster
4
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
COLM 2024Poster
4
InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory
NeurIPS 2024Poster
4
Predicting Emergent Abilities with Infinite Resolution Evaluation
ICLR 2024Poster
5
BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences
ICLR 2024Rejected
3
CA-LoRA: Adapting Existing LoRA for Compressed LLMs to Enable Efficient Multi-Tasking on Personal Devices
COLM 2024Poster
4
Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models
NeurIPS 2024Poster
4
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages
ICLR 2024Spotlight