Song Han

~Song_Han5

26

论文总数

13.0

年均投稿

6.4

平均评分

接收情况20/26

会议分布

ICLR

16

NeurIPS

6

ICML

4

发表论文 (26 篇)

202520 篇

XAttention: Block Sparse Attention with Antidiagonal Scoring

ICML 2025Poster

Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search

NeurIPS 2025Poster

COAT: Compressing Optimizer states and Activations for Memory-Efficient FP8 Training

ICLR 2025Poster

Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models

ICLR 2025Poster

Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning

NeurIPS 2025Spotlight

SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity

ICML 2025Poster

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

ICLR 2025Poster

X-VILA: Cross-Modality Alignment for Large Language Models

ICLR 2025withdrawn

VILA^2: VLM Augmented VLM with Self-Improvement

ICLR 2025withdrawn

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

ICLR 2025Poster

SVDQuant: Absorbing Outliers by Low-Rank Component for 4-Bit Diffusion Models

ICLR 2025Spotlight

SANA: Efficient High-Resolution Text-to-Image Synthesis with Linear Diffusion Transformers

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

ICLR 2025Poster

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

NeurIPS 2025Spotlight

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

ICML 2025Poster

Sparse Video-Gen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity

ICML 2025Poster

Radial Attention: $\mathcal O(n \log n)$ Sparse Attention for Long Video Generation

NeurIPS 2025Poster

Scaling RL to Long Videos

NeurIPS 2025Poster

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

ICLR 2025Poster

Wolf: Accurate Video Captioning with a World Summarization Framework

ICLR 2025withdrawn

20246 篇

ZEST: ZEROSHOT SPARSE FINE-TUNING

ICLR 2024Rejected

Efficient Streaming Language Models with Attention Sinks

ICLR 2024Poster

LIFT: Efficient Layer-wise Fine-tuning for Large Model Models

ICLR 2024Rejected

BitDelta: Your Fine-Tune May Only Be Worth One Bit

NeurIPS 2024Poster

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Sparse Refinement for Efficient High-Resolution Semantic Segmentation

ICLR 2024Rejected

合作者 (20)

Ligeng Zhu11 篇

Haotian Tang8 篇

Junyu Chen6 篇