PaperHub

Jianfei Chen

~Jianfei_Chen1

26
论文总数
13.0
年均投稿
6.0
平均评分
接收情况18/26
会议分布
ICLR
14
NeurIPS
6
ICML
6

发表论文 (26 篇)

202520

4.0
4

SparseDM: Toward Sparse Efficient Diffusion Models

ICLR 2025withdrawn
6.2
5

Diffusion Bridge Implicit Models

ICLR 2025Poster
6.0
4

Elucidating the Preconditioning in Consistency Distillation

ICLR 2025Poster
6.6
5

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

ICLR 2025Poster
4.6
5

When Bigger is Better: Revisiting Large-Batch Optimization in Language Model Pretraining

NeurIPS 2025Rejected
6.6
4

Visual Generation Without Guidance

ICML 2025Poster
6.3
3

Oscillation-Reduced MXFP4 Training for Vision Transformers

ICML 2025Poster
7.0
4

SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

ICLR 2025Poster
5.0
3

Zero-shot Quantization for Object Detection

ICLR 2025Rejected
4.0
4

FrameBridge: Improving Image-to-Video Generation with Bridge Models

ICLR 2025Rejected
5.5
4

1-Bit FQT: Pushing the Limit of Fully Quantized Training to 1-bit

ICLR 2025Rejected
6.6
4

FrameBridge: Improving Image-to-Video Generation with Bridge Models

ICML 2025Poster
7.8
5

SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization

ICML 2025Poster
6.0
4

COAT: Compressing Optimizer states and Activations for Memory-Efficient FP8 Training

ICLR 2025Poster
7.3
3

On the Optimization and Generalization of Two-layer Transformers with Sign Gradient Descent

ICLR 2025Spotlight
6.6
4

SpargeAttention: Accurate and Training-free Sparse Attention Accelerating Any Model Inference

ICML 2025Poster
5.8
5

Beyond 2:4: Exploring V:N:M Sparsity for Efficient Transformer Inference on GPUs

ICLR 2025Rejected
6.8
4

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

NeurIPS 2025Spotlight
7.3
4

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

NeurIPS 2025Spotlight
6.1
4

Sparse Video-Gen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity

ICML 2025Poster