Jun Zhu
~Jun_Zhu2
55
论文总数
27.5
年均投稿
平均评分
接收情况38/55
会议分布
ICLR
31
NeurIPS
16
ICML
8
发表论文 (55 篇)
202529 篇
5
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing
ICLR 2025Poster
5
When Bigger is Better: Revisiting Large-Batch Optimization in Language Model Pretraining
NeurIPS 2025Rejected
3
Oscillation-Reduced MXFP4 Training for Vision Transformers
ICML 2025Poster
3
Zero-shot Quantization for Object Detection
ICLR 2025Rejected
4
FrameBridge: Improving Image-to-Video Generation with Bridge Models
ICLR 2025Rejected
4
Audio Super-Resolution with Latent Bridge Models
NeurIPS 2025Poster
4
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
ICLR 2025Poster
4
LUNCH: Adaptive Balancing of Continual Learning via Hyperparameter Uncertainty
ICLR 2025withdrawn
6
Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
ICLR 2025Oral
3
PivotMesh: Generic 3D Mesh Generation via Pivot Vertices Guidance
ICLR 2025Poster
4
ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding
NeurIPS 2025Spotlight
4
Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling
ICLR 2025Poster
4
Elucidating the Preconditioning in Consistency Distillation
ICLR 2025Poster
4
SparseDM: Toward Sparse Efficient Diffusion Models
ICLR 2025withdrawn
5
SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization
ICML 2025Poster
4
FrameBridge: Improving Image-to-Video Generation with Bridge Models
ICML 2025Poster
5
Diffusion Bridge Implicit Models
ICLR 2025Poster
4
A Regularized Newton Method for Nonconvex Optimization with Global and Local Complexity Guarantees
NeurIPS 2025Poster
4
SpargeAttention: Accurate and Training-free Sparse Attention Accelerating Any Model Inference
ICML 2025Poster
4
RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers
ICML 2025Poster
4
Visual Generation Without Guidance
ICML 2025Poster
4
Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator
ICML 2025Spotlight
3
On the Optimization and Generalization of Two-layer Transformers with Sign Gradient Descent
ICLR 2025Spotlight
4
Scaling Diffusion Transformers Efficiently via $\mu$P
NeurIPS 2025Poster
4
SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training
NeurIPS 2025Spotlight
4
ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
ICLR 2025Rejected
4
Robust Representation Consistency Model via Contrastive Denoising
ICLR 2025Poster
4
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
ICLR 2025Poster
4
STAIR: Improving Safety Alignment with Introspective Reasoning
ICML 2025Oral
202426 篇
4
S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
NeurIPS 2024Poster
4
Efficient Backpropagation with Variance Controlled Adaptive Sampling
ICLR 2024Poster
4
Converging and Stabilizing Generative Adversarial Imitation Learning
ICLR 2024Rejected
4
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
NeurIPS 2024Poster
5
C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory
NeurIPS 2024Poster
4
Score Regularized Policy Optimization through Diffusion Behavior
ICLR 2024Poster
4
Consistency Diffusion Bridge Models
NeurIPS 2024Poster
4
On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability
NeurIPS 2024Poster
4
Bridge-TTS: Text-to-Speech Synthesis with Schrodinger Bridge
ICLR 2024withdrawn
4
Investigating Uncertainty Calibration of Aligned Language Models under the Multiple-Choice Setting
ICLR 2024Rejected
4
ControlVideo: Conditional Control for Text-driven Video Editing and Beyond
ICLR 2024withdrawn
4
Rethinking Model Ensemble in Transfer-based Adversarial Attacks
ICLR 2024Poster
4
Embodied Active Defense: Leveraging Recurrent Feedback to Counter Adversarial Patches
ICLR 2024Poster
4
BSPA: Exploring Black-box Stealthy Prompt Attacks against Image Generators
ICLR 2024withdrawn
4
Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels
NeurIPS 2024Poster
5
Noise Contrastive Alignment of Language Models with Explicit Rewards
NeurIPS 2024Poster
4
Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model
NeurIPS 2024Poster
4
Neural Eigenfunctions Are Structured Representation Learners
ICLR 2024Rejected
4
Preconditioning for Physics-Informed Neural Networks
ICLR 2024Rejected
3
PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning
NeurIPS 2024Poster
3
Robust Classification via a Single Diffusion Model
ICLR 2024Rejected
4
Diffusion Models are Certifiably Robust Classifiers
NeurIPS 2024Poster
3
InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image
ICLR 2024Poster
3
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
ICLR 2024Rejected
4
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
ICLR 2024Rejected
4
PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEs
ICLR 2024Rejected