影响力指数

94.31/100

前 0.3%

全站排名 #197

发表论文35 篇

平均评分5.7

年均产出11.7 篇/年

Hanwang Zhang

Full Professor@Nanyang Technological University·新加坡·OpenReview

研究方向

causal inference · scene graph generation · vision-language

Benchmarking Open-Set Recognition Beyond Vision-Language Pre-training

ICLR 2026Rejected

Streaming Drag-Oriented Interactive Video Manipulation: Drag Anything, Anytime!

ICLR 2026Poster

Reducing Class-Wise Performance Disparity via Margin Regularization

ICLR 2026Poster

Real-Time Motion-Controllable Autoregressive Video Diffusion

ICLR 2026Poster

Look Carefully: Adaptive Visual Reinforcements in Multimodal Large Language Models for Hallucination Mitigation

ICLR 2026Poster

Generative Distribution Distillation

ICLR 2026Withdrawn

On Path to Multimodal Generalist: General-Level and General-Bench

Enhancing CLIP Robustness via Cross-Modality Alignment

NeurIPS 2025Spotlight

Selftok-Zero: Reinforcement Learning for Visual Generation via Discrete and Autoregressive Visual Tokens

NeurIPS 2025Poster

$\mathcal{V}ista\mathcal{DPO}$: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models

ICML 2025Poster

Co-Reinforcement Learning for Unified Multimodal Understanding and Generation

NeurIPS 2025Spotlight

Vinci: Deep Thinking in Text-to-Image Generation using Unified Model with Reinforcement Learning

NeurIPS 2025Poster

Towards Semantic Equivalence of Tokenization in Multimodal LLM

ICLR 2025Poster

VR-Sampling: Accelerating Flow Generative Model Training with Variance Reduction Sampling

ICLR 2025Withdrawn

Learning to Animate Images from A Few Videos to Portray Delicate Human Actions

ICLR 2025Withdrawn

3D Question Answering via only 2D Vision-Language Models

ICML 2025Poster

Geo-3DGS: Multi-view Geometry Consistency for 3D Gaussian Splatting and Surface Reconstruction

ICLR 2025Rejected

Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing

ICML 2025Poster

Object Fusion via Diffusion Time-step for Customized Image Editing with Single Example

ICLR 2025Withdrawn

Towards Debiased Source-Free Domain Adaptation

ICLR 2025Withdrawn

A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training

ICLR 2025Withdrawn

合作者 (20)