影响力指数

98.08/100

前 0.1%

全站排名 #50

发表论文64 篇

平均评分5.6

年均产出21.3 篇/年

Ziwei Liu

Associate Professor@Nanyang Technological University·新加坡·OpenReview

研究方向

Computer Vision · Machine Learning · Computer Graphics

EgoTwin: Dreaming Body and View in First Person

ICLR 2026Poster

IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction

ICLR 2026Poster

JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

ICLR 2026Poster

From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors

ICLR 2026Poster

Light-X: Generative 4D Video Rendering with Camera and Illumination Control

ICLR 2026Poster

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

ICLR 2026Poster

EchoBench: Benchmarking Sycophancy in Medical Large Vision-Language Models

ICLR 2026Rejected

VISA: Preserving Fine-Grained Perception in MLLMs via Visual Semantic Anchoring

ICLR 2026Desk Rejected

JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation

ICLR 2026Poster

The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

ICLR 2026Poster

Visual Jigsaw Post-Training Improves MLLMs

ICLR 2026Poster

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

ICLR 2026Withdrawn

Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning

ICLR 2026Rejected

ArtHOI: Articulated Human-Object Interaction Synthesis via Dynamics Distillation

ICLR 2026Withdrawn

RealDPO: Real or Not Real, that is the Preference

ICLR 2026Withdrawn

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

ICLR 2026Withdrawn

RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark

ICLR 2026Withdrawn

HSImul3R: Reconstructing Simulation-Ready Human-Scene-Interaction from Sparse Views

ICLR 2026Withdrawn

SafeRBench: A Comprehensive Benchmark for Safety Assessment of Large Reasoning Models

ICLR 2026Withdrawn

DiverseAR: Boosting Diversity in Bitwise Autoregressive Image Generation

ICLR 2026Withdrawn

PhysX-3D: Physical-Grounded 3D Asset Generation

NeurIPS 2025Spotlight

Video World Models with Long-term Spatial Memory

NeurIPS 2025Poster

DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes

ICLR 2025Spotlight

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

NeurIPS 2025Poster

Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos

NeurIPS 2025Poster

Imagine360: Immersive 360 Video Generation from Perspective Anchor

NeurIPS 2025Poster

VideoLucy: Deep Memory Backtracking for Long Video Understanding

NeurIPS 2025Poster

GOOD: Training-Free Guided Diffusion Sampling for Out-of-Distribution Detection

NeurIPS 2025Poster

GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior

NeurIPS 2025Poster

GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data

NeurIPS 2025Poster

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

ICLR 2025Poster

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

ICLR 2025Poster

Long Context Transfer from Language to Vision

ICLR 2025Rejected

FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model

ICLR 2025Rejected

Unsolvable Problem Detection: Evaluating Trustworthiness of Large Multimodal Models

ICLR 2025Rejected

InterDance: Reactive 3D Dance Generation with Realistic Duet Interactions

ICLR 2025Rejected

GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting

ICLR 2025Rejected

AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation

ICLR 2025Poster

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

ICLR 2025Poster

FreeTraj: Tuning-Free Trajectory Control via Noise Guided Video Diffusion

ICLR 2025Rejected

EgoLM: Multi-Modal Language Model of Egocentric Motions

ICLR 2025Withdrawn

VEnhancer: Generative Space-Time Enhancement for Video Generation

ICLR 2025Rejected

3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion

ICLR 2025Withdrawn

Video Instruction Tuning with Synthetic Data

ICLR 2025Withdrawn

X-PlugVid: Versatile Adaptation of Image Plugins for Controllable Video Generation

ICLR 2025Withdrawn

Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM

ICML 2025Poster

MVPaint: 3D Texture Generation with Multi-View Consistency

ICLR 2025Withdrawn

合作者 (20)