Ziwei Liu
~Ziwei_Liu1
44
论文总数
22.0
年均投稿
平均评分
接收情况25/44
会议分布
ICLR
31
NeurIPS
12
ICML
1
发表论文 (44 篇)
202527 篇
4
Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution
ICLR 2025Poster
3
Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM
ICML 2025Poster
4
GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting
ICLR 2025Rejected
4
PhysX-3D: Physical-Grounded 3D Asset Generation
NeurIPS 2025Spotlight
4
FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model
ICLR 2025Rejected
5
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion
ICLR 2025Poster
4
AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
ICLR 2025Poster
4
Video World Models with Long-term Spatial Memory
NeurIPS 2025Poster
4
Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos
NeurIPS 2025Poster
4
FreeTraj: Tuning-Free Trajectory Control via Noise Guided Video Diffusion
ICLR 2025Rejected
3
GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior
NeurIPS 2025Poster
4
Imagine360: Immersive 360 Video Generation from Perspective Anchor
NeurIPS 2025Poster
4
Video Instruction Tuning with Synthetic Data
ICLR 2025withdrawn
4
EgoLM: Multi-Modal Language Model of Egocentric Motions
ICLR 2025withdrawn
4
FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
ICLR 2025Poster
3
X-PlugVid: Versatile Adaptation of Image Plugins for Controllable Video Generation
ICLR 2025withdrawn
4
DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes
ICLR 2025Spotlight
4
GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data
NeurIPS 2025Poster
5
InterDance: Reactive 3D Dance Generation with Realistic Duet Interactions
ICLR 2025Rejected
4
VEnhancer: Generative Space-Time Enhancement for Video Generation
ICLR 2025Rejected
4
VideoLucy: Deep Memory Backtracking for Long Video Understanding
NeurIPS 2025Poster
3
Unsolvable Problem Detection: Evaluating Trustworthiness of Large Multimodal Models
ICLR 2025Rejected
4
GOOD: Training-Free Guided Diffusion Sampling for Out-of-Distribution Detection
NeurIPS 2025Poster
5
Long Context Transfer from Language to Vision
ICLR 2025Rejected
-
MVPaint: 3D Texture Generation with Multi-View Consistency
ICLR 2025withdrawn
4
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion
ICLR 2025withdrawn
4
ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models
NeurIPS 2025Poster
202417 篇
4
AID: Attention Interpolation of Text-to-Image Diffusion
NeurIPS 2024Poster
4
Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need
ICLR 2024withdrawn
4
MoveAnything: Controllable Scene Generation with Text-to-Image Diffusion Models
ICLR 2024withdrawn
4
DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation
ICLR 2024Oral
3
Large-Vocabulary 3D Diffusion Model with Transformer
ICLR 2024Poster
4
Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment
ICLR 2024Poster
5
Make-it-Real: Unleashing Large Multimodal Model for Painting 3D Objects with Realistic Materials
NeurIPS 2024Poster
3
Learning without Forgetting for Vision-Language Models
ICLR 2024Rejected
4
LAVITA: Latent Video Diffusion Models with Spatio-temporal Transformers
ICLR 2024withdrawn
4
FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling
ICLR 2024Poster
4
L4GM: Large 4D Gaussian Reconstruction Model
NeurIPS 2024Poster
4
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion
ICLR 2024Poster
4
MMBench: Is Your Multi-modal Model an All-around Player?
ICLR 2024Rejected
4
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
ICLR 2024Poster
4
Learning Embodied Vision-Language Programming From Instruction, Exploration, and Environmental Feedback
ICLR 2024Rejected
4
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
ICLR 2024Spotlight
4
LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
ICLR 2024Rejected