PaperHub

Ziwei Liu

~Ziwei_Liu1

44
论文总数
22.0
年均投稿
5.9
平均评分
接收情况25/44
会议分布
ICLR
31
NeurIPS
12
ICML
1

发表论文 (44 篇)

202527

6.0
4

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

ICLR 2025Poster
4.0
3

Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM

ICML 2025Poster
5.5
4

GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting

ICLR 2025Rejected
8.2
4

PhysX-3D: Physical-Grounded 3D Asset Generation

NeurIPS 2025Spotlight
5.8
4

FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model

ICLR 2025Rejected
6.2
5

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

ICLR 2025Poster
5.5
4

AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation

ICLR 2025Poster
8.2
4

Video World Models with Long-term Spatial Memory

NeurIPS 2025Poster
6.8
4

Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos

NeurIPS 2025Poster
5.5
4

FreeTraj: Tuning-Free Trajectory Control via Noise Guided Video Diffusion

ICLR 2025Rejected
6.4
3

GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior

NeurIPS 2025Poster
6.8
4

Imagine360: Immersive 360 Video Generation from Perspective Anchor

NeurIPS 2025Poster
4.5
4

Video Instruction Tuning with Synthetic Data

ICLR 2025withdrawn
5.0
4

EgoLM: Multi-Modal Language Model of Egocentric Motions

ICLR 2025withdrawn
5.5
4

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

ICLR 2025Poster
4.3
3

X-PlugVid: Versatile Adaptation of Image Plugins for Controllable Video Generation

ICLR 2025withdrawn
7.5
4

DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes

ICLR 2025Spotlight
6.4
4

GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data

NeurIPS 2025Poster
5.6
5

InterDance: Reactive 3D Dance Generation with Realistic Duet Interactions

ICLR 2025Rejected
5.0
4

VEnhancer: Generative Space-Time Enhancement for Video Generation

ICLR 2025Rejected
6.8
4

VideoLucy: Deep Memory Backtracking for Long Video Understanding

NeurIPS 2025Poster
5.7
3

Unsolvable Problem Detection: Evaluating Trustworthiness of Large Multimodal Models

ICLR 2025Rejected
6.8
4

GOOD: Training-Free Guided Diffusion Sampling for Out-of-Distribution Detection

NeurIPS 2025Poster
5.8
5

Long Context Transfer from Language to Vision

ICLR 2025Rejected
-

MVPaint: 3D Texture Generation with Multi-View Consistency

ICLR 2025withdrawn
4.8
4

3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion

ICLR 2025withdrawn
7.3
4

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

NeurIPS 2025Poster

202417

4.8
4

AID: Attention Interpolation of Text-to-Image Diffusion

NeurIPS 2024Poster
4.8
4

Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need

ICLR 2024withdrawn
4.5
4

MoveAnything: Controllable Scene Generation with Text-to-Image Diffusion Models

ICLR 2024withdrawn
8.5
4

DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation

ICLR 2024Oral
6.0
3

Large-Vocabulary 3D Diffusion Model with Transformer

ICLR 2024Poster
6.3
4

Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment

ICLR 2024Poster
5.4
5

Make-it-Real: Unleashing Large Multimodal Model for Painting 3D Objects with Realistic Materials

NeurIPS 2024Poster
5.7
3

Learning without Forgetting for Vision-Language Models

ICLR 2024Rejected
4.0
4

LAVITA: Latent Video Diffusion Models with Spatio-temporal Transformers

ICLR 2024withdrawn
5.8
4

FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling

ICLR 2024Poster
5.8
4

L4GM: Large 4D Gaussian Reconstruction Model

NeurIPS 2024Poster
7.5
4

HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion

ICLR 2024Poster
5.3
4

MMBench: Is Your Multi-modal Model an All-around Player?

ICLR 2024Rejected
5.5
4

SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

ICLR 2024Poster
4.0
4

Learning Embodied Vision-Language Programming From Instruction, Exploration, and Environmental Feedback

ICLR 2024Rejected
7.0
4

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

ICLR 2024Spotlight
5.5
4

LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

ICLR 2024Rejected