Zuxuan Wu
~Zuxuan_Wu1
17
论文总数
8.5
年均投稿
平均评分
接收情况11/17
会议分布
NeurIPS
9
ICLR
7
COLM
1
发表论文 (17 篇)
202511 篇
5
GeoGS3D: Single-view 3D Reconstruction via Geometric-aware Diffusion Model and Gaussian Splatting
ICLR 2025withdrawn
4
Adaptive Retention & Correction: Test-Time Training for Continual Learning
ICLR 2025Poster
4
BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers
ICLR 2025withdrawn
3
TinyMem: Condensing Multimodal Memory for Long-form Video Action Detection
ICLR 2025withdrawn
4
Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control
NeurIPS 2025Poster
4
ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
NeurIPS 2025Poster
4
OmniGen-AR: AutoRegressive Any-to-Image Generation
NeurIPS 2025Poster
4
UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation
NeurIPS 2025Poster
4
Hydra-MDP++: Advancing End-to-End Driving via Hydra-Distillation with Expert-Guided Decision Analysis
ICLR 2025withdrawn
4
INST-IT: Boosting Instance Understanding via Explicit Visual Prompt Instruction Tuning
NeurIPS 2025Poster
4
AgentGym: Evaluating and Evolving Large Language Model-based Agents across Diverse Envronments
ICLR 2025Rejected
20246 篇
5
GenRec: Unifying Video Generation and Recognition with Diffusion Models
NeurIPS 2024Poster
4
Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
NeurIPS 2024Poster
4
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
NeurIPS 2024Poster
3
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs
NeurIPS 2024Poster
4
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
ICLR 2024withdrawn
4
Poly-Visual-Expert Vision-Language Models
COLM 2024Poster