Ming-Hsuan Yang
~Ming-Hsuan_Yang1
41
论文总数
20.5
年均投稿
平均评分
接收情况26/41
会议分布
ICLR
28
NeurIPS
11
COLM
1
ICML
1
发表论文 (41 篇)
202531 篇
4
Ranking-aware adapter for text-driven image ordering with CLIP
ICLR 2025Poster
4
Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models
ICLR 2025Rejected
4
Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models
COLM 2025Poster
4
Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint
ICLR 2025Poster
4
Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video
NeurIPS 2025Poster
4
Customized Procedure Planning in Instructional Videos
ICLR 2025Rejected
4
Gaga: Group Any Gaussians via 3D-aware Memory Bank
ICLR 2025withdrawn
4
IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation
NeurIPS 2025Poster
4
InstaInpaint: Instant 3D-Scene Inpainting with Masked Large Reconstruction Model
NeurIPS 2025Poster
4
Learning Spatial-Semantic Features for Robust Video Object Segmentation
ICLR 2025Poster
3
EA3D: Online Open-World 3D Object Extraction from Streaming Videos
NeurIPS 2025Poster
4
Three-Dimensional Trajectory Prediction with 3DMoTraj Dataset
ICML 2025Poster
3
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
ICLR 2025Oral
4
PredFormer: Transformers Are Effective Spatial-Temporal Predictive Learners
ICLR 2025withdrawn
4
Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration
ICLR 2025withdrawn
4
Hierarchical Information Flow for Generalized Efficient Image Restoration
ICLR 2025withdrawn
5
A Simple Approach to Unifying Diffusion-based Conditional Generation
ICLR 2025Poster
4
HQGS: High-Quality Novel View Synthesis with Gaussian Splatting in Degraded Scenes
ICLR 2025Poster
4
RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection
ICLR 2025Poster
4
VideoAlchemy: Open-set Personalization in Video Generation
ICLR 2025withdrawn
5
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion
ICLR 2025Spotlight
4
RelationBooth: Towards Relation-Aware Customized Object Generation
ICLR 2025withdrawn
4
HoliGS: Holistic Gaussian Splatting for Embodied View Synthesis
NeurIPS 2025Poster
3
PrML: Progressive Multi-Task Learning for Monocular 3D Human Pose Estimation
ICLR 2025Rejected
4
RAPID Hand: Robust, Affordable, Perception-Integrated, Dexterous Manipulation Platfrom for Embodied Intelligence
NeurIPS 2025Poster
4
DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos
NeurIPS 2025Poster
4
OmnixR: Evaluating Omni-modality Language Models on Reasoning across Modalities
ICLR 2025Poster
4
Kitten: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities
ICLR 2025withdrawn
3
HALO: Human-Aligned End-to-end Image Retargeting with Layered Transformations
ICLR 2025Rejected
4
RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything
ICLR 2025Oral
4
4KAgent: Agentic Any Image to 4K Super-Resolution
NeurIPS 2025Poster
202410 篇
4
Video Generation Beyond a Single Clip
ICLR 2024withdrawn
4
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding
ICLR 2024Poster
3
Towards 4D Human Video Stylization
ICLR 2024Rejected
5
Dual Associated Encoder for Face Restoration
ICLR 2024Poster
3
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
NeurIPS 2024Poster
4
Text-driven Editing of 3D Scenes without Retraining
ICLR 2024withdrawn
4
Extending Video Masked Autoencoders to 128 frames
NeurIPS 2024Poster
4
Sharing Key Semantics in Transformer Makes Efficient Image Restoration
NeurIPS 2024Poster
3
Language Model Beats Diffusion - Tokenizer is key to visual generation
ICLR 2024Poster
4
VideoGLUE: Video General Understanding Evaluation of Foundation Models
ICLR 2024Rejected