Yuhang Zang
~Yuhang_Zang1
16
论文总数
8.0
年均投稿
平均评分
接收情况10/16
会议分布
ICLR
9
NeurIPS
5
ICML
2
发表论文 (16 篇)
202512 篇
4
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
ICLR 2025Poster
4
VideoRoPE: What Makes for Good Video Rotary Position Embedding?
ICML 2025Oral
4
RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition
ICLR 2025Rejected
4
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning
NeurIPS 2025Poster
4
Bootstrap3D: Improving Multi-view Diffusion Model with Synthetic Data
ICLR 2025withdrawn
4
Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate
ICLR 2025withdrawn
4
SAM2Long: Enhancing SAM2 for Long Video Segmentation with a Training-Free Memory Tree
ICLR 2025withdrawn
4
MotionClone: Training-Free Motion Cloning for Controllable Video Generation
ICLR 2025Poster
4
PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction
ICLR 2025withdrawn
5
BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way
ICLR 2025withdrawn
3
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
ICML 2025Poster
5
HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance
NeurIPS 2025Poster
20244 篇
4
Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization
ICLR 2024Poster
3
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
NeurIPS 2024Poster
4
Streaming Long Video Understanding with Large Language Models
NeurIPS 2024Poster
4
Are We on the Right Way for Evaluating Large Vision-Language Models?
NeurIPS 2024Poster