Xiaoyi Dong
~Xiaoyi_Dong1
16
论文总数
8.0
年均投稿
平均评分
接收情况8/16
会议分布
ICLR
10
NeurIPS
4
ICML
2
发表论文 (16 篇)
202513 篇
4
Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate
ICLR 2025withdrawn
4
PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction
ICLR 2025withdrawn
4
SAM2Long: Enhancing SAM2 for Long Video Segmentation with a Training-Free Memory Tree
ICLR 2025withdrawn
3
DualFocus: Integrating Macro and Micro Perspectives in Multi-modal Large Language Models
ICLR 2025withdrawn
5
SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation
ICLR 2025Rejected
4
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
ICLR 2025Poster
4
VideoRoPE: What Makes for Good Video Rotary Position Embedding?
ICML 2025Oral
4
MotionClone: Training-Free Motion Cloning for Controllable Video Generation
ICLR 2025Poster
3
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
ICML 2025Poster
5
BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way
ICLR 2025withdrawn
4
Bootstrap3D: Improving Multi-view Diffusion Model with Synthetic Data
ICLR 2025withdrawn
5
HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance
NeurIPS 2025Poster
4
RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition
ICLR 2025Rejected