Yunhang Shen
~Yunhang_Shen1
9
论文总数
4.5
年均投稿
平均评分
接收情况8/9
会议分布
NeurIPS
3
ICML
3
ICLR
3
发表论文 (9 篇)
20258 篇
4
VITA-Audio: Fast Interleaved Audio-Text Token Generation for Efficient Large Speech-Language Model
NeurIPS 2025Poster
4
DS-VLM: Diffusion Supervision Vision Language Model
ICML 2025Poster
4
Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification
ICLR 2025Poster
5
FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification
ICML 2025Poster
4
Learning Interleaved Image-Text Comprehension in Vision-Language Large Models
ICLR 2025Poster
3
Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
ICML 2025Poster
4
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
NeurIPS 2025Spotlight
4
Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs
NeurIPS 2025Poster