Chaoyou Fu
~Chaoyou_Fu1
10
论文总数
10.0
年均投稿
平均评分
接收情况9/10
会议分布
NeurIPS
4
ICML
3
ICLR
3
发表论文 (10 篇)
202510 篇
4
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
NeurIPS 2025Spotlight
3
Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
ICML 2025Poster
4
VITA-Audio: Fast Interleaved Audio-Text Token Generation for Efficient Large Speech-Language Model
NeurIPS 2025Poster
4
Learning Interleaved Image-Text Comprehension in Vision-Language Large Models
ICLR 2025Poster
5
MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
ICLR 2025Poster
4
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment
ICML 2025Poster
4
Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
NeurIPS 2025Poster
4
Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs
NeurIPS 2025Poster
4
MME-FINANCE: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
ICLR 2025withdrawn
4
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency
ICML 2025Poster