Chunyuan Li
~Chunyuan_Li1
15
论文总数
7.5
年均投稿
平均评分
接收情况6/15
会议分布
ICLR
14
NeurIPS
1
发表论文 (15 篇)
20257 篇
5
Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning
ICLR 2025Poster
4
Video Instruction Tuning with Synthetic Data
ICLR 2025withdrawn
3
LLaVA-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models
ICLR 2025Spotlight
4
LLaVA-Critic: Learning to Evaluate Multimodal Models
ICLR 2025withdrawn
5
Long Context Transfer from Language to Vision
ICLR 2025Rejected
4
MMSearch: Unveiling the Potential of Large Models as Multi-modal Search Engines
ICLR 2025Poster
5
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
ICLR 2025Poster
20248 篇
4
Understanding Multimodal Instruction Format for In-context Learning
ICLR 2024Rejected
-
Knowledge-Augmented Large Vision-and-Language Assistant
ICLR 2024withdrawn
4
Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
NeurIPS 2024Poster
3
MedJourney: Counterfactual Medical Image Generation by Instruction-Learning from Multimodal Patient Journeys
ICLR 2024Rejected
4
Aligning Large Multimodal Models with Factually Augmented RLHF
ICLR 2024Rejected
4
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
ICLR 2024Oral
3
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
ICLR 2024Rejected
4
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
ICLR 2024Rejected