Haotian Zhang
~Haotian_Zhang3
11
论文总数
5.5
年均投稿
平均评分
接收情况7/11
会议分布
ICLR
9
COLM
1
ICML
1
发表论文 (11 篇)
20257 篇
4
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
ICLR 2025Poster
4
MMEgo: Towards Building Egocentric Multimodal LLMs for Video QA
ICLR 2025Poster
4
Contrastive Localized Language-Image Pre-Training
ICML 2025Poster
4
Contrastive Localized Language-Image Pre-Training
ICLR 2025Rejected
3
Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms
ICLR 2025Poster
4
Improve Vision Language Model Chain-of-thought Reasoning
ICLR 2025withdrawn
4
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models
ICLR 2025Poster
20244 篇
3
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models
COLM 2024Poster
4
Data Curation for Large Scale Detection Pretraining
ICLR 2024withdrawn
3
Ferret: Refer and Ground Anything Anywhere at Any Granularity
ICLR 2024Spotlight
3
From Scarcity to Efficiency: Improving CLIP Training via Visual-enriched Captions
ICLR 2024withdrawn