Hao Fei
~Hao_Fei1
21
论文总数
10.5
年均投稿
平均评分
接收情况18/21
会议分布
NeurIPS
12
ICLR
5
ICML
3
COLM
1
发表论文 (21 篇)
202511 篇
5
On Path to Multimodal Generalist: General-Level and General-Bench
ICML 2025Oral
4
MuSLR: Multimodal Symbolic Logical Reasoning
NeurIPS 2025Poster
5
Towards Semantic Equivalence of Tokenization in Multimodal LLM
ICLR 2025Poster
4
Watch Out Your Album! On the Inadvertent Privacy Memorization in Multi-Modal Large Language Models
ICML 2025Poster
3
CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs
ICLR 2025Poster
4
Grounding is All You Need? Dual Temporal Grounding for Video Dialog
ICLR 2025withdrawn
4
Visual Thoughts: A Unified Perspective of Understanding Multimodal Chain-of-Thought
NeurIPS 2025Poster
4
Probing then Editing Response Personality of Large Language Models
COLM 2025Poster
4
$\mathcal{V}ista\mathcal{DPO}$: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
ICML 2025Poster
4
VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models
NeurIPS 2025Poster
4
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
NeurIPS 2025Spotlight
202410 篇
4
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
NeurIPS 2024Poster
4
NExT-GPT: Any-to-Any Multimodal LLM
ICLR 2024Rejected
4
Synergistic Dual Spatial-aware Generation of Image-to-text and Text-to-image
NeurIPS 2024Poster
4
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding
NeurIPS 2024Poster
4
What Factors Affect Multi-Modal In-Context Learning? An In-Depth Exploration
NeurIPS 2024Poster
4
Towards Complex-query Referring Image Segmentation: A Novel Benchmark
ICLR 2024withdrawn
4
Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration
NeurIPS 2024Spotlight
3
Unified Generative and Discriminative Training for Multi-modal Large Language Models
NeurIPS 2024Poster
4
ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models
NeurIPS 2024Poster
4
RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation
NeurIPS 2024Oral