影响力指数

51.24/100

前 8%

全站排名 #5,124

发表论文16 篇

平均评分5.3

年均产出5.3 篇/年

Jianhua Han

Researcher@Huawei Technologies Ltd.·中国·OpenReview

研究方向

Vision-Language Learning · Multi-modal Learning · Object Detection

SemHiTok: A Unified Image Tokenizer via Semantic-Guided Hierarchical Codebook for Multimodal Understanding and Generation

ICLR 2026Poster

Does Your 3D Encoder Really Work? A simple yet effective pathway to real 3D scene understanding

ICLR 2026Rejected

C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning

ICLR 2026Rejected

Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization

NeurIPS 2025Poster

ACT-IN-LLM: Adaptively Compression Vision Tokens in LLM for High-Resolution Multimodal Large Language Models

ICLR 2025Rejected

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

ICLR 2025Poster

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

ICLR 2025Withdrawn

HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models

ICLR 2025Withdrawn

合作者 (20)