Jianhua Han
~Jianhua_Han1
13
论文总数
6.5
年均投稿
平均评分
接收情况7/13
会议分布
ICLR
9
NeurIPS
4
发表论文 (13 篇)
20255 篇
4
Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization
NeurIPS 2025Poster
4
ACT-IN-LLM: Adaptively Compression Vision Tokens in LLM for High-Resolution Multimodal Large Language Models
ICLR 2025Rejected
4
HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models
ICLR 2025withdrawn
4
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model
ICLR 2025Poster
4
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
ICLR 2025withdrawn
20248 篇
3
LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
ICLR 2024withdrawn
4
Ins-DetCLIP: Aligning Detection Model to Follow Human-Language Instruction
ICLR 2024Poster
-
RealignDiff: Boosting text-to-image diffusion model with coarse-to-fine semantic re-alignment
ICLR 2024Rejected
5
VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation
NeurIPS 2024Poster
4
Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models
ICLR 2024withdrawn
4
Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis
ICLR 2024Poster
4
SlowFocus: Enhancing Fine-grained Temporal Understanding in Video LLM
NeurIPS 2024Poster
3
UNIT: Unifying Image and Text Recognition in One Vision Encoder
NeurIPS 2024Poster