影响力指数

77.88/100

前 1.6%

全站排名 #1,012

发表论文26 篇

平均评分5.3

年均产出8.7 篇/年

Hang Xu

Researcher@Huawei Noah‘s Ark Lab·中国香港·OpenReview

研究方向

Machine Learning · Computer Vision · Object Detection

6.0

UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity

ICLR 2026Poster

5.5

UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding

ICLR 2026Poster

5.0

SemHiTok: A Unified Image Tokenizer via Semantic-Guided Hierarchical Codebook for Multimodal Understanding and Generation

ICLR 2026Poster

4.5

Does Your 3D Encoder Really Work? A simple yet effective pathway to real 3D scene understanding

ICLR 2026Rejected

4.5

C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning

ICLR 2026Rejected

4.0

GLaVE-Cap: Global-Local Aligned Video Captioning with Vision Expert Integration

ICLR 2026Rejected

6.4

Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization

NeurIPS 2025Poster

6.0

FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise

ICLR 2025Poster

6.0

UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting

ICLR 2025Poster

5.5

ACT-IN-LLM: Adaptively Compression Vision Tokens in LLM for High-Resolution Multimodal Large Language Models

ICLR 2025Rejected

5.5

INST-IT: Boosting Instance Understanding via Explicit Visual Prompt Instruction Tuning

NeurIPS 2025Poster

5.5

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

ICLR 2025Poster

5.3

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

ICLR 2025Withdrawn

通讯

5.0

4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration

NeurIPS 2025Poster

4.8

HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models

合作者 (20)