PaperHub

Hengshuang Zhao

~Hengshuang_Zhao2

37
论文总数
18.5
年均投稿
5.7
平均评分
接收情况23/37
会议分布
ICLR
16
NeurIPS
15
ICML
6

发表论文 (37 篇)

202523

6.4
4

LiteReality: Graphic-Ready 3D Scene Reconstruction from RGB-D Scans

NeurIPS 2025Poster
4.5
4

LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence

ICLR 2025withdrawn
5.5
4

Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models

ICML 2025Poster
4.4
4

LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence

ICML 2025Poster
5.5
4

BOOD: Boundary-based Out-Of-Distribution Data Generation

ICLR 2025Rejected
4.7
3

Effective LLM Knowledge Learning Requires Rethinking Generalization

ICLR 2025Rejected
4.9
4

TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization

ICML 2025Poster
7.2
4

BOOD: Boundary-based Out-Of-Distribution Data Generation

ICML 2025Poster
7.5
5

PlayerOne: Egocentric World Simulator

NeurIPS 2025Oral
4.8
4

HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models

ICLR 2025withdrawn
6.8
4

Seg-VAR:Image Segmentation with Visual Autoregressive Modeling

NeurIPS 2025Poster
6.6
4

HaploVL: A Single-Transformer Baseline for Multi-Modal Understanding

ICML 2025Poster
8.7
4

Orient Anything V2: Unifying Orientation and Rotation Understanding

NeurIPS 2025Spotlight
6.8
4

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

NeurIPS 2025Poster
7.1
5

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

NeurIPS 2025Poster
6.0
4

MiCo: Multi-image Contrast for Reinforcement Visual Reasoning

NeurIPS 2025Poster
3.5
4

VIRT: Vision Instructed Transformer for Robotic Manipulation

ICLR 2025withdrawn
6.1
4

VIP: Vision Instructed Pre-training for Robotic Manipulation

ICML 2025Poster
6.8
4

ROSE: Remove Objects with Side Effects in Videos

NeurIPS 2025Poster
6.3
4

OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces

ICLR 2025Poster
7.3
4

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

NeurIPS 2025Poster
4.8
4

Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images

ICLR 2025withdrawn
5.3
4

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

ICLR 2025withdrawn

202414