影响力指数

81.28/100

前 1.3%

全站排名 #813

发表论文27 篇

平均评分5.5

年均产出9.0 篇/年

Jianwei Yang

Researcher@Microsoft·美国·OpenReview

研究方向

Vision and Language · Machine Learning · Computer vision

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

ICLR 2026Rejected

Agent Learning via Early Experience

ICLR 2026Rejected

Self-supervised Sparse Vision Concepts for Image Understanding and Reconstruction

ICLR 2026Rejected

Lemon: A Unified and Scalable 3D Multimodal Model for Universal Spatial Understanding

ICLR 2026Desk Rejected

TemporalBench: Evaluating Fine-Grained Temporal Dynamics Understanding for Multimodal Models

ICLR 2026Withdrawn

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

NeurIPS 2025Poster

TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies

ICLR 2025Poster

Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation

NeurIPS 2025Poster

Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs

NeurIPS 2025Poster

MindJourney: Test-Time Scaling with World Models for Spatial Reasoning

NeurIPS 2025Poster

Simplifying DINO via Coding Rate Regularization

ICML 2025Poster

Matryoshka Multimodal Models

ICLR 2025Poster

Latent Action Pretraining from Videos

ICLR 2025Poster

ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding

ICML 2025Poster

OmniParser for Pure Vision Based GUI Agent

ICLR 2025Rejected

TemporalBench: Towards Fine-grained Temporal Understanding for Multimodal Video Models

ICLR 2025Withdrawn

Evaluating Graphical Perception of Large Multimodal Models

ICLR 2025Withdrawn

合作者 (20)