影响力指数

78.38/100

前 1.5%

全站排名 #984

发表论文31 篇

平均评分5.4

年均产出10.3 篇/年

Peng Gao

Researcher@Alibaba Group·中国·OpenReview

研究方向

Multimodal Learning

Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield

ICLR 2026Poster

Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision

ICLR 2026Rejected

Contrastive Self-Rewarding MLLM

ICLR 2026Withdrawn

Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning

ICLR 2026Withdrawn

EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation

NeurIPS 2025Poster

Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation

ICLR 2025Spotlight

MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine

ICLR 2025Poster

MMSearch: Unveiling the Potential of Large Models as Multi-modal Search Engines

ICLR 2025Poster

Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining

ICLR 2025Rejected

PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions

ICLR 2025Poster

Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want

ICLR 2025Poster

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

ICML 2025Poster

LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models

ICLR 2025Rejected

I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow

ICLR 2025Rejected

VEnhancer: Generative Space-Time Enhancement for Video Generation

ICLR 2025Rejected

AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction

ICLR 2025Withdrawn

Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling

ICLR 2025Rejected

Exploring the Design Space of Autoregressive Models for Efficient and Scalable Image Generation

ICLR 2025Withdrawn

TerDiT: Ternary Diffusion Models with Transformers

ICLR 2025Withdrawn

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

ICLR 2025Rejected

合作者 (20)

博士导师17 篇