PaperHub

Renrui Zhang

~Renrui_Zhang1

27
论文总数
13.5
年均投稿
6.3
平均评分
接收情况21/27
会议分布
ICLR
14
NeurIPS
12
ICML
1

发表论文 (27 篇)

202518

6.5
4

MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine

ICLR 2025Poster
6.4
4

MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning

NeurIPS 2025Poster
6.5
4

MMSearch: Unveiling the Potential of Large Models as Multi-modal Search Engines

ICLR 2025Poster
5.5
4

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

ICML 2025Poster
7.3
3

LLaVA-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models

ICLR 2025Spotlight
7.3
4

T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

NeurIPS 2025Poster
7.8
4

Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO

NeurIPS 2025Poster
7.0
3

UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens

NeurIPS 2025Poster
6.0
4

PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions

ICLR 2025Poster
6.8
4

What We Miss Matters: Learning from the Overlooked in Point Cloud Transformers

NeurIPS 2025Poster
8.2
4

Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking

NeurIPS 2025Poster
4.8
4

PointACL: Point Cloud Understanding via Attention-Driven Contrastive Learning

ICLR 2025withdrawn
7.3
4

Fast-in-Slow: A Dual-System VLA Model Unifying Fast Manipulation within Slow Reasoning

NeurIPS 2025Poster
6.0
4

HybridVLA: Collaborative Autoregression and Diffusion in a Unified Vision-Language-Action Model

NeurIPS 2025Rejected
6.4
3

Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos

NeurIPS 2025Poster
4.0
4

TerDiT: Ternary Diffusion Models with Transformers

ICLR 2025withdrawn
7.2
5

Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation

ICLR 2025Spotlight
6.4
4

AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation

NeurIPS 2025Poster

20249