Xiang Bai
~Xiang_Bai1
18
论文总数
9.0
年均投稿
平均评分
接收情况11/18
会议分布
NeurIPS
9
ICLR
8
ICML
1
发表论文 (18 篇)
202513 篇
4
MSTAR: Box-free Multi-query Scene Text Retrieval with Attention Recycling
NeurIPS 2025Poster
5
PlayerOne: Egocentric World Simulator
NeurIPS 2025Oral
4
More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models
NeurIPS 2025Poster
4
Mini-Monkey: Alleviating the Semantic Sawtooth Effect for Lightweight MLLMs via Complementary Image Pyramid
ICLR 2025Poster
4
MiCo: Multi-image Contrast for Reinforcement Visual Reasoning
NeurIPS 2025Poster
4
NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding
NeurIPS 2025Poster
4
VIP: Vision Instructed Pre-training for Robotic Manipulation
ICML 2025Poster
4
VIRT: Vision Instructed Transformer for Robotic Manipulation
ICLR 2025withdrawn
4
MCTBench: Multimodal Cognition towards Text-Rich Visual Scenes Benchmark
ICLR 2025Rejected
4
A Framework of Distilling Multimodal Large Language Models
ICLR 2025Rejected
3
R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models
ICLR 2025withdrawn
5
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering
ICLR 2025withdrawn
4
TextSquare: Scaling up Text-Centric Visual Instruction Tuning
ICLR 2025withdrawn
20245 篇
3
SingleInsert: Inserting New Concepts from a Single Image into Text-to-Image Models for Flexible Editing
ICLR 2024Rejected
4
A Unified Framework for 3D Scene Understanding
NeurIPS 2024Poster
4
MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
NeurIPS 2024Poster
4
LION: Linear Group RNN for 3D Object Detection in Point Clouds
NeurIPS 2024Poster
4
PointMamba: A Simple State Space Model for Point Cloud Analysis
NeurIPS 2024Poster