Xiaojian Ma
~Xiaojian_Ma1
14
论文总数
7.0
年均投稿
平均评分
接收情况9/14
会议分布
ICLR
10
NeurIPS
3
ICML
1
发表论文 (14 篇)
20258 篇
4
LongViTU: Instruction Tuning for Long-Form Video Understanding
ICLR 2025withdrawn
4
NEP: Autoregressive Image Editing via Next Editing Token Prediction
NeurIPS 2025Poster
4
Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage
ICLR 2025Spotlight
4
Falcon: Fast Visuomotor Policies via Partial Denoising
ICML 2025Poster
4
GROOT-2: Weakly Supervised Multimodal Instruction Following Agents
ICLR 2025Poster
4
Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning
NeurIPS 2025Poster
4
Task-oriented Sequential Grounding in 3D Scenes
ICLR 2025Rejected
4
Optimizing Inference-Time Reasoning in LLMs via Retrieval-Augmented Reflection
ICLR 2025Rejected
20246 篇
4
Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World
ICLR 2024Poster
5
MindAgent: Emergent Gaming Interaction
ICLR 2024withdrawn
4
An Embodied Generalist Agent in 3D World
ICLR 2024Rejected
4
GROOT: Learning to Follow Instructions by Watching Gameplay Videos
ICLR 2024Spotlight
5
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning
ICLR 2024Poster
4
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents
NeurIPS 2024Poster