Linjie Li
~Linjie_Li1
21
论文总数
10.5
年均投稿
平均评分
接收情况18/21
会议分布
ICLR
11
NeurIPS
7
ICML
2
COLM
1
发表论文 (21 篇)
202513 篇
3
CertainlyUncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness
ICLR 2025Poster
4
Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning
NeurIPS 2025Poster
4
OmniContrast: Vision-Language-Interleaved Contrast from Pixels All at once
ICLR 2025Rejected
4
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback
ICML 2025Poster
4
Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark
ICML 2025Oral
4
VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents
NeurIPS 2025Poster
4
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
NeurIPS 2025Spotlight
4
GenXD: Generating Any 3D and 4D Scenes
ICLR 2025Poster
3
EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
ICLR 2025Poster
4
SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
ICLR 2025Spotlight
5
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
ICLR 2025Poster
4
ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs
NeurIPS 2025Poster
3
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
ICLR 2025Oral
20248 篇
4
DisCo: Disentangled Control for Realistic Human Dance Generation
ICLR 2024withdrawn
4
Interfacing Foundation Models' Embeddings
NeurIPS 2024Poster
4
Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning
NeurIPS 2024Poster
4
Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
ICLR 2024Poster
4
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
ICLR 2024withdrawn
3
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
NeurIPS 2024Poster
4
The Generative AI Paradox: “What It Can Create, It May Not Understand”
ICLR 2024Poster
6
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
COLM 2024Poster