Lijuan Wang
~Lijuan_Wang1
19
论文总数
9.5
年均投稿
平均评分
接收情况17/19
会议分布
ICLR
10
NeurIPS
7
ICML
1
COLM
1
发表论文 (19 篇)
202512 篇
4
Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark
ICML 2025Oral
4
Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization
ICLR 2025Poster
3
CertainlyUncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness
ICLR 2025Poster
4
Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning
NeurIPS 2025Poster
4
GenXD: Generating Any 3D and 4D Scenes
ICLR 2025Poster
4
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
NeurIPS 2025Spotlight
3
EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
ICLR 2025Poster
4
VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents
NeurIPS 2025Poster
3
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
ICLR 2025Oral
4
SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
ICLR 2025Spotlight
5
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
ICLR 2025Poster
4
ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs
NeurIPS 2025Poster
20247 篇
4
Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning
NeurIPS 2024Poster
4
Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
ICLR 2024Poster
4
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
ICLR 2024withdrawn
3
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
NeurIPS 2024Poster
4
DisCo: Disentangled Control for Realistic Human Dance Generation
ICLR 2024withdrawn
6
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
COLM 2024Poster
4
Interfacing Foundation Models' Embeddings
NeurIPS 2024Poster