PaperHub

Xiaodan Liang

~Xiaodan_Liang2

29
论文总数
14.5
年均投稿
5.7
平均评分
接收情况17/29
会议分布
ICLR
22
NeurIPS
6
ICML
1

发表论文 (29 篇)

202517

7.1
5

SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning

NeurIPS 2025Poster
6.4
5

GDrag:Towards General-Purpose Interactive Editing with Anti-ambiguity Point Diffusion

ICLR 2025Poster
5.7
3

Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes

ICLR 2025Poster
4.0
4

Memory-Driven Multimodal Chain of Thought for Embodied Long-Horizon Task Planning

ICLR 2025withdrawn
6.4
5

PT-T2I/V: An Efficient Proxy-Tokenized Diffusion Transformer for Text-to-Image/Video-Task

ICLR 2025Poster
7.0
3

MMTryon: Multi-Modal Multi-Reference Control for High-Quality Fashion Generation

ICLR 2025Rejected
4.7
3

StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration

ICLR 2025Rejected
4.8
4

UncertaintyRAG: Span Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation

ICLR 2025Rejected
3.0
4

ActionFiller: Fill-In-The-Blank Prompting for OS Agent

ICLR 2025Rejected
6.0
5

UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting

ICLR 2025Poster
6.8
4

WISA: World simulator assistant for physics-aware text-to-video generation

NeurIPS 2025Spotlight
4.8
4

Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models

ICLR 2025withdrawn
6.3
4

CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models

ICLR 2025Poster
6.7
3

OptiBench Meets ReSocratic: Measure and Improve LLMs for Optimization Modeling

ICLR 2025Poster
4.8
4

HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models

ICLR 2025withdrawn
5.5
4

S2-Track: A Simple yet Strong Approach for End-to-End 3D Multi-Object Tracking

ICML 2025Poster
5.3
4

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

ICLR 2025withdrawn

202412