Xiaodan Liang
~Xiaodan_Liang2
29
论文总数
14.5
年均投稿
平均评分
接收情况17/29
会议分布
ICLR
22
NeurIPS
6
ICML
1
发表论文 (29 篇)
202517 篇
5
SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning
NeurIPS 2025Poster
5
GDrag:Towards General-Purpose Interactive Editing with Anti-ambiguity Point Diffusion
ICLR 2025Poster
3
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes
ICLR 2025Poster
4
Memory-Driven Multimodal Chain of Thought for Embodied Long-Horizon Task Planning
ICLR 2025withdrawn
5
PT-T2I/V: An Efficient Proxy-Tokenized Diffusion Transformer for Text-to-Image/Video-Task
ICLR 2025Poster
3
MMTryon: Multi-Modal Multi-Reference Control for High-Quality Fashion Generation
ICLR 2025Rejected
3
StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration
ICLR 2025Rejected
4
UncertaintyRAG: Span Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation
ICLR 2025Rejected
4
ActionFiller: Fill-In-The-Blank Prompting for OS Agent
ICLR 2025Rejected
5
UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting
ICLR 2025Poster
4
WISA: World simulator assistant for physics-aware text-to-video generation
NeurIPS 2025Spotlight
4
Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
ICLR 2025withdrawn
4
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
ICLR 2025Poster
3
OptiBench Meets ReSocratic: Measure and Improve LLMs for Optimization Modeling
ICLR 2025Poster
4
HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models
ICLR 2025withdrawn
4
S2-Track: A Simple yet Strong Approach for End-to-End 3D Multi-Object Tracking
ICML 2025Poster
4
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
ICLR 2025withdrawn
202412 篇
4
Learning with Counterfactual Explanations for Radiology Report Generation
ICLR 2024withdrawn
4
Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
NeurIPS 2024Poster
4
Ins-DetCLIP: Aligning Detection Model to Follow Human-Language Instruction
ICLR 2024Poster
3
LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
ICLR 2024withdrawn
-
RealignDiff: Boosting text-to-image diffusion model with coarse-to-fine semantic re-alignment
ICLR 2024Rejected
3
Rep-Adapter: Parameter-free Automatic Adaptation of Pre-trained ConvNets via Re-parameterization
ICLR 2024Rejected
4
PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation
NeurIPS 2024Poster
5
VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation
NeurIPS 2024Poster
3
MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data
ICLR 2024Spotlight
4
Proving Theorems Recursively
NeurIPS 2024Poster
4
LEGO-Prover: Neural Theorem Proving with Growing Libraries
ICLR 2024Oral
4
DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning
ICLR 2024Poster