影响力指数

98/100

前 0.1%

全站排名 #52

发表论文64 篇

平均评分5.5

年均产出21.3 篇/年

Mengdi Wang

Full Professor@Princeton University·美国·OpenReview

研究方向

optimization · reinforcement learning · large language models · AI for science

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

ICLR 2026Poster

ARMOR: High-Performance Semi-Structured Pruning via Adaptive Matrix Factorization

ICLR 2026Poster

GeneBreaker: Jailbreak Attacks against DNA Language Models with Pathogenicity Guidance

ICLR 2026Poster

Alita-G: Self-Evolving Generative Agent for Agent Generation

ICLR 2026Desk Rejected

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

ICLR 2026Poster

FMIP: Joint Continuous-Integer Flow For Mixed-Integer Linear Programming

ICLR 2026Poster

CubeBench: Diagnosing Interactive, Long-Horizon Physical Intelligence under Partial Observations

ICLR 2026Poster

Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

ICLR 2026Poster

PoseX: AI Defeats Physics-based Methods on Protein Ligand Cross-Docking

ICLR 2026Poster

Teaching Language Model to Act Efficiently

ICLR 2026Rejected

AutoTool: Dynamic Tool Selection and Integration for Agentic Reasoning

ICLR 2026Rejected

Detecting Post-generation Edits to Watermarked LLM Outputs via Combinatorial Watermarking

ICLR 2026Rejected

SafeProtein: Red-Teaming Framework and Benchmark for Protein Foundation Models

ICLR 2026Rejected

On the Role of Preference Variance in Preference Optimization

ICLR 2026Rejected

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

ICLR 2026Desk Rejected

Demystifying Reinforcement Learning in Agentic Reasoning

ICLR 2026Rejected

On Path to Multimodal Historical Reasoning: HistBench and HistAgent

ICLR 2026Rejected

SafeThink: A Key to Safety in Multi-Modal Large Reasoning Models

ICLR 2026Withdrawn

AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes

ICLR 2026Rejected

CURE: Co-Evolving Coders and Unit Testers via Reinforcement Learning

NeurIPS 2025Spotlight

Does Thinking More Always Help? Mirage of Test-Time Scaling in Reasoning Models

NeurIPS 2025Poster

ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs

NeurIPS 2025Poster

Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models

NeurIPS 2025Poster

DISC: Dynamic Decomposition Improves LLM Inference Scaling

NeurIPS 2025Poster

MMaDA: Multimodal Large Diffusion Language Models

NeurIPS 2025Poster

IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

ICLR 2025Poster

Securing the Language of Life: Inheritable Watermarks from DNA Language Models to Proteins

NeurIPS 2025Poster

Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data

ICLR 2025Poster

NoWag: A Unified Framework for Shape Preserving Com- pression of Large Language Models

COLM 2025Poster

SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths

COLM 2025Poster

Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models

ICML 2025Poster

A First-order Generative Bilevel Optimization Framework for Diffusion Models

ICML 2025Poster

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment

ICLR 2025Poster

MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations

ICML 2025Poster

A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

ICLR 2025Poster

Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias

ICLR 2025Poster

Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow

ICLR 2025Poster

SAIL: Self-improving Efficient Online Alignment of Large Language Models

ICLR 2025Rejected

SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths

ICLR 2025Rejected

Online Detection for Black-Box Large Language Models with Adaptive Prompt Selection

ICLR 2025Rejected

Regularized DeepIV with Model Selection

ICLR 2025Rejected

LIAR: Leveraging Inverse Alignment to Jailbreak LLMs in Seconds

ICLR 2025Rejected

Relative-Translation Invariant Wasserstein Distance

ICLR 2025Rejected

AIME: AI System Optimization via Multiple LLM Evaluators

ICLR 2025Withdrawn

GuideCO: Training Objective-Guided Diffusion Solver with Imperfect Data for Combinatorial Optimization

ICLR 2025Withdrawn

合作者 (20)

Souradip Chakraborty