影响力指数

96.07/100

前 0.2%

全站排名 #137

发表论文34 篇

平均评分6.2

年均产出11.3 篇/年

Wen Sun

Assistant Professor@Cornell University·OpenReview

研究方向

Imitation Learning · Reinforcement Learning

All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning

ICLR 2026Poster

Prompt Curriculum Learning for Efficient LLM Post-Training

ICLR 2026Poster

Expressive Value Learning for Scalable Offline Reinforcement Learning

ICLR 2026Rejected

Value-as-Return: A Two-Stage Framework to Align on the Optimal Score Function

ICLR 2026Rejected

Controllable Diffusion via Optimal Classifier Guidance

ICLR 2026Rejected

Test Time Scaling of Diffusion Model via Flow Matching Corrector

ICLR 2026Rejected

Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics

$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training

ICML 2025Rejected

Accelerating RL for LLM Reasoning with Optimal Advantage Regression

NeurIPS 2025Poster

$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training

NeurIPS 2025Poster

Value-Guided Search for Efficient Chain-of-Thought Reasoning

NeurIPS 2025Poster

Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds

ICLR 2025Poster

On Speeding Up Language Model Evaluation

ICLR 2025Poster

Avoiding exp(R) scaling in RLHF through Preference-based Exploration

NeurIPS 2025Poster

Diffusing States and Matching Scores: A New Framework for Imitation Learning

ICLR 2025Poster

Scaling Offline RL via Efficient and Expressive Shortcut Models

NeurIPS 2025Poster

Efficient Imitation under Misspecification

ICLR 2025Poster

Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF

ICLR 2025Poster

Correcting the Mythos of KL-Regularization: Direct Alignment without Overoptimization via Chi-Squared Preference Optimization

ICLR 2025Spotlight

A Reductions Approach to Risk-Sensitive Reinforcement Learning with Optimized Certainty Equivalents

ICML 2025Poster

Convergence Of Consistency Model With Multistep Sampling Under General Data Assumptions

ICLR 2025Rejected

On Orchestrating Personalized LLMs

ICLR 2025Rejected

Convergence of Consistency Model with Multistep Sampling under General Data Assumptions

ICML 2025Poster

合作者 (20)

Kianté Brantley

Jonathan Daniel Chang