Yi Wu
~Yi_Wu1
13
论文总数
6.5
年均投稿
平均评分
接收情况7/13
会议分布
ICLR
7
NeurIPS
5
ICML
1
发表论文 (13 篇)
20259 篇
4
Reasoning Is Not a Race: When Stopping Early Beats Going Deeper
NeurIPS 2025Poster
4
Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps
NeurIPS 2025Rejected
4
Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
ICML 2025Poster
4
FlightBench: Benchmarking Learning-based Methods for Ego-vision-based Quadrotors Navigation
ICLR 2025withdrawn
4
What Can RL Bring to VLA Generalization? An Empirical Study
NeurIPS 2025Poster
4
Few-shot In-context Preference Learning using Large Language Models
ICLR 2025Rejected
4
How Far Are We from Optimal Reasoning Efficiency?
NeurIPS 2025Poster
6
On Designing Effective RL Reward at Training Time for LLM Reasoning
ICLR 2025Rejected
4
AREAL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning
NeurIPS 2025Poster
20244 篇
5
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
ICLR 2024Rejected
4
HiLoRL: A Hierarchical Logical Model for Learning Composite Tasks
ICLR 2024Rejected
4
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
ICLR 2024Poster
4
Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets
ICLR 2024Poster