影响力指数

74.14/100

前 2%

全站排名 #1,296

发表论文14 篇

平均评分6.0

年均产出4.7 篇/年

Lifan Yuan

PhD student@University of Illinois at Urbana-Champaign·美国·OpenReview

How Far Can Unsupervised RLVR Scale LLM Training?

ICLR 2026Poster

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

ICLR 2026Poster

Process Reinforcement through Implicit Rewards

ICLR 2026Rejected

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

ICLR 2026Withdrawn

The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning

NeurIPS 2025Poster

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

NeurIPS 2025Poster

TTRL: Test-Time Reinforcement Learning

NeurIPS 2025Poster

Advancing LLM Reasoning Generalists with Preference Trees

ICLR 2025Poster

Free Process Rewards without Process Labels

ICML 2025Poster

Improving Zero-Shot Generalization of Instruction Tuning by Data Arrangement

ICLR 2025Withdrawn

合作者 (20)