Lifan Yuan
~Lifan_Yuan1
10
论文总数
5.0
年均投稿
平均评分
接收情况8/10
会议分布
ICLR
5
NeurIPS
4
ICML
1
发表论文 (10 篇)
20256 篇
5
Free Process Rewards without Process Labels
ICML 2025Poster
4
Advancing LLM Reasoning Generalists with Preference Trees
ICLR 2025Poster
4
Reinforcement Learning Finetunes Small Subnetworks in Large Language Models
NeurIPS 2025Poster
4
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
NeurIPS 2025Poster
5
Improving Zero-Shot Generalization of Instruction Tuning by Data Arrangement
ICLR 2025withdrawn
4
TTRL: Test-Time Reinforcement Learning
NeurIPS 2025Poster
20244 篇
3
CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets
ICLR 2024Poster
4
UltraFeedback: Boosting Language Models with High-quality Feedback
ICLR 2024Rejected
5
Noise Contrastive Alignment of Language Models with Explicit Rewards
NeurIPS 2024Poster
4
MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback
ICLR 2024Poster