Nicolas Le Roux
~Nicolas_Le_Roux2
5
论文总数
2.5
年均投稿
平均评分
接收情况3/5
会议分布
NeurIPS
2
ICLR
2
ICML
1
发表论文 (5 篇)
20254 篇
4
Tapered Off-Policy REINFORCE - Stable and efficient reinforcement learning for large language models
NeurIPS 2025Poster
4
fPLSA: Learning Semantic Structures in Document Collections Using Foundation Models
ICLR 2025Rejected
4
VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
ICLR 2025Rejected
4
VinePPO: Refining Credit Assignment in RL Training of LLMs
ICML 2025Poster