影响力指数

39.23/100

前 15%

全站排名 #9,636

发表论文5 篇

平均评分5.4

年均产出2.5 篇/年

Nicolas Le Roux

Researcher@Microsoft·OpenReview

研究方向

deep learning · large scale learning · convex optimization · reinforcement learning

Tapered Off-Policy REINFORCE - Stable and efficient reinforcement learning for large language models

NeurIPS 2025Poster

VinePPO: Refining Credit Assignment in RL Training of LLMs

ICML 2025Poster

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

ICLR 2025Rejected

fPLSA: Learning Semantic Structures in Document Collections Using Foundation Models

ICLR 2025Rejected

Improving Context-Aware Preference Modeling for Language Models

NeurIPS 2024Poster

合作者 (19)

Alessandro Sordoni

Aaron Courville

Amirhossein Kazemnejad

Milad Aghajohari