Remi Munos
~Remi_Munos1
7
论文总数
3.5
年均投稿
平均评分
接收情况7/7
会议分布
NeurIPS
5
ICML
2
发表论文 (7 篇)
20254 篇
4
Beyond Verifiable Rewards: Scaling Reinforcement Learning in Language Models to Unverifiable Data
NeurIPS 2025Poster
4
Temporal Difference Flows
ICML 2025Oral
6
Optimizing Language Models for Inference Time Objectives using Reinforcement Learning
ICML 2025Poster
5
Asymmetric REINFORCE for off-Policy Reinforcement Learning: Balancing positive and negative rewards
NeurIPS 2025Poster