影响力指数

52.05/100

前 7.6%

全站排名 #4,865

发表论文7 篇

平均评分5.5

年均产出3.5 篇/年

Yunhao Tang

Research Scientist@Google·OpenReview

研究方向

Reinforcement Learning

Beyond Verifiable Rewards: Scaling Reinforcement Learning in Language Models to Unverifiable Data

NeurIPS 2025Poster

Asymmetric REINFORCE for off-Policy Reinforcement Learning: Balancing positive and negative rewards

NeurIPS 2025Poster

Categorical Distributional Reinforcement Learning with Kullback-Leibler Divergence: Convergence and Asymptotics

ICML 2025Poster

Optimizing Language Models for Inference Time Objectives using Reinforcement Learning

ICML 2025Poster

On scalable oversight with weak LLMs judging strong LLMs

NeurIPS 2024Poster

A Distributional Analogue to the Successor Representation

ICLR 2024Rejected

Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model

NeurIPS 2024Poster

合作者 (20)

Amir-massoud Farahmand

Murat A Erdogdu

Li Kevin Wenliang