Yunhao Tang
~Yunhao_Tang1
7
论文总数
3.5
年均投稿
平均评分
接收情况6/7
会议分布
NeurIPS
4
ICML
2
ICLR
1
发表论文 (7 篇)
20254 篇
6
Optimizing Language Models for Inference Time Objectives using Reinforcement Learning
ICML 2025Poster
4
Beyond Verifiable Rewards: Scaling Reinforcement Learning in Language Models to Unverifiable Data
NeurIPS 2025Poster
4
Categorical Distributional Reinforcement Learning with Kullback-Leibler Divergence: Convergence and Asymptotics
ICML 2025Poster
5
Asymmetric REINFORCE for off-Policy Reinforcement Learning: Balancing positive and negative rewards
NeurIPS 2025Poster