Paper
Hub
搜索
Toggle language
Jincheng Mei
~Jincheng_Mei1
3
论文总数
1.5
年均投稿
6.0
平均评分
接收情况
3
/
3
会议分布
NeurIPS
2
ICLR
1
发表论文 (3 篇)
2025
2 篇
5.5
4
Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF
ICLR 2025
Poster
6.8
4
REINFORCE Converges to Optimal Policies with Any Learning Rate
NeurIPS 2025
Poster
2024
1 篇
5.8
5
Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates
NeurIPS 2024
Poster
合作者 (14)
BD
Bo Dai
3 篇
DS
Dale Schuurmans
3 篇
CS
Csaba Szepesvari
2 篇
AA
Alekh Agarwal
1 篇
AR
Anant Raj
1 篇
SV
Sharan Vaswani
1 篇
HD
Hanjun Dai
1 篇
KG
Katayoon Goshvadi
1 篇
查看全部 14 位合作者