Paper
Hub
搜索
Toggle language
Rong Bao
~Rong_Bao1
3
论文总数
1.5
年均投稿
6.0
平均评分
接收情况
3
/
3
会议分布
NeurIPS
2
ICLR
1
发表论文 (3 篇)
2025
2 篇
6.4
5
Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections
NeurIPS 2025
Poster
6.0
3
RMB: Comprehensively benchmarking reward models in LLM alignment
ICLR 2025
Poster
2024
1 篇
5.5
4
InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling
NeurIPS 2024
Poster
合作者 (20)
DT
Dacheng Tao
1 篇
LZ
Lefei Zhang
1 篇
LD
Liang Ding
1 篇
SZ
Sen Zhang
1 篇
YM
Yuchun Miao
1 篇
BW
Binghai Wang
1 篇
EZ
Enyu Zhou
1 篇
GZ
Guodong Zheng
1 篇
查看全部 20 位合作者