Paper
Hub
搜索
Toggle language
Alexander Bukharin
~Alexander_Bukharin1
6
论文总数
3.0
年均投稿
5.7
平均评分
接收情况
5
/
6
会议分布
NeurIPS
2
ICLR
2
ICML
1
COLM
1
发表论文 (6 篇)
2025
3 篇
6.6
4
Deep Reinforcement Learning from Hierarchical Preference Design
ICML 2025
Poster
6.5
4
Adversarial Training of Reward Models
COLM 2025
Poster
5.8
5
HelpSteer2-Preference: Complementing Ratings with Preferences
ICLR 2025
Poster
2024
3 篇
5.5
4
Robust Reinforcement Learning from Corrupted Human Feedback
NeurIPS 2024
Poster
4.0
4
Deep Reinforcement Learning from Weak Hierarchical Preference Feedback
ICLR 2024
Rejected
5.5
4
Adaptive Preference Scaling for Reinforcement Learning with Human Feedback
NeurIPS 2024
Poster
合作者 (20)
TZ
Tuo Zhao
4 篇
YL
Yixiao Li
3 篇
HJ
Haoming Jiang
2 篇
IH
Ilgee Hong
2 篇
ZL
Zichong Li
2 篇
OK
Oleksii Kuchaiev
2 篇
OD
Olivier Delalleau
2 篇
ZW
Zhilin Wang
2 篇
查看全部 20 位合作者