Paper
Hub
搜索
Toggle language
Viraj Mehta
~Viraj_Mehta1
3
论文总数
1.5
年均投稿
5.6
平均评分
接收情况
2
/
3
会议分布
ICLR
1
COLM
1
NeurIPS
1
发表论文 (3 篇)
2025
1 篇
6.3
3
Sample Efficient Preference Alignment in LLMs via Active Exploration
COLM 2025
Poster
2024
2 篇
4.8
5
Sample Efficient Reinforcement Learning from Human Feedback via Active Exploration
ICLR 2024
Rejected
5.6
5
Group Robust Preference Optimization in Reward-free RLHF
NeurIPS 2024
Poster
合作者 (14)
IB
Ilija Bogunovic
3 篇
JS
Jeff Schneider
2 篇
ON
Ojash Neopane
2 篇
VD
Vikramjeet Das
2 篇
WN
Willie Neiswanger
2 篇
YD
Yijia Dai
2 篇
HA
Haitham Bou Ammar
1 篇
IC
Iason Chaimalas
1 篇
查看全部 14 位合作者