Rishabh Joshi
~Rishabh_Joshi1
6
论文总数
3.0
年均投稿
平均评分
接收情况5/6
会议分布
ICLR
5
ICML
1
发表论文 (6 篇)
20255 篇
4
Reward-Guided Prompt Evolving in Reinforcement Learning for LLMs
ICML 2025Poster
3
Evolving Alignment via Asymmetric Self-Play
ICLR 2025Rejected
4
RRM: Robust Reward Model Training Mitigates Reward Hacking
ICLR 2025Poster
4
Learning from negative feedback, or positive feedback or both
ICLR 2025Spotlight
4
Building Math Agents with Multi-Turn Iterative Preference Learning
ICLR 2025Poster