Rafael Rafailov
~Rafael_Rafailov1
16
论文总数
8.0
年均投稿
平均评分
接收情况10/16
会议分布
ICLR
9
COLM
3
NeurIPS
3
ICML
1
发表论文 (16 篇)
20257 篇
3
MALT: Improving Reasoning with Multi-Agent LLM Training
COLM 2025Poster
4
Generative Reward Models
ICLR 2025Rejected
4
Collapse or Thrive? Perils and Promises of Synthetic Data in a Self-Generating World
ICLR 2025Rejected
4
Collapse or Thrive: Perils and Promises of Synthetic Data in a Self-Generating World
ICML 2025Poster
5
Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents
ICLR 2025Rejected
4
MJ-Video: Benchmarking and Rewarding Video Generation with Fine-Grained Video Preference
NeurIPS 2025Spotlight
4
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
ICLR 2025Rejected
20249 篇
3
From $r$ to $Q^*$: Your Language Model is Secretly a Q-Function
COLM 2024Poster
4
Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms
NeurIPS 2024Poster
4
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
ICLR 2024Rejected
4
Contrastive Preference Learning: Learning from Human Feedback without Reinforcement Learning
ICLR 2024Poster
4
An Emulator for Fine-tuning Large Language Models using Small Language Models
ICLR 2024Poster
4
Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels
NeurIPS 2024Poster
3
Language Model Detectors Are Easily Optimized Against
ICLR 2024Poster
4
Offline RL for Online RL: Decoupled Policy Learning for Mitigating Exploration Bias
ICLR 2024Rejected
4
Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data
COLM 2024Poster