影响力指数

72.76/100

前 2.2%

全站排名 #1,418

发表论文19 篇

平均评分5.7

年均产出6.3 篇/年

Rafael Rafailov

PhD student@Stanford University·美国·OpenReview

研究方向

Reinforcement Learning · Offline Reinforcement Learning · Model-Based Reinforcement Learning · Imitation Learning

Just Enough Thinking: Efficient Reasoning with Adaptive Length Penalties Reinforcement Learning

ICLR 2026Rejected

Scaling Laws for Generative Reward Models

ICLR 2026Rejected

Self-Guided Thinking: Enabling LLMs to Decide When to Think

ICLR 2026Withdrawn

MJ-Video: Benchmarking and Rewarding Video Generation with Fine-Grained Video Preference

NeurIPS 2025Spotlight

MALT: Improving Reasoning with Multi-Agent LLM Training

COLM 2025Poster

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

ICLR 2025Rejected

Collapse or Thrive? Perils and Promises of Synthetic Data in a Self-Generating World

ICLR 2025Rejected

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

ICLR 2025Rejected

Generative Reward Models

ICLR 2025Rejected

Collapse or Thrive: Perils and Promises of Synthetic Data in a Self-Generating World

ICML 2025Poster

合作者 (20)

博士导师12 篇

David L. Donoho

Matthias Gerstgrasser

Rylan Schaeffer