影响力指数

83.7/100

前 1.1%

全站排名 #679

发表论文25 篇

平均评分5.3

年均产出8.3 篇/年

Anca Dragan

Associate Professor@University of California Berkeley·OpenReview

Training LLM Agents to Empower Humans

ICLR 2026Rejected

CTRL-Rec: Controlling Recommender Systems With Natural Language

ICLR 2026Rejected

Benchmarking Anomaly Detection for Large Language Model Alignment

ICLR 2026Rejected

Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following

NeurIPS 2025Poster

Correlated Proxies: A New Definition and Improved Mitigation for Reward Hacking

ICLR 2025Spotlight

Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning

ICLR 2025Poster

Context Steering: Controllable Personalization at Inference Time

ICLR 2025Poster

On Targeted Manipulation and Deception when Optimizing LLMs for User Feedback

ICLR 2025Poster

AssistanceZero: Scalably Solving Assistance Games

ICML 2025Poster

Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL

NeurIPS 2025Poster

Adversaries Can Misuse Combinations of Safe Models

ICML 2025Poster

Interactive Dialogue Agents via Reinforcement Learning with Hindsight Regenerations

ICLR 2025Rejected

Successor Representations Enable Emergent Compositional Instruction Following

ICLR 2025Rejected

Adversaries Can Misuse Combinations of Safe Models

ICLR 2025Rejected

Reliability-Aware Preference Learning for LLM Reward Models

ICLR 2025Withdrawn

Defining Deception in Decision Making

ICLR 2025Rejected

Zero-Shot Goal Dialogue via Reinforcement Learning on Imagined Conversations

ICLR 2025Rejected

合作者 (20)

Cassidy Laidlaw