影响力指数

38.56/100

前 15.5%

全站排名 #9,960

发表论文5 篇

平均评分5.8

年均产出2.5 篇/年

Akbir Khan

PhD student@University College London, University of London·OpenReview

研究方向

Multi-agent Reinforcement Learning · AI Safety · Large Language Models

Programming by Backprop: An Instruction is Worth 100 Examples When Finetuning LLMs

ICLR 2026Poster

Adaptive Deployment of Untrusted LLMs Reduces Distributed Threats

ICLR 2025Poster

Language Models Learn to Mislead Humans via RLHF

ICLR 2025Poster

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

ICLR 2025Poster

Shell Games: Control Protocols for Adversarial AI Agents

ICLR 2025Withdrawn

合作者 (20)

Jakob Nicolaus Foerster

Tim Rocktäschel

博士导师2 篇