影响力指数

98.65/100

前 0.1%

全站排名 #30

发表论文55 篇

平均评分5.7

年均产出18.3 篇/年

Jakob Nicolaus Foerster

Associate Professor@University of Oxford, University of Oxford·OpenReview

研究方向

open-ended learning · unsupervised environment design · Multi-agent RL · Deep RL

StochasTok: Improving Fine-Grained Subword Understanding in LLMs

ICLR 2026Poster

High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning

ICLR 2026Poster

The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind

ICLR 2026Rejected

Programming by Backprop: An Instruction is Worth 100 Examples When Finetuning LLMs

ICLR 2026Poster

DéjàQ: Open-Ended Evolution of Diverse, Learnable and Verifiable Problems

ICLR 2026Rejected

HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks

ICLR 2026Rejected

SOReL and TOReL: Two Methods for Fully Offline Reinforcement Learning

ICLR 2026Rejected

Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents

ICLR 2026Rejected

A Clean Slate for Offline Reinforcement Learning

NeurIPS 2025Oral

Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks

AgentBreeder: Mitigating the AI Safety Risks of Multi-Agent Scaffolds via Self-Improvement

NeurIPS 2025Spotlight

Improving Regret Approximation for Unsupervised Dynamic Environment Generation

NeurIPS 2025Poster

Simplifying Deep Temporal Difference Learning

ICLR 2025Spotlight

AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench

NeurIPS 2025Spotlight

Meta-Learning Objectives for Preference Optimization

NeurIPS 2025Poster

LILO: Learning to Reason at the Frontier of Learnability

NeurIPS 2025Poster

Imagined Autocurricula

NeurIPS 2025Poster

Expected Return Symmetries

ICLR 2025Poster

LOB-Bench: Benchmarking Generative AI for Finance - an Application to Limit Order Book Data

ICML 2025Poster

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

ICLR 2025Poster

The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind

ICLR 2025Rejected

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

COLM 2025Poster

TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation

ICLR 2025Rejected

Towards Learning to Reason at Pre-Training Scale

ICLR 2025Rejected

Ad-Hoc Human-AI Coordination Challenge

ICML 2025Spotlight

ADIOS: Antibody Development via Opponent Shaping

ICML 2025Poster

OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination

ICLR 2025Poster

The Complexity Dynamics of Grokking

ICLR 2025Rejected

Learning Loss Landscapes in Preference Optimization

ICLR 2025Rejected

CURATe: Benchmarking Personalised Alignment of Conversational AI Assistants

ICLR 2025Rejected

Ad-Hoc Human-AI Coordination Challenge

ICLR 2025Rejected

Beyond the Boundaries of Proximal Policy Optimization

ICLR 2025Rejected

Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources

ICLR 2025Rejected

Investigating Online RL in World Models

ICLR 2025Rejected

LOB-Bench: Benchmarking Generative AI for Finance - with an Application to Limit Order Book Markets

ICLR 2025Rejected

Do Symbolic or Black-Box Representations Generalise Better In Learned Optimisation?

ICLR 2025Rejected

合作者 (20)

Tim Rocktäschel

Matthew Thomas Jackson

Shimon Whiteson

博士导师6 篇