Adrià Garriga-Alonso
~Adrià_Garriga-Alonso1
8
论文总数
4.0
年均投稿
平均评分
接收情况5/8
会议分布
NeurIPS
6
ICLR
2
发表论文 (8 篇)
20255 篇
4
Among Us: A Sandbox for Measuring and Detecting Agentic Deception
NeurIPS 2025Spotlight
4
Feature Hedging: Correlated Features Break Narrow Sparse Autoencoders
NeurIPS 2025Rejected
4
Interpreting learned search: finding a transition model and value function in an RNN that plays Sokoban
NeurIPS 2025Rejected
4
Interpreting Emergent Planning in Model-Free Reinforcement Learning
ICLR 2025Oral
4
Planning in a recurrent neural network that plays Sokoban
ICLR 2025Rejected