Paper
Hub
搜索
Toggle language
暂无评分数据
ICLR 2024
The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions
Nishil Patel
,
Sebastian Lee
,
Stefano Sarao Mannelli
,
Sebastian Goldt
,
Andrew M Saxe
OpenReview
PDF
提交: 2023-09-22
更新: 2024-03-26
摘要
关键词
Statistical physics of learning
generalisation models
reinforcement learning
Reinforce Policy gradient
评审与讨论
暂无评审记录