暂无评分数据
ICLR 2024
Characterizing Training Dynamics for Finite-width Deep Neural Networks
TL;DR
Non-asymptotics bounds on the weights and activations of finite-width neural networks and results suggesting gradients have low rank.
摘要
关键词
feature learningtraining dynamicshigh dimensional statistics
评审与讨论
暂无评审记录