Antonio Orvieto
~Antonio_Orvieto3
18
论文总数
9.0
年均投稿
平均评分
接收情况13/18
会议分布
NeurIPS
11
ICLR
5
ICML
2
发表论文 (18 篇)
202511 篇
4
In Search of Adam’s Secret Sauce
NeurIPS 2025Oral
4
When recalling in-context, Transformers are not SSMs
NeurIPS 2025Rejected
4
Geometric Inductive Biases of Deep Networks: The Role of Data and Architecture
ICLR 2025Spotlight
4
When, Where and Why to Average Weights?
ICML 2025Poster
4
Enhancing Optimizer Stability: Momentum Adaptation of The NGN Step-size
NeurIPS 2025Poster
4
NIMBA : Towards Robust and Principled Processing of Point Clouds With SSMs
ICLR 2025Rejected
4
Enhancing Optimizer Stability: Momentum Adaptation of NGN Step-size
ICLR 2025Rejected
4
Generalized Linear Mode Connectivity for Transformers
NeurIPS 2025Oral
3
Fixed-Point RNNs: Interpolating from Diagonal to Dense
NeurIPS 2025Spotlight
3
Generalized Interpolating Discrete Diffusion
ICML 2025Poster
4
Adaptive Methods through the Lens of SDEs: Theoretical Insights on the Role of Noise
ICLR 2025Poster
20247 篇
4
Theoretical Foundations of Deep Selective State-Space Models
NeurIPS 2024Poster
5
Recurrent neural networks: vanishing and exploding gradients are not the end of the story
NeurIPS 2024Poster
4
Recurrent Distance-Encoding Neural Networks for Graph Representation Learning
ICLR 2024Rejected
4
Loss Landscape Characterization of Neural Networks without Over-Parametrization
NeurIPS 2024Poster
4
Super Consistency of Neural Network Landscapes and Learning Rate Transfer
NeurIPS 2024Poster
4
Understanding the Differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks
NeurIPS 2024Poster
3
SDEs for Adaptive Methods: The Role of Noise
NeurIPS 2024Rejected