Matthieu Geist
~Matthieu_Geist1
12
论文总数
6.0
年均投稿
平均评分
接收情况9/12
会议分布
ICLR
6
NeurIPS
6
发表论文 (12 篇)
20255 篇
5
Solving robust MDPs as a sequence of static RL problems
ICLR 2025Rejected
4
Self-Improving Robust Preference Optimization
ICLR 2025Poster
4
Understanding Likelihood Over-optimisation in Direct Alignment Algorithms
ICLR 2025Rejected
4
Learning Equilibria from Data: Provably Efficient Multi-Agent Imitation Learning
NeurIPS 2025Poster
3
ShiQ: Bringing back Bellman to LLMs
NeurIPS 2025Poster
20247 篇
4
Revisiting the Static Model in Robust Reinforcement Learning
ICLR 2024Rejected
4
Periodic agent-state based Q-learning for POMDPs
NeurIPS 2024Poster
4
Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.
ICLR 2024Poster
4
Time-Constrained Robust MDPs
NeurIPS 2024Poster
4
Near-Optimal Distributionally Robust Reinforcement Learning with General $L_p$ Norms
NeurIPS 2024Poster
4
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes
ICLR 2024Poster
5
Imitating Language via Scalable Inverse Reinforcement Learning
NeurIPS 2024Poster