Miao Lu
~Miao_Lu3
4
论文总数
2.0
年均投稿
平均评分
接收情况4/4
会议分布
NeurIPS
2
ICLR
2
发表论文 (4 篇)
20243 篇
4
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithms
NeurIPS 2024Poster
3
Benign Oscillation of Stochastic Gradient Descent with Large Learning Rate
ICLR 2024Poster
3
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer
NeurIPS 2024Poster