Lei Ying
~Lei_Ying1
6
论文总数
3.0
年均投稿
平均评分
接收情况3/6
会议分布
ICLR
4
NeurIPS
2
发表论文 (6 篇)
20254 篇
4
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference
ICLR 2025Poster
4
Achieving $\tilde{\mathcal{O}}(1/N)$ Optimality Gap in Restless Bandits through Gaussian Approximation
NeurIPS 2025Spotlight
4
Near-Optimal Regret-Queue Length Tradeoff in Online Learning for Two-Sided Markets
NeurIPS 2025Poster
7
Rethinking Behavior Regularization in Offline Safe RL: A Region-Based Approach
ICLR 2025Rejected