Heyang Zhao
~Heyang_Zhao1
8
论文总数
4.0
年均投稿
平均评分
接收情况6/8
会议分布
ICLR
5
NeurIPS
2
ICML
1
发表论文 (8 篇)
20254 篇
5
Logarithmic Regret for Online KL-Regularized Reinforcement Learning
ICML 2025Poster
4
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
ICLR 2025Rejected
4
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
NeurIPS 2025Poster
4
Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration
ICLR 2025Poster
20244 篇
3
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation
NeurIPS 2024Poster
4
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation
ICLR 2024Rejected
4
Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning
ICLR 2024Poster
4
Variance-aware Regret Bounds for Stochastic Contextual Dueling Bandits
ICLR 2024Poster