影响力指数

74.44/100

前 2%

全站排名 #1,263

发表论文11 篇

平均评分6.0

年均产出3.7 篇/年

Heyang Zhao

PhD student@Computer Science Department, University of California, Los Angeles·OpenReview

Breaking the Total Variance Barrier: Sharp Sample Complexity for Linear Heteroscedastic Bandits with Fixed Action Set

ICLR 2026Poster

Towards a Sharp Analysis of Offline Policy Learning for $f$-Divergence-Regularized Contextual Bandits

ICLR 2026Poster

Best-of-Majority: Minimax-Optimal Strategy for Pass@k Inference Scaling

ICLR 2026Poster

Logarithmic Regret for Online KL-Regularized Reinforcement Learning

ICML 2025Poster

Sharp Analysis for KL-Regularized Contextual Bandits and RLHF

NeurIPS 2025Poster

Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration

ICLR 2025Poster

Sharp Analysis for KL-Regularized Contextual Bandits and RLHF

ICLR 2025Rejected

合作者 (19)