Seungyub Han
~Seungyub_Han1
7
论文总数
3.5
年均投稿
平均评分
接收情况4/7
会议分布
ICLR
3
ICML
2
NeurIPS
2
发表论文 (7 篇)
20256 篇
-
Time to Truncate Trajectory: Stochastic Retrace for Multi-step Off-policy Reinforcement Learning
ICLR 2025withdrawn
3
Self-Alignment for Offline Safe Reinforcement Learning
ICLR 2025Rejected
4
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation
ICML 2025Poster
5
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation
ICLR 2025Rejected
4
Policy-labeled Preference Learning: Is Preference Enough for RLHF?
ICML 2025Spotlight
4
Pareto Optimal Risk-Agnostic Distributional Bandits with Heavy-Tail Rewards
NeurIPS 2025Poster