Yujing Hu
~Yujing_Hu2
8
论文总数
4.0
年均投稿
平均评分
接收情况5/8
会议分布
ICLR
6
NeurIPS
2
发表论文 (8 篇)
20253 篇
4
Reinforcement Learning from Imperfect Corrective Actions and Proxy Rewards
ICLR 2025Poster
4
Improving Reward Models with Proximal Policy Exploration for Preference-Based Reinforcement Learning
NeurIPS 2025Poster
4
Outward Odyssey: Improving Reward Models with Proximal Policy Exploration for Preference-Based Reinforcement Learning
ICLR 2025Rejected
20245 篇
4
Bayesian Offline-to-Online Reinforcement Learning : A Realist Approach
ICLR 2024Rejected
4
Addressing Real-Time Fragmentary Interaction Control Problems via Muti-step Representation Reinforcement Learning
ICLR 2024Rejected
3
Unlock the Intermittent Control Ability of Model Free Reinforcement Learning
NeurIPS 2024Poster
4
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
ICLR 2024Poster
4
Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets
ICLR 2024Poster