Fengshuo Bai
~Fengshuo_Bai1
9
论文总数
4.5
年均投稿
平均评分
接收情况3/9
会议分布
ICLR
7
NeurIPS
2
发表论文 (9 篇)
20254 篇
4
STAR: Efficient Preference-based Reinforcement Learning via Dual Regularization
NeurIPS 2025Poster
4
DexFlyWheel: A Scalable and Self-improving Data Generation Framework for Dexterous Manipulation
NeurIPS 2025Spotlight
4
Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs
ICLR 2025Poster
4
Iterative Training of Language Models with Opponent Modeling for Red Teaming Data Generation
ICLR 2025Rejected
20245 篇
4
SEER: Towards Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation
ICLR 2024Rejected
4
BATTLE: Towards Behavior-oriented Adversarial Attacks against Deep Reinforcement Learning
ICLR 2024Rejected
3
Measuring Value Understanding in Language Models through Discriminator-Critique Gap
ICLR 2024withdrawn
5
$\beta$-DQN: Diverse Exploration via Learning a Behavior Function
ICLR 2024Rejected
4
Zero-shot Cross-task Preference Alignment for Offline RL via Optimal Transport
ICLR 2024Rejected