Scott Niekum
~Scott_Niekum1
14
论文总数
7.0
年均投稿
平均评分
接收情况11/14
会议分布
ICLR
8
NeurIPS
4
COLM
2
发表论文 (14 篇)
20256 篇
4
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
ICLR 2025Poster
4
Influencing Humans to Conform to Preference Models for RLHF
ICLR 2025Rejected
4
Pareto-Optimal Learning from Preferences with Hidden Context
ICLR 2025Rejected
4
Pairwise or Pointwise? Evaluating Feedback Protocols for Bias in LLM-Based Evaluation
COLM 2025Poster
4
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning
ICLR 2025Poster
4
RLZero: Direct Policy Inference from Language Without In-Domain Supervision
NeurIPS 2025Poster
20248 篇
4
Predicting Future Actions of Reinforcement Learning Agents
NeurIPS 2024Poster
3
D2PO: Discriminator-Guided DPO with Response Evaluation Models
COLM 2024Poster
4
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
ICLR 2024Spotlight
4
Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill Learning
ICLR 2024Rejected
4
Contrastive Preference Learning: Learning from Human Feedback without Reinforcement Learning
ICLR 2024Poster
4
Score Models for Offline Goal-Conditioned Reinforcement Learning
ICLR 2024Poster
4
SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
NeurIPS 2024Poster
4
Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms
NeurIPS 2024Poster