Pradeep Varakantham
~Pradeep_Varakantham1
14
论文总数
7.0
年均投稿
平均评分
接收情况8/14
会议分布
ICLR
7
NeurIPS
7
发表论文 (14 篇)
20258 篇
4
Offline Safe Policy Optimization From Human Feedback
ICLR 2025withdrawn
4
On Generalization Across Environments In Multi-Objective Reinforcement Learning
ICLR 2025Poster
4
No Experts, No Problem: Avoidance Learning from Bad Demonstrations
NeurIPS 2025Poster
4
Semantic Loss Guided Data Efficient Supervised Fine Tuning for Safe Responses in LLMs
ICLR 2025Poster
4
On Minimizing Adversarial Counterfactual Error in Adversarial Reinforcement Learning
ICLR 2025Poster
4
UNIQ: Offline Inverse Q-learning for Avoiding Undesirable Demonstrations
ICLR 2025Rejected
4
From Machine to Human Learning: Towards Warm-Starting Teacher Algorithms with Reinforcement Learning Agents
NeurIPS 2025Rejected
4
Bootstrapping Language Models with DPO Implicit Rewards
ICLR 2025Poster
20246 篇
4
Enhancing the Hierarchical Environment Design via Generative Trajectory Modeling
NeurIPS 2024Rejected
4
Safety through feedback in Constrained RL
NeurIPS 2024Poster
5
Marginal Benefit Induced Unsupervised Environment Design
ICLR 2024withdrawn
4
SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning
NeurIPS 2024Poster
5
EduQate: Generating Adaptive Curricula through RMABs in Education Settings
NeurIPS 2024Rejected
4
Improving Environment Novelty Quantification for Effective Unsupervised Environment Design
NeurIPS 2024Oral