Aldo Pacchiano
~Aldo_Pacchiano1
12
论文总数
6.0
年均投稿
平均评分
接收情况10/12
会议分布
ICLR
6
ICML
3
NeurIPS
2
COLM
1
发表论文 (12 篇)
202510 篇
4
Second Order Bounds for Contextual Bandits with Function Approximation
ICLR 2025Poster
4
Adaptive Exploration for Multi-Reward Multi-Policy Evaluation
ICML 2025Poster
4
Sample Efficient Multiple-policy Evaluation in Reinforcement Learning
ICLR 2025Rejected
3
Principled Fine-tuning of LLMs from User-Edits: A Medley of Preference, Supervision, and Reward
NeurIPS 2025Poster
4
Feasible Action Search for Bandit Linear Programs via Thompson Sampling
ICML 2025Poster
4
Multiple-policy Evaluation via Density Estimation
ICML 2025Poster
4
High Probability Contextual Bandits for Optimal Dosage Selection
ICLR 2025Rejected
3
A Theoretical Framework for Partially-Observed Reward States in RLHF
ICLR 2025Poster
6
ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization
ICLR 2025Poster
4
Language Model Personalization via Reward Factorization
COLM 2025Poster