Alec Koppel
~Alec_Koppel1
8
论文总数
4.0
年均投稿
平均评分
接收情况5/8
会议分布
ICLR
7
NeurIPS
1
发表论文 (8 篇)
20254 篇
4
GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-Time Alignment
ICLR 2025Poster
4
Learning in Stackelberg Mean Field Games: A Non-Asymptotic Analysis
NeurIPS 2025Poster
4
SAIL: Self-improving Efficient Online Alignment of Large Language Models
ICLR 2025Rejected
4
Collab: Controlled Decoding using Mixture of Agents for LLM Alignment
ICLR 2025Poster
20244 篇
4
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback
ICLR 2024Poster
4
Limited-Memory Greedy Quasi-Newton Method with Non-asymptotic Superlinear Convergence Rate
ICLR 2024Rejected
4
Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
ICLR 2024withdrawn
4
Efficient Inverse Multiagent Learning
ICLR 2024Spotlight