Kianté Brantley
~Kianté_Brantley2
12
论文总数
6.0
年均投稿
平均评分
接收情况9/12
会议分布
NeurIPS
5
ICLR
5
COLM
1
ICML
1
发表论文 (12 篇)
20259 篇
4
Accelerating RL for LLM Reasoning with Optimal Advantage Regression
NeurIPS 2025Poster
3
LLMs Are In-Context Bandit Reinforcement Learners
COLM 2025Poster
4
LLMs Are In-Context Reinforcement Learners
ICLR 2025withdrawn
5
Diffusing States and Matching Scores: A New Framework for Imitation Learning
ICLR 2025Poster
4
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
ICLR 2025Poster
5
Value-Guided Search for Efficient Chain-of-Thought Reasoning
NeurIPS 2025Poster
5
Scaling Offline RL via Efficient and Expressive Shortcut Models
NeurIPS 2025Poster
3
$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
ICML 2025Rejected
4
$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
NeurIPS 2025Poster