Zhaolin Gao
~Zhaolin_Gao1
7
论文总数
3.5
年均投稿
平均评分
接收情况6/7
会议分布
NeurIPS
5
ICLR
1
ICML
1
发表论文 (7 篇)
20256 篇
4
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
ICLR 2025Poster
4
Pre-trained Large Language Models Learn to Predict Hidden Markov Models In-context
NeurIPS 2025Poster
4
Accelerating RL for LLM Reasoning with Optimal Advantage Regression
NeurIPS 2025Poster
4
$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
NeurIPS 2025Poster
5
Value-Guided Search for Efficient Chain-of-Thought Reasoning
NeurIPS 2025Poster
3
$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
ICML 2025Rejected