Jonathan Daniel Chang
~Jonathan_Daniel_Chang1
8
论文总数
4.0
年均投稿
平均评分
接收情况5/8
会议分布
ICLR
4
NeurIPS
3
ICML
1
发表论文 (8 篇)
20255 篇
4
$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
NeurIPS 2025Poster
5
Value-Guided Search for Efficient Chain-of-Thought Reasoning
NeurIPS 2025Poster
4
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
ICLR 2025Poster
3
$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
ICML 2025Rejected
4
Critique-out-Loud Reward Models
ICLR 2025Rejected