Paper
Hub
搜索
Toggle language
Milad Aghajohari
~Milad_Aghajohari1
6
论文总数
3.0
年均投稿
5.5
平均评分
接收情况
3
/
6
会议分布
ICLR
5
ICML
1
发表论文 (6 篇)
2025
3 篇
5.0
4
VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
ICLR 2025
Rejected
6.6
4
VinePPO: Refining Credit Assignment in RL Training of LLMs
ICML 2025
Poster
7.5
4
Advantage Alignment Algorithms
ICLR 2025
Oral
2024
3 篇
4.8
4
Best Response Shaping
ICLR 2024
Rejected
4.0
5
LOQA: Learning with Opponent Q-Learning Awareness
ICLR 2024
Poster
5.2
6
Meta-Value Learning: a General Framework for Learning with Learning Awareness
ICLR 2024
Rejected
合作者 (12)
AC
Aaron Courville
6 篇
TC
Tim Cooijmans
4 篇
JD
Juan Agustin Duque
3 篇
AS
Alessandro Sordoni
2 篇
AK
Amirhossein Kazemnejad
2 篇
EP
Eva Portelance
2 篇
NR
Nicolas Le Roux
2 篇
SR
Siva Reddy
2 篇
查看全部 12 位合作者