Arian Hosseini
~Arian_Hosseini1
8
论文总数
4.0
年均投稿
平均评分
接收情况7/8
会议分布
COLM
4
ICLR
4
发表论文 (8 篇)
20256 篇
4
Not All LLM Reasoners Are Created Equal
ICLR 2025Rejected
4
Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling
ICLR 2025Poster
3
Generative Verifiers: Reward Modeling as Next-Token Prediction
ICLR 2025Poster
3
When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning
COLM 2025Poster
4
Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers
COLM 2025Poster
4
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
ICLR 2025Poster