Weihao Zeng
~Weihao_Zeng2
5
论文总数
2.5
年均投稿
平均评分
接收情况5/5
会议分布
ICLR
4
COLM
1
发表论文 (5 篇)
20254 篇
3
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
ICLR 2025Poster
4
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild
COLM 2025Poster
4
AgentRefine: Enhancing Agent Generalization through Refinement Tuning
ICLR 2025Poster
4
CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
ICLR 2025Poster