Kunhao Zheng
~Kunhao_Zheng1
7
论文总数
7.0
年均投稿
平均评分
接收情况6/7
会议分布
ICML
4
ICLR
3
发表论文 (7 篇)
20257 篇
3
What Makes Large Language Models Reason in (Multi-Turn) Code Generation?
ICLR 2025Poster
4
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
ICML 2025Spotlight
4
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
ICLR 2025Rejected
6
Optimizing Language Models for Inference Time Objectives using Reinforcement Learning
ICML 2025Poster
7
The KoLMogorov Test: Compression by Code Generation
ICLR 2025Poster
4
Improving Diversity in Language Models: When Temperature Fails, Change the Loss
ICML 2025Poster
4
PILAF: Optimal Human Preference Sampling for Reward Modeling
ICML 2025Poster