Yunhua Zhou
~Yunhua_Zhou1
6
论文总数
6.0
年均投稿
平均评分
接收情况5/6
会议分布
ICLR
4
NeurIPS
2
发表论文 (6 篇)
20256 篇
4
Pre-Trained Policy Discriminators are General Reward Models
NeurIPS 2025Poster
4
Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures
ICLR 2025Poster
4
BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments
ICLR 2025Poster
4
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance
ICLR 2025Poster
4
GAOKAO-Eval: Does High Scores Truly Reflect Strong Capabilities in LLMs?
ICLR 2025Rejected
5
Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections
NeurIPS 2025Poster