Xiaojun Quan
~Xiaojun_Quan1
10
论文总数
5.0
年均投稿
平均评分
接收情况6/10
会议分布
ICLR
7
NeurIPS
2
ICML
1
发表论文 (10 篇)
20258 篇
4
Advantage-Guided Distillation for Preference Alignment in Small Language Models
ICLR 2025Spotlight
5
Let Large Language Models Find the Data to Train Themselves
ICLR 2025withdrawn
3
FuseChat: Knowledge Fusion of Chat Models
ICLR 2025withdrawn
4
Lookahead Routing for Large Language Models
NeurIPS 2025Poster
3
Weighted-Reward Preference Optimization for Implicit Model Fusion
ICLR 2025Poster
4
Discriminative Policy Optimization for Token-Level Reward Models
ICML 2025Poster
4
Probabilistic Token Alignment for Large Language Model Fusion
ICLR 2025Rejected
4
Probabilistic Token Alignment for Large Language Model Fusion
NeurIPS 2025Poster