Paper
Hub
搜索
Toggle language
Hanyang Zhao
~Hanyang_Zhao1
3
论文总数
3.0
年均投稿
6.2
平均评分
接收情况
3
/
3
会议分布
ICLR
2
ICML
1
发表论文 (3 篇)
2025
3 篇
6.1
4
Score as Action: Fine Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
ICML 2025
Poster
6.0
4
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
ICLR 2025
Poster
6.5
4
MallowsPO: Fine-Tune Your LLM with Preference Dispersions
ICLR 2025
Poster
合作者 (9)
DY
David Yao
3 篇
WT
Wenpin Tang
3 篇
HC
Haoxian Chen
2 篇
HL
Henry Lam
1 篇
AD
Anirban Das
1 篇
GW
Genta Indra Winata
1 篇
SS
Sambit Sahu
1 篇
SZ
Shi-Xiong Zhang
1 篇
查看全部 9 位合作者