Paper
Hub
搜索
Toggle language
Yingxiang Yang
~Yingxiang_Yang2
5
论文总数
2.5
年均投稿
5.4
平均评分
接收情况
3
/
5
会议分布
ICLR
3
ICML
1
NeurIPS
1
发表论文 (5 篇)
2025
3 篇
5.2
5
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
ICLR 2025
Rejected
5.5
4
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
ICML 2025
Poster
4.3
4
How Can LLM Guide RL? A Value-Based Approach
ICLR 2025
withdrawn
2024
2 篇
5.5
4
Let Models Speak Ciphers: Multiagent Debate through Embeddings
ICLR 2024
Poster
6.3
3
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer
NeurIPS 2024
Poster
合作者 (20)
ZW
Zhaoran Wang
5 篇
BL
Boyi Liu
4 篇
SZ
Shenao Zhang
4 篇
ZL
Zhihan Liu
4 篇
LC
Liyu Chen
2 篇
TS
Tao Sun
2 篇
YL
Yongfei Liu
2 篇
YZ
Yufeng Zhang
2 篇
查看全部 20 位合作者