Junkang Wu
~Junkang_Wu1
8
论文总数
4.0
年均投稿
平均评分
接收情况7/8
会议分布
ICML
4
NeurIPS
2
ICLR
2
发表论文 (8 篇)
20257 篇
5
RePO: Understanding Preference Learning Through ReLU-Based Optimization
NeurIPS 2025Poster
5
AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization
ICML 2025Poster
5
Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
ICLR 2025Poster
5
$\alpha$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
ICLR 2025Rejected
3
DAMA: Data- and Model-aware Alignment of Multi-modal LLMs
ICML 2025Poster
6
Larger or Smaller Reward Margins to Select Preferences for LLM Alignment?
ICML 2025Poster
4
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment
ICML 2025Poster