Shuang Qiu
~Shuang_Qiu2
6
论文总数
6.0
年均投稿
平均评分
接收情况5/6
会议分布
ICLR
3
NeurIPS
2
ICML
1
发表论文 (6 篇)
20256 篇
3
ROPO: Robust Preference Optimization for Large Language Models
ICLR 2025Rejected
3
Online Preference Alignment for Language Models via Count-based Exploration
ICLR 2025Spotlight
4
Tackling Data Corruption in Offline Reinforcement Learning via Sequence Modeling
ICLR 2025Poster
4
ROPO: Robust Preference Optimization for Large Language Models
ICML 2025Poster
4
Revisiting Multi-Agent World Modeling from a Diffusion-Inspired Perspective
NeurIPS 2025Poster
4
Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models
NeurIPS 2025Poster