Dong Yan
~Dong_Yan1
8
论文总数
8.0
年均投稿
平均评分
接收情况3/8
会议分布
ICLR
7
ICML
1
发表论文 (8 篇)
20258 篇
3
Uncertainty-aware Reward Model: Teaching Reward Models to Know What is Unknown
ICLR 2025withdrawn
4
Data-Driven Creativity: Amplifying Imagination in LLM Writing
ICLR 2025Rejected
4
Towards Mitigating Factual Hallucination in LLMs through Self-Alignment with Memory
ICLR 2025withdrawn
4
Boosting Deductive Reasoning with Step Signals In RLHF
ICLR 2025Rejected
4
3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
ICLR 2025Poster
4
Learning LLM-as-a-Judge for Preference Alignment
ICLR 2025Poster
4
STAIR: Improving Safety Alignment with Introspective Reasoning
ICML 2025Oral
4
Reward-Robust RLHF in LLMs
ICLR 2025Rejected