影响力指数

36.19/100

前 17.6%

全站排名 #11,353

发表论文8 篇

平均评分4.9

年均产出8.0 篇/年

Dong Yan

Researcher@Baichuan Intelligent Technology·OpenReview

研究方向

Algorithmic Game Theory · Large-scale problem solving · Reinforcement Learning

STAIR: Improving Safety Alignment with Introspective Reasoning

Learning LLM-as-a-Judge for Preference Alignment

ICLR 2025Poster

3D-Properties: Identifying Challenges in DPO and Charting a Path Forward

ICLR 2025Poster

Towards Mitigating Factual Hallucination in LLMs through Self-Alignment with Memory

ICLR 2025Withdrawn

Reward-Robust RLHF in LLMs

ICLR 2025Rejected

Uncertainty-aware Reward Model: Teaching Reward Models to Know What is Unknown

ICLR 2025Withdrawn

Boosting Deductive Reasoning with Step Signals In RLHF

ICLR 2025Rejected

Data-Driven Creativity: Amplifying Imagination in LLM Writing

ICLR 2025Rejected

合作者 (20)