影响力指数

77.34/100

前 1.7%

全站排名 #1,064

发表论文21 篇

平均评分5.5

年均产出7.0 篇/年

Yinpeng Dong

Assistant Professor@Tsinghua University·中国·OpenReview

研究方向

AI safety · deep learning · computer vision

5.0

Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models

ICLR 2026Withdrawn

3.2

Towards the Worst-case Robustness of Large Language Models

ICLR 2026Rejected

二作

8.3

STAIR: Improving Safety Alignment with Introspective Reasoning

ICML 2025Oral

6.4

Red-Teaming Text-to-Image Systems by Rule-based Preference Modeling

NeurIPS 2025Poster

通讯

6.4

Mitigating Overthinking in Large Reasoning Models via Manifold Steering

NeurIPS 2025Poster

通讯

4.3

Towards Mitigating Factual Hallucination in LLMs through Self-Alignment with Memory

ICLR 2025Withdrawn

三作

4.3

Adaptive Strategy Evolution for Generating Tailored Jailbreak Prompts against Black-Box Safety-Aligned LLMs

合作者 (20)

Yinpeng Dong

Why Do Unlearnable Examples Work: A Novel Perspective of Mutual Information

Towards Safe Reasoning in Large Reasoning Models via Corrective Intervention

Unveiling the Basin-Like Loss Landscape in Large Language Models

Silent Leaks: Implicit Knowledge Extraction Attack on RAG Systems

Exploring Recursive Doubt in Large Language Models

Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models

Towards the Worst-case Robustness of Large Language Models

STAIR: Improving Safety Alignment with Introspective Reasoning

Red-Teaming Text-to-Image Systems by Rule-based Preference Modeling

Mitigating Overthinking in Large Reasoning Models via Manifold Steering

Towards Mitigating Factual Hallucination in LLMs through Self-Alignment with Memory

Adaptive Strategy Evolution for Generating Tailored Jailbreak Prompts against Black-Box Safety-Aligned LLMs