影响力指数

56.36/100

前 6%

全站排名 #3,832

发表论文11 篇

平均评分5.5

年均产出3.7 篇/年

Yu Yang

Researcher@OpenAI·美国·OpenReview

研究方向

machine learning

RedCodeAgent: Automatic Red-teaming Agent against Diverse Code Agents

ICLR 2026Poster

AIR-BENCH 2024: A Safety Benchmark based on Regulation and Policies Specified Risk Categories

ICLR 2025Spotlight

AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration

NeurIPS 2025Poster

Mini-batch Coresets for Memory-efficient Language Model Training on Data Mixtures

ICLR 2025Poster

SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI

ICLR 2025Rejected

RedCodeAgent: Automatic Red-teaming Agent against Code Agents

ICLR 2025Rejected

AutoRedTeamer: An Autonomous Red Teaming Agent Against Language Models

ICLR 2025Rejected

合作者 (20)

Baharan Mirzasoleiman

博士导师4 篇