Yu Yang
~Yu_Yang4
10
论文总数
5.0
年均投稿
平均评分
接收情况5/10
会议分布
ICLR
8
NeurIPS
2
发表论文 (10 篇)
20256 篇
4
SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI
ICLR 2025Rejected
4
AIR-BENCH 2024: A Safety Benchmark based on Regulation and Policies Specified Risk Categories
ICLR 2025Spotlight
4
RedCodeAgent: Automatic Red-teaming Agent against Code Agents
ICLR 2025Rejected
4
AutoRedTeamer: An Autonomous Red Teaming Agent Against Language Models
ICLR 2025Rejected
4
Mini-batch Coresets for Memory-efficient Language Model Training on Data Mixtures
ICLR 2025Poster
4
AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration
NeurIPS 2025Poster
20244 篇
-
Identifying Spurious Biases Early in Training through the Lens of Simplicity Bias
ICLR 2024withdrawn
4
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models
NeurIPS 2024Poster
3
Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality
ICLR 2024Poster
3
SIEVE: Multimodal Dataset Pruning using Image-Captioning Models
ICLR 2024withdrawn