Yang Zhang
~Yang_Zhang15
11
论文总数
5.5
年均投稿
平均评分
接收情况4/11
会议分布
ICLR
8
NeurIPS
2
ICML
1
发表论文 (11 篇)
20255 篇
4
Finding and Reactivating Post-Trained LLMs' Hidden Safety Mechanisms
NeurIPS 2025Poster
4
Adjacent Words, Divergent Intents: Jailbreaking Large Language Models via Task Concurrency
NeurIPS 2025Poster
4
SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation
ICLR 2025Poster
4
ACE: Attack Combo Enhancement Against Machine Learning Models
ICLR 2025withdrawn
4
The Ripple Effect: On Unforeseen Complications of Backdoor Attacks
ICML 2025Poster
20246 篇
4
Last One Standing: A Comparative Analysis of Security and Privacy of Soft Prompt Tuning, LoRA, and In-Context Learning
ICLR 2024Rejected
4
A Comprehensive Study of Privacy Risks in Curriculum Learning
ICLR 2024Rejected
5
Fine-Tuning Is All You Need to Mitigate Backdoor Attacks
ICLR 2024Rejected
4
Composite Backdoor Attacks Against Large Language Models
ICLR 2024withdrawn
4
Prompt Backdoors in Visual Prompt Learning
ICLR 2024withdrawn
4
Robustness Over Time: Understanding Adversarial Examples’ Effectiveness on Longitudinal Versions of Large Language Models
ICLR 2024Rejected