Zeming Wei
~Zeming_Wei1
5
论文总数
2.5
年均投稿
平均评分
接收情况4/5
会议分布
NeurIPS
3
ICLR
1
ICML
1
发表论文 (5 篇)
20244 篇
4
Characterizing Robust Overfitting in Adversarial Training via Cross-Class Features
ICLR 2024Rejected
3
Adversarial Representation Engineering: A General Model Editing Framework for Large Language Models
NeurIPS 2024Poster
5
Fight Back Against Jailbreaking via Prompt Adversarial Tuning
NeurIPS 2024Poster
5
A Theoretical Understanding of Self-Correction through In-context Alignment
NeurIPS 2024Poster