Sicheng Zhu
~Sicheng_Zhu1
8
论文总数
4.0
年均投稿
平均评分
接收情况6/8
会议分布
ICLR
5
COLM
2
NeurIPS
1
发表论文 (8 篇)
20252 篇
20246 篇
4
AutoDAN: Automatic and Interpretable Adversarial Attacks on Large Language Models
ICLR 2024Rejected
4
AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models
COLM 2024Poster
4
Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models
COLM 2024Poster
4
PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts
ICLR 2024Poster
4
Like Oil and Water: Group Robustness Methods and Poisoning Defenses May Be at Odds
ICLR 2024Poster
4
On the Possibilities of AI-Generated Text Detection: A Sample Complexity Analysis
ICLR 2024Rejected