影响力指数

53.2/100

前 7%

全站排名 #4,534

发表论文8 篇

平均评分6.0

年均产出4.0 篇/年

Sicheng Zhu

PhD student@University of Maryland, College Park·美国·OpenReview

研究方向

machine learning; generalization; out-of-domain generalization; adversarial robustness; interpretability; invariance;

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-Time Alignment

ICLR 2025Poster

AdvPrefix: An Objective for Nuanced LLM Jailbreaks

NeurIPS 2025Poster

AutoDAN: Automatic and Interpretable Adversarial Attacks on Large Language Models

ICLR 2024Rejected

AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models

COLM 2024Poster

Like Oil and Water: Group Robustness Methods and Poisoning Defenses May Be at Odds

ICLR 2024Poster

Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models

COLM 2024Poster

PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts

ICLR 2024Poster

On the Possibilities of AI-Generated Text Detection: A Sample Complexity Analysis

ICLR 2024Rejected

合作者 (20)

Michael-Andrei Panaitescu-Liess