Aishan Liu
~Aishan_Liu1
9
论文总数
4.5
年均投稿
平均评分
接收情况8/9
会议分布
NeurIPS
4
ICLR
4
ICML
1
发表论文 (9 篇)
20256 篇
4
Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing
NeurIPS 2025Poster
3
ICLShield: Exploring and Mitigating In-Context Learning Backdoor Attacks
ICML 2025Poster
4
Lie Detector: Unified Backdoor Detection via Cross-Examination Framework
NeurIPS 2025Poster
4
T2V-OptJail: Discrete Prompt Optimization for Text-to-Video Jailbreak Attacks
NeurIPS 2025Poster
4
Exploring Semantic-constrained Adversarial Example with Instruction Uncertainty Reduction
NeurIPS 2025Poster
4
BadRobot: Jailbreaking Embodied LLM Agents in the Physical World
ICLR 2025Poster