Mrinank Sharma
~Mrinank_Sharma1
10
论文总数
5.0
年均投稿
平均评分
接收情况6/10
会议分布
ICLR
7
NeurIPS
2
ICML
1
发表论文 (10 篇)
20257 篇
6
PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning
ICLR 2025Rejected
4
PoisonBench: Assessing Language Model Vulnerability to Poisoned Preference Data
ICML 2025Poster
4
Rapid Response: Mitigating LLM Jailbreaks With A Few Examples
ICLR 2025Rejected
3
Adaptive Deployment of Untrusted LLMs Reduces Distributed Threats
ICLR 2025Poster
3
Attacking Audio Language Models with Best-of-N Jailbreaking
ICLR 2025Rejected
4
Best-of-N Jailbreaking
NeurIPS 2025Poster
4
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
ICLR 2025Poster