John Hughes
~John_Hughes4
7
论文总数
3.5
年均投稿
平均评分
接收情况6/7
会议分布
ICLR
3
NeurIPS
2
ICML
1
COLM
1
发表论文 (7 篇)
20256 篇
3
Attacking Audio Language Models with Best-of-N Jailbreaking
ICLR 2025Rejected
4
Best-of-N Jailbreaking
NeurIPS 2025Poster
3
Why Do Some Language Models Fake Alignment While Others Don't?
NeurIPS 2025Spotlight
4
How Do Large Language Monkeys Get Their Power (Laws)?
ICML 2025Oral
4
Looking Inward: Language Models Can Learn About Themselves by Introspection
ICLR 2025Poster
4
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
ICLR 2025Poster