Wenxiang Jiao
~Wenxiang_Jiao1
11
论文总数
5.5
年均投稿
平均评分
接收情况6/11
会议分布
ICLR
9
NeurIPS
2
发表论文 (11 篇)
20254 篇
5
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training
ICLR 2025withdrawn
4
Competing Large Language Models in Multi-Agent Gaming Environments
ICLR 2025Poster
3
Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step
ICLR 2025Rejected
4
RaSA: Rank-Sharing Low-Rank Adaptation
ICLR 2025Poster
20247 篇
4
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
ICLR 2024Poster
4
Unsupervised Sign Language Translation and Generation
ICLR 2024Rejected
5
Improving Gloss-free Sign Language Translation by Reducing Representation Density
NeurIPS 2024Poster
3
Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models
ICLR 2024withdrawn
4
Apathetic or Empathetic? Evaluating LLMs' Emotional Alignments with Humans
NeurIPS 2024Poster
4
All Languages Matter: On the Multilingual Safety of Large Language Models
ICLR 2024Rejected
3
On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs
ICLR 2024Oral