Paper
Hub
搜索
Toggle language
Xinpeng Wang
~Xinpeng_Wang3
3
论文总数
1.5
年均投稿
6.3
平均评分
接收情况
3
/
3
会议分布
NeurIPS
1
ICLR
1
COLM
1
发表论文 (3 篇)
2025
2 篇
6.4
4
Refusal Direction is Universal Across Safety-Aligned Languages
NeurIPS 2025
Poster
5.8
4
Surgical, Cheap, and Flexible: Mitigating False Refusal in Language Models via Single Vector Ablation
ICLR 2025
Poster
2024
1 篇
6.8
4
Look at the Text: Instruction-Tuned Language Models are More Robust Multiple Choice Selectors than You Think
COLM 2024
Poster
合作者 (8)
BP
Barbara Plank
3 篇
CH
Chengzhi Hu
2 篇
HS
Hinrich Schuetze
1 篇
MW
Mingyang Wang
1 篇
YL
Yihong Liu
1 篇
BM
Bolei Ma
1 篇
PR
Paul Rottger
1 篇
PR
Paul Röttger
1 篇