Paper
Hub
搜索
Toggle language
Xiaomeng Hu
~Xiaomeng_Hu1
2
论文总数
1.0
年均投稿
5.6
平均评分
接收情况
2
/
2
会议分布
NeurIPS
2
发表论文 (2 篇)
2025
1 篇
6.0
4
CARE: Decoding-Time Safety Alignment via Rollback and Introspection Intervention
NeurIPS 2025
Poster
2024
1 篇
5.3
4
Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes
NeurIPS 2024
Poster
合作者 (5)
TH
Tsung-Yi Ho
2 篇
CY
Chenhan Yuan
1 篇
FH
Fei Huang
1 篇
JL
Junyang Lin
1 篇
PC
Pin-Yu Chen
1 篇