Paper
Hub
搜索
Toggle language
Aaron Jiaxun Li
~Aaron_Jiaxun_Li1
4
论文总数
2.0
年均投稿
5.5
平均评分
接收情况
2
/
4
会议分布
ICLR
3
COLM
1
发表论文 (4 篇)
2025
1 篇
7.0
4
More RLHF, More Trust? On The Impact of Preference Alignment On Trustworthiness
ICLR 2025
Oral
2024
3 篇
5.8
4
Improving Prototypical Part Networks with Reward Reweighing, Reselection, and Retraining
ICLR 2024
Rejected
3.5
4
Certifying LLM Safety against Adversarial Prompting
ICLR 2024
Rejected
5.8
4
Certifying LLM Safety against Adversarial Prompting
COLM 2024
Poster
合作者 (8)
HL
Himabindu Lakkaraju
3 篇
AK
Aounon Kumar
2 篇
CA
Chirag Agarwal
2 篇
SF
Soheil Feizi
2 篇
SS
Suraj Srinivas
2 篇
SK
Satyapriya Krishna
1 篇
BY
Bin Yu
1 篇
RN
Robin Netzorg
1 篇