Paper
Hub
搜索
Toggle language
Amir Abdullah
~Amir_Abdullah1
3
论文总数
1.5
年均投稿
4.6
平均评分
接收情况
2
/
3
会议分布
NeurIPS
1
ICLR
1
ICML
1
发表论文 (3 篇)
2025
1 篇
5.5
4
Activation Space Interventions Can Be Transferred Between Large Language Models
ICML 2025
Poster
2024
2 篇
5.3
4
Interpreting Learned Feedback Patterns in Large Language Models
NeurIPS 2024
Poster
3.0
3
Interpreting Reward Models in RLHF-Tuned Language Models Using Sparse Autoencoders
ICLR 2024
withdrawn
合作者 (12)
FB
Fazl Barez
2 篇
LM
Luke Marks
2 篇
PT
Philip Torr
2 篇
RA
Rauno Arike
2 篇
LM
Luna Mendez
1 篇
AH
Abir HARRASSE
1 篇
DN
Dhruv Nathawani
1 篇
ML
Michael Lan
1 篇
查看全部 12 位合作者