Paper
Hub
搜索
Toggle language
Aleksandar Makelov
~Aleksandar_Makelov1
2
论文总数
1.0
年均投稿
6.7
平均评分
接收情况
2
/
2
会议分布
ICLR
2
发表论文 (2 篇)
2025
1 篇
7.0
4
Towards Principled Evaluations of Sparse Autoencoders for Interpretability and Control
ICLR 2025
Poster
2024
1 篇
6.3
3
Is This the Subspace You Are Looking for? An Interpretability Illusion for Subspace Activation Patching
ICLR 2024
Poster
合作者 (3)
GL
Georg Lange
2 篇
NN
Neel Nanda
2 篇
AG
Atticus Geiger
1 篇