Paper
Hub
搜索
Toggle language
Stefan Heimersheim
~Stefan_Heimersheim1
2
论文总数
2.0
年均投稿
6.0
平均评分
接收情况
1
/
2
会议分布
ICML
1
ICLR
1
发表论文 (2 篇)
2025
2 篇
6.0
5
Detecting Strategic Deception with Linear Probes
ICML 2025
Poster
-
Evaluating Synthetic Activations composed of SAE Latents in GPT-2
ICLR 2025
desk_rejected
合作者 (7)
BC
Bilal Chughtai
1 篇
MH
Marius Hobbhahn
1 篇
NG
Nicholas Goldowsky-Dill
1 篇
CM
Chatrik Singh Mangat
1 篇
GG
Giorgi Giglemiani
1 篇
JJ
Jett Janiak
1 篇
NP
Nora Petrova
1 篇