Paper
Hub
搜索
Toggle language
Bilal Chughtai
~Bilal_Chughtai1
5
论文总数
2.5
年均投稿
6.2
平均评分
接收情况
3
/
5
会议分布
ICLR
2
ICML
1
COLM
1
NeurIPS
1
发表论文 (5 篇)
2025
2 篇
6.0
5
Detecting Strategic Deception with Linear Probes
ICML 2025
Poster
7.8
4
Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning
NeurIPS 2025
Poster
2024
3 篇
5.3
4
Summing Up the Facts: Additive Mechanisms behind Factual Recall in LLMs
ICLR 2024
Rejected
5.0
3
Language Models Struggle to Explain Themselves
ICLR 2024
Rejected
7.0
4
Transformer Circuit Evaluation Metrics Are Not Robust
COLM 2024
Poster
合作者 (12)
NN
Neel Nanda
2 篇
MH
Marius Hobbhahn
1 篇
NG
Nicholas Goldowsky-Dill
1 篇
SH
Stefan Heimersheim
1 篇
DS
Dane Sherburn
1 篇
OE
Owain Evans
1 篇
AC
Alan Cooney
1 篇
CJ
Caden Juang
1 篇
查看全部 12 位合作者