暂无评分数据
ICLR 2025
Evaluating Synthetic Activations composed of SAE Latents in GPT-2
TL;DR
We investigate how synthetic activations composed of SAE latents compare to real model-generated activations in GPT-2, revealing insights into the importance of SAE latents relationships and activation plateau characteristics.
摘要
关键词
Mechanistic InterpretabilitySAEsActivationsSAE Latents
评审与讨论
PC编辑台拒稿
直接拒稿原因
Wrong formatting template.