Alessandro Stolfo
~Alessandro_Stolfo1
5
论文总数
2.5
年均投稿
平均评分
接收情况5/5
会议分布
NeurIPS
3
ICLR
1
ICML
1
发表论文 (5 篇)
20254 篇
4
Improving Instruction-Following in Language Models through Activation Steering
ICLR 2025Poster
4
Dense SAE Latents Are Features, Not Bugs
NeurIPS 2025Poster
4
Transferring Linear Features Across Language Models With Model Stitching
NeurIPS 2025Spotlight
4
MIB: A Mechanistic Interpretability Benchmark
ICML 2025Poster