David Bau
~David_Bau1
16
论文总数
8.0
年均投稿
平均评分
接收情况14/16
会议分布
ICLR
7
NeurIPS
5
COLM
3
ICML
1
发表论文 (16 篇)
202510 篇
4
The Dual-Route Model of Induction
COLM 2025Poster
4
Erasing Conceptual Knowledge from Language Models
NeurIPS 2025Poster
3
Erasing Conceptual Knowledge from Language Models
ICLR 2025Rejected
4
Art-Free Generative Models: Exploring Art Creation Without Prior Artistic Knowledge
ICLR 2025withdrawn
4
LLMs Encode Harmfulness and Refusal Separately
NeurIPS 2025Poster
4
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
ICLR 2025Oral
4
When Are Concepts Erased From Diffusion Models?
NeurIPS 2025Poster
4
One-Step is Enough: Sparse Autoencoders for Text-to-Image Diffusion Models
NeurIPS 2025Poster
4
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals
ICLR 2025Poster
4
MIB: A Mechanistic Interpretability Benchmark
ICML 2025Poster
20246 篇
4
Locating and Editing Factual Associations in Mamba
COLM 2024Poster
3
Measuring and Controlling Instruction (In)Stability in Language Model Dialogs
COLM 2024Poster
3
Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking
ICLR 2024Poster
4
Function Vectors in Large Language Models
ICLR 2024Poster
3
Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models
NeurIPS 2024Poster
3
Linearity of Relation Decoding in Transformer Language Models
ICLR 2024Spotlight