影响力指数

43.29/100

前 12.1%

全站排名 #7,805

发表论文6 篇

平均评分6.4

年均产出3.0 篇/年

Alex Tamkin

Values in the Wild: Discovering and Mapping Values in Real-World Language Model Interactions

COLM 2025Poster

Eliciting Human Preferences with Language Models

ICLR 2025Poster

Towards Measuring the Representation of Subjective Global Opinions in Language Models

COLM 2024Poster

Many-shot Jailbreaking

NeurIPS 2024Poster

Codebook Features: Sparse and Discrete Interpretability for Neural Networks

ICLR 2024Rejected

Eliciting Human Preferences with Language Models

ICLR 2024Rejected

合作者 (20)

Nicholas Schiefer