PaperHub

Percy Liang

~Percy_Liang1

26
论文总数
13.0
年均投稿
6.3
平均评分
接收情况19/26
会议分布
ICLR
17
ICML
5
NeurIPS
3
COLM
1

发表论文 (26 篇)

202519

6.5
4

Model Equality Testing: Which Model is this API Serving?

ICLR 2025Poster
6.4
4

On the Entropy Calibration of Language Models

NeurIPS 2025Poster
7.2
4

Reliable and Efficient Amortized Model-based Evaluation

ICML 2025Poster
6.5
6

Reliable and Efficient Amortized Model-based Evaluation

ICLR 2025Rejected
3.5
4

On the Entropy Calibration of Language Models

ICLR 2025withdrawn
6.4
5

Independence Tests for Language Models

ICML 2025Spotlight
6.1
4

Auditing Prompt Caching in Language Model APIs

ICML 2025Poster
5.3
4

Independence Tests for Language Models

ICLR 2025Rejected
6.0
4

Instruction Following without Instruction Tuning

ICLR 2025Rejected
6.0
5

Understanding Warmup-Stable-Decay Learning Rates: A River Valley Loss Landscape View

ICLR 2025Poster
7.8
3

Eliciting Language Model Behaviors with Investigator Agents

ICML 2025Poster
6.8
4

Audits Under Resource, Data, and Access Constraints: Scaling Laws For Less Discriminatory Alternatives

NeurIPS 2025Poster
4.8
4

VideoAgent: Self-Improving Video Generation

ICLR 2025Rejected
6.3
4

AutoBencher: Towards Declarative Benchmark Construction

ICLR 2025Poster
7.3
4

Blackbox Model Provenance via Palimpsestic Membership Inference

NeurIPS 2025Spotlight
7.2
4

Language Models May Verbatim Complete Text They Were Not Explicitly Trained On

ICML 2025Spotlight
6.4
5

BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments

ICLR 2025Poster
7.5
4

AIR-BENCH 2024: A Safety Benchmark based on Regulation and Policies Specified Risk Categories

ICLR 2025Spotlight
8.7
3

Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models

ICLR 2025Oral