影响力指数

32.06/100

前 22.1%

全站排名 #14,257

发表论文4 篇

平均评分4.7

年均产出2.0 篇/年

Jing-Jing Li

PhD student@University of California, Berkeley·美国·OpenReview

研究方向

Agentic AI · AI Safety · Computational Cognitive Science · Computational Neuroscience

PluriHarms: Benchmarking the Full Spectrum of Human Judgments on AI Harm

ICLR 2026Poster

STAC: When Innocent Tools Form Dangerous Chains to Jailbreak LLM Agents

ICLR 2026Withdrawn

SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior

ICML 2025Poster

SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation

ICLR 2025Rejected

合作者 (19)

博士导师3 篇

Valentina Pyatkin

Jana Schaich Borg

Max Kleiman-Weiner