影响力指数

79.97/100

前 1.4%

全站排名 #897

发表论文23 篇

平均评分5.3

年均产出7.7 篇/年

Niloofar Mireshghallah

Assistant Professor@Carnegie Mellon University·美国·OpenReview

研究方向

Policy and AI Governance · Natural Language Processing · Privacy in Machine Learning · Computer Architecture

CIMemories: A Compositional Benchmark For Contextual Integrity In LLMs

ICLR 2026Poster

Harnessing Optimization Dynamics for Curvature-Informed Model Merging

ICLR 2026Rejected

Operationalizing Data Minimization for Privacy-Preserving LLM Prompting

ICLR 2026Poster

Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability

ICLR 2026Poster

Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch

ICLR 2026Rejected

Bob’s Confetti: Phonetic Memorization Attacks in Music and Video Generation

ICLR 2026Withdrawn

Privacy-Preserving LLM Interaction with Socratic Chain-of-Thought Reasoning and Homomorphically Encrypted Vector Databases

ICLR 2026Rejected

Can Large Language Models Really Recognize Your Name?

ICLR 2026Withdrawn

Fighting Fire with Fire: Assessing Test Set Contamination Through Deliberate Training on Test Data

ICLR 2026Rejected

Reinforcement Learning Improves Traversal of Hierarchical Knowledge in LLMs

ICLR 2026Withdrawn

AI as Humanity’s Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text

Exploring the limits of strong membership inference attacks on large language models

NeurIPS 2025Poster

HAICOSYSTEM: An Ecosystem for Sandboxing Safety Risks in Interactive AI Agents

COLM 2025Poster

The Surprising Effectiveness of Membership Inference with Simple N-Gram Coverage

COLM 2025Poster

HAICOSYSTEM: An Ecosystem for Sandboxing Safety Risks in Human-AI Interactions

ICLR 2025Rejected

A False Sense of Privacy: Evaluating Textual Data Sanitization Beyond Surface-level Privacy Leakage

ICLR 2025Rejected

ParaPO: Aligning Language Models to Reduce Verbatim Reproduction of Pre-training Data

COLM 2025Poster

Leveraging Set Assumption for Membership Inference in Language Models

ICLR 2025Rejected

合作者 (20)

博后导师13 篇

博后导师5 篇