Sanmi Koyejo

~Sanmi_Koyejo1

48

论文总数

24.0

年均投稿

5.6

平均评分

接收情况26/48

会议分布

ICLR

27

ICML

10

NeurIPS

8

COLM

3

发表论文 (48 篇)

202537 篇

The Utility and Complexity of In- and Out-of-Distribution Machine Unlearning

ICLR 2025Poster

Logits are All We Need to Adapt Closed Models

ICML 2025Poster

Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs

ICLR 2025withdrawn

Language Models May Verbatim Complete Text They Were Not Explicitly Trained On

ICML 2025Spotlight

Reliable and Efficient Amortized Model-based Evaluation

ICLR 2025Rejected

From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?

ICML 2025Poster

Reliable and Efficient Amortized Model-based Evaluation

ICML 2025Poster

Quantifying Variance in Evaluation Benchmarks

ICLR 2025Rejected

Certified Unlearning for Neural Networks

ICML 2025Poster

Lean-ing on Quality: How High-Quality Data Beats Diverse Multilingual Data in AutoFormalization

ICLR 2025Rejected

Aligning Compound AI Systems via System-level DPO

NeurIPS 2025Poster

Scaling Laws for Downstream Task Performance in Machine Translation

ICLR 2025Poster

Beyond Scale: The Diversity Coefficient as a Data Quality Metric for Variability in Natural Language Data

ICLR 2025Rejected

Sharpe Ratio-Guided Active Learning for Preference Optimization in RLHF

COLM 2025Poster

Incidental Polysemanticity: A New Obstacle for Mechanistic Interpretability

ICLR 2025Rejected

Attacking Audio Language Models with Best-of-N Jailbreaking

ICLR 2025Rejected

AutoRedTeamer: An Autonomous Red Teaming Agent Against Language Models

ICLR 2025Rejected

Putnam-AXIOM: A Functional & Static Benchmark for Measuring Higher Level Mathematical Reasoning in LLMs

ICLR 2025Rejected

Nonmyopic Bayesian Optimization in Dynamic Cost Settings

ICLR 2025Rejected

Best-of-N Jailbreaking

NeurIPS 2025Poster

KGGen: Extracting Knowledge Graphs from Plain Text with Language Models

NeurIPS 2025Poster

ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment

ICLR 2025Rejected

Putnam-AXIOM: A Functional & Static Benchmark for Measuring Higher Level Mathematical Reasoning in LLMs

ICML 2025Poster

Collapse or Thrive? Perils and Promises of Synthetic Data in a Self-Generating World

ICLR 2025Rejected

Collapse or Thrive: Perils and Promises of Synthetic Data in a Self-Generating World

ICML 2025Poster

Context Clues: Evaluating Long Context Models for Clinical Prediction Tasks on EHR Data

ICLR 2025Poster

MoSH: Modeling Multi-Objective Tradeoffs with Soft and Hard Bounds

ICLR 2025Rejected

No, of Course I Can! Deeper Fine-Tuning Attacks That Bypass Token-Level Safety Mechanisms

NeurIPS 2025Rejected

AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration

NeurIPS 2025Poster

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

ICML 2025Poster

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

ICLR 2025Rejected

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

ICLR 2025Rejected

How Do Large Language Monkeys Get Their Power (Laws)?

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

ICML 2025Spotlight

Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks

ICLR 2025Rejected

Understanding challenges to the interpretation of disaggregated evaluations of algorithmic fairness

NeurIPS 2025Poster

Failures to Find Transferable Image Jailbreaks Between Vision-Language Models

ICLR 2025Poster

202411 篇

Principled Federated Domain Adaptation: Gradient Projection and Auto-Weighting

ICLR 2024Poster

Enhancing Robustness of Last Layer Two-Stage Fair Model Corrections

NeurIPS 2024Poster

HIFA: High-fidelity Text-to-3D Generation with Advanced Diffusion Guidance

ICLR 2024Poster

Sketching for Distributed Deep Learning: A Sharper Analysis

NeurIPS 2024Poster

Beyond Scale: the Diversity Coefficient as a Data Quality Metric Demonstrates LLMs are Pre-trained on Formally Diverse Data

ICLR 2024Rejected

Is Pre-training Truly Better Than Meta-Learning?

ICLR 2024Rejected

Learning to (Learn at Test Time)

ICLR 2024withdrawn

On Fairness of Low-Rank Adaptation of Large Models

COLM 2024Poster

Divergence at the Interpolation Threshold: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle

ICLR 2024withdrawn

Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data

COLM 2024Poster

Enhancing Neural Network Transparency through Representation Analysis

ICLR 2024Rejected

合作者 (20)

Rylan Schaeffer16 篇

Brando Miranda10 篇

Joshua Kazdan7 篇

John Hughes5 篇

Berivan Isik4 篇

Henry Sleight4 篇

Carlos Guestrin4 篇