影响力指数

98.91/100

前 0.1%

全站排名 #22

发表论文119 篇

平均评分5.1

年均产出39.7 篇/年

Philip Torr

Full Professor@University of Oxford·英国·OpenReview

研究方向

computer vision

Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training

A Guardrail for Safety Preservation: When Safety-Sensitive Subspace Meets Harmful-Resistant Null-Space

ICLR 2026Poster

Beyond Linear Probes: Dynamic Safety Monitoring for Language Models

ICLR 2026Poster

Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks

ICLR 2026Poster

Eigen-Agent: Adaptive Multi-Agent Scientific Reasoning with Monitor-Based RAG

ICLR 2026Poster

Single LLM Debate, MoLaCE: Mixture of Latent Concept Experts Against Confirmation Bias

ICLR 2026Desk Rejected

BiasBusters: Uncovering and Mitigating Tool Selection Bias in Large Language Models

ICLR 2026Poster

Interleaving Reasoning for Better Text-to-Image Generation

ICLR 2026Poster

TRACEDET: HALLUCINATION DETECTION FROM THE DECODING TRACE OF DIFFUSION LARGE LANGUAGE MODELS

ICLR 2026Poster

Safe Autoregressive Image Generation with Iterative Self-Improving Codebooks

ICLR 2026Rejected

Learnable Sparsity for Vision Generative Models

ICLR 2026Poster

CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards

ICLR 2026Poster

It's a TRAP! Task-Redirecting Agent Persuasion Benchmark for Web Agents

ICLR 2026Rejected

DIVA: Discrete Diffusion Vision-Language-Action Models for Parallelized Action Generation

ICLR 2026Rejected

Can Large Language Models Express Uncertainty Like Human?

ICLR 2026Rejected

Better know nothing than half-know anything: A Precise and Efficient Dataset for Scientific Reasoning in Language Models

ICLR 2026Rejected

Towards Understanding Multimodal Fine-Tuning: A Case Study into Spatial Features

ICLR 2026Rejected

PostAlign: Multimodal Grounding as a Corrective Lens for MLLMs

ICLR 2026Poster

FORCE: Transferable Visual Jailbreaking Attacks via Feature Over-Reliance CorrEction

ICLR 2026Withdrawn

Measuring Sparse Autoencoder Feature Space Similarities Across Large Language Models

ICLR 2026Withdrawn

Learning to Fly Camera Drones by Watching Internet Videos

ICLR 2026Withdrawn

Purifying Task Vectors in Knowledge-Aware Subspace for Model Merging

ICLR 2026Rejected

OMNI-LEAK: Orchestrator Multi-Agent Network Induced Data Leakage

ICLR 2026Desk Rejected

h1: Bootstrapping Models to Reason over Longer Horizons via Reinforcement Learning

ICLR 2026Rejected

GLARE: Towards Graph-less Retrieval for Retrieval Augmented Generation on Million-scale Knowledge Graphs

ICLR 2026Rejected

LikePhys: Evaluating Intuitive Physics Understanding in Video Diffusion Models via Likelihood Preference

ICLR 2026Poster

Ego to World: Collaborative Spatial Reasoning in Embodied Systems via Reinforcement Learning

ICLR 2026Withdrawn

Base Models Know How to Reason, Thinking Models Learn When

ICLR 2026Withdrawn

Bag of Tricks for Subverting Reasoning-based Safety Guardrails

ICLR 2026Withdrawn

RoboMonster: Compositional Generalization of Heterogeneous Multi-End Effector Embodied Agents

ICLR 2026Withdrawn

Predicting Weak-to-Strong Generalization from Latent Representations

ICLR 2026Rejected

Computer-Use Agents as Judges for Automatic GUI Design

ICLR 2026Withdrawn

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

ICLR 2026Rejected

On the Diminishing Reliability of Reference-Free Memorization Detection in Modern Diffusion Models

ICLR 2026Withdrawn

ToolTweak: An Attack on Tool Selection in LLM-based Agents

ICLR 2026Rejected

Causal Discovery under Changing Mechanisms: A Unified Graphical Approach

ICLR 2026Rejected

Towards Causal Fine-Tuning under Latent-Confounded Shift

ICLR 2026Withdrawn

Building Scalable Real-World Robot Data Generation via Compositional Simulation

ICLR 2026Withdrawn

AN ITERATIVE PROMPTING FRAMEWORK FOR LLM-BASED DATA PREPROCESSING

ICLR 2026Rejected

CrystalSeg: Automating Synchrotron Tomographic Reconstruction Segmentation for Crystallography with Physically Guided Simulations

ICLR 2026Withdrawn

Towards Agents That Know When They Don't Know: Uncertainty as a Control Signal for Structured Reasoning

ICLR 2026Withdrawn

MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning

ICLR 2026Withdrawn

Can Knowledge-Graph-based Retrieval Augmented Generation Really Retrieve What You Need?

NeurIPS 2025Spotlight

Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models

ICLR 2025Spotlight

Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention

NeurIPS 2025Poster

On the Coexistence and Ensembling of Watermarks

NeurIPS 2025Poster

Towards Interpreting Visual Information Processing in Vision-Language Models

ICLR 2025Poster

Focus On This, Not That! Steering LLMs with Adaptive Feature Specification

ICML 2025Poster

MALT: Improving Reasoning with Multi-Agent LLM Training

COLM 2025Poster

Towards Reliable Identification of Diffusion-based Image Manipulations

NeurIPS 2025Poster

Towards Certification of Uncertainty Calibration under Adversarial Attacks

ICLR 2025Poster

Shh, don't say that! Domain Certification in LLMs

ICLR 2025Poster

Mixture of Experts Made Intrinsically Interpretable

ICML 2025Poster

On the Coexistence and Ensembling of Watermarks

ICLR 2025Rejected

True Multimodal In-Context Learning Needs Attention to the Visual Context

COLM 2025Poster

MIP against Agent: Malicious Image Patches Hijacking Multimodal OS Agents

NeurIPS 2025Poster

Too Late to Recall: Explaining the Two-Hop Problem in Multimodal Knowledge Retrieval

NeurIPS 2025Poster

Rethinking Safety in LLM Fine-tuning: An Optimization Perspective

COLM 2025Poster

Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models

ICLR 2025Rejected

Do as I do (Safely): Mitigating Task-Specific Fine-tuning Risks in Large Language Models

ICLR 2025Poster

Semantic Score Distillation Sampling for Compositional Text-to-3D Generation

ICLR 2025Rejected

Focus On This, Not That! Steering LLMs With Adaptive Feature Specification

ICLR 2025Rejected

Flex3D: Feed-Forward 3D Generation with Flexible Reconstruction Model and Input View Curation

ICML 2025Poster

Flex3D: Feed-Forward 3D Generation with Flexible Reconstruction Model and Input View Curation

ICLR 2025Rejected

MAD-Sherlock: Multi-Agent Debates for Out-of-Context Misinformation Detection

ICLR 2025Rejected

PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning

ICLR 2025Rejected

REVIP: Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge

ICLR 2025Withdrawn

Minimalist Concept Erasure in Generative Models

ICML 2025Poster

SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training?

ICLR 2025Rejected

Incrementally Adapting Generative Vision-Language Models with Task Codebook

ICLR 2025Rejected

Can Editing LLMs Inject Harm?

ICLR 2025Rejected

PoisonBench: Assessing Language Model Vulnerability to Poisoned Preference Data

ICML 2025Poster

LLM Jailbreak Detection for (Almost) Free!

ICLR 2025Withdrawn

FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models

ICLR 2025Withdrawn

CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents

ICLR 2025Rejected

OASIS: Open Agents Social Interaction Simulations on a Large Scale

ICLR 2025Rejected

SAGE: Scalable Ground Truth Evaluations for Large Sparse Autoencoders

ICLR 2025Withdrawn

Cracking the Collective Mind: Adversarial Manipulation in Multi-Agent Systems

ICLR 2025Withdrawn

Learning Visual Prompts for Guiding the Attention of Vision Transformers

ICLR 2025Rejected

A Scalable Communication Protocol for Networks of Large Language Models

ICLR 2025Withdrawn

FAIRMINDSIM: ALIGNMENT OF BEHAVIOR, EMO- TION, AND BELIEF IN HUMANS AND LLM AGENTS AMID ETHICAL DILEMMAS

ICLR 2025Rejected

Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap

ICLR 2025Rejected

Questioning Simplicity Bias Assumptions

ICLR 2025Withdrawn

Language Models' Internal Conflicts: Layer-wise Usable Information For Detecting Model (Un)answerability

ICLR 2025Withdrawn

合作者 (20)

Christian Schroeder de Witt