影响力指数

88.87/100

前 0.7%

全站排名 #424

发表论文38 篇

平均评分5.1

年均产出12.7 篇/年

Yixuan Li

Associate Professor@University of Wisconsin - Madison·美国·OpenReview

研究方向

deep learning · out-of-distribution detection · AI safety

Detecting Distillation Data from Reasoning Models

ICLR 2026Rejected

Perturbations Matter: Sensitivity-Guided Hallucination Detection in LLMs

ICLR 2026Rejected

Combating Data Laundering in LLM Training

ICLR 2026Rejected

Measuring and Mitigating Identity Bias in Multi-Agent Debate via Anonymization

ICLR 2026Withdrawn

GeoArena: An Open Platform for Benchmarking Large Vision-language Models on WorldWide Image Geolocalization

ICLR 2026Withdrawn

Cognition-of-Thought Elicits Social-Aligned Reasoning in Large Language Models

ICLR 2026Desk Rejected

OOD Learner via In-Context Learning

ICLR 2026Rejected

Unsupervised Domain Adaptation with an Unobservable Source Subpopulation

ICLR 2026Withdrawn

Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning

ICLR 2026Withdrawn

Do Vision Language Models Rotate in Mind? Evaluating Spatial Transformation Reasoning

ICLR 2026Withdrawn

Unlocking the Pre-Trained Model as a Dual-Alignment Calibrator for Post-Trained LLMs

ICLR 2026Withdrawn

Your Weak LLM is Secretly a Strong Teacher for Alignment

ICLR 2025Poster

How Contaminated Is Your Benchmark? Measuring Dataset Leakage in Large Language Models with Kernel Divergence

ICML 2025Poster

DaWin: Training-free Dynamic Weight Interpolation for Robust Adaptation

ICLR 2025Poster

Process Reward Model with Q-value Rankings

ICLR 2025Poster

Steer LLM Latents for Hallucination Detection

ICML 2025Poster

Unsolvable Problem Detection: Evaluating Trustworthiness of Large Multimodal Models

ICLR 2025Rejected

Understanding and Mitigating Miscalibration in Prompt Tuning for Vision-Language Models

ICML 2025Poster

CONDA: Adaptive Concept Bottleneck for Foundation Models Under Distribution Shifts

ICLR 2025Poster

Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach

ICML 2025Poster

Understanding and Mitigating Miscalibration in Prompt Tuning for Vision-Language Models

ICLR 2025Rejected

On the Generalization of Preference Learning with DPO

ICLR 2025Rejected

CodeLutra: Boosting LLM Code Generation via Preference-Guided Refinement

ICLR 2025Withdrawn

VLMGuard: Defending VLMs against Malicious Prompts via Unlabeled Data

ICLR 2025Withdrawn

Understanding Generalization of Preference Optimization Under Noisy Feedback

ICLR 2025Rejected

How Reliable Is Human Feedback For Aligning Large Language Models?

ICLR 2025Withdrawn

合作者 (20)

Hyeong Kyu Choi