Yixuan Li
~Yixuan_Li1
27
论文总数
13.5
年均投稿
平均评分
接收情况16/27
会议分布
ICLR
19
NeurIPS
4
ICML
4
发表论文 (27 篇)
202515 篇
4
Understanding Generalization of Preference Optimization Under Noisy Feedback
ICLR 2025Rejected
4
Understanding and Mitigating Miscalibration in Prompt Tuning for Vision-Language Models
ICML 2025Poster
4
Understanding and Mitigating Miscalibration in Prompt Tuning for Vision-Language Models
ICLR 2025Rejected
5
Process Reward Model with Q-value Rankings
ICLR 2025Poster
5
Your Weak LLM is Secretly a Strong Teacher for Alignment
ICLR 2025Poster
4
On the Generalization of Preference Learning with DPO
ICLR 2025Rejected
4
DaWin: Training-free Dynamic Weight Interpolation for Robust Adaptation
ICLR 2025Poster
4
CONDA: Adaptive Concept Bottleneck for Foundation Models Under Distribution Shifts
ICLR 2025Poster
4
How Contaminated Is Your Benchmark? Measuring Dataset Leakage in Large Language Models with Kernel Divergence
ICML 2025Poster
4
How Reliable Is Human Feedback For Aligning Large Language Models?
ICLR 2025withdrawn
4
Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach
ICML 2025Poster
3
Steer LLM Latents for Hallucination Detection
ICML 2025Poster
5
CodeLutra: Boosting LLM Code Generation via Preference-Guided Refinement
ICLR 2025withdrawn
4
VLMGuard: Defending VLMs against Malicious Prompts via Unlabeled Data
ICLR 2025withdrawn
3
Unsolvable Problem Detection: Evaluating Trustworthiness of Large Multimodal Models
ICLR 2025Rejected
202412 篇
4
Bridging OOD Detection and Generalization: A Graph-Theoretic View
NeurIPS 2024Poster
4
Out-of-Distribution Detection with Hyperspherical Energy
ICLR 2024Rejected
3
A Graph-Theoretic Framework for Joint OOD Generalization and Detection
ICLR 2024Rejected
3
How Does RLHF Shift Behavior Distributions? Distinguishability and Steerability
ICLR 2024Rejected
4
HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection
NeurIPS 2024Spotlight
4
SODA: Stream Out-of-Distribution Adaptation
ICLR 2024Rejected
4
ARGS: Alignment as Reward-Guided Search
ICLR 2024Poster
4
How Does Unlabeled Data Provably Help Out-of-Distribution Detection?
ICLR 2024Poster
4
ConjNorm: Tractable Density Estimation for Out-of-Distribution Detection
ICLR 2024Poster
3
HYPO: Hyperspherical Out-Of-Distribution Generalization
ICLR 2024Poster
4
Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
NeurIPS 2024Poster
4
BackdoorAlign: Mitigating Fine-tuning based Jailbreak Attack with Backdoor Enhanced Safety Alignment
NeurIPS 2024Poster