Xuandong Zhao
~Xuandong_Zhao1
17
论文总数
8.5
年均投稿
平均评分
接收情况14/17
会议分布
ICLR
8
NeurIPS
3
ICML
3
COLM
3
发表论文 (17 篇)
202513 篇
5
Weak-to-Strong Jailbreaking on Large Language Models
ICLR 2025Rejected
4
Efficiently Identifying Watermarked Segments in Mixed-Source Texts
ICLR 2025withdrawn
5
Permute-and-Flip: An optimally stable and watermarkable decoder for LLMs
ICLR 2025Poster
3
Weak-to-Strong Jailbreaking on Large Language Models
ICML 2025Poster
4
Improving LLM Safety Alignment with Dual-Objective Optimization
ICML 2025Poster
5
Scalable Best-of-N Selection for Large Language Models via Self-Certainty
NeurIPS 2025Poster
4
DIS-CO: Discovering Copyrighted Content in VLMs Training Data
ICML 2025Poster
4
An Undetectable Watermark for Generative Image Models
ICLR 2025Poster
5
Multimodal Situational Safety
ICLR 2025Poster
4
LeakAgent: RL-based Red-teaming Agent for LLM Privacy Leakage
COLM 2025Poster
4
Assessing Judging Bias in Large Reasoning Models: An Empirical Study
COLM 2025Poster
5
ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World
ICLR 2025withdrawn
4
MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models
ICLR 2025Poster
20244 篇
3
Invisible Image Watermarks Are Provably Removable Using Generative AI
NeurIPS 2024Poster
4
Provable Robust Watermarking for AI-Generated Text
ICLR 2024Poster
4
Bileve: Securing Text Provenance in Large Language Models Against Spoofing with Bi-level Signature
NeurIPS 2024Poster
4
Mapping the Increasing Use of LLMs in Scientific Papers
COLM 2024Poster