Mintong Kang
~Mintong_Kang1
11
论文总数
5.5
年均投稿
平均评分
接收情况8/11
会议分布
ICLR
8
ICML
2
NeurIPS
1
发表论文 (11 篇)
20259 篇
5
C-SafeGen: Certified Safe LLM Generation with Claim-Based Streaming Guardrails
NeurIPS 2025Poster
3
$R^2$-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning
ICLR 2025Spotlight
4
AdvWave: Stealthy Adversarial Jailbreak Attack against Large Audio-Language Models
ICLR 2025Poster
4
FairGen: controlling fair generations in diffusion models via adaptive latent guidance
ICLR 2025Rejected
4
ShieldAgent: Shielding Agents via Verifiable Safety Policy Reasoning
ICML 2025Poster
5
AdvWeb: Controllable Black-box Attacks on VLM-powered Web Agents
ICLR 2025Rejected
3
AdvAgent: Controllable Blackbox Red-teaming on Web Agents
ICML 2025Poster
5
EIA: ENVIRONMENTAL INJECTION ATTACK ON GENERALIST WEB AGENTS FOR PRIVACY LEAKAGE
ICLR 2025Poster
4
MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models
ICLR 2025Poster