Wenxuan Wang
~Wenxuan_Wang2
14
论文总数
7.0
年均投稿
平均评分
接收情况8/14
会议分布
ICLR
9
NeurIPS
4
ICML
1
发表论文 (14 篇)
20259 篇
3
Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step
ICLR 2025Rejected
3
Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
ICLR 2025withdrawn
5
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training
ICLR 2025withdrawn
4
Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards
NeurIPS 2025Poster
4
Competing Large Language Models in Multi-Agent Gaming Environments
ICLR 2025Poster
5
On the Resilience of Multi-Agent Systems with Malicious Agents
ICLR 2025Rejected
4
On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents
ICML 2025Poster
4
Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training
NeurIPS 2025Poster
4
Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding
NeurIPS 2025Poster
20245 篇
4
All Languages Matter: On the Multilingual Safety of Large Language Models
ICLR 2024Rejected
3
On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs
ICLR 2024Oral
4
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
ICLR 2024Poster
3
Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models
ICLR 2024withdrawn
4
Apathetic or Empathetic? Evaluating LLMs' Emotional Alignments with Humans
NeurIPS 2024Poster