Yichi Zhang
~Yichi_Zhang4
6
论文总数
3.0
年均投稿
平均评分
接收情况3/6
会议分布
ICLR
4
ICML
1
NeurIPS
1
发表论文 (6 篇)
20254 篇
4
STAIR: Improving Safety Alignment with Introspective Reasoning
ICML 2025Oral
4
Towards Mitigating Factual Hallucination in LLMs through Self-Alignment with Memory
ICLR 2025withdrawn
4
Mitigating Overthinking in Large Reasoning Models via Manifold Steering
NeurIPS 2025Poster
4
Adaptive Strategy Evolution for Generating Tailored Jailbreak Prompts against Black-Box Safety-Aligned LLMs
ICLR 2025Rejected