Junyuan Hong
~Junyuan_Hong1
8
论文总数
4.0
年均投稿
平均评分
接收情况6/8
会议分布
ICLR
4
COLM
3
ICML
1
发表论文 (8 篇)
20255 篇
4
SEAL: Steerable Reasoning Calibration of Large Language Models for Free
COLM 2025Poster
4
GuardAgent: Safeguard LLM Agents via Knowledge-Enabled Reasoning
ICML 2025Poster
4
GuardAgent: Safeguard LLM Agent by a Guard Agent via Knowledge-Enabled Reasoning
ICLR 2025Rejected
4
LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning
COLM 2025Poster
4
More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment
COLM 2025Poster