Jing Yao
~Jing_Yao4
4
论文总数
4.0
年均投稿
平均评分
接收情况1/4
会议分布
ICLR
3
NeurIPS
1
发表论文 (4 篇)
20254 篇
4
Counterfactual Reasoning for Steerable Pluralistic Value Alignment of Large Language Models
NeurIPS 2025Poster
3
Why Do You Answer Like That? Psychological Analysis on Underlying Connections between LLM's Values and Safety Risks
ICLR 2025Rejected
4
Elephant in the Room: Unveiling the Pitfalls of Human Proxies in Alignment
ICLR 2025Rejected
4
One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks
ICLR 2025withdrawn