Lilian Weng
~Lilian_Weng1
5
论文总数
2.5
年均投稿
平均评分
接收情况3/5
会议分布
ICLR
4
NeurIPS
1
发表论文 (5 篇)
20254 篇
4
Diverse and Effective Red Teaming with Auto-generated Rewards and Multi-step Reinforcement Learning
ICLR 2025Rejected
7
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
ICLR 2025Rejected
4
First-Person Fairness in Chatbots
ICLR 2025Spotlight
4
MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
ICLR 2025Oral