Tomasz Korbak
~Tomasz_Korbak1
6
论文总数
3.0
年均投稿
平均评分
接收情况5/6
会议分布
ICLR
4
COLM
1
NeurIPS
1
发表论文 (6 篇)
20245 篇
4
Compositional Preference Models for Aligning LMs
ICLR 2024Poster
4
Towards Understanding Sycophancy in Language Models
ICLR 2024Poster
4
Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data
COLM 2024Poster
4
The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A”
ICLR 2024Poster
5
Many-shot Jailbreaking
NeurIPS 2024Poster