PaperHub

暂无评分数据

ICLR 2025

Learning through Conditioning on Natural Language Feedback

OpenReviewPDF
提交: 2024-09-28更新: 2024-10-04
TL;DR

We explore whether we can finetune language models by letting them generate answers conditioned on prior feedback.

摘要

关键词
Social LearningNatural Language FeedbackInstructive Learning

评审与讨论

撤稿通知

In retrospect rushed and not ready for review, don't want to waste reviewers time