Zhihan Liu
~Zhihan_Liu1
11
论文总数
5.5
年均投稿
平均评分
接收情况4/11
会议分布
ICLR
8
ICML
2
NeurIPS
1
发表论文 (11 篇)
20258 篇
5
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
ICLR 2025Rejected
4
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
ICML 2025Poster
4
Progressive LLM Alignments Using Two-Player Games
ICLR 2025Rejected
-
Hindsight Planner: A Closed-loop few-shot planner for Embodied Instruction Following
ICLR 2025withdrawn
4
Provably Efficient and Practical Self-Play for Better LLM Alignment
ICLR 2025Rejected
4
How Can LLM Guide RL? A Value-Based Approach
ICLR 2025withdrawn
-
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
ICLR 2025withdrawn
3
BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
ICML 2025Poster