Shenao Zhang
~Shenao_Zhang1
10
论文总数
5.0
年均投稿
平均评分
接收情况3/10
会议分布
ICLR
7
ICML
2
NeurIPS
1
发表论文 (10 篇)
20257 篇
4
How Can LLM Guide RL? A Value-Based Approach
ICLR 2025withdrawn
-
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
ICLR 2025withdrawn
5
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
ICLR 2025Rejected
4
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
ICML 2025Poster
-
Hindsight Planner: A Closed-loop few-shot planner for Embodied Instruction Following
ICLR 2025withdrawn
3
BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
ICML 2025Poster
4
Provably Efficient and Practical Self-Play for Better LLM Alignment
ICLR 2025Rejected
20243 篇
4
Asking Before Acting: Gather Information in Embodied Decision-Making with Language Models
ICLR 2024Rejected
3
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer
NeurIPS 2024Poster
4
Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents
ICLR 2024Rejected