影响力指数

52.26/100

前 7.4%

全站排名 #4,791

发表论文12 篇

平均评分5.1

年均产出4.0 篇/年

Shenao Zhang

PhD student@Northwestern University·美国·OpenReview

Beyond Markovian: Reflective Exploration via Bayes-Adaptive RL for LLM Reasoning

ICLR 2026Poster

Learning to Reason as Action Abstractions with Scalable Mid-Training RL

ICLR 2026Poster

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

ICML 2025Poster

BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning

ICML 2025Poster

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

ICLR 2025Rejected

How Can LLM Guide RL? A Value-Based Approach

ICLR 2025Withdrawn

Provably Efficient and Practical Self-Play for Better LLM Alignment

ICLR 2025Rejected

Self-Exploring Language Models: Active Preference Elicitation for Online Alignment

ICLR 2025Withdrawn

Hindsight Planner: A Closed-loop few-shot planner for Embodied Instruction Following

ICLR 2025Withdrawn

合作者 (20)