影响力指数

91.62/100

前 0.5%

全站排名 #298

发表论文28 篇

平均评分5.8

年均产出9.3 篇/年

Hao Peng

Assistant Professor@University of Illinois Urbana-Champaign·美国·OpenReview

研究方向

Natural Language Processing

Executable Counterfactuals: Improving LLMs' Causal Reasoning Through Code

ICLR 2026Poster

mCLM: A Modular Chemical Language Model that Generates Functional and Makeable Molecules

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

ICLR 2026Poster

Generalization of RLVR Using Causal Reasoning as a Testbed

ICLR 2026Poster

Process Reinforcement through Implicit Rewards

ICLR 2026Rejected

Process Reward Models That Think

ICLR 2026Withdrawn

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

ICLR 2026Withdrawn

The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning

NeurIPS 2025Poster

The Best Instruction-Tuning Data are Those That Fit

NeurIPS 2025Spotlight

Retrieval Head Mechanistically Explains Long-Context Factuality

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

NeurIPS 2025Poster

OpenHands: An Open Platform for AI Software Developers as Generalist Agents

ICLR 2025Poster

Eliminating Position Bias of Language Models: A Mechanistic Approach

ICLR 2025Poster

Advancing LLM Reasoning Generalists with Preference Trees

ICLR 2025Poster

A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts

ICLR 2025Poster

Scaling Diffusion Language Models via Adaptation from Autoregressive Models

ICLR 2025Poster

$PLUM$: Improving Code LMs Using On-Policy Preference Learning Powered by Automatic Test Cases

ICLR 2025Withdrawn

Free Process Rewards without Process Labels

ICML 2025Poster

Improving Influence-based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities

ICLR 2025Withdrawn

S2-Attention: Hardware-Aware Context Sharding Among Attention Heads

ICLR 2025Rejected

FactCheckmate: Preemptively Detecting and Mitigating Hallucinations in LMs

ICLR 2025Withdrawn

合作者 (20)