影响力指数

72.63/100

前 2.2%

全站排名 #1,434

发表论文14 篇

平均评分5.6

年均产出4.7 篇/年

Zhiwei He

PhD student@Shanghai Jiao Tong University·中国·OpenReview

研究方向

large language model · natural language processing · machine translation

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

ICLR 2026Poster

RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents

ICLR 2026Poster

DeepCompress: A Dual Reward Strategy for Dynamically Exploring and Compressing Reasoning Chains

ICLR 2026Poster

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

ICLR 2026Rejected

Thoughts Are All Over the Place: On the Underthinking of Long Reasoning Models

NeurIPS 2025Spotlight

Weak-to-Strong Preference Optimization: Stealing Reward from Weak Aligned Model

ICLR 2025Spotlight

RaSA: Rank-Sharing Low-Rank Adaptation

ICLR 2025Poster

Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards

NeurIPS 2025Poster

Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training

NeurIPS 2025Poster

The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models

NeurIPS 2025Poster

Do NOT Think That Much for 2+3=? On the Overthinking of Long Reasoning Models

ICML 2025Poster

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

ICML 2025Rejected

合作者 (20)

合作者12 篇

合作者11 篇

博士导师9 篇