Zhiwei He
~Zhiwei_He1
10
论文总数
5.0
年均投稿
平均评分
接收情况7/10
会议分布
ICLR
4
NeurIPS
4
ICML
2
发表论文 (10 篇)
20258 篇
4
RaSA: Rank-Sharing Low-Rank Adaptation
ICLR 2025Poster
4
Weak-to-Strong Preference Optimization: Stealing Reward from Weak Aligned Model
ICLR 2025Spotlight
4
Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards
NeurIPS 2025Poster
4
Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training
NeurIPS 2025Poster
4
Do NOT Think That Much for 2+3=? On the Overthinking of Long Reasoning Models
ICML 2025Poster
4
The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models
NeurIPS 2025Poster
4
Thoughts Are All Over the Place: On the Underthinking of Long Reasoning Models
NeurIPS 2025Spotlight
4
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
ICML 2025Rejected