Haitao Mi
~Haitao_Mi1
15
论文总数
7.5
年均投稿
平均评分
接收情况12/15
会议分布
NeurIPS
8
ICLR
5
ICML
2
发表论文 (15 篇)
202513 篇
4
HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows
ICLR 2025Rejected
4
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
ICLR 2025Poster
4
MPS-Prover: Advancing Stepwise Theorem Proving by Multi-Perspective Search and Data Curation
NeurIPS 2025Poster
4
Improving LLM General Preference Alignment via Optimistic Online Mirror Descent
NeurIPS 2025Spotlight
3
SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models
ICLR 2025withdrawn
4
UniGist: Towards General and Hardware-aligned Sequence-level Long Context Compression
NeurIPS 2025Poster
4
Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning
ICLR 2025Oral
4
Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards
NeurIPS 2025Poster
4
The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models
NeurIPS 2025Poster
4
Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training
NeurIPS 2025Poster
4
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
ICML 2025Rejected
4
Do NOT Think That Much for 2+3=? On the Overthinking of Long Reasoning Models
ICML 2025Poster
4
Thoughts Are All Over the Place: On the Underthinking of Long Reasoning Models
NeurIPS 2025Spotlight