Dian Yu
~Dian_Yu3
10
论文总数
5.0
年均投稿
平均评分
接收情况6/10
会议分布
ICLR
5
NeurIPS
3
ICML
2
发表论文 (10 篇)
20257 篇
3
SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models
ICLR 2025withdrawn
4
Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning
ICLR 2025Oral
4
Improving LLM General Preference Alignment via Optimistic Online Mirror Descent
NeurIPS 2025Spotlight
4
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
ICLR 2025Poster
4
Do NOT Think That Much for 2+3=? On the Overthinking of Long Reasoning Models
ICML 2025Poster
4
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
ICML 2025Rejected
4
Thoughts Are All Over the Place: On the Underthinking of Long Reasoning Models
NeurIPS 2025Spotlight