PaperHub

Dong Yu

~Dong_Yu2

32
论文总数
16.0
年均投稿
5.5
平均评分
接收情况16/32
会议分布
ICLR
21
NeurIPS
9
ICML
2

发表论文 (32 篇)

202525

4.8
4

HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows

ICLR 2025Rejected
4.2
5

Controllable Text-to-Speech Synthesis with Masked-Autoencoded Style Representation

ICLR 2025withdrawn
5.8
5

ParallelSpec: Parallel Drafter for Efficient Speculative Decoding

ICLR 2025Rejected
4.0
4

MultiMedia-Agent: A Multimodal Agent for Multimedia Content Generation

ICLR 2025withdrawn
6.4
4

MPS-Prover: Advancing Stepwise Theorem Proving by Multi-Perspective Search and Data Curation

NeurIPS 2025Poster
6.3
4

DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search

ICLR 2025Poster
6.3
4

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

ICLR 2025Poster
4.3
3

SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models

ICLR 2025withdrawn
5.2
5

DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning

ICLR 2025Rejected
3.7
3

Video-to-Audio generation with Hidden Alignment

ICLR 2025withdrawn
6.8
4

Improving LLM General Preference Alignment via Optimistic Online Mirror Descent

NeurIPS 2025Spotlight
6.4
4

UniGist: Towards General and Hardware-aligned Sequence-level Long Context Compression

NeurIPS 2025Poster
6.0
4

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

ICLR 2025Oral
6.8
4

Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards

NeurIPS 2025Poster
6.8
5

DSBench: How Far Are Data Science Agents from Becoming Data Science Experts?

ICLR 2025Poster
5.0
4

SePPO: Semi-Policy Preference Optimization for Diffusion Alignment

ICLR 2025withdrawn
6.3
4

DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects

ICLR 2025Rejected
4.3
3

Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks

ICLR 2025withdrawn
6.2
5

RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph

ICLR 2025Poster
7.8
4

LeVo: High-Quality Song Generation with Multi-Preference Alignment

NeurIPS 2025Poster
6.0
4

The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models

NeurIPS 2025Poster
4.4
4

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

ICML 2025Rejected
7.3
4

Thoughts Are All Over the Place: On the Underthinking of Long Reasoning Models

NeurIPS 2025Spotlight
4.9
4

Do NOT Think That Much for 2+3=? On the Overthinking of Long Reasoning Models

ICML 2025Poster
6.8
4

Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training

NeurIPS 2025Poster