影响力指数

90.77/100

前 0.5%

全站排名 #336

发表论文25 篇

平均评分6.1

年均产出8.3 篇/年

Kaifeng Lyu

Assistant Professor@Tsinghua University·中国·OpenReview

研究方向

deep learning theory

Can Small Training Runs Reliably Guide Data Curation? Rethinking Proxy-Model Practice

ICLR 2026Poster

Larger Datasets Can Be Repeated More: A Theoretical Analysis of Multi-Epoch Scaling in Linear Regression

ICLR 2026Poster

How Learning Rate Decay Wastes Your Best Data in Curriculum-Based LLM Pretraining

The Information Bottleneck of Chain-of-Thought and How Latent CoT Overcomes It

ICLR 2026Rejected

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

ICLR 2026Rejected

When Bias Pretends to Be Truth: How Spurious Correlations Undermine Hallucination Detection in LLMs

ICLR 2026Rejected

Fine-tuning VLMs Without Forgetting Is Easier Than You Think

ICLR 2026Rejected

Safety Alignment Should be Made More Than Just a Few Tokens Deep

Adam Reduces a Unique Form of Sharpness: Theoretical Insights Near the Minimizer Manifold

NeurIPS 2025Poster

Data Mixing Can Induce Phase Transitions in Knowledge Acquisition

NeurIPS 2025Spotlight

Feature Averaging: An Implicit Bias of Gradient Descent Leading to Non-Robustness in Neural Networks

ICLR 2025Poster

Efficient stagewise pretraining via progressive subnetworks

ICLR 2025Poster

Weak-to-Strong Generalization Even in Random Feature Networks, Provably

ICML 2025Poster

Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias

ICLR 2025Poster

A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules

ICLR 2025Poster

How Far Are We from Optimal Reasoning Efficiency?

NeurIPS 2025Poster

RNNs are not Transformers (Yet): The Key Bottleneck on In-Context Retrieval

ICLR 2025Poster

Shift is Good: Mismatched Data Mixing Improves Test Performance

NeurIPS 2025Rejected

AI-Assisted Generation of Difficult Math Questions

ICLR 2025Rejected

合作者 (20)

博士导师4 篇