Shizhe Diao
~Shizhe_Diao2
13
论文总数
6.5
年均投稿
平均评分
接收情况8/13
会议分布
ICLR
6
NeurIPS
5
ICML
2
发表论文 (13 篇)
202511 篇
4
$\textbf{PLUM}$: Improving Code LMs Using On-Policy Preference Learning Powered by Automatic Test Cases
ICLR 2025withdrawn
4
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
NeurIPS 2025Poster
4
Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models
NeurIPS 2025Poster
4
Hymba: A Hybrid-head Architecture for Small Language Models
ICLR 2025Spotlight
4
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
ICLR 2025withdrawn
3
MA-LoT: Model-Collaboration Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving
ICML 2025Poster
4
UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language Models
ICML 2025Poster
4
LongMamba: Enhancing Mamba's Long-Context Capabilities via Training-Free Receptive Field Enlargement
ICLR 2025Poster
4
GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling
NeurIPS 2025Poster
3
Double-Checker: Enhancing Reasoning of Slow-Thinking LLMs via Self-Critical Fine-Tuning
NeurIPS 2025Rejected
3
LLM Pruning and Distillation in Practice
ICLR 2025Rejected