影响力指数

70.78/100

前 2.5%

全站排名 #1,606

发表论文11 篇

平均评分5.9

年均产出3.7 篇/年

Joel Hestness

Research Scientist@Cerebras Systems, Inc·美国·OpenReview

研究方向

Machine/Deep Learning · Computer Architecture

Scaling with Collapse: Efficient and Predictable Training of LLM Families

ICLR 2026Poster

Predicting Training Re-evaluation Curves Enables Effective Data Curriculums for LLMs

ICLR 2026Poster

GQA-$\mu$P: The Maximal Parameterization Update for Grouped Query Attention and Fully Sharded Data Parallel

ICLR 2026Rejected

Don't be lazy: CompleteP enables compute-efficient deep transformers

NeurIPS 2025Poster

Power Lines: Scaling laws for weight decay and batch size in LLM pre-training

NeurIPS 2025Poster

Straight to Zero: Why Linearly Decaying the Learning Rate to Zero Works Best for LLMs

ICLR 2025Poster

BLIMEY: Towards Better Routing Methods in Sparse Mixture of Experts

ICLR 2025Withdrawn

合作者 (20)

Nolan Simran Dey

Bin Claire Zhang

Shaheer Muhammad