Rulin Shao
~Rulin_Shao1
7
论文总数
3.5
年均投稿
平均评分
接收情况4/7
会议分布
ICLR
3
COLM
2
NeurIPS
2
发表论文 (7 篇)
20253 篇
20244 篇
6
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
NeurIPS 2024Poster
4
DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training
COLM 2024Poster
4
LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers
ICLR 2024Rejected
3
Language models scale reliably with over-training and on downstream tasks
NeurIPS 2024Rejected