Songlin Yang
~Songlin_Yang1
11
论文总数
5.5
年均投稿
平均评分
接收情况9/11
会议分布
NeurIPS
6
ICLR
3
COLM
2
发表论文 (11 篇)
20258 篇
4
PaTH Attention: Position Encoding via Accumulating Householder Transformations
NeurIPS 2025Poster
4
Gated Delta Networks: Improving Mamba2 with Delta Rule
ICLR 2025Poster
4
Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth Study
ICLR 2025Poster
4
A Controlled Study on Long Context Extension and Generalization in LLMs
COLM 2025Poster
4
A Controlled Study on Long Context Extension and Generalization in LLMs
ICLR 2025Rejected
4
Test-Time Training Done Right
NeurIPS 2025Rejected
4
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
NeurIPS 2025Oral
4
Radial Attention: $\mathcal O(n \log n)$ Sparse Attention for Long Video Generation
NeurIPS 2025Poster