Paper
Hub
搜索
Toggle language
Sebastian Jaszczur
~Sebastian_Jaszczur1
4
论文总数
2.0
年均投稿
4.8
平均评分
接收情况
2
/
4
会议分布
ICLR
2
NeurIPS
1
ICML
1
发表论文 (4 篇)
2025
2 篇
2.5
4
Different Rates for Different Weights: Decoupled Relative Learning Rate Schedules
ICLR 2025
withdrawn
6.6
4
Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient
ICML 2025
Poster
2024
2 篇
4.0
3
Structured Packing in LLM Training Improves Long Context Utilization
ICLR 2024
Rejected
6.0
3
Mixture of Tokens: Continuous MoE through Cross-Example Aggregation
NeurIPS 2024
Poster
合作者 (17)
JK
Jakub Krajewski
3 篇
JL
Jan Ludziejewski
3 篇
MP
Maciej Pióro
3 篇
MC
Marek Cygan
3 篇
MK
Michał Krutul
3 篇
KC
Kamil Ciebiera
2 篇
JM
Jan Małaśnicki
2 篇
KA
Kamil Adamczewski
2 篇
查看全部 17 位合作者