Paper
Hub
搜索
Toggle language
Jonah Wonkyu Yi
~Jonah_Wonkyu_Yi1
2
论文总数
2.0
年均投稿
6.1
平均评分
接收情况
2
/
2
会议分布
NeurIPS
2
发表论文 (2 篇)
2024
2 篇
6.0
5
KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization
NeurIPS 2024
Poster
6.3
4
NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention
NeurIPS 2024
Poster
合作者 (4)
AS
Anshumali Shrivastava
2 篇
TZ
Tianyi Zhang
2 篇
ZX
Zhaozhuo Xu
2 篇
BY
Bowen Yao
1 篇