Tianyi Zhang
~Tianyi_Zhang6
8
论文总数
4.0
年均投稿
平均评分
接收情况6/8
会议分布
NeurIPS
4
ICLR
3
ICML
1
发表论文 (8 篇)
20255 篇
4
SpaLLM: Unified Compressive Adaptation of Large Language Models with Sketching
ICLR 2025Rejected
6
LeanQuant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Grid
ICLR 2025Poster
4
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float (DFloat11)
NeurIPS 2025Poster
4
Sketch to Adapt: Fine-Tunable Sketches for Efficient LLM Adaptation
ICML 2025Poster
4
Breaking the Frozen Subspace: Importance Sampling for Low-Rank Optimization in LLM Pretraining
NeurIPS 2025Poster
20243 篇
6
HashOrder: Accelerating Graph Processing Through Hashing-based Reordering
ICLR 2024Rejected
5
KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization
NeurIPS 2024Poster
4
NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention
NeurIPS 2024Poster