Zhenyu Zhang
~Zhenyu_Zhang4
11
论文总数
5.5
年均投稿
平均评分
接收情况7/11
会议分布
ICLR
7
ICML
2
NeurIPS
1
COLM
1
发表论文 (11 篇)
20256 篇
4
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients
ICLR 2025withdrawn
4
R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference
ICLR 2025Poster
4
SEAL: Steerable Reasoning Calibration of Large Language Models for Free
COLM 2025Poster
4
On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear Attention for Long-Context LLM Serving
ICML 2025Poster
4
From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients
ICLR 2025Rejected
3
Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More
ICML 2025Poster
20245 篇
3
Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding
NeurIPS 2024Poster
3
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy
ICLR 2024Spotlight
5
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity
ICLR 2024Rejected
4
JoMA: Demystifying Multilayer Transformers via Joint Dynamics of MLP and Attention
ICLR 2024Poster
3
Sparse Cocktail: Every Sparse Pattern Every Sparse Ratio All At Once
ICLR 2024Rejected