Zhihao Jia
~Zhihao_Jia2
11
论文总数
5.5
年均投稿
平均评分
接收情况7/11
会议分布
ICLR
6
NeurIPS
5
发表论文 (11 篇)
20255 篇
4
SuffixDecoding: Extreme Speculative Decoding for Emerging AI Applications
NeurIPS 2025Spotlight
4
CAT Pruning: Cluster-Aware Token Pruning For Text-to-Image Diffusion Models
ICLR 2025Rejected
4
SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning
NeurIPS 2025Poster
4
TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
ICLR 2025Poster
5
MagicPIG: LSH Sampling for Efficient LLM Generation
ICLR 2025Spotlight
20246 篇
4
Communication Bounds for the Distributed Experts Problem
NeurIPS 2024Poster
4
Communication Bounds for the Distributed Experts Problem
ICLR 2024Rejected
3
FedOD: Federated Outlier Detection via Neural Approximation
ICLR 2024Rejected
4
SpecExec: Massively Parallel Speculative Decoding For Interactive LLM Inference on Consumer Devices
NeurIPS 2024Poster
4
Sequoia: Scalable and Robust Speculative Decoding
NeurIPS 2024Spotlight
4
Accelerating Retrieval-augmented Language Model Serving with Speculation
ICLR 2024Rejected