Vahab Mirrokni

~Vahab_Mirrokni2

31

论文总数

15.5

年均投稿

6.0

平均评分

接收情况23/31

会议分布

ICLR

12

NeurIPS

10

ICML

7

COLM

2

发表论文 (31 篇)

202521 篇

Improving the Variance of Differentially Private Randomized Experiments through Clustering

ICML 2025Poster

ClusterGen: Token Generation in Sublinear Time and Memory with Clustering KV Cache

ICLR 2025Rejected

Titans: Learning to Memorize at Test Time

NeurIPS 2025Poster

Maximum Coverage in Turnstile Streams with Applications to Fingerprinting Measures

ICLR 2025Rejected

Procurement Auctions via Approximately Optimal Submodular Optimization

ICML 2025Spotlight

Maximum Coverage in Turnstile Streams with Applications to Fingerprinting Measures

ICML 2025Poster

Efficient Data Selection at Scale via Influence Distillation

NeurIPS 2025Poster

Nested Learning: The Illusion of Deep Learning Architectures

NeurIPS 2025Poster

Self-Boost via Optimal Retraining: An Analysis via Approximate Message Passing

NeurIPS 2025Poster

Synthetic Text Generation for Training Large Language Models via Gradient Matching

ICML 2025Poster

TRELLIS: Learning to Compress Key-Value Memory in Attention Models

COLM 2025Poster

MS-SSM: A Multi-Scale State Space Model for Efficient Sequence Modeling

COLM 2025Poster

NanoMoE: Scaling Mixture of Experts to Individual Layers for Parameter-Efficient Deep Learning

ICLR 2025Rejected

PiKE: Adaptive Data Mixing for Large-Scale Multi-Task Learning Under Low Gradient Conflicts

NeurIPS 2025Spotlight

DiSK: Differentially Private Optimizer with Simplified Kalman Filter for Noise Reduction

ICLR 2025Poster

Retraining with Predicted Hard Labels Provably Increases Model Accuracy

ICLR 2025desk_rejected

Retraining with Predicted Hard Labels Provably Increases Model Accuracy

ICML 2025Poster

DeepCrossAttention: Supercharging Transformer Residual Connections

ICML 2025Poster

Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language Models

ICLR 2025Poster

Best of Both Worlds: Advantages of Hybrid Graph Sequence Models

ICML 2025Poster

Towards an Understanding of Graph Sequence Models

ICLR 2025Rejected

202410 篇

PolySketchFormer: Fast Transformers via Sketches for Polynomial Kernels

ICLR 2024Rejected

Learning from Aggregate responses: Instance Level versus Bag Level Loss Functions

ICLR 2024Poster

Efficiency of the First-Price Auction in the Autobidding World

NeurIPS 2024Poster

Autobidder's Dilemma: Why More Sophisticated Autobidders Lead to Worse Auction Efficiency

NeurIPS 2024Poster

High-Dimensional Geometric Streaming for Nearly Low Rank Data

ICLR 2024Rejected

Sensitivity Sampling for Coreset-Based Data Selection

ICLR 2024Rejected

HyperAttention: Long-context Attention in Near-Linear Time

ICLR 2024Poster

MUVERA: Multi-Vector Retrieval via Fixed Dimensional Encoding

NeurIPS 2024Poster

SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization

NeurIPS 2024Poster

Understanding Transformer Reasoning Capabilities via Graph Algorithms

NeurIPS 2024Poster

合作者 (20)

Peilin Zhong11 篇

Ali Behrouz6 篇

Adel Javanmard6 篇

David Woodruff5 篇

Alessandro Epasto5 篇

Meisam Razaviyayn5 篇