PaperHub

Ion Stoica

~Ion_Stoica1

40
论文总数
20.0
年均投稿
6.2
平均评分
接收情况30/40
会议分布
ICLR
18
NeurIPS
10
ICML
9
COLM
3

发表论文 (40 篇)

202527

6.6
4

OR-Bench: An Over-Refusal Benchmark for Large Language Models

ICML 2025Poster
5.0
8

OR-Bench: An Over-Refusal Benchmark for Large Language Models

ICLR 2025Rejected
6.3
3

MPC-Minimized Secure LLM Inference

ICLR 2025Rejected
5.7
3

A Statistical Framework for Ranking LLM-based Chatbots

ICLR 2025Poster
4.8
4

Post-Training Sparse Attention with Double Sparsity

ICLR 2025Rejected
3.5
4

Test-Time RAG: Enhancing Long Context Understanding in LLMs with Retrieval-Augmented Mechanisms

ICLR 2025Rejected
6.6
4

Fast Video Generation with Sliding Tile Attention

ICML 2025Poster
6.5
4

GameArena: Evaluating LLM Reasoning through Live Computer Games

ICLR 2025Poster
5.8
4

Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile

ICLR 2025Rejected
7.8
4

HashAttention: Semantic Sparsity for Faster Inference

ICML 2025Poster
7.0
3

Faster Video Diffusion with Trainable Sparse Attention

NeurIPS 2025Poster
3.8
4

The Berkeley Function Calling Leaderboard (BFCL): From Tool Use to Agentic Evaluation of Large Language Models

ICML 2025Poster
8.0
4

R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

COLM 2025Poster
7.5
5

Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning

NeurIPS 2025Spotlight
6.0
5

Prompt-to-Leaderboard: Prompt-Adaptive LLM Evaluations

ICML 2025Poster
6.1
4

From Crowdsourced Data to High-quality Benchmarks: Arena-Hard and Benchbuilder Pipeline

ICML 2025Poster
6.5
4

JudgeBench: A Benchmark for Evaluating LLM-Based Judges

ICLR 2025Poster
6.3
3

RouteLLM: Learning to Route LLMs from Preference Data

ICLR 2025Poster
6.0
3

Bench-O-Matic: Automating Benchmark Curation from Crowdsourced Data

ICLR 2025Rejected
5.5
4

Copilot Arena: A Platform for Code LLM Evaluation in the Wild

ICML 2025Poster
6.3
4

How to Evaluate Reward Models for RLHF

ICLR 2025Poster
6.3
4

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

ICLR 2025Poster
7.0
3

Efficiently Scaling LLM Reasoning Programs with Certaindex

NeurIPS 2025Poster
6.3
3

Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards

ICML 2025Oral
6.1
4

Sparse Video-Gen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity

ICML 2025Poster
7.3
4

Radial Attention: $\mathcal O(n \log n)$ Sparse Attention for Long Video Generation

NeurIPS 2025Poster
7.3
4

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

NeurIPS 2025Spotlight

202413