Ion Stoica
~Ion_Stoica1
40
论文总数
20.0
年均投稿
平均评分
接收情况30/40
会议分布
ICLR
18
NeurIPS
10
ICML
9
COLM
3
发表论文 (40 篇)
202527 篇
4
OR-Bench: An Over-Refusal Benchmark for Large Language Models
ICML 2025Poster
8
OR-Bench: An Over-Refusal Benchmark for Large Language Models
ICLR 2025Rejected
3
MPC-Minimized Secure LLM Inference
ICLR 2025Rejected
3
A Statistical Framework for Ranking LLM-based Chatbots
ICLR 2025Poster
4
Post-Training Sparse Attention with Double Sparsity
ICLR 2025Rejected
4
Test-Time RAG: Enhancing Long Context Understanding in LLMs with Retrieval-Augmented Mechanisms
ICLR 2025Rejected
4
Fast Video Generation with Sliding Tile Attention
ICML 2025Poster
4
GameArena: Evaluating LLM Reasoning through Live Computer Games
ICLR 2025Poster
4
Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
ICLR 2025Rejected
4
HashAttention: Semantic Sparsity for Faster Inference
ICML 2025Poster
3
Faster Video Diffusion with Trainable Sparse Attention
NeurIPS 2025Poster
4
The Berkeley Function Calling Leaderboard (BFCL): From Tool Use to Agentic Evaluation of Large Language Models
ICML 2025Poster
4
R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
COLM 2025Poster
5
Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning
NeurIPS 2025Spotlight
5
Prompt-to-Leaderboard: Prompt-Adaptive LLM Evaluations
ICML 2025Poster
4
From Crowdsourced Data to High-quality Benchmarks: Arena-Hard and Benchbuilder Pipeline
ICML 2025Poster
4
JudgeBench: A Benchmark for Evaluating LLM-Based Judges
ICLR 2025Poster
3
RouteLLM: Learning to Route LLMs from Preference Data
ICLR 2025Poster
3
Bench-O-Matic: Automating Benchmark Curation from Crowdsourced Data
ICLR 2025Rejected
4
Copilot Arena: A Platform for Code LLM Evaluation in the Wild
ICML 2025Poster
4
How to Evaluate Reward Models for RLHF
ICLR 2025Poster
4
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code
ICLR 2025Poster
3
Efficiently Scaling LLM Reasoning Programs with Certaindex
NeurIPS 2025Poster
3
Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards
ICML 2025Oral
4
Sparse Video-Gen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
ICML 2025Poster
4
Radial Attention: $\mathcal O(n \log n)$ Sparse Attention for Long Video Generation
NeurIPS 2025Poster
4
Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation
NeurIPS 2025Spotlight
202413 篇
3
Trustless Audits without Revealing Data or Models
ICLR 2024Rejected
4
Scaling up Trustless DNN Inference with Zero-Knowledge Proofs
ICLR 2024Rejected
4
Online Speculative Decoding
ICLR 2024Rejected
4
Are More LLM Calls All You Need? Towards the Scaling Properties of Compound AI Systems
NeurIPS 2024Poster
4
Crafting Interpretable Embeddings for Language Neuroscience by Asking LLMs Questions
NeurIPS 2024Poster
4
Efficient LLM Scheduling by Learning to Rank
NeurIPS 2024Poster
3
LLM-Assisted Code Cleaning For Training Accurate Code Generators
ICLR 2024Poster
4
RAFT: Adapting Language Model to Domain Specific RAG
COLM 2024Poster
4
DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training
COLM 2024Poster
4
LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers
ICLR 2024Rejected
4
Stylus: Automatic Adapter Selection for Diffusion Models
NeurIPS 2024Oral
5
SGLang: Efficient Execution of Structured Language Model Programs
NeurIPS 2024Poster
4
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
ICLR 2024Spotlight