影响力指数

86.88/100

前 0.8%

全站排名 #513

发表论文45 篇

平均评分4.8

年均产出15.0 篇/年

Shuai Li

Associate Professor@Shanghai Jiaotong University·中国·OpenReview

研究方向

bandit algorithms · online learning · reinforcement learning · diffusion models · LLM theory

Bandit Learning for Online Scheduling with Immediate Decision

ICLR 2026Rejected

Multi-Subspace Multi-Modal Modeling for Diffusion Models: Estimation, Convergence and Mixture of Experts

ICLR 2026Poster

Player-optimal Stable Regret for Bandit Learning in Many-to-one Matching Markets with Substitutability

ICLR 2026Rejected

Elucidating Guidance in Variance Exploding Diffusion Models: Fast Convergence and Better Diversity

ICLR 2026Rejected

AdaReP: Plug-and-Play Acceleration for World Model Predictive Control using Adaptive Re-Planning

ICLR 2026Withdrawn

On the Effect of Positional Encoding for In-context Learning in Transformers

ICLR 2026Rejected

The Accumulation of Score Estimation Error in Diffusion Models

ICLR 2026Rejected

Evaluating the Role of Great Pre-trained Diffusion Models in Few-shot Phase: Warm-up and Acceleration

ICLR 2026Rejected

Quantifying QiGan: What Shape Patterns Does a DNN Exploit in Go?

ICLR 2026Withdrawn

Overcoming Redundant Context in Auto-Regressive LLMs with Dynamic Draft Refinement

ICLR 2026Rejected

In-Context Stochastic Gradient Descent with Hybrid Mamba-2 and Linear Self-Attention Model

ICLR 2026Rejected

One-Shot Style Personalization for RL Agents via Latent Discriminator

ICLR 2026Rejected

Efficient Thinking via Meta Chain-of-Thought Evaluation

ICLR 2026Rejected

Beyond Static Retrieval Policies: Task-Aware Adaptive RAG With METAR

ICLR 2026Rejected

Optimal Algorithm for Max-Min Fair Bandit

ICML 2025Poster

Learning Preferences without Interaction for Cooperative AI: A Hybrid Offline-Online Approach

NeurIPS 2025Poster

Bandit Learning in Matching Markets with Indifference

ICLR 2025Poster

Learning Imperfect Information Extensive-form Games with Last-iterate Convergence under Bandit Feedback

ICLR 2025Rejected

Optimal Algorithm for Max-Min Fair Bandit

ICLR 2025Rejected

Online-to-Offline RL for Agent Alignment

ICLR 2025Poster

Learning Imperfect Information Extensive-form Games with Last-iterate Convergence under Bandit Feedback

ICML 2025Poster

Multi-player Multi-armed Bandits with Delayed Feedback

ICLR 2025Rejected

Improved Discretization Complexity Analysis of Consistency Models: Variance Exploding Forward Process and Decay Discretization Scheme

ICML 2025Poster

Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation

ICML 2025Poster

Contrastive guidance and feedback: A Suitable way to improve 3D Consistency of Multi-view Diffusion Model

ICLR 2025Rejected

The Convergence of Second-Order Sampling Methods for Diffusion Models

ICLR 2025Rejected

The Implicit Bias of Stochastic AdaGrad-Norm on Separable Data

ICLR 2025Rejected

Adversarial Attacks on Cooperative Multi-agent Bandits

ICLR 2025Rejected

The Discretization Complexity Analysis of Consistency Models under Variance Exploding Forward Process

ICLR 2025Rejected

Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation

ICLR 2025Rejected

合作者 (20)