Shuai Li
~Shuai_Li3
31
论文总数
15.5
年均投稿
平均评分
接收情况14/31
会议分布
ICLR
20
NeurIPS
7
ICML
4
发表论文 (31 篇)
202516 篇
4
Optimal Algorithm for Max-Min Fair Bandit
ICLR 2025Rejected
4
Optimal Algorithm for Max-Min Fair Bandit
ICML 2025Poster
4
The Implicit Bias of Stochastic AdaGrad-Norm on Separable Data
ICLR 2025Rejected
5
Multi-player Multi-armed Bandits with Delayed Feedback
ICLR 2025Rejected
5
The Convergence of Second-Order Sampling Methods for Diffusion Models
ICLR 2025Rejected
4
Learning Preferences without Interaction for Cooperative AI: A Hybrid Offline-Online Approach
NeurIPS 2025Poster
4
Improved Discretization Complexity Analysis of Consistency Models: Variance Exploding Forward Process and Decay Discretization Scheme
ICML 2025Poster
4
The Discretization Complexity Analysis of Consistency Models under Variance Exploding Forward Process
ICLR 2025Rejected
4
Contrastive guidance and feedback: A Suitable way to improve 3D Consistency of Multi-view Diffusion Model
ICLR 2025Rejected
3
Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation
ICLR 2025Rejected
3
Learning Imperfect Information Extensive-form Games with Last-iterate Convergence under Bandit Feedback
ICML 2025Poster
4
Learning Imperfect Information Extensive-form Games with Last-iterate Convergence under Bandit Feedback
ICLR 2025Rejected
5
Adversarial Attacks on Cooperative Multi-agent Bandits
ICLR 2025Rejected
4
Online-to-Offline RL for Agent Alignment
ICLR 2025Poster
4
Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation
ICML 2025Poster
4
Bandit Learning in Matching Markets with Indifference
ICLR 2025Poster
202415 篇
3
The Closeness of In-Context Learning and Weight Shifting for Softmax Regression
NeurIPS 2024Poster
3
The Closeness of In-Context Learning and Weight Shifting for Softmax Regression
ICLR 2024Rejected
3
On Stationary Point Convergence of PPO-Clip
ICLR 2024Poster
4
Improved Analysis for Bandit Learning in Matching Markets
NeurIPS 2024Poster
3
Transformers Perform In-Context Learning through Neural Networks
ICLR 2024withdrawn
4
Finite Sample Analysis for Single-Loop Single-Timescale Natural Actor-Critic Algorithm
ICLR 2024withdrawn
4
Adaptive Offline Data Replay in Offline-to-Online Reinforcement Learning
ICLR 2024withdrawn
4
Calibrating Reasoning in Language Models with Internal Consistency
NeurIPS 2024Poster
4
Leveraging Drift to Improve Sample Complexity of Variance Exploding Diffusion Models
NeurIPS 2024Poster
4
The Convergence of Variance Exploding Diffusion Models under the Manifold Hypothesis
ICLR 2024Rejected
4
Learning Versatile Skills with Curriculum Masking
NeurIPS 2024Poster
4
CurrMask: Learning Versatile Skills with Automatic Masking Curricula
ICLR 2024withdrawn
4
FutureDD: Planning in POMDP with Encoded Future Dynamics
ICLR 2024withdrawn
4
Few-Shot Diffusion Models Escape the Curse of Dimensionality
NeurIPS 2024Poster
4
Towards Provably Efficient Learning of Extensive-Form Games with Imperfect Information and Linear Function Approximation
ICLR 2024Rejected