Baoxiang Wang
~Baoxiang_Wang1
27
论文总数
13.5
年均投稿
平均评分
接收情况12/27
会议分布
ICLR
19
NeurIPS
5
ICML
3
发表论文 (27 篇)
202520 篇
4
Uncoupled and Convergent Learning in Monotone Games under Bandit Feedback
NeurIPS 2025Poster
4
The adaptive complexity of parallelized log-concave sampling
ICLR 2025withdrawn
4
Uncoupled and Convergent Learning in Monotone Games under Bandit Feedback
ICLR 2025withdrawn
4
No-Regret and Incentive-Compatible Combinatorial Online Prediction
ICLR 2025Rejected
4
Improved Approximation Algorithms for $k$-Submodular Maximization via Multilinear Extension
ICLR 2025Spotlight
4
Scalable Exploration via Ensemble++
ICLR 2025withdrawn
4
Scalable Exploration via Ensemble++
NeurIPS 2025Poster
3
No MCMC Teaching For me: Learning Energy-Based Models via Diffusion Synergy
ICLR 2025Rejected
-
A Comprehensive Framework for Analyzing the Convergence of Adam: Bridging the Gap with Stochastic Gradient Descent
ICLR 2025withdrawn
4
Learning Imperfect Information Extensive-form Games with Last-iterate Convergence under Bandit Feedback
ICLR 2025Rejected
4
Verbalized Bayesian Persuasion
ICLR 2025Rejected
4
Learning to Communicate Through Implicit Communication Channels
ICLR 2025Poster
4
The Implicit Bias of Stochastic AdaGrad-Norm on Separable Data
ICLR 2025Rejected
3
Learning Imperfect Information Extensive-form Games with Last-iterate Convergence under Bandit Feedback
ICML 2025Poster
4
A Comprehensive Framework for Analyzing the Convergence of Adam: Bridging the Gap with SGD
ICML 2025Poster
3
Reward Translation via Reward Machine in Semi-Alignable MDPs
ICML 2025Poster
3
Offline Multi-agent Reinforcement Learning with Sequential Score Decomposition
ICLR 2025Rejected
4
Tackling Data Corruption in Offline Reinforcement Learning via Sequence Modeling
ICLR 2025Poster
4
ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning
NeurIPS 2025Poster
3
Bayesian Persuasion Is a Bargaining Game
ICLR 2025Rejected
20247 篇
4
Finite Sample Analysis for Single-Loop Single-Timescale Natural Actor-Critic Algorithm
ICLR 2024withdrawn
3
On Stationary Point Convergence of PPO-Clip
ICLR 2024Poster
4
The Implicit Bias of Stochastic AdaGrad-Norm on Separable Data
ICLR 2024Rejected
4
Online Control with Adversarial Disturbance for Continuous-time Linear Systems
NeurIPS 2024Poster
4
Reward Translation via Reward Machine in Semi-Alignable MDPs
ICLR 2024Rejected
4
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
ICLR 2024Rejected
4
Few-Shot Diffusion Models Escape the Curse of Dimensionality
NeurIPS 2024Poster