影响力指数

81.36/100

前 1.3%

全站排名 #807

发表论文35 篇

平均评分4.9

年均产出11.7 篇/年

Baoxiang Wang

Assistant Professor@The Chinese University of Hong Kong, Shenzhen·中国·OpenReview

研究方向

game theory · learning theory · reinforcement learning

Offline Multi-Agent Reinforcement Learning via Sequential Score Decomposition

ICLR 2026Rejected

Talk, Judge, Cooperate: Gossip-Driven Indirect Reciprocity in Self-Interested LLM Agents

ICLR 2026Rejected

Provable Strategic In-Context Learning of Transformers

ICLR 2026Withdrawn

On the Decomposition of Differentiable Games

ICLR 2026Rejected

When Human Data Runs Out: Self-Supervised Reasoning via Negotiation Self-Play

ICLR 2026Rejected

Information Bargaining: Bilateral Commitment in Bayesian Persuasion

ICLR 2026Rejected

Pointwise Convergence in Games with Conflicting Interests

ICLR 2026Withdrawn

Verbalized Bayesian Persuasion

ICLR 2026Withdrawn

Improved Approximation Algorithms for $k$-Submodular Maximization via Multilinear Extension

ICLR 2025Spotlight

Scalable Exploration via Ensemble++

NeurIPS 2025Poster

Uncoupled and Convergent Learning in Monotone Games under Bandit Feedback

NeurIPS 2025Poster

The adaptive complexity of parallelized log-concave sampling

ICLR 2025Withdrawn

A Comprehensive Framework for Analyzing the Convergence of Adam: Bridging the Gap with SGD

ICML 2025Poster

Tackling Data Corruption in Offline Reinforcement Learning via Sequence Modeling

ICLR 2025Poster

ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning

NeurIPS 2025Poster

Learning to Communicate Through Implicit Communication Channels

ICLR 2025Poster

Reward Translation via Reward Machine in Semi-Alignable MDPs

ICML 2025Poster

Learning Imperfect Information Extensive-form Games with Last-iterate Convergence under Bandit Feedback

ICLR 2025Rejected

Learning Imperfect Information Extensive-form Games with Last-iterate Convergence under Bandit Feedback

ICML 2025Poster

Scalable Exploration via Ensemble++

ICLR 2025Withdrawn

No-Regret and Incentive-Compatible Combinatorial Online Prediction

ICLR 2025Rejected

The Implicit Bias of Stochastic AdaGrad-Norm on Separable Data

ICLR 2025Rejected

Uncoupled and Convergent Learning in Monotone Games under Bandit Feedback

ICLR 2025Withdrawn

Verbalized Bayesian Persuasion

ICLR 2025Rejected

Bayesian Persuasion Is a Bargaining Game

ICLR 2025Rejected

No MCMC Teaching For me: Learning Energy-Based Models via Diffusion Synergy

ICLR 2025Rejected

Offline Multi-agent Reinforcement Learning with Sequential Score Decomposition

ICLR 2025Rejected

A Comprehensive Framework for Analyzing the Convergence of Adam: Bridging the Gap with Stochastic Gradient Descent

ICLR 2025Withdrawn

合作者 (20)