Tong Yang
~Tong_Yang4
8
论文总数
4.0
年均投稿
平均评分
接收情况7/8
会议分布
NeurIPS
4
ICLR
3
ICML
1
发表论文 (8 篇)
20254 篇
4
Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games
ICML 2025Poster
4
Exploration from a Primal-Dual Lens: Value-Incentivized Actor-Critic Methods for Sample-Efficient Online RL
NeurIPS 2025Poster
5
Multi-head Transformers Provably Learn Symbolic Multi-step Reasoning via Gradient Descent
NeurIPS 2025Poster
4
Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF
ICLR 2025Poster
20244 篇
4
In-Context Learning with Representations: Contextual Generalization of Trained Transformers
NeurIPS 2024Poster
4
Federated Natural Policy Gradient and Actor Critic Methods for Multi-task Reinforcement Learning
NeurIPS 2024Poster
4
Federated Natural Policy Gradient Methods for Multi-task Reinforcement Learning
ICLR 2024Rejected
4
A Primal-Dual Approach to Solving Variational Inequalities with General Constraints
ICLR 2024Poster