Zhuoran Yang
~Zhuoran_Yang1
21
论文总数
10.5
年均投稿
平均评分
接收情况12/21
会议分布
ICLR
13
ICML
6
NeurIPS
2
发表论文 (21 篇)
202513 篇
6
Provable Learning for DEC-POMDPs: Factored Models and Memoryless Agents
ICLR 2025Rejected
3
In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention
ICML 2025Poster
4
In-Context Reinforcement Learning From Suboptimal Historical Data
ICLR 2025Rejected
4
Quantile-Optimal Policy Learning under Unmeasured Confounding
ICLR 2025withdrawn
4
Can Neural Networks Achieve Optimal Computational-statistical Tradeoff? An Analysis on Single-Index Model
ICLR 2025Oral
5
Exploration in the Face of Strategic Responses: Provable Learning of Online Stackelberg Games
ICLR 2025Rejected
3
An Instrumental Value for Data Production and its Application to Data Pricing
ICML 2025Poster
4
In-Context Reinforcement Learning From Suboptimal Historical Data
ICML 2025Poster
4
Decoding Rewards in Competitive Games: Inverse Game Theory with Entropy Regularization
ICML 2025Poster
4
Steer a Crowd: Learning to Persuade a Population in a Stackelberg Game
ICLR 2025withdrawn
4
The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability
ICML 2025Poster
4
STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making
ICLR 2025Rejected
3
BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
ICML 2025Poster
20248 篇
3
On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games
NeurIPS 2024Poster
3
Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism
ICLR 2024withdrawn
3
What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization
ICLR 2024Rejected
5
Mean Field Langevin Actor-Critic: Faster Convergence and Global Optimality beyond Lazy Learning
ICLR 2024Rejected
4
Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation
ICLR 2024Poster
5
Sample-Efficient Multi-Agent RL: An Optimization Perspective
ICLR 2024Poster
5
Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems
ICLR 2024Spotlight
4
Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers
NeurIPS 2024Poster