影响力指数

88.48/100

前 0.7%

全站排名 #438

发表论文34 篇

平均评分5.0

年均产出11.3 篇/年

Zhuoran Yang

Assistant Professor@Yale University·美国·OpenReview

研究方向

Large Language Models · Reinforcement Learning

Dual-Robust Cross-Domain Offline Reinforcement Learning Against Dynamics Shifts

ICLR 2026Poster

Muon Outperforms Adam in Tail-End Associative Memory Learning

ICLR 2026Poster

Cross-domain Offline Policy Adaptation with Dynamics- and Value-aligned Data Filtering

ICLR 2026Rejected

Taming Polysemanticity in LLMs: Theory-Grounded Feature Recovery via Sparse Autoencoders

ICLR 2026Poster

Interpreting Multi-Layer Transformers for In-Context Linear Regression with Varying Covariance

ICLR 2026Rejected

How Transformers Learn Causal Structures In-Context: Explainable Mechanism Meets Theoretical Guarantee

ICLR 2026Poster

Unlocking Out-of-Distribution Generalization in Transformers via Latent Space Reasoning

ICLR 2026Rejected

AV-Odyssey Bench: From Fundamental Audio Perception to Audio-Visual Understanding

ICLR 2026Withdrawn

Learning to Incentivize on the Fly: Leader-Follower Games with Policy Recommendation

ICLR 2026Rejected

Learning in Context, Guided by Choice: A Reward-Free Paradigm for Reinforcement Learning with Transformers

ICLR 2026Rejected

On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking

ICLR 2026Rejected

Mechanistic Interpretability of In-Context Learning Generalization through Structured Task Curriculum

ICLR 2026Rejected

Can VLMs Reason Through Multiple Views?

ICLR 2026Desk Rejected

Can Neural Networks Achieve Optimal Computational-statistical Tradeoff? An Analysis on Single-Index Model

Decoding Rewards in Competitive Games: Inverse Game Theory with Entropy Regularization

ICML 2025Poster

In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention

ICML 2025Poster

Provable Learning for DEC-POMDPs: Factored Models and Memoryless Agents

ICLR 2025Rejected

In-Context Reinforcement Learning From Suboptimal Historical Data

ICLR 2025Rejected

In-Context Reinforcement Learning From Suboptimal Historical Data

ICML 2025Poster

Exploration in the Face of Strategic Responses: Provable Learning of Online Stackelberg Games

ICLR 2025Rejected

The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability

ICML 2025Poster

STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making

ICLR 2025Rejected

BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms

ICML 2025Poster

Quantile-Optimal Policy Learning under Unmeasured Confounding

ICLR 2025Withdrawn

An Instrumental Value for Data Production and its Application to Data Pricing

ICML 2025Poster

Steer a Crowd: Learning to Persuade a Population in a Stackelberg Game

ICLR 2025Withdrawn

合作者 (20)