影响力指数

89.41/100

前 0.6%

全站排名 #403

发表论文40 篇

平均评分5.0

年均产出13.3 篇/年

Yutaka Matsuo

Full Professor@The University of Tokyo·日本·OpenReview

研究方向

deep learning · web mining · social media

Does “Do Differentiable Simulators Give Better Policy Gradients?” Give Better Policy Gradients?

ICLR 2026Poster

Quantization-Aware Diffusion Models For Maximum Likelihood Training

ICLR 2026Poster

C-Voting: Confidence-Based Test-Time Voting without Explicit Energy Functions

ICLR 2026Poster

SELF-HARMONY: LEARNING TO HARMONIZE SELF-SUPERVISION AND SELF-PLAY IN TEST-TIME REINFORCEMENT LEARNING

ICLR 2026Poster

RL Squeezes, SFT Expands: A Comparative Study of Reasoning LLMs

ICLR 2026Poster

Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying

ICLR 2026Rejected

Geometry of Nash Mirror Dynamics: Adaptive $\beta$-Control for Stable and Bias-Robust Self-Improving LLM Agents

ICLR 2026Rejected

Unlocking Noise-Resistant Vision: Key Architectural Secrets for Robust Models

ICLR 2026Rejected

WorldPack: Compressed Memory Improves Spatial Consistency in Video World Modeling

ICLR 2026Rejected

Leave No Observation Behind: Real-time Correction for VLA Action Chunks

ICLR 2026Rejected

Recurrent model for Sequential reasoning

ICLR 2026Rejected

Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback

ICLR 2026Withdrawn

Emergent Misalignment from Superposition

ICLR 2026Withdrawn

Vertical Attention: Automatic Exploration of Inter-Layer Connections in Transformer-based Language Models

ICLR 2026Rejected

Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation

NeurIPS 2025Spotlight

Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search

NeurIPS 2025Poster

Topology of Reasoning: Understanding Large Reasoning Models through Reasoning Graph Properties

NeurIPS 2025Poster

Beyond Induction Heads: In-Context Meta Learning Induces Multi-Phase Circuit Emergence

ICML 2025Poster

Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form

ICLR 2025Poster

Rethinking Evaluation of Sparse Autoencoders through the Representation of Polysemous Words

ICLR 2025Poster

CityNav: Language-Goal Aerial Navigation Dataset Using Geographic Information

ICLR 2025Rejected

Bridging Lottery Ticket and Grokking: Understanding Grokking from Inner Structure of Networks

ICLR 2025Rejected

FullDiffusion: Diffusion Models Without Time Truncation

ICLR 2025Rejected

ToM-agent: Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection

ICLR 2025Rejected

The Geometry of Phase Transitions in Diffusion Models: Tubular Neighbourhoods and Singularities

ICLR 2025Rejected

MMA: Benchmarking Multi-Modal Large Language Model in Ambiguity Contexts

ICLR 2025Withdrawn

RAGDP: Retrieve-Augmented Generative Diffusion Policy

ICLR 2025Rejected

Maximum Likelihood Estimation for Flow Matching by Direct Second-order Trace Objective

ICLR 2025Rejected

Curse of Instructions: Large Language Models Cannot Follow Multiple Instructions at Once

ICLR 2025Rejected

合作者 (20)

Masahiro Suzuki

Gouki Minegishi

Aleksandra Faust

Shohei Taniguchi