影响力指数

87.46/100

前 0.8%

全站排名 #491

发表论文32 篇

平均评分5.5

年均产出10.7 篇/年

Shie Mannor

Full Professor@Technion - Israel Institute of Technology, Technion·以色列·OpenReview

研究方向

Machine learning

Task Tokens: A Flexible Approach to Adapting Behavior Foundation Models

ICLR 2026Poster

Spectral Bellman Method: Unifying Representation and Exploration in RL

ICLR 2026Poster

Horizon Imagination: Efficient On-Policy Rollout in Diffusion World Models

ICLR 2026Poster

Reinforcement Learning with Discrete Diffusion Policies for Combinatorial Action Spaces

ICLR 2026Rejected

Representative Action Selection for Large Action Space Meta-Bandits

ICLR 2026Rejected

Hierarchical Bandits for Adversarial Online Configuration Optimization

ICLR 2026Rejected

State Entropy Regularization for Robust Reinforcement Learning

NeurIPS 2025Oral

Efficient Fairness-Performance Pareto Front Computation

NeurIPS 2025Spotlight

On the Convergence of Single-Timescale Actor-Critic

NeurIPS 2025Poster

On Bits and Bandits: Quantifying the Regret-Information Trade-off

ICLR 2025Poster

Global Convergence of Policy Gradient in Average Reward MDPs

ICLR 2025Poster

Non-rectangular Robust MDPs with Normed Uncertainty Sets

NeurIPS 2025Poster

Reinforcement Learning with Segment Feedback

ICML 2025Poster

Policy Gradient with Tree Expansion

ICML 2025Poster

Learning Multiple Initial Solutions to Optimization Problems

ICML 2025Rejected

Efficient Fairness-Performance Pareto Front Computation

ICLR 2025Rejected

Reinforcement Learning with Segment Feedback

ICLR 2025Rejected

Policy Optimized Text-to-Image Pipeline Design

NeurIPS 2025Poster

Learning Multiple Initial Solutions to Optimization Problems

ICLR 2025Rejected

A Classification View on Meta Learning Bandits

ICML 2025Poster

Policy Gradient with Tree Expansion

ICLR 2025Rejected

Uncovering Untapped Potential in Sample-Efficient World Model Agents

NeurIPS 2025Rejected

Real Time Macro-Block Rate Control for Task-Aware Video Compression Using Reinforcement Learning

ICLR 2025Withdrawn

Improved Sample Complexity for Global Convergence of Actor-Critic Algorithms

ICLR 2025Withdrawn

Human-like Communication Strategies for Improved Multi-Agent Reinforcement Learning

ICLR 2025Withdrawn

SQT -- rough conservative actor critic

ICLR 2025Rejected

合作者 (20)

Kfir Yehuda Levy