Shie Mannor
~Shie_Mannor2
26
论文总数
13.0
年均投稿
平均评分
接收情况12/26
会议分布
ICLR
14
NeurIPS
8
ICML
4
发表论文 (26 篇)
202520 篇
3
A Classification View on Meta Learning Bandits
ICML 2025Poster
4
Real Time Macro-Block Rate Control for Task-Aware Video Compression Using Reinforcement Learning
ICLR 2025withdrawn
3
Efficient Fairness-Performance Pareto Front Computation
NeurIPS 2025Spotlight
5
Efficient Fairness-Performance Pareto Front Computation
ICLR 2025Rejected
3
Reinforcement Learning with Segment Feedback
ICLR 2025Rejected
4
Policy Gradient with Tree Expansion
ICML 2025Poster
4
Policy Gradient with Tree Expansion
ICLR 2025Rejected
3
Reinforcement Learning with Segment Feedback
ICML 2025Poster
4
On Bits and Bandits: Quantifying the Regret-Information Trade-off
ICLR 2025Poster
4
Policy Optimized Text-to-Image Pipeline Design
NeurIPS 2025Poster
4
Uncovering Untapped Potential in Sample-Efficient World Model Agents
NeurIPS 2025Rejected
4
On the Convergence of Single-Timescale Actor-Critic
NeurIPS 2025Poster
4
Learning Multiple Initial Solutions to Optimization Problems
ICML 2025Rejected
4
Improved Sample Complexity for Global Convergence of Actor-Critic Algorithms
ICLR 2025withdrawn
4
Learning Multiple Initial Solutions to Optimization Problems
ICLR 2025Rejected
4
SQT -- rough conservative actor critic
ICLR 2025Rejected
4
Human-like Communication Strategies for Improved Multi-Agent Reinforcement Learning
ICLR 2025withdrawn
4
Non-rectangular Robust MDPs with Normed Uncertainty Sets
NeurIPS 2025Poster
4
State Entropy Regularization for Robust Reinforcement Learning
NeurIPS 2025Oral
4
Global Convergence of Policy Gradient in Average Reward MDPs
ICLR 2025Poster
20246 篇
4
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
NeurIPS 2024Poster
3
Unnormalized Density Estimation with Root Sobolev Norm Regularization
ICLR 2024Rejected
4
Tree Search-Based Policy Optimization under Stochastic Execution Delay
ICLR 2024Poster
4
Policy Gradient with Tree Expansion
NeurIPS 2024Rejected
4
SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Expansion
ICLR 2024Rejected
4
EWoK: Tackling Robust Markov Decision Processes via Estimating Worst Kernel
ICLR 2024Rejected