Min-hwan Oh
~Min-hwan_Oh1
34
论文总数
17.0
年均投稿
平均评分
接收情况25/34
会议分布
ICLR
15
NeurIPS
14
ICML
5
发表论文 (34 篇)
202527 篇
4
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
ICLR 2025Poster
4
Combinatorial Reinforcement Learning with Preference Feedback
ICML 2025Poster
4
Improved Online Confidence Bounds for Multinomial Logistic Bandits
ICML 2025Poster
4
Optimal and Practical Batched Linear Bandit Algorithm
ICML 2025Poster
4
Combinatorial Reinforcement Learning with Preference Feedback
ICLR 2025Rejected
-
Coordinated Exploration in Distributed Reinforcement Learning
ICLR 2025withdrawn
4
Infrequent Exploration in Linear Bandits
NeurIPS 2025Poster
4
Minimax Optimal Reinforcement Learning with Quasi-Optimism
ICLR 2025Poster
5
Exploration via Feature Perturbation in Contextual Bandits
NeurIPS 2025Spotlight
4
Dynamic Assortment Selection and Pricing with Censored Preference Feedback
ICLR 2025Poster
5
Mostly Exploration-free Algorithms for Multi-Objective Linear Bandits
ICLR 2025withdrawn
5
Stochastic Matching Bandits under Preference Feedback
ICLR 2025withdrawn
3
ADAM Optimization with Adaptive Batch Selection
ICLR 2025Poster
5
GFlowNets Need Automorphism Correction for Unbiased Graph Generation
ICLR 2025Rejected
3
Oracle-Efficient Combinatorial Semi-Bandits
NeurIPS 2025Poster
4
Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options
NeurIPS 2025Poster
4
Tractable Multinomial Logit Contextual Bandits with Non-Linear Utilities
NeurIPS 2025Poster
4
True Impact of Cascade Length in Contextual Cascading Bandits
NeurIPS 2025Poster
5
Thompson Sampling for Multi-Objective Linear Contextual Bandit
NeurIPS 2025Poster
4
Symmetry-Aware GFlowNets
ICML 2025Poster
3
Lasso Bandit with Compatibility Condition on Optimal Arm
ICLR 2025Poster
3
Neural Dynamic Pricing: Provable and Practical Efficiency
ICLR 2025withdrawn
4
Revisiting Follow-the-Perturbed-Leader with Unbounded Perturbations in Bandit Problems
NeurIPS 2025Poster
4
Linear Bandits with Partially Observable Features
ICLR 2025Rejected
3
Linear Bandits with Partially Observable Features
ICML 2025Poster
3
Magnituder Layers for Implicit Neural Representations in 3D
ICLR 2025Rejected
4
EUGens: Efficient, Unified and General Dense Layers
NeurIPS 2025Poster
20247 篇
5
Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
ICLR 2024Poster
4
Local Anti-Concentration Class: Logarithmic Regret for Greedy Linear Contextual Bandit
NeurIPS 2024Poster
5
Nearly Minimax Optimal Regret for Multinomial Logistic Bandit
NeurIPS 2024Poster
4
Provably Efficient Policy Optimization with Rare Policy Switches
ICLR 2024withdrawn
4
Improved Regret of Linear Ensemble Sampling
NeurIPS 2024Poster
4
Queueing Matching Bandits with Preference Feedback
NeurIPS 2024Poster
4
Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation
NeurIPS 2024Poster