PaperHub

Min-hwan Oh

~Min-hwan_Oh1

34
论文总数
17.0
年均投稿
6.2
平均评分
接收情况25/34
会议分布
ICLR
15
NeurIPS
14
ICML
5

发表论文 (34 篇)

202527

7.0
4

Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning

ICLR 2025Poster
6.1
4

Combinatorial Reinforcement Learning with Preference Feedback

ICML 2025Poster
7.2
4

Improved Online Confidence Bounds for Multinomial Logistic Bandits

ICML 2025Poster
5.5
4

Optimal and Practical Batched Linear Bandit Algorithm

ICML 2025Poster
5.5
4

Combinatorial Reinforcement Learning with Preference Feedback

ICLR 2025Rejected
-

Coordinated Exploration in Distributed Reinforcement Learning

ICLR 2025withdrawn
7.3
4

Infrequent Exploration in Linear Bandits

NeurIPS 2025Poster
7.0
4

Minimax Optimal Reinforcement Learning with Quasi-Optimism

ICLR 2025Poster
7.8
5

Exploration via Feature Perturbation in Contextual Bandits

NeurIPS 2025Spotlight
6.5
4

Dynamic Assortment Selection and Pricing with Censored Preference Feedback

ICLR 2025Poster
4.0
5

Mostly Exploration-free Algorithms for Multi-Objective Linear Bandits

ICLR 2025withdrawn
4.8
5

Stochastic Matching Bandits under Preference Feedback

ICLR 2025withdrawn
6.7
3

ADAM Optimization with Adaptive Batch Selection

ICLR 2025Poster
5.6
5

GFlowNets Need Automorphism Correction for Unbiased Graph Generation

ICLR 2025Rejected
7.0
3

Oracle-Efficient Combinatorial Semi-Bandits

NeurIPS 2025Poster
6.8
4

Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options

NeurIPS 2025Poster
7.3
4

Tractable Multinomial Logit Contextual Bandits with Non-Linear Utilities

NeurIPS 2025Poster
7.8
4

True Impact of Cascade Length in Contextual Cascading Bandits

NeurIPS 2025Poster
7.1
5

Thompson Sampling for Multi-Objective Linear Contextual Bandit

NeurIPS 2025Poster
6.1
4

Symmetry-Aware GFlowNets

ICML 2025Poster
6.3
3

Lasso Bandit with Compatibility Condition on Optimal Arm

ICLR 2025Poster
4.3
3

Neural Dynamic Pricing: Provable and Practical Efficiency

ICLR 2025withdrawn
7.3
4

Revisiting Follow-the-Perturbed-Leader with Unbounded Perturbations in Bandit Problems

NeurIPS 2025Poster
5.0
4

Linear Bandits with Partially Observable Features

ICLR 2025Rejected
4.8
3

Linear Bandits with Partially Observable Features

ICML 2025Poster
5.3
3

Magnituder Layers for Implicit Neural Representations in 3D

ICLR 2025Rejected
6.4
4

EUGens: Efficient, Unified and General Dense Layers

NeurIPS 2025Poster