Bo Dai
~Bo_Dai1
18
论文总数
9.0
年均投稿
平均评分
接收情况15/18
会议分布
NeurIPS
9
ICLR
7
ICML
2
发表论文 (18 篇)
202512 篇
4
Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games
ICML 2025Poster
4
Exploration from a Primal-Dual Lens: Value-Incentivized Actor-Critic Methods for Sample-Efficient Online RL
NeurIPS 2025Poster
4
REINFORCE Converges to Optimal Policies with Any Learning Rate
NeurIPS 2025Poster
4
An Analysis of Causal Effect Estimation using Outcome Invariant Data Augmentation
NeurIPS 2025Spotlight
3
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
ICLR 2025Poster
4
Efficient Online Reinforcement Learning for Diffusion Policy
ICML 2025Poster
4
AmorLIP: Efficient Language-Image Pretraining via Amortization
NeurIPS 2025Poster
4
VideoAgent: Self-Improving Video Generation
ICLR 2025Rejected
5
EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing
ICLR 2025Poster
4
Matryoshka Pilot: Learning to Drive Black-Box LLMs with LLMs
NeurIPS 2025Poster
4
Martryoshka: Learning to Drive Black-Box LLMs with LLMs
ICLR 2025Rejected
4
Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF
ICLR 2025Poster
20246 篇
5
Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates
NeurIPS 2024Poster
4
Probabilistic Adaptation of Black-Box Text-to-Video Models
ICLR 2024Poster
4
UQE: A Query Engine for Unstructured Databases
NeurIPS 2024Poster
4
Diffusion Spectral Representation for Reinforcement Learning
NeurIPS 2024Poster
3
Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning
ICLR 2024Rejected
3
HYDRA: Model Factorization Framework for Black-Box LLM Personalization
NeurIPS 2024Poster