Quanquan Gu
~Quanquan_Gu1
59
论文总数
29.5
年均投稿
平均评分
接收情况36/59
会议分布
ICLR
34
NeurIPS
14
ICML
11
发表论文 (59 篇)
202538 篇
4
Variance-Dependent Regret Lower Bounds for Contextual Bandits
NeurIPS 2025Rejected
4
Variance-Aware Feel-Good Thompson Sampling for Contextual Bandits
NeurIPS 2025Poster
4
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
NeurIPS 2025Poster
3
Accelerated Preference Optimization for Large Language Model Alignment
ICLR 2025Rejected
4
Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
ICLR 2025Rejected
4
Unified Convergence Analysis for Score-Based Diffusion Models with Deterministic Samplers
ICLR 2025Poster
4
CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing
ICLR 2025Rejected
4
Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
ICML 2025Poster
4
Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis
ICLR 2025Poster
5
Ranking with Multiple Oracles: From Weak to Strong Stochastic Transitivity
ICML 2025Poster
3
Imbalance-Regularized LoRA: A Plug-and-Play Method for Improving Fine-Tuning of Foundation Models
ICLR 2025Rejected
4
Energy-Weighted Flow Matching for Offline Reinforcement Learning
ICLR 2025Poster
4
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
ICLR 2025Rejected
4
Global Convergence and Rich Feature Learning in $L$-Layer Infinite-Width Neural Networks under $\mu$ Parametrization
ICML 2025Poster
4
CryoFM: A Flow-based Foundation Model for Cryo-EM Densities
ICLR 2025Poster
5
Mitigating Object Hallucination in Large Vision-Language Models via Image-Grounded Guidance
ICLR 2025Rejected
5
Logarithmic Regret for Online KL-Regularized Reinforcement Learning
ICML 2025Poster
4
Mitigating Object Hallucination in Large Vision-Language Models via Image-Grounded Guidance
ICML 2025Spotlight
5
Relative-Translation Invariant Wasserstein Distance
ICLR 2025Rejected
4
Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment
ICML 2025Poster
4
MARS: Unleashing the Power of Variance Reduction for Training Large Models
ICML 2025Poster
4
Decomposed Direct Preference Optimization for Structure-Based Drug Design
ICLR 2025Rejected
4
Multi-Step Preference Optimization via Two-Player Markov Games
ICLR 2025Rejected
3
Towards Simple and Provable Parameter-Free Adaptive Gradient Methods
ICLR 2025Rejected
4
Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration
ICLR 2025Poster
4
General Preference Modeling with Preference Representations for Aligning Language Models
ICLR 2025Rejected
4
An All-Atom Generative Model for Designing Protein Complexes
ICML 2025Poster
4
Simultaneous Modeling of Protein Conformation and Dynamics via Autoregression
NeurIPS 2025Poster
4
Tensor Product Attention Is All You Need
NeurIPS 2025Spotlight
3
DPLM-2: A Multimodal Diffusion Protein Language Model
ICLR 2025Poster
5
ProteinWeaver: A Divide-and-Assembly Approach for Protein Backbone Design
ICLR 2025desk_rejected
4
Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization
ICLR 2025Rejected
4
Tensor Product Attention Is All You Need
ICML 2025Rejected
4
Self-Play Preference Optimization for Language Model Alignment
ICLR 2025Poster
4
LLaVA-Critic: Learning to Evaluate Multimodal Models
ICLR 2025withdrawn
4
Elucidating the Design Space of Multimodal Protein Language Models
ICML 2025Spotlight
4
Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling
ICML 2025Poster
4
ProteinBench: A Holistic Evaluation of Protein Foundation Models
ICLR 2025Poster
202421 篇
4
Asynchronous Multi-Agent Reinforcement Learning with General Function Approximation
NeurIPS 2024Rejected
4
Matching the Statistical Query Lower Bound for $k$-Sparse Parity Problems with Sign Stochastic Gradient Descent
NeurIPS 2024Poster
4
Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
NeurIPS 2024Rejected
3
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation
NeurIPS 2024Poster
4
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation
ICLR 2024Rejected
4
Achieving Constant Regret in Linear Markov Decision Processes
NeurIPS 2024Poster
4
Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning
ICLR 2024Poster
3
Self-Play Fine-tuning of Diffusion Models for Text-to-image Generation
NeurIPS 2024Poster
4
Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP
ICLR 2024Poster
4
How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression?
ICLR 2024Spotlight
4
Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs
ICLR 2024Poster
3
Risk Bounds of Accelerated SGD for Overparameterized Linear Regression
ICLR 2024Poster
4
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
ICLR 2024Rejected
4
On the Power of Multitask Representation Learning with Gradient Descent
ICLR 2024Rejected
4
Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time
NeurIPS 2024Poster
4
Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization
NeurIPS 2024Poster
4
Enhancing Large Vision Language Models with Self-Training on Image Comprehension
NeurIPS 2024Poster
4
Borda Regret Minimization for Generalized Linear Dueling Bandits
ICLR 2024Rejected
4
Variance-aware Regret Bounds for Stochastic Contextual Dueling Bandits
ICLR 2024Poster
4
Fast Sampling via De-randomization for Discrete Diffusion Models
ICLR 2024Rejected
5
DecompOpt: Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization
ICLR 2024Poster