PaperHub

Quanquan Gu

~Quanquan_Gu1

59
论文总数
29.5
年均投稿
5.9
平均评分
接收情况36/59
会议分布
ICLR
34
NeurIPS
14
ICML
11

发表论文 (59 篇)

202538

6.0
4

Variance-Dependent Regret Lower Bounds for Contextual Bandits

NeurIPS 2025Rejected
6.8
4

Variance-Aware Feel-Good Thompson Sampling for Contextual Bandits

NeurIPS 2025Poster
6.8
4

Sharp Analysis for KL-Regularized Contextual Bandits and RLHF

NeurIPS 2025Poster
5.0
3

Accelerated Preference Optimization for Large Language Model Alignment

ICLR 2025Rejected
5.3
4

Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback

ICLR 2025Rejected
6.5
4

Unified Convergence Analysis for Score-Based Diffusion Models with Deterministic Samplers

ICLR 2025Poster
5.3
4

CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing

ICLR 2025Rejected
6.1
4

Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback

ICML 2025Poster
7.0
4

Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis

ICLR 2025Poster
6.0
5

Ranking with Multiple Oracles: From Weak to Strong Stochastic Transitivity

ICML 2025Poster
4.7
3

Imbalance-Regularized LoRA: A Plug-and-Play Method for Improving Fine-Tuning of Foundation Models

ICLR 2025Rejected
6.3
4

Energy-Weighted Flow Matching for Offline Reinforcement Learning

ICLR 2025Poster
4.8
4

Sharp Analysis for KL-Regularized Contextual Bandits and RLHF

ICLR 2025Rejected
6.1
4

Global Convergence and Rich Feature Learning in $L$-Layer Infinite-Width Neural Networks under $\mu$ Parametrization

ICML 2025Poster
6.5
4

CryoFM: A Flow-based Foundation Model for Cryo-EM Densities

ICLR 2025Poster
6.0
5

Mitigating Object Hallucination in Large Vision-Language Models via Image-Grounded Guidance

ICLR 2025Rejected
7.3
5

Logarithmic Regret for Online KL-Regularized Reinforcement Learning

ICML 2025Poster
6.6
4

Mitigating Object Hallucination in Large Vision-Language Models via Image-Grounded Guidance

ICML 2025Spotlight
3.8
5

Relative-Translation Invariant Wasserstein Distance

ICLR 2025Rejected
6.1
4

Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment

ICML 2025Poster
8.3
4

MARS: Unleashing the Power of Variance Reduction for Training Large Models

ICML 2025Poster
4.0
4

Decomposed Direct Preference Optimization for Structure-Based Drug Design

ICLR 2025Rejected
5.3
4

Multi-Step Preference Optimization via Two-Player Markov Games

ICLR 2025Rejected
4.0
3

Towards Simple and Provable Parameter-Free Adaptive Gradient Methods

ICLR 2025Rejected
6.0
4

Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration

ICLR 2025Poster
5.0
4

General Preference Modeling with Preference Representations for Aligning Language Models

ICLR 2025Rejected
7.2
4

An All-Atom Generative Model for Designing Protein Complexes

ICML 2025Poster
8.2
4

Simultaneous Modeling of Protein Conformation and Dynamics via Autoregression

NeurIPS 2025Poster
7.3
4

Tensor Product Attention Is All You Need

NeurIPS 2025Spotlight
6.3
3

DPLM-2: A Multimodal Diffusion Protein Language Model

ICLR 2025Poster
4.2
5

ProteinWeaver: A Divide-and-Assembly Approach for Protein Backbone Design

ICLR 2025desk_rejected
3.5
4

Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization

ICLR 2025Rejected
6.1
4

Tensor Product Attention Is All You Need

ICML 2025Rejected
6.0
4

Self-Play Preference Optimization for Language Model Alignment

ICLR 2025Poster
4.8
4

LLaVA-Critic: Learning to Evaluate Multimodal Models

ICLR 2025withdrawn
7.8
4

Elucidating the Design Space of Multimodal Protein Language Models

ICML 2025Spotlight
7.2
4

Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling

ICML 2025Poster
6.5
4

ProteinBench: A Holistic Evaluation of Protein Foundation Models

ICLR 2025Poster

202421

5.8
4

Asynchronous Multi-Agent Reinforcement Learning with General Function Approximation

NeurIPS 2024Rejected
6.0
4

Matching the Statistical Query Lower Bound for $k$-Sparse Parity Problems with Sign Stochastic Gradient Descent

NeurIPS 2024Poster
5.8
4

Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback

NeurIPS 2024Rejected
6.3
3

A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation

NeurIPS 2024Poster
5.3
4

A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation

ICLR 2024Rejected
5.3
4

Achieving Constant Regret in Linear Markov Decision Processes

NeurIPS 2024Poster
6.3
4

Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning

ICLR 2024Poster
6.0
3

Self-Play Fine-tuning of Diffusion Models for Text-to-image Generation

NeurIPS 2024Poster
6.5
4

Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP

ICLR 2024Poster
6.8
4

How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression?

ICLR 2024Spotlight
6.0
4

Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs

ICLR 2024Poster
6.0
3

Risk Bounds of Accelerated SGD for Overparameterized Linear Regression

ICLR 2024Poster
5.0
4

Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning

ICLR 2024Rejected
5.3
4

On the Power of Multitask Representation Learning with Gradient Descent

ICLR 2024Rejected
5.5
4

Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time

NeurIPS 2024Poster
5.8
4

Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization

NeurIPS 2024Poster
5.5
4

Enhancing Large Vision Language Models with Self-Training on Image Comprehension

NeurIPS 2024Poster
5.3
4

Borda Regret Minimization for Generalized Linear Dueling Bandits

ICLR 2024Rejected
6.0
4

Variance-aware Regret Bounds for Stochastic Contextual Dueling Bandits

ICLR 2024Poster
5.5
4

Fast Sampling via De-randomization for Discrete Diffusion Models

ICLR 2024Rejected
6.2
5

DecompOpt: Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization

ICLR 2024Poster