影响力指数

98.13/100

前 0.1%

全站排名 #47

发表论文81 篇

平均评分4.7

年均产出27.0 篇/年

Heng Huang

Full Professor@Department of Computer Science, University of Maryland, College Park·美国·OpenReview

研究方向

machine learning · data mining · bioinformatics · computer vision

Catalog-Native LLM: Speaking Item-ID dialect with Less Entanglement for Recommendation

ICLR 2026Poster

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

ICLR 2026Poster

Achieve Performatively Optimal Policy for Performative Reinforcement Learning

ICLR 2026Rejected

New Hybrid Fine-Tuning Paradigm for LLMs: Algorithm Design and Convergence Analysis Framework

ICLR 2026Poster

Zeroth-Order Methods for Stochastic Nonconvex Nonsmooth Composite Optimization

ICLR 2026Rejected

Riemannian Zeroth-Order Gradient Estimation with Structure-Preserving Metrics for Geodesically Incomplete Manifolds

ICLR 2026Poster

An Ensemble Framework for Unbiased Language Model Watermarking

ICLR 2026Poster

Learning to Reason via Mixture-of-Thought for Logical Reasoning

ICLR 2026Poster

Trade-off in Estimating the Number of Byzantine Clients in Federated Learning

ICLR 2026Rejected

Toward Bit-Efficient Dataset Condensation: A General Framework

ICLR 2026Rejected

Mitigating Data Heterogeneity Effect in Client-Reshuffling-Based Federated Learning

ICLR 2026Rejected

Model Correlation Detection via Random Selection Probing

ICLR 2026Rejected

A Multi-Fidelity Mixture-of-Expert Framework Integrating PDE Solvers and Neural Operators for Computational Fluid Dynamics

ICLR 2026Desk Rejected

Analyzing and Evaluating Unbiased Language Model Watermark

ICLR 2026Poster

Provably Learning Representations under Generalized Dependency Structure

ICLR 2026Rejected

An Unlearning-Enhanced General Framework for Test-Time Adaptation

ICLR 2026Withdrawn

Explore Data Left Behind in Reinforcement Learning for Reasoning Language Models

ICLR 2026Rejected

CoTGuard: Using Chain-of-Thought Triggering for Copyright Protection in Multi-Agent LLM Systems

ICLR 2026Desk Rejected

Defending against Model Extraction for GNNs with Model Reprogramming

ICLR 2026Rejected

Improving Text-to-Image Generation with Input-Side Inference-Time Scaling

ICLR 2026Withdrawn

Surprise-Modulated Meta-Advantages in Reinforcement Learning: Towards Language-Neutral Post-Training for Code LLMs

ICLR 2026Rejected

Pruning Without Fine-Tuning: Dynamic Pruning of Autoregressive Image Generation Models to Mixtures of Experts

ICLR 2026Withdrawn

Layer-Wise Feedback Signals: Dynamic Regulation for Continual Learning

ICLR 2026Withdrawn

Leave No TRACE: Black-box Detection of Copyrighted Dataset Usage in Large Language Models via Watermarking

ICLR 2026Withdrawn

Understanding Catastrophic Interference On the Identifibility of Latent Representations

ICLR 2026Rejected

Modality-Balancing Preference Optimization of Large Multimodal Models by Adversarial Negative Mining

ICLR 2026Withdrawn

Learning to Reason About Code Insecurity: Composite-Reinforcement Fine-Tuning for Cognitive Alignment

ICLR 2026Rejected

Graph Attention with Knowledge-Aware Domain Adaptation for Drug-Target Interaction Prediction

ICLR 2026Rejected

Understanding Weak-to-Strong Generalization: A Spectral Analysis

ICLR 2026Withdrawn

Advancing Drug-Target Interaction Prediction via Graph Transformers and Residual Protein Embeddings

ICLR 2026Desk Rejected

DDI-Aware Domain Adaptation for Cross-Domain Drug Combination Representation Learning via Contrastive Embedding

ICLR 2026Desk Rejected

Meta-Reinforcement Learning for Compiler Optimization: A Kernel-Embedded CompilerLLM with Verified Assumptions and Practical Guarantees

ICLR 2026Desk Rejected

Cost-Aware Contrastive Routing for LLMs

NeurIPS 2025Spotlight

On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization

NeurIPS 2025Spotlight

Robust Distortion-Free Watermark for Autoregressive Audio Generation Models

NeurIPS 2025Poster

Bilevel ZOFO: Efficient LLM Fine-Tuning and Meta-Training

NeurIPS 2025Poster

Revisiting Zeroth-Order Optimization: Minimum-Variance Two-Point Estimators and Directionally Aligned Perturbations

ICLR 2025Spotlight

Identification of Intermittent Temporal Latent Process

ICLR 2025Poster

Robust Reinforcement Learning in Finance: Modeling Market Impact with Elliptic Uncertainty Sets

NeurIPS 2025Poster

Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models

ICLR 2025Poster

Revisiting Convergence: Shuffling Complexity Beyond Lipschitz Smoothness

ICML 2025Poster

OmnixR: Evaluating Omni-modality Language Models on Reasoning across Modalities

ICLR 2025Poster

Revisiting Convergence: A Study on Shuffling-Type Gradient Methods

ICLR 2025Rejected

Provably Mitigating Corruption, Overoptimization, and Verbosity Simultaneously in Offline and Online RLHF/DPO Alignment

ICLR 2025Rejected

Bilevel ZOFO: Bridging Parameter-Efficient and Zeroth-Order Techniques for Efficient LLM Fine-Tuning and Meta-Training

ICLR 2025Rejected

A Watermark for Order-Agnostic Language Models

ICLR 2025Poster

Towards Optimal Multi-draft Speculative Decoding

ICLR 2025Poster

Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework

ICLR 2025Withdrawn

De-mark: Watermark Removal in Large Language Models

ICML 2025Poster

LLaVA-Critic: Learning to Evaluate Multimodal Models

ICLR 2025Withdrawn

Escaping Saddle Point Efficiently in Minimax and Bilevel Optimizations

ICLR 2025Withdrawn

Data Efficient Continual Learning of Large Language Model

ICLR 2025Rejected

Pseudo- vs. True-Randomness: Rethinking Distortion-Free Watermarks of Language Models under Watermark Key Collisions

ICLR 2025Withdrawn

From Pixels to Prose: A Large Dataset of Dense Image Captions

ICLR 2025Withdrawn

OPTune: Efficient Online Preference Tuning

ICLR 2025Withdrawn

Achieve Performatively Optimal Policy for Performative Reinforcement Learning

ICML 2025Rejected

A Fast Federated Method for Minimax Problems with Sequential Convergence Guarantees

ICLR 2025Withdrawn

Hybrid Fine-Tuning of LLMs: Theoretical Insights on Generalized Smoothness and Convergence

ICLR 2025Withdrawn

Sharper Analysis of Data Echoing and New Communication-Efficient Algorithm for Data Parallelism

ICLR 2025Rejected

Provably Mitigating Corruption, Overoptimization, and Verbosity Simultaneously in Offline and Online RLHF/DPO Alignment

ICML 2025Rejected

Efficient Machine Unlearning for Deep Generative Models by Mitigating Optimization Conflicts

ICLR 2025Withdrawn

RAG$^C$: Towards Copyright Protection for Knowledge Bases of Retrieval-augmented Language Models

ICLR 2025Withdrawn

Rectified Robust Policy Optimization for Robust Constrained Reinforcement Learning without Strong Duality

ICML 2025Rejected

合作者 (20)