影响力指数

98.73/100

前 0.1%

全站排名 #24

发表论文128 篇

平均评分5.5

年均产出42.7 篇/年

Li Shen

Associate Professor@Sun Yat-Sen University·中国·OpenReview

研究方向

artificial intelligence · reinforcement learning · deep learning · optimization

Compactness and Consistency: A Conjoint Framework for Deep Graph Clustering

Diffusion Language Model Knows the Answer Before It Decodes

GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching

ICLR 2026Poster

Convergent Differential Privacy Analysis for General Federated Learning

ICLR 2026Poster

OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging

ICLR 2026Poster

EchoBench: Benchmarking Sycophancy in Medical Large Vision-Language Models

ICLR 2026Rejected

MaskPro: Linear-Space Probabilistic Learning for Strict (N:M)-Sparsity on LLMs

ICLR 2026Poster

Understanding the Dynamics of Forgetting and Generalization in Continual Learning via the Neural Tangent Kernel

ICLR 2026Poster

Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning

ICLR 2026Poster

TrojanTO: Action-Level Backdoor Attacks Against Trajectory Optimization Models

ICLR 2026Poster

Memory Efficient Fine-Tuning of LLMs via Forward-Only Hessian-Free Coordinate Descent

ICLR 2026Rejected

Reliable Poisoned Sample Detection against Backdoor Attacks Enhanced by Sharpness Aware Minimization

ICLR 2026Poster

NBSP: A Neuron-Level Framework for Balancing Stability and Plasticity in Deep Reinforcement Learning

ICLR 2026Rejected

MergOPT: A Merge-Aware Optimizer for Robust Model Merging

ICLR 2026Poster

Towards Optimism-Pessimism Trade-off in Model-based Offline-to-Online Reinforcement Learning

ICLR 2026Rejected

Qualitative and Quantitative Quality Assessment of Low-Light Enhanced Images: A Dataset and Benchmark Metric

ICLR 2026Withdrawn

The State of Reinforcement Finetuning for Transformer-based Agents

ICLR 2026Poster

AdaGC: Improving Training Stability for Large Language Model Pretraining

ICLR 2026Rejected

UltraHorizon: Benchmarking LLM-Agent Capabilities in Ultra Long-Horizon Scenarios

ICLR 2026Rejected

Rewiring Experts on the Fly: Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models

ICLR 2026Rejected

Merge to Remember: Sharpness-Aware Isotropic Merging for Continual Learning

ICLR 2026Rejected

LOST: Low-rank and Sparse Pre-training for Large Language Models

ICLR 2026Rejected

TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs

ICLR 2026Rejected

Model Unmerging: Making Your Models Unmergeable for Secure Model Sharing

ICLR 2026Rejected

SimReg: Achieving Higher Convergence and Generalization in the LLM Pretraining via Embedding Similarity Regularization

ICLR 2026Rejected

MAPLE: Masked Adapter Prototype Learning for OOD generalization

ICLR 2026Rejected

Mediater: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing

ICLR 2026Rejected

SeWA: Selective Weight Average via Probabilistic Masking

ICLR 2026Rejected

Stability and Generalization of Split Learning : Sequential or Federated

ICLR 2026Withdrawn

Q-learning Penalized Transformer for Safe Offline Reinforcement Learning

ICLR 2026Rejected

Stable-SPAM: How to Stably Train Large Language Models in 4-Bit

ICLR 2026Withdrawn

ELAS: Efficient Pre-Training of Low-Rank Large Language Models via 2:4 Activation Sparsity

ICLR 2026Withdrawn

Beyond Two-Stage Training: Cooperative SFT and RL for LLM Reasoning

ICLR 2026Withdrawn

Branching Memory: Task-Specific Expansion for Continual Learning in Large Language Models

ICLR 2026Withdrawn

Stealthy Fine-Grained Editing Attack on MLLMs

ICLR 2026Withdrawn

Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning

Retrieval-Augmented Perception: High-resolution Image Perception Meets Visual RAG

Adaptive Defense against Harmful Fine-Tuning for Large Language Models via Bayesian Data Scheduler

NeurIPS 2025Spotlight

Open-Vocabulary Customization from CLIP via Data-Free Knowledge Distillation

Combatting Dimensional Collapse in LLM Pre-Training Data via Submodular File Selection

RobustMerge: Parameter-Efficient Model Merging for MLLMs with Direction Robustness

NeurIPS 2025Spotlight

NBSP: A Neuron-Level Framework for Balancing Stability and Plasticity in Deep Reinforcement Learning

NeurIPS 2025Rejected

Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More

ICML 2025Poster

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

NeurIPS 2025Spotlight

Unveiling the Power of Multiple Gossip Steps: A Stability-Based Generalization Analysis in Decentralized Training

NeurIPS 2025Spotlight

Enhancing Learning with Label Differential Privacy by Vector Approximation

ICLR 2025Spotlight

Value-Guided Decision Transformer: A Unified Reinforcement Learning Framework for Online and Offline Settings

NeurIPS 2025Poster

AlphaDecay: Module-wise Weight Decay for Heavy-Tailed Balancing in LLMs

NeurIPS 2025Poster

Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning

NeurIPS 2025Poster

Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives

NeurIPS 2025Poster

Decision Mixer: Integrating Long-term and Local Dependencies via Dynamic Token Selection for Decision-Making

ICML 2025Poster

Self-Verification Provably Prevents Model Collapse in Recursive Synthetic Training

NeurIPS 2025Poster

Mitigating the Backdoor Effect for Multi-Task Model Merging via Safety-Aware Subspace

ICLR 2025Poster

Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation

NeurIPS 2025Poster

Convergent Differential Privacy Analysis for General Federated Learning

NeurIPS 2025Rejected

Tackling Continual Offline RL through Selective Weights Activation on Aligned Spaces

NeurIPS 2025Poster

CHPO: Constrained Hybrid-action Policy Optimization for Reinforcement Learning

NeurIPS 2025Poster

Safety Reasoning with Guidelines

ICML 2025Poster

MixPrompt: Efficient Mixed Prompting for Multimodal Semantic Segmentation

NeurIPS 2025Poster

Ada-R1: Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization

NeurIPS 2025Poster

Dynamic Neural Fortresses: An Adaptive Shield for Model Extraction Defense

ICLR 2025Poster

Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency

ICLR 2025Spotlight

Beyond Two-Stage Training: Integrating SFT and RL for Improved Reasoning in LLMs

NeurIPS 2025Rejected

Layer as Puzzle Pieces: Compressing Large Language Models through Layer Concatenation

NeurIPS 2025Poster

Merging on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging

NeurIPS 2025Poster

Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging

NeurIPS 2025Poster

Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption

NeurIPS 2025Poster

RoMa: A Robust Model Watermarking Scheme for Protecting IP in Diffusion Models

NeurIPS 2025Poster

R1-ShareVL: Incentivizing Reasoning Capabilities of Multimodal Large Language Models via Share-GRPO

NeurIPS 2025Poster

PEARL: Towards Permutation-Resilient LLMs

ICLR 2025Poster

Multinoulli Extension: A Lossless Yet Effective Probabilistic Framework for Subset Selection over Partition Constraints

ICML 2025Poster

Targeted Low-rank Refinement: Enhancing Sparse Language Models with Precision

ICML 2025Poster

Contextual Bandits for Unbounded Context Distributions

ICML 2025Poster

Continual Model Merging without Data: Dual Projections for Balancing Stability and Plasticity

NeurIPS 2025Poster

Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought

NeurIPS 2025Poster

Efficient Federated Learning against Byzantine Attacks and Data Heterogeneity via Aggregating Normalized Gradients

NeurIPS 2025Poster

Understanding the Stability-based Generalization of Personalized Federated Learning

ICLR 2025Poster

Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent

ICML 2025Poster

GraphCL: Graph-based Clustering for Semi-Supervised Medical Image Segmentation

ICML 2025Poster

AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization

ICLR 2025Rejected

LightSAM: Parameter-Agnostic Sharpness-Aware Minimization

ICLR 2025Withdrawn

FusionBench: A Comprehensive Benchmark of Deep Model Fusion

ICLR 2025Rejected

Towards Constraint-aware Learning for Resource Allocation in NFV-enabled Networks

ICLR 2025Rejected

Vulnerability-Aware Alignment: Mitigating Uneven Forgetting in Harmful Fine-Tuning

ICML 2025Poster

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer

ICML 2025Poster

Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces

ICLR 2025Rejected

Continual Task Learning through Adaptive Policy Self-Composition

ICLR 2025Withdrawn

Exploring One-Shot Federated Learning by Model Inversion and Token Relabel with Vision Transformers

ICLR 2025Rejected

Learning with User-Level Local Differential Privacy

ICLR 2025Rejected

Targeted Low-rank Refinement: Enhancing Sparse Neural Networks with Precision

ICLR 2025Withdrawn

Continuous Spiking Graph ODE Networks

ICLR 2025Withdrawn

LoRA Recycle: Towards Fine-Tuning-Free Visual Foundation Model via Double-Efficient Data-Free Meta-Learning

ICLR 2025Withdrawn

Open-World Test-Time Training: Self-Training with Contrastive Learning

ICLR 2025Withdrawn

DynamicKV: Task-Aware Adaptive KV Cache Compression for Long Context LLMs

ICLR 2025Rejected

Memory Efficient Block Coordinate Descent Method for Forward-Only Second-Order Finetuning of LLM Models

ICML 2025Rejected

Simultaneous Computation and Memory Efficient Zeroth-Order Optimizer for Fine-Tuning Large Language Models

ICLR 2025Withdrawn

Memory-Efficient Block Coordinate Descent for Hessian-Informed Zeroth-Order Optimizer

ICLR 2025Withdrawn

Towards Understanding Memory buffer based Continual Learning

ICLR 2025Withdrawn

AGLP: A Graph Learning Perspective for Semi-supervised Domain Adaptation

ICLR 2025Withdrawn

Language Model for Large-Text Transmission in Noisy Quantum Communications

ICLR 2025Withdrawn

SgCG: Semantic-guided Contrastive Generalization for Medical Image Segmentation

ICLR 2025Withdrawn

合作者 (20)