PaperHub

Dacheng Tao

~Dacheng_Tao1

89
论文总数
44.5
年均投稿
5.9
平均评分
接收情况57/89
会议分布
ICLR
45
NeurIPS
31
ICML
13

发表论文 (89 篇)

202561

6.8
4

On Geometry-Enhanced Parameter-Efficient Fine-Tuning for 3D Scene Segmentation

NeurIPS 2025Poster
5.0
3

JustLogic: A benchmark for natural language deductive reasoning

ICLR 2025Rejected
6.3
3

ICLShield: Exploring and Mitigating In-Context Learning Backdoor Attacks

ICML 2025Poster
4.8
4

Physics-based Skinned Dance Generation with RL Fine-tuning

ICLR 2025withdrawn
6.8
4

Convergent Differential Privacy Analysis for General Federated Learning

NeurIPS 2025Rejected
4.8
4

ER-AAE: A quantum state preparation approach based on entropy reduction

ICLR 2025Rejected
3.0
4

Problem-dependent Quantum Circuit Design Based on Entropy Matching

ICLR 2025withdrawn
6.3
4

Learning system dynamics without forgetting

ICLR 2025Poster
8.2
5

Adaptive Defense against Harmful Fine-Tuning for Large Language Models via Bayesian Data Scheduler

NeurIPS 2025Spotlight
7.0
3

Self-Verification Provably Prevents Model Collapse in Recursive Synthetic Training

NeurIPS 2025Poster
7.5
4

Energy-based Backdoor Defense Against Federated Graph Learning

ICLR 2025Oral
3.0
4

Neuron-level Balance between Stability and Plasticity in Deep Reinforcement Learning

ICLR 2025withdrawn
6.0
5

MD-LSM: An Efficient Tool for Real-time Monitoring Linear Separability of Hidden-layer Outputs of Deep Networks

ICLR 2025Rejected
6.5
4

A Statistical Approach for Controlled Training Data Detection

ICLR 2025Poster
6.8
5

Safety Reasoning with Guidelines

ICML 2025Poster
6.0
3

NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language Models

ICLR 2025Poster
6.8
5

Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency

ICLR 2025Spotlight
3.8
4

Test-Time Training for Out-of-Distribution Industrial Anomaly Detection via Robust Distribution Alignment

ICLR 2025withdrawn
4.0
4

Hough Voting-based Self-Training for Vision-Language Model Adaptation

ICLR 2025withdrawn
6.7
3

A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops

ICLR 2025Poster
5.5
4

AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration

ICML 2025Poster
4.4
4

Memory Efficient Block Coordinate Descent Method for Forward-Only Second-Order Finetuning of LLM Models

ICML 2025Rejected
6.8
5

Dynamic Neural Fortresses: An Adaptive Shield for Model Extraction Defense

ICLR 2025Poster
4.0
4

Memory-Efficient Block Coordinate Descent for Hessian-Informed Zeroth-Order Optimizer

ICLR 2025withdrawn
7.2
4

The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking

ICML 2025Poster
8.0
4

Open-Vocabulary Customization from CLIP via Data-Free Knowledge Distillation

ICLR 2025Oral
7.8
4

Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing

NeurIPS 2025Poster
7.3
4

D$^2$GS: Dense Depth Regularization for LiDAR-free Urban Scene Reconstruction

NeurIPS 2025Poster
4.5
4

LoRA Recycle: Towards Fine-Tuning-Free Visual Foundation Model via Double-Efficient Data-Free Meta-Learning

ICLR 2025withdrawn
5.0
4

FusionBench: A Comprehensive Benchmark of Deep Model Fusion

ICLR 2025Rejected
8.3
4

Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning

ICML 2025Oral
7.8
4

NBSP: A Neuron-Level Framework for Balancing Stability and Plasticity in Deep Reinforcement Learning

NeurIPS 2025Rejected
6.8
4

VORTA: Efficient Video Diffusion via Routing Sparse Attention

NeurIPS 2025Poster
6.8
4

AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding

NeurIPS 2025Poster
7.6
3

SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding

NeurIPS 2025Poster
6.3
3

Multinoulli Extension: A Lossless Yet Effective Probabilistic Framework for Subset Selection over Partition Constraints

ICML 2025Poster
4.9
4

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer

ICML 2025Poster
8.0
4

Combatting Dimensional Collapse in LLM Pre-Training Data via Submodular File Selection

ICLR 2025Oral
5.5
4

LLM Data Selection and Utilization via Dynamic Bi-level Optimization

ICML 2025Poster
7.2
4

Decision Mixer: Integrating Long-term and Local Dependencies via Dynamic Token Selection for Decision-Making

ICML 2025Poster
6.8
4

T2V-OptJail: Discrete Prompt Optimization for Text-to-Video Jailbreak Attacks

NeurIPS 2025Poster
4.8
4

Continual Task Learning through Adaptive Policy Self-Composition

ICLR 2025withdrawn
7.2
4

Learn from Downstream and Be Yourself in Multimodal Large Language Models Fine-Tuning

ICML 2025Poster
6.4
4

Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning

NeurIPS 2025Poster
7.1
5

SEGA: Shaping Semantic Geometry for Robust Hashing under Noisy Supervision

NeurIPS 2025Poster
6.8
4

SeRL: Self-play Reinforcement Learning for Large Language Models with Limited Data

NeurIPS 2025Poster
7.3
4

Value-Guided Decision Transformer: A Unified Reinforcement Learning Framework for Online and Offline Settings

NeurIPS 2025Poster
7.3
4

Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives

NeurIPS 2025Poster
6.4
4

Merging on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging

NeurIPS 2025Poster
6.8
4

Ada-R1: Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization

NeurIPS 2025Poster
8.3
4

Retrieval-Augmented Perception: High-resolution Image Perception Meets Visual RAG

ICML 2025Oral
7.0
3

AiDE-Q: Synthetic Labeled Datasets Can Enhance Learning Models for Quantum Property Estimation

NeurIPS 2025Poster
7.3
4

Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning

NeurIPS 2025Poster
6.6
4

Be Confident: Uncovering Overfitting in MLLM Multi-Task Tuning

ICML 2025Poster
4.8
4

Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces

ICLR 2025Rejected
6.8
4

Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation

NeurIPS 2025Poster
4.0
4

Hough Voting-based Prompt Learning for Segment Anything Model

ICLR 2025withdrawn
6.4
4

R1-ShareVL: Incentivizing Reasoning Capabilities of Multimodal Large Language Models via Share-GRPO

NeurIPS 2025Poster
6.8
4

Tackling Continual Offline RL through Selective Weights Activation on Aligned Spaces

NeurIPS 2025Poster
7.0
4

FreDF: Learning to Forecast in the Frequency Domain

ICLR 2025Poster
7.8
4

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

NeurIPS 2025Spotlight

202428

6.3
4

A-FedPD: Aligning Dual-Drift is All Federated Primal-Dual Learning Needs

NeurIPS 2024Spotlight
5.5
4

Offline Behavior Distillation

NeurIPS 2024Poster
4.5
4

Which mode is better for federated learning? Centralized or Decentralized

ICLR 2024Rejected
6.3
4

Deep Graph Mating

NeurIPS 2024Poster
4.0
3

PointHR: Exploring High-Resolution Architectures for 3D Point Cloud Segmentation

ICLR 2024withdrawn
4.5
4

Prompt-Tuning Decision Transformer with Preference Ranking

ICLR 2024withdrawn
4.5
4

Graph Decision Transformer

ICLR 2024withdrawn
5.4
5

Topology-aware Embedding Memory for Learning on Expanding Graphs

ICLR 2024Rejected
7.0
4

Learning Multi-Agent Communication from Graph Modeling Perspective

ICLR 2024Poster
6.0
4

Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization

ICLR 2024Rejected
6.5
4

Convergence of Bayesian Bilevel Optimization

ICLR 2024Spotlight
4.0
4

Solving Continual Offline Reinforcement Learning with Decision Transformer

ICLR 2024withdrawn
7.0
4

One For All: Towards Training One Graph Model For All Classification Tasks

ICLR 2024Spotlight
5.3
4

MG-Net: Learn to Customize QAOA with Circuit Depth Awareness

NeurIPS 2024Poster
3.0
4

Uniform Approximation of Equivariant/Invariant Neural Networks

ICLR 2024withdrawn
5.3
4

Decomposed Prompt Decision Transformer for Efficient Unseen Task Generalization

NeurIPS 2024Poster
3.0
3

An Attention-based Approach for Bayesian Optimization with Dependencies

ICLR 2024withdrawn
5.0
4

GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching

NeurIPS 2024Poster
5.5
4

InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling

NeurIPS 2024Poster
4.8
4

Enhancing Personal Decentralized Federated Learning through Model Decoupling

ICLR 2024Rejected
6.5
4

AdaMerging: Adaptive Model Merging for Multi-Task Learning

ICLR 2024Poster
4.3
4

Federated Tuning for Black Box Large Models

ICLR 2024withdrawn
4.8
4

Task-Distributionally Robust Data-Free Meta-Learning

ICLR 2024withdrawn
4.0
4

FHA-Kitchens: A Novel Dataset for Fine-Grained Hand Action Recognition in Kitchen Scenes

ICLR 2024Rejected
7.0
4

Parameter-Efficient Multi-Task Model Fusion with Partial Linearization

ICLR 2024Poster
6.0
4

Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages

ICLR 2024Poster
4.3
3

Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning

ICLR 2024withdrawn
3.0
4

Are Large Language Models Really Robust to Word-Level Perturbations?

ICLR 2024withdrawn