Dacheng Tao
~Dacheng_Tao1
89
论文总数
44.5
年均投稿
平均评分
接收情况57/89
会议分布
ICLR
45
NeurIPS
31
ICML
13
发表论文 (89 篇)
202561 篇
4
On Geometry-Enhanced Parameter-Efficient Fine-Tuning for 3D Scene Segmentation
NeurIPS 2025Poster
3
JustLogic: A benchmark for natural language deductive reasoning
ICLR 2025Rejected
3
ICLShield: Exploring and Mitigating In-Context Learning Backdoor Attacks
ICML 2025Poster
4
Physics-based Skinned Dance Generation with RL Fine-tuning
ICLR 2025withdrawn
4
Convergent Differential Privacy Analysis for General Federated Learning
NeurIPS 2025Rejected
4
ER-AAE: A quantum state preparation approach based on entropy reduction
ICLR 2025Rejected
4
Problem-dependent Quantum Circuit Design Based on Entropy Matching
ICLR 2025withdrawn
4
Learning system dynamics without forgetting
ICLR 2025Poster
5
Adaptive Defense against Harmful Fine-Tuning for Large Language Models via Bayesian Data Scheduler
NeurIPS 2025Spotlight
3
Self-Verification Provably Prevents Model Collapse in Recursive Synthetic Training
NeurIPS 2025Poster
4
Energy-based Backdoor Defense Against Federated Graph Learning
ICLR 2025Oral
4
Neuron-level Balance between Stability and Plasticity in Deep Reinforcement Learning
ICLR 2025withdrawn
5
MD-LSM: An Efficient Tool for Real-time Monitoring Linear Separability of Hidden-layer Outputs of Deep Networks
ICLR 2025Rejected
4
A Statistical Approach for Controlled Training Data Detection
ICLR 2025Poster
5
Safety Reasoning with Guidelines
ICML 2025Poster
3
NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language Models
ICLR 2025Poster
5
Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency
ICLR 2025Spotlight
4
Test-Time Training for Out-of-Distribution Industrial Anomaly Detection via Robust Distribution Alignment
ICLR 2025withdrawn
4
Hough Voting-based Self-Training for Vision-Language Model Adaptation
ICLR 2025withdrawn
3
A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops
ICLR 2025Poster
4
AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration
ICML 2025Poster
4
Memory Efficient Block Coordinate Descent Method for Forward-Only Second-Order Finetuning of LLM Models
ICML 2025Rejected
5
Dynamic Neural Fortresses: An Adaptive Shield for Model Extraction Defense
ICLR 2025Poster
4
Memory-Efficient Block Coordinate Descent for Hessian-Informed Zeroth-Order Optimizer
ICLR 2025withdrawn
4
The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking
ICML 2025Poster
4
Open-Vocabulary Customization from CLIP via Data-Free Knowledge Distillation
ICLR 2025Oral
4
Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing
NeurIPS 2025Poster
4
D$^2$GS: Dense Depth Regularization for LiDAR-free Urban Scene Reconstruction
NeurIPS 2025Poster
4
LoRA Recycle: Towards Fine-Tuning-Free Visual Foundation Model via Double-Efficient Data-Free Meta-Learning
ICLR 2025withdrawn
4
FusionBench: A Comprehensive Benchmark of Deep Model Fusion
ICLR 2025Rejected
4
Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
ICML 2025Oral
4
NBSP: A Neuron-Level Framework for Balancing Stability and Plasticity in Deep Reinforcement Learning
NeurIPS 2025Rejected
4
VORTA: Efficient Video Diffusion via Routing Sparse Attention
NeurIPS 2025Poster
4
AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding
NeurIPS 2025Poster
3
SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding
NeurIPS 2025Poster
3
Multinoulli Extension: A Lossless Yet Effective Probabilistic Framework for Subset Selection over Partition Constraints
ICML 2025Poster
4
Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
ICML 2025Poster
4
Combatting Dimensional Collapse in LLM Pre-Training Data via Submodular File Selection
ICLR 2025Oral
4
LLM Data Selection and Utilization via Dynamic Bi-level Optimization
ICML 2025Poster
4
Decision Mixer: Integrating Long-term and Local Dependencies via Dynamic Token Selection for Decision-Making
ICML 2025Poster
4
T2V-OptJail: Discrete Prompt Optimization for Text-to-Video Jailbreak Attacks
NeurIPS 2025Poster
4
Continual Task Learning through Adaptive Policy Self-Composition
ICLR 2025withdrawn
4
Learn from Downstream and Be Yourself in Multimodal Large Language Models Fine-Tuning
ICML 2025Poster
4
Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning
NeurIPS 2025Poster
5
SEGA: Shaping Semantic Geometry for Robust Hashing under Noisy Supervision
NeurIPS 2025Poster
4
SeRL: Self-play Reinforcement Learning for Large Language Models with Limited Data
NeurIPS 2025Poster
4
Value-Guided Decision Transformer: A Unified Reinforcement Learning Framework for Online and Offline Settings
NeurIPS 2025Poster
4
Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
NeurIPS 2025Poster
4
Merging on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging
NeurIPS 2025Poster
4
Ada-R1: Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization
NeurIPS 2025Poster
4
Retrieval-Augmented Perception: High-resolution Image Perception Meets Visual RAG
ICML 2025Oral
3
AiDE-Q: Synthetic Labeled Datasets Can Enhance Learning Models for Quantum Property Estimation
NeurIPS 2025Poster
4
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
NeurIPS 2025Poster
4
Be Confident: Uncovering Overfitting in MLLM Multi-Task Tuning
ICML 2025Poster
4
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces
ICLR 2025Rejected
4
Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation
NeurIPS 2025Poster
4
Hough Voting-based Prompt Learning for Segment Anything Model
ICLR 2025withdrawn
4
R1-ShareVL: Incentivizing Reasoning Capabilities of Multimodal Large Language Models via Share-GRPO
NeurIPS 2025Poster
4
Tackling Continual Offline RL through Selective Weights Activation on Aligned Spaces
NeurIPS 2025Poster
4
FreDF: Learning to Forecast in the Frequency Domain
ICLR 2025Poster
4
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search
NeurIPS 2025Spotlight
202428 篇
4
A-FedPD: Aligning Dual-Drift is All Federated Primal-Dual Learning Needs
NeurIPS 2024Spotlight
4
Offline Behavior Distillation
NeurIPS 2024Poster
4
Which mode is better for federated learning? Centralized or Decentralized
ICLR 2024Rejected
4
Deep Graph Mating
NeurIPS 2024Poster
3
PointHR: Exploring High-Resolution Architectures for 3D Point Cloud Segmentation
ICLR 2024withdrawn
4
Prompt-Tuning Decision Transformer with Preference Ranking
ICLR 2024withdrawn
4
Graph Decision Transformer
ICLR 2024withdrawn
5
Topology-aware Embedding Memory for Learning on Expanding Graphs
ICLR 2024Rejected
4
Learning Multi-Agent Communication from Graph Modeling Perspective
ICLR 2024Poster
4
Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization
ICLR 2024Rejected
4
Convergence of Bayesian Bilevel Optimization
ICLR 2024Spotlight
4
Solving Continual Offline Reinforcement Learning with Decision Transformer
ICLR 2024withdrawn
4
One For All: Towards Training One Graph Model For All Classification Tasks
ICLR 2024Spotlight
4
MG-Net: Learn to Customize QAOA with Circuit Depth Awareness
NeurIPS 2024Poster
4
Uniform Approximation of Equivariant/Invariant Neural Networks
ICLR 2024withdrawn
4
Decomposed Prompt Decision Transformer for Efficient Unseen Task Generalization
NeurIPS 2024Poster
3
An Attention-based Approach for Bayesian Optimization with Dependencies
ICLR 2024withdrawn
4
GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching
NeurIPS 2024Poster
4
InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling
NeurIPS 2024Poster
4
Enhancing Personal Decentralized Federated Learning through Model Decoupling
ICLR 2024Rejected
4
AdaMerging: Adaptive Model Merging for Multi-Task Learning
ICLR 2024Poster
4
Federated Tuning for Black Box Large Models
ICLR 2024withdrawn
4
Task-Distributionally Robust Data-Free Meta-Learning
ICLR 2024withdrawn
4
FHA-Kitchens: A Novel Dataset for Fine-Grained Hand Action Recognition in Kitchen Scenes
ICLR 2024Rejected
4
Parameter-Efficient Multi-Task Model Fusion with Partial Linearization
ICLR 2024Poster
4
Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
ICLR 2024Poster
3
Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning
ICLR 2024withdrawn
4
Are Large Language Models Really Robust to Word-Level Perturbations?
ICLR 2024withdrawn