Tiansheng Huang
~Tiansheng_Huang1
9
论文总数
4.5
年均投稿
平均评分
接收情况6/9
会议分布
ICLR
5
NeurIPS
3
ICML
1
发表论文 (9 篇)
20255 篇
4
Antidote: Post-fine-tuning Safety Alignment for Large Language Models against Harmful Fine-tuning Attack
ICML 2025Poster
4
Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation
ICLR 2025Oral
4
Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models
ICLR 2025Poster
4
Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation
NeurIPS 2025Poster
5
PokéLLMon: A Grounding and Reasoning Benchmark for Large Language Models in Pokémon Battles
ICLR 2025withdrawn
20244 篇
4
Vaccine: Perturbation-aware Alignment for Large Language Models against Harmful Fine-tuning Attack
NeurIPS 2024Poster
4
Lisa: Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning Attack
NeurIPS 2024Poster
4
Silencer: Pruning-aware Backdoor Defense for Decentralized Federated Learning
ICLR 2024withdrawn
3
FusionShot: Boosting Few Shot Learners with Focal-Diversity Optimized Ensemble Method
ICLR 2024withdrawn