PaperHub

Min Lin

~Min_Lin1

39
论文总数
19.5
年均投稿
6.1
平均评分
接收情况28/39
会议分布
ICLR
23
NeurIPS
10
ICML
3
COLM
3

发表论文 (39 篇)

202520

6.1
4

PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization

ICML 2025Poster
6.4
4

Lifelong Safety Alignment for Language Models

NeurIPS 2025Poster
5.8
3

Sample-Efficient Alignment for LLMs

NeurIPS 2025Rejected
7.2
4

Continual Reinforcement Learning by Planning with Online World Models

ICML 2025Spotlight
5.0
4

Sample Efficient Alignment for LLMs

ICLR 2025Rejected
6.0
4

LLM-based Multi-Agents System Attack via Continuous Optimization with Discrete Efficient Search

COLM 2025Poster
4.0
3

Meta-Unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts

ICLR 2025withdrawn
7.2
4

Improving Your Model Ranking on Chatbot Arena by Vote Rigging

ICML 2025Poster
6.4
4

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

NeurIPS 2025Poster
5.3
4

SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction

ICLR 2025Rejected
7.8
8

Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates

ICLR 2025Oral
4.0
4

Denial-of-Service Poisoning Attacks against Large Language Models

ICLR 2025withdrawn
6.0
4

A Closer Look at Machine Unlearning for Large Language Models

ICLR 2025Poster
4.5
4

Test-Time Backdoor Attacks on Multimodal Large Language Models

ICLR 2025withdrawn
6.5
4

Scaling up Masked Diffusion Models on Text

ICLR 2025Poster
6.0
4

Bootstrapping Language Models with DPO Implicit Rewards

ICLR 2025Poster
8.0
3

Understanding R1-Zero-Like Training: A Critical Perspective

COLM 2025Poster
7.3
3

When Attention Sink Emerges in Language Models: An Empirical View

ICLR 2025Spotlight
7.2
5

RegMix: Data Mixture as Regression for Language Model Pre-training

ICLR 2025Spotlight
6.3
4

Improved Techniques for Optimization-Based Jailbreaking on Large Language Models

ICLR 2025Poster

202419

6.8
4

Automatic Functional Differentiation in JAX

ICLR 2024Poster
7.3
4

Stochastic Taylor Derivative Estimator: Efficient amortization for arbitrary differential operators

NeurIPS 2024Oral
5.3
4

Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform

ICLR 2024Poster
7.3
3

Finetuning Text-to-Image Diffusion Models for Fairness

ICLR 2024Oral
6.3
3

Pipeline Parallelism with Controllable Memory

NeurIPS 2024Poster
6.7
3

Locality Sensitive Sparse Encoding for Learning World Models Online

ICLR 2024Poster
7.0
4

Zero Bubble (Almost) Pipeline Parallelism

ICLR 2024Poster
7.0
4

Amortized Eigendecomposition for Neural Networks

NeurIPS 2024Poster
4.8
5

On Memorization in Diffusion Models

ICLR 2024Rejected
6.3
3

Graph Diffusion Policy Optimization

NeurIPS 2024Poster
6.0
3

Intriguing Properties of Data Attribution on Diffusion Models

ICLR 2024Poster
3.5
4

BAFFLE: A Baseline of Backpropagation-Free Federated Learning

ICLR 2024Rejected
5.8
5

Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses

NeurIPS 2024Poster
5.8
5

Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs

NeurIPS 2024Poster
5.3
3

A Recipe for Watermarking Diffusion Models

ICLR 2024Rejected
5.3
3

LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

ICLR 2024Rejected
6.5
4

LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

COLM 2024Poster
6.3
3

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

NeurIPS 2024Poster
4.8
4

Differentiable Optimization in Plane-Wave Density Functional Theory for Solid States

ICLR 2024Rejected