We are sincerely thanks the reviewer for their valuable feedback. Due to limited words, we given summary response. We are eager to have more profound discussion.

Respouse to Questions:

Q1. Supplementary evaluation results

We updated the quantization baselines with state-of-the-art methods on Llama-3.1-8B-Instruct, covering various quantization strategies:

Weight-only quantization: PB-LLM, BiLLM,

Weight-activation quantization: SpinQuant, OmniQuant, DuQuant,

Quantization with fine-tuning: IQ-LoRA, LoftQ.

For safety evaluation, we use the harmful score benchmark (1 to 5) from Qi et al., where higher scores indicate more harm. For utility evaluation, we report perplexity (PPL) on WikiText2.

Q2. Missing references

Thank you for highlighting the missing references [1-5], which cover partial binarization, post-training quantization, and dual transformation for optimization. We have integrated these methods into our updated evaluation (see Q3) for a comprehensive comparison. Results show that while these methods aim to preserve utility, they often overlook safety, which is essential for practical applications.

Q3. Update quantization baseline

Methods	Setting	ASR	Harmful Score	PPL
FP16	W16A16	0.2	1.02	6.14
PB-LLM	W2A16	86.4	4.46	6.30
BiLLM	W4A16	48.1	2.95	32.48
OmniQuant	W4A4	79.5	4.18	6.45
SpinQuant	W4A4	36.4	2.47	6.30
DuQuant	W4A4	26.8	2.15	8.06
IQ-LoRA	W4A16	34.7	2.40	6.42
LoftQ	W4A16	65.3	3.64	6.18
Qresafe (Ours)	W4A16	3.4	1.09	6.35

Results indicate that existing methods primarily target minimizing utility loss but often ignore safety. Q-resafe shows superior safety performance while maintaining competitive utility. Due to time constraints, additional experiments with lower-bit settings will be included in the revision.

Why we initially used Llama-2-7B-Chat and Gemma-7B-Instruct: These models were chosen for their widespread use in existing quantized LLM studies, ensuring comparability. Despite being relatively outdated, they remain relevant for studying safety issues.

Q4. Insights into the safety results of quantized LLMs

From Table 3, we observe:

Quantization Techniques: All methods degrade safety. Weight-only quantization has less impact compared to fine-tuning. Parameter-efficient fine-tuning (e.g., IQ-LoRA) tends to degrade safety more than full-parameter fine-tuning.
Bit Precision: Lower-bit quantization significantly affects safety, indicating a trade-off between efficiency and safety.
Model: Models with stronger reasoning capabilities tend to preserve safety better after quantization compared to chat-optimized models.

To validate these findings, we preserved a portion of safety-critical (top ) weights as FP16 while quantizing the rest to FP4 on Llama-3.1-8B-Instruct:**

Top	ASR	Harmful Score
0 (full FP4)	68.5	3.81
0.05	5.4	1.25
0.1	3.7	1.19
0.2	2.5	1.15
0.5	0.4	1.06
1 (full FP16)	0.2	1.02

Even preserving a small portion of safety-critical weights significantly improves safety while retaining quantization efficiency.

Q5. Safety patch dependency analysis

Q4 results indicate that Q-resafe's effectiveness does not depend on fine-tuning or specific safety-patching datasets. Instead, preserving safety-critical weights is crucial. Q-resafe requires minimal safety-patching samples (e.g., 10) to maintain performance. We will provide more insightful analysis as suggested.

Respouse to Other Strengths And Weaknesses & Suggestions:

We appreciate the suggestions and have enhanced our discussion by incorporating recent studies, analyzing the effects of different quantization strategies on safety, and including more Harmful Score and PPL results. These updates improve clarity and comprehensiveness in our revised manuscript.