Thank you for your comments and feedback. We address your concerns here.

Q1: The biggest concern is insufficient baselines. The method compare against a large number of non-diffusion based methods or diffusion based iterative methods, but it lacks comparisons against the most closely related methods: other diffusion distillation algorithms. This method distill a pre-trained SR diffusion model into one step with some specific design for SR, but there are many distillation methods designed for general diffusion models, such as consistency model and the family of distribution matching distillation. The authors should run controlled experiment with the same teacher model with different algorithms to emphasize the relative advantage. For example, personally I found CM works well in distilling SR model into one step, and DMD and its variant can distilled the more complicated T2I model into one step. Their relative performance on SR diffusion is what we really care.

A1: Thank you for your valuable suggestion. We apply both consistency models and distribution matching distillation (DMD) to SR tasks for evaluation. Specifically, we employ consistency distillation under L2 loss and set the same boundary conditions as consistency models: which clearly satisfies and . For DMD, we alternately update the fake score network and generator, with the weights of the distribution matching distillation loss and regression loss set to 1.

The experimental results are presented in Table 1. As shown in the table, the high-resolution images generated by the model using consistency distillation are significantly inferior to those produced by other super-resolution methods across all metrics, which appears to contradict the reviewer's findings. We speculate that this discrepancy may be due to ResShift modifying the standard Markov chain of the diffusion model, making it difficult to apply consistency distillation directly. While applying DMD to super-resolution tasks has yielded promising results, it still falls short of our method. To further validate the effectiveness of our approach, we also transferred it to an unconditional generation task. The results of this evaluation on CIFAR-10 are presented in Table 2. As shown, our method achieves competitive performance, even in unconditional generation tasks, outperforming both consistency models and DMD.

Table 1: Quantitative results of different SR methods. The best and second best results are highlighted in bold and italic. ∗ indicates that the result was obtained by replicating the method in the paper.

Datasets		ImageNet-test		RealSR	RealSR	RealSet65	RealSet65
Methods	LPIPS	CLIPIQA	MUSIQ	CLIPIQA	MUSIQ	CLIPIQA	MUSIQ
LDM-15	0.269	0.512	46.419	0.384	49.317	0.427	47.488
ResShift-15	0.231	0.592	53.660	0.596	59.873	0.654	61.330
SinSR-1	0.221	0.611	53.357	0.689	61.582	0.715	62.169
SinSR*-1	0.231	0.599	52.462	0.691	60.865	0.712	62.575
DMD*-1	0.246	0.612	54.124	0.709	63.610	0.723	66.177
CD-L2*-1	0.568	0.192	27.002	0.230	30.578	0.262	35.101
TAD-SR-1	0.227	0.652	57.533	0.741	65.701	0.734	67.500

Table 2: Generative performance on unconditional CIFAR-10. The best and second best results are highlighted in bold and italic.

Method	DDPM	DDIM	EDM(Teacher)	DPM-solver2	UniPC	CD-L2	CD-LPIPS	DEQ	DMD	Ours
NFE	1000	50	35	12	8	1	1	1	1	1
FID	3.17	4.67	1.88	5.28	5.10	7.90	3.55	6.91	3.77	2.31