We thank the reviewer for the constructive feedback. We are glad that the reviewer finds our paper to be 'well-structured and complete' and the experimental design to be 'reasonable'. We also would like to thank the reviewer for bringing up reference [1] (Du et al.), as this is an excellent reference for clarifying our contributions.

Before addressing the concerns raised, we would like to clarify a potential misunderstanding about the proposed method.

The main objective of the proposed method is to modify the target density at the inference time (e.g. by sampling from the annealed distribution or product of densities) rather than 'sampling from the training data and improving sampling efficiency'.
In complete agreement with [1], our work shows that simulating the reverse SDEs with modified scores does not sample from the target densities. This is exactly the goal of introducing the Feynman-Kac corrector scheme, which, as we prove, allows for consistent sampling from the target densities by re-weighting and re-sampling. We have added a self-contained proof of Eq. 9 in the updated Appendix to further emphasize this claim.

Next, we would like to address the salient concerns raised by the reviewer individually:

the choice of

The choice of the interval depends on the hyperparameters and, especially, the noise schedule, but does not require significant tuning. In practice, we choose the interval based on the fraction of unique samples resampled at every iteration. We report the corresponding plot in Fig. 1 of the PDF for the annealing experiments. Note that the scale of the weights is proportional to the noise (see Prop 3.2), which results in a low number of unique samples close to due to the Variance Exploding schedule.

For molecule generation experiments in Table 3, we selected based on a validation task by going over the grid ; was always set to .

We have also added a new set of molecule experiments for a harder set of tasks using a model that directly predicts the 3D coordinates of the atoms in a molecule from [2] (Zhou et al.) and docks molecules to a set of target protein pairs. Again, we use a validation task to sweep over values, which we report in Table 2 of the PDF.

Ablation study for the resampling methods

We perform the ablation study of the two considered resampling methods in Table 6 of the Appendix. We find that systematic resampling always performs better or comparably and opt to use it throughout the rest of the empirical study.

Computational cost of the resampling step

The additional computational cost of the resampling step is negligible compared to the reverse SDE simulation. Indeed, all the weights depend only on scores, which are already evaluated at the forward pass. We purposefully avoid the computation of the divergence operators when deriving the integration scheme (see Lines 176-189 L). The cost of the systematic resampling step is equivalent to generating one uniform random variable, which is negligible.

Our proposed method is notably more convenient than [1], which requires changing the diffusion model parameterization and performing additional MCMC steps or Metropolis-Hastings tests.

for FKC on SDXL

As requested by the reviewer, we provide results for both values of on the Geneval benchmark [3] (Dhruba Ghosh et al.). See the detailed image-generation study in response to Reviewer 65RJ.

Model		Overall	Single object	Two object	Counting	Colors	Position	Color attribution
SDXL (Rerun)	7.5	0.57	0.99	0.80	0.46	0.86	0.11	0.22
FKC	5.5	0.58	0.99	0.77	0.49	0.87	0.10	0.22
FKC	7.5	0.57	0.99	0.78	0.46	0.83	0.13	0.23

Derivation of equation (6)

Equation (6) can be understood by the separation of variables. Indeed,

where the exponent part corresponds to the update of the weights . Note that in eq. (9), appears as log-weights for Self-Normalized Importance Sampling.

Note that Eq. (6) is meant to introduce the reweighting evolution. In later sections, we derive the weights which preserve the particular target distribution evolution under simulation or transport by an SDE with given drift/diffusion.

Closing remarks

Again, we thank the reviewer for their questions, which gave us the opportunity to clarify and improve our work. We hope that our answers fully address all the important questions raised by the reviewer, and we are happy to consider any additional questions or further suggestions.

We kindly ask the reviewer to consider increasing their score if our responses address their concerns satisfactorily.