Dear Reviewer,

We defined inversion as a trajectory that transports a clean sample to a noisy latent . Within the DDPM framework, the forward process is described as:

First, we formulate a pure stochastic SDE that follows the forward diffusion to gradually add noise, and then run the time-reversed SDE to retrieve an editable reconstruction, similar to the philosophy of SDEdit [1].

Second, a probability-flow ODE treats diffusion via the score-based velocity field , replacing the random noise with a deterministic velocity field proportional to the score :

A continuum between these two extremes is obtained by interpolating the stochastic and deterministic contributions with a parameter :

where and is a schedule-dependent factor that aligns the units of the velocity term with standard DDPM dynamics. Choosing recovers the deterministic ODE path, whereas yields the fully stochastic SDE path, and intermediate values trade deterministic guidance for stochasticity.

Our flow-aware inversion belongs to the deterministic end. As Flux.1-dev predicts rectified-flow velocity rather than a DDPM score, we insert a lightweight rank-128 LoRA adapter that maps the frozen backbone’s predicted velocity into the DDPM score domain through:

The time-dependent coefficient helps bridge the rectified-flow velocity and the DDPM score scale, while the linear bridge preserves the benefits of flow pre-training and enables faithful one-to-one reconstructions, in a similar spirit to edit-friendly DDPM or LEDITS++.

Thank you so much for your advice! Our camera-ready manuscript will be supplied by a dedicated subsection on our used flow-aware DDPM inversion, the closed-form derivation of the schedule re-calibration, and a comparison with the mentioned methods. We hope this can address the remaining concerns. Please let us know if further details would be helpful.

Sincerely,

The Authors

[1] Meng, C., He, Y., Song, Y., Song, J., Wu, J., Zhu, J. Y., and Ermon, S. SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations. In International Conference on Learning Representations.