Only diffusion-/score matching-based generative models are discussed in this work. Could you please provide some similar derivations for the recent flow matching-based generative models, such as SD3 and FLUX? I believe similar conclusions stand for flow-based methods, and it will make this work more comprehensive.
Is it possible to disentangle two noises used in the denoising and renoising process? For example, we can set different hyperparameters and for , and use this term in different weights for denoising and renoising respectively. Setting is the same as CFG and setting is the same as CFG++.
Is it possible to provide derivations of CFG++ for other diffusion solvers, except DDIM? Extensions of CFG++ to other solvers in Appendix A are more like intuitive understanding, rather than derivations from inverse problems and SDS loss, as done for DDIM.
Essentially, CFG++ can be written as reweighted CFG whose varies along the sampling process (let and for easier LaTeX rendering in OpenReview):

For DDIM CFG:

For DDIM CFG++:

So,

As discussed in Sec. 5, previous studies also propose adjusting the guidance scale across timesteps. However, it may be incorrect to claim that "these findings are orthogonal to ours and keep the sampling trajectory the same ... CFG++ is designing a different trajectory".