We would like to express our sincere gratitude for your time in reviewing the Rebuttal for our paper and for providing your valuable comments. We are fully committed to improving our paper to the best of our ability within the given time constraints. Your feedback is greatly appreciated and will significantly contribute to enhancing our work.

We appreciate your pointing out the issue with the current proof. We agree with your comment that in general mask absorbing state discrete diffusion, while approaches the Prior , this is not completely achieved depending on the parameter settings. (However, in practical cases, as described in Appendix C.2, in the VQ-Diffusion setting we use, the elements of become [MASK] tokens with a probability of 99.999%, which is why we stated this assumption in the main text and used it in the proof.)

After your comment and upon further consideration, we have decided to proceed with the proof of Lemma B.3 without relying on the assumption that (or ) coincides with . As the period for modifying the main text has already concluded, we will provide an outline of this revised proof here.

First, the conditional distribution in is identical to the conditional distribution of the star-shaped noise process. At this point, both distributions are exactly the same.

From here on, we can show that for each , starting from and proceeding in the order , it matches as a marginal distribution. This can be confirmed in terms of marginal distributions, given that uses the conditional distribution of the star-shaped noise process. For example, when , we have .

It is important to note that and match only when we focus on each and marginalize over the other variables . G2D2 enables posterior sampling by omitting the complex dependencies of .