The major concern is that the proposed method generates only the labels, excluding the input features and sensitive variables. This means that the approach is nothing but only augments the labels of given pairs of (input feature, sensitive variable). Generating the label alone has several critical issues:

(a) There is a lack of scenarios illustrating the generation of marginal data (i.e., labels only). In what situations would such generated data be needed (i.e., when only labels are generated)? Why is it important and meaningful to generate labels alone?
(b) The distribution of would not be similar to the distribution of Therefore, the goal of generative modeling (i.e., estimating the distribution) is not achieved.
(c) Assume that the confidence for the ground-truth response of a given instance is high. If is augmented by a large margin (i.e., and are significantly different), and a model fits this augmented data well (i.e., the prediction of is very similar to ). Then, the prediction performance on will be poor.

(a) The baselines (e.g., FairGAN) generate not only the label but also the input and sensitive attributes ( and ). Hence, the experimental comparison may not be fair.
(b) Furthermore, the fairness and faithfulness are only measured in the label space, without considering the generated (joint) distribution (e.g., the similarity between the distribution of generated sample () and the original distribution ()).

In equation (6), if it seems that the distribution degenerates to a point mass at 0, not . Is equation (6) correct? If I missed some details, please let me know. If not, the proposed distribution is not valid.
(Minor) Following the author guide in the ICLR 2025 website (https://iclr.cc/Conferences/2025/AuthorGuide), it seems better to add `reproducibility statement’ at the end of the main text.