Fair Image Generation from Pre-trained Models by Probabilistic Modeling

Mahdi Ahmadi,John Leland,Agneet Chatterjee,YooJung Choi

OpenReview PDF

提交: 2024-09-28更新: 2024-11-15

摘要

关键词

Image GenerationFairnessProbabilistic Modeling

评审与讨论

审稿意见

评分: 3置信度: 42024-10-29

This work proposes a scheme to help with fair image generation. The main idea is to introduce probabilistic circuits (PCs) to guide the model in generating fair images. A critical improvement is that the sensitive attributes of the fair dataset are introduced into the training of the PCs. The advantage of the method is that it is not required to retrain the generation model. Experimental results on CelebA show that the proposed method can improve the fairness of the gender distribution.

优点

The method is a pluggable one that does not require retraining or special network architectures, which makes the method valuable for various tasks.
The background of the method is described in detail. Thus, readers can follow the idea easily.

缺点

For the contributions of this work, introducing PCs and introducing the sensitive attributes in PC training are the main improvements. However, there could be shortcomings in both contributions:

The former one may not provide a very overwhelming improvement since there are previous methods tried to introduce other models, as described in Sec. 2, while the PCs are also an existing method.
The latter contribution is not detailly described. The formal definition of the sensitive attributes $S$ is not clarified. Since it is determined by the sampling algorithm, it is not clear how to ensure that the PCs can correctly learn the attributes. In the experiments, the dataset is divided by the gender attribute. But among images of one gender, the other attributes could also be not fair.

For the experiments, it can be observed that the method works well on CelebA for gender fairness. However, there could be many other cases not evaluated:

The performance for some other attributes is not evaluated. For CelebA, attributes such as race, age, or hair color are not considered. The performance on different datasets, such as CIFAR-10 and LSUN, which contain more attributes is not evaluated. It may be necessary because the results can demonstrate that the PCs can process different attributes' features.
The performance for different models is not evaluated. Since one advantage of the method is that it is easy to apply without the requirement of retraining, experiments only on VQ-GAN may not be enough to prove its effectiveness. On the one hand, widely utilized GAN models, such as StyleGAN, should be considered. On the other hand, different generative models, such as VAEs, flow-based models, and diffusion models should also be considered.

问题

Based on the Weaknesses, I recommend the authors provide a more comprehensive analysis of the improvements. Meanwhile, more experiments should be conducted to support the assertion of the advantage. Additionally, there are some minor issues:

In lines 156-157, it seems that the fair dataset is $D_{fair}$ instead of $D_{bias}$ .
In the tables, it may be better to use XX et al. (year) to represent previous methods rather than use (XX et al., year). (Please use \citet if you are using latex)
In Algorithm 2, the input train set is $D_{train}$ instead of $D_{val}$ .

审稿意见

评分: 3置信度: 42024-11-03

This work presents a fair image generation approach using a pre-trained model and a small fair reference dataset. Overall, the idea of using a probabilistic circuit to manipulate the latent space to make a fair representation is unique. However, the work needs to be compared with state-of-the-art models, and the writing of the whole paper needs to be improved.

优点

The proposed method only manipulates the latent space, thus taking less computational resources.
A critical finding in this work is that even if fair data is used to retrain pre-trained models, the resulting model still shows biased output.
Using a probabilistic circuit to transform the latent space to fair representation.

缺点

The introduction is not well organized
1. Need some background of probabilistic circuits in the introduction before mentioning in the contribution sections.
2. Use reference when making some claims, i.e.
  1. On page 1, line 47, "Rather, the users and ML practitioners for downstream tasks may be interested in the distribution of the generated samples and ensuring that it is not biased with respect to some sensitive attributes", - add reference.
In the proposed method, as the authors tried to manipulate the latent space using both latent space and sensitive attributes, a comparison should made with [1]; they also manipulate the latent space by minimizing the correlation distance between the sensitive and non-sensitive attributes. Besides this, the authors should also compare their work with more recent work [2].
The overall writing should be improved
1. In the Related Works section, some examples of probabilistic circuit-based generative models should be given
2. The author should also mention this work's limitations and future works

References

[1] Liu, Ji, et al. "Fair representation learning: An alternative to mutual information." Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2022.

[2] Teo, Christopher TH, Milad Abdollahzadeh, and Ngai-Man Cheung. "Fair generative models via transfer learning." Proceedings of the AAAI conference on artificial intelligence. Vol. 37. No. 2. 2023.

问题

Have you tried this proposed approach for fair tabular data generation?

审稿意见

评分: 1置信度: 52024-11-04

The paper address the problem of fair image generation - controlling a pre-trained model to generate balanced sets of images w.r.t. certain sensitive attributes - following a reference dataset. They propose to use probabilistic circuits (PCs) to learn the distribution of a reference fair dataset. From the learned PC, they can sample fairly the latent embeddings to generate fair distributions.

优点

Ensuring fairness in generative model is critical. The writing is easy to follow, for the most part. The proposed method is fairly simple.

缺点

The overall quality of the paper is quite low due to several critical issues:

Presentation and Clarity: The paper's presentation requires significant improvement to enhance clarity. Specifically:

There is no clear motivation for the use of PC and why it is particularly relevant for the task. What advantages does it offer over existing methods that also aim to learn the distributions of fair latent codes, such as Gaussian Mixture Models (GMM)? Furthermore, it is unclear if integrating PC into this problem poses any unique challenges, or if it is merely a straightforward "plug-and-play" approach.
The method lacks a concise, high-level summary, making it difficult to follow. Nowhere in the methodology section is it explicitly explained how PC is integrated with different generative models. Based on my understanding, the authors use PC to learn the distribution of latent codes to match a reference distribution. However, the description of this approach is vague, with statements like "The training algorithm is the same as the previous case" or "An overview of the proposed method can be seen in Algorithm 1," which fail to provide sufficient detail.

Limited Contribution: The paper’s contribution appears to be minimal, as it primarily focuses on applying PC to learn a set of latent codes without introducing substantial innovation or advancement beyond this.
Inadequate Evaluation: The experimental evaluation is limited in scope, considering only a single dataset, a single attribute, and a single model. This restricted setup weakens the generalizability and robustness of the findings. Some other models should be considered such as Stable Diffusion or StyleGAN. Some attributes for face generation should be considered such as "BlackHair Young Smiling Moustache".
Lack of Analysis and Ablation Study: The paper does not include any analysis or ablation study that would help elucidate the specific effects and impact of the proposed method.

问题

N/A. The paper requires substantial revisions before it can be considered ready for publication. There are no possible response from the authors that could change my opinion.

撤稿通知

2024-11-15

I have read and agree with the venue's withdrawal policy on behalf of myself and my co-authors.