5.5

/10

Poster4 位审稿人

最低4最高6标准差0.9

3.8

置信度

正确性3.3

贡献度2.5

表达3.0

NeurIPS 2024

The Fairness-Quality Tradeoff in Clustering

Rashida Hakim,Ana-Andreea Stoica,Christos Papadimitriou,Mihalis Yannakakis

OpenReview PDF

提交: 2024-05-16更新: 2024-11-06

TL;DR

We propose new algorithms for recovering the Pareto Front of the clustering problem with fairness considerations.

摘要

关键词

clusteringalgorithmic-fairnessmultiobjective-optimization

评审与讨论

审稿意见

评分: 6置信度: 42024-07-09

The paper studies fair clustering from a novel and quite fresh perspective. So far in the literature, fairness in clustering has been mostly defined as an additional constraint the solution should satisfy. Given that definition, algorithms are trying to optimize the clustering objective (usually some metric objective) subject to the fairness constraints (with the exception of the Esmaeili et al. NeurIPS 2021 paper that reverses the roles of fairness and objective). On the other hand, the paper at hand tries to create the Pareto front of the clustering solutions. This is merely a set that contains all solutions that are not dominated at at least one objective (either clustering cost or fairness value) by any other solution.

At first the authors define the necessary characteristics of the fairness notions their framework can capture, while giving plenty of examples of specific notions that can be handled by their algorithm. Then, they given a DP algorithm that solves exactly the assignment version of the problem (cluster centers are fixed). This algorithm runs in exponential time. Moving forward, they show how any approximation algorithm for the vanilla clustering problem can be combined with the DP algorithm in order to give approximations of the Pareto front for the general problem. Furthermore, they show that in the general case you cannot hope to get an algorithm that runs in polynomial time for their problem. Finally, for a newly introduced fairness notion they demonstrate that the Pareto front can be computed efficiently.

优点

I believe that this new perspective on fair clustering is interesting.
The theoretical results are solid and are adequately accompanied by experimental evaluation.
The presentation is clean.

缺点

The paper would benefit from more motivation on why the Pareto front would be helpful in real life applications of fair clustering.
The main results are not really practical from a runtime perspective.
It seems like the framework cannot capture any notions of individual fairness.

问题

Do you think that your framework can be expended to capture notions of individual fairness as well?

局限性

N/A

作者回复

2024-08-07

We thank the reviewer for the comments and positive feedback!

Individual fairness definitions: To first answer the question: we think our framework is best suited for notions of group fairness, where there is some additional information about the nodes aside from their features used in the clustering objective. For individual fairness definitions, the only information available are the features themselves, making individual fairness and clustering objectives correlated (improving individual fairness in a clustering also makes the clustering cost smaller). Under some such definitions, the problem can be thought of as a single-objective optimization problem (e.g. Jung et al 2019 [37], Ghadiri et al 2019 [29]). For such problems, there is no Pareto front. For recent adaptations of individual fairness under bi-criteria optimization problems (Mahabadi and Vakilian 2020 [46]), one can apply a repeated approximation algorithm under an upper bound on one of the objectives (similar to the repeated-FCBC algorithm we proposed). In doing so, one would obtain loose approximation guarantees (see for example the $(\beta,\gamma)$ -approximation guarantee for individual fairness proved by Mahabadi and Vakilian 2020 [46]). Dynamic programming approaches would not directly apply, however, we think this would be an excellent avenue for future work.

Motivation for computing the Pareto front: Computing the Pareto front would be helpful for central decision makers who are balancing different objectives: for example, in facility location problems where a central agent decides on the location of facilities (e.g. buses, hospitals, etc) based on the location of individuals within a certain region. It is well-known that people of lower socio-economic status often travel longer distances [1] and have fewer health facilities in their region [2]. A decision maker has multiple objectives to balance: minimal travel time for the entire population and equal access to facilities for different populations. The Pareto front allows the decision maker to balance these objectives in the optimal way: how much improvement can a group of people gain by moving a facility slightly away from the solution given by a single objective (e.g. by performing k-means)?

Computing the Pareto front for facility location problems with multiple objectives has a long history motivated by such questions, with real-world case studies in the Singapore road network [3] and ambulance placement in Barbados [4]. Further applications are found in extensive surveys [5]. We would be happy to include a more comprehensive motivation including real-world applications in the final version of the paper.

[1] Preston, John, and Fiona Rajé. "Accessibility, mobility and transport-related social exclusion." Journal of transport geography 15, no. 3 (2007): 151-160.

[2] Felland, Laurie E., Johanna R. Lauer, and Peter J. Cunningham. "Suburban poverty and the health care safety net." Washington (DC): Center for Studying Health System Change, 2009.

[3] Huang, Bo, P. Fery, L. Xue, and Y. Wang. "Seeking the Pareto front for multiobjective spatial optimization problems." International Journal of Geographical Information Science 22, no. 5 (2008): 507-526.

[4] Harewood, S. I. (2002). Emergency ambulance deployment in Barbados: A multi-objective approach. Journal of Operations Research Society, 53, 185–192.

[5] Farahani, Reza Zanjirani, Maryam SteadieSeifi, and Nasrin Asgari. "Multiple criteria facility location problems: A survey." Applied mathematical modelling 34, no. 7 (2010): 1689-1709.

Runtime: We think exploring faster algorithms is an excellent direction for the future. In this first analysis of the Pareto front for clustering and fairness objectives, we uncover the following complexities:

Some objectives are more difficult than others: fairness objectives that sum over clusters do not allow a good poly-time approximation (Thm 7.1 [26]), whereas sum/max of imbalances objectives allow poly-time computation of the Pareto front through a reduction to a matching problem (Thms 3.6 and A.5 in our work).
Overall, our main theoretical contribution gives tight approximation bounds to the true Pareto front, needing only some general conditions on the objective functions. In other words, we are agnostic to the specific objective functions used.
We explored faster heuristics by adapting approximation algorithms that optimize fairness under a bounded clustering cost to recover the entire Pareto front; while such algorithms are polynomial, the approximation bounds are worse than what we obtain by dynamic programming; in practice, such algorithms obtain a very loose approximation of the Pareto front. We consider this a contribution of our paper: we believe that this is the first work to start exploring the tradeoff between approximation guarantees and the runtime complexity for computing the Pareto front.

审稿意见

评分: 4置信度: 42024-07-13

The paper is concerned with finding the Pareto front in fair clustering. Specifically, the clustering cost along with a fairness objective are considered simultaneously. Algorithms that can approximate the Pareto front are shown. The run-time is exponential however for the special fairness notion of sum of imbalances it can be done in polynomial time. The papers also conducts experiments on real world datasets and compares against some baselines.

优点

-The objective of the paper is interesting and arguably is what one would like to see in fairness work as it matures. Instead of just satisfying the fairness constraint we would have a full trace of the Pareto front.

-As shown in section 2, the paper considers a number of fairness notions.

缺点

-The major weakness of the paper is that the results are somewhat unsatisfying theoretically. In particular, the run-time of the algorithm is exponential. It is argued in section 3.3 that this intractability is inherent to the problem. I suspect that some hardness result actually exists but the argument here is not formal. What would be better is to give a theorem that states something like even when the Pareto front is of polynomial size, it cannot be approximated to (2+alpha,1) in polynomial time.

-Further, since the run-time is exponential. Why not consider algorithms that approximate the fairness objective as well, i.e. the approximation would be (2+alpha,beta) where beta is not equal to 1. Unfortunately, the run-time of Theorem 3.2 seems very expensive $n^{O(k)}$ so such approximations might run much faster. Although, there might still exist a hardness result that even forbids such approximations, but it would be very interesting to prove them.

The following points are more minor:

-Sum of Imbalances objective is a bit restrictive since it ignores proportionality. For example, it would achieve the same value even if the imbalance is large over a large cluster (so small proportional violation) or large over a small cluster (large proportional violation).

-Does this paper have any new implications to the setting of Esmaeili [26]? For example, better approximations or faster run-time?

-FCBC algorithm in page 7 is not elaborated on? It refers to the algorithm in [26], correct?

问题

Please respond to the issues under the Weaknesses section above.

局限性

Limitations are adequately addressed.

作者回复

2024-08-07

We thank the reviewer for the thoughtful questions and comments!

Theoretical results: We appreciate the reviewer’s suggestions and we also think that an excellent direction would be some negative/hardness results that can decisively address what approximations can and cannot be obtained in poly-time. Currently, such results are difficult to obtain for the clustering problem for the following reasons:

First bottleneck: much of the literature obtains theoretical approximation bounds by solving the assignment objective as a proxy for the clustering objective. In doing so, we obtain the $(2 + \alpha)$ -approximation on the clustering cost. Subject to this method, we know of theoretical results stating the we cannot get a $(2+\alpha,1)$ -approximation on the Pareto front for (clustering objective, fairness objective) in poly-time for a range of fairness objectives (Thm 7.1 [26] for fairness objectives that sum across the clusters).
Second bottleneck: the assignment problem is also computationally hard (Thm 5.1 in [26]). This result implies that even a polynomially-sized Pareto curve is hard to compute for the (assignment, fairness) objectives. Even approximating fairness objectives with an additive approximation of $O(n^{\delta})$ , $\delta \in [0,1)$ is NP-hard (Thm 7.1 [26]), therefore, an approximation of $(1, O(n^{\delta}))$ for the (assignment, fairness) objectives is hard.

In other words, all the guarantees we get are based on not relaxing the cost approximation from $(2 + \alpha)$ to something looser. Allowing a relaxation on the cost approximation may lead to theoretical guarantees on the fairness approximation (e.g. obtaining a $(c + \alpha, 1)$ -approximation for $c > 2$ ). We think this is an excellent direction for future work, but would require fundamentally different techniques for solving the clustering objective that do not necessarily use the assignment objective as a proxy. We believe this would make for an entire paper on its own, and as such, this is beyond the scope of our paper.

Potential approximations on the fairness objective: The repeated-FCBC algorithm does indeed involve an approximation on the fairness objective as well, $(2+\alpha, c + \eta)$ . Its purpose is precisely to explore the tradeoff between approximations and runtime: what we gain in runtime (FCBC is polynomial), we lose in approximation guarantees. The additive approximation eta on the fairness objective provided by FCBC has no known theoretical bound. We describe this issue further down in this response. In particular, no objectives that sum a measure of unfairness across the clusters allow a poly-time approximation of the form $O(n^{\delta})$ (Thm 7.1 [26]). Other objectives, such as sum and max of imbalances, allow even a polynomial time algorithm for computing the entire Pareto front (Thms 3.6 and A.5, our work). Such differences in the complexity of different objectives makes it difficult to have a general result that encompasses all objectives. Instead, our dynamic programming technique provides sufficient conditions on the objectives to get a good approximation guarantee of the Pareto front.

For this paper, we believe that this is the first work to start exploring the tradeoff between approximation guarantees and the runtime complexity for computing the Pareto front. We think the reviewer’s suggestion of further exploring this tradeoff is an excellent avenue for future work.

For the more minor points:

Sum of Imbalances: indeed, this objective ignores proportionality. Its purpose is to demonstrate that the complexity in computing the Pareto front comes exactly from requiring proportionality in the fairness objective. The reason for this difficulty can be explained by a reduction of fair assignment from the exact 3-cover problem [26], which shows that it is NP-hard to find an assignment of minimum cost where all clusters have red and blue points in proportion to the dataset ratio of 1:3. In contrast, simpler objectives that only compute the difference between the number of nodes in each sensitive group across clusters allow poly-time algorithms.

FCBC algorithm: described in lines 312-317, 326-328, FCBC is a polynomial-time algorithm for computing a clustering that has a cost bounded by $(2 + \alpha)U$ , for an upper bound on the cost $U$ that is fixed a priori, and a fairness additive approximation, using an LP-type method, indeed proposed by [26] as the reviewer noted. This algorithm computes a single point on the approximated Pareto front. We will include a detailed description of the FCBC algorithm in the final version of the paper.

Implications for Esmaeili et al [26]: A contribution of our paper is to adapt FCBC for computing the entire Pareto front by allowing the cost upper bound $U$ to vary: in doing so, we explore the runtime-approximation tradeoff. Although FCBC has a polynomial runtime, in practice, the approximations are far from the true Pareto front. Figure 2 has the following implications: we can gain in runtime by adapting the FCBC algorithm to compute an approximated Pareto front; however, the approximation is quite loose in practice, with no guarantees on the fairness additive approximation factor. The additive approximation factor depends on a particular quantity, $L(U)$ , which is the size of the smallest cluster across all clusterings of cost not exceeding $U$ . This quantity has no theoretical guarantees and is computationally very expensive to compute in practice (it means iterating over all possible clusterings). In our experiments, repeated-FCBC leads to a very different approximated Pareto front than the one computed by our dynamic programming approach (Figs 2 and 7 from our paper). We consider this insight a contribution of our paper: we adapted FCBC to compute the Pareto front, and we compare it in practice with the Pareto front obtained by dynamic programming.

审稿意见

评分: 6置信度: 42024-07-13

The authors consider an important problem in the realm of fair clustering in this work-- is it possible to output the entire Pareto fairness-utility frontier when undertaking fair clustering? This can allow practitioners to answer questions of the form-- "how much utility can be sacrificed to improve fairness of the model?" and constitutes a problem setting where the practitioner is willing to be suboptimal in one objective to allow for gains in the other. This is an overlooked yet pertinent problem and the authors provide novel approaches for outputting the entire Pareto fairness-quality curve for fair clustering. More specifically, the authors define two general classes to cover fairness and utility notions in clustering, and then propose exponential algorithms for tracing the Pareto frontier under these definitions. Notably, they provide an intractability result with regards to their algorithm taking exponential time in the worst case. Finally, they also propose a polynomial time algorithm under a specific fairness notion for the assignment (not clustering) problem, where the cluster centers are assumed to be fixed.

优点

The paper formulates and studies an important problem of outputting the entire fairness-utility Pareto frontier. The authors propose a number of novel algorithms for the clustering/assignment problems under general definitions they provide (pattern based objectives and mergeable objectives) that generally encapsulate fairness notions.
I also appreciate the authors providing the intractability result showcasing that exponential time is the best we can do in the worst case for the general problem, even if the Pareto frontier itself is polynomial.
The experiments conducted validate the efficacy of the proposed algorithms.

缺点

I am somewhat unsure of how general the definitions of the fairness notions are-- for instance, can the authors discuss whether other popular notions such as social fairness by Ghadiri et al (https://arxiv.org/pdf/2006.10085) fall into the proposed categorization? Currently, the definitions seem to mostly apply to Balance and the notions covered in Esmaeili et al (https://arxiv.org/pdf/2106.07239), so more discussion on what other fairness definitions (given the vast literature in fair clustering) follow the assumptions of the paper (pattern based objectives and mergeable objectives) would be useful information to have for readers.
In the abstract the authors state: "for perhaps the most natural fairness objective: minimizing the sum, over all clusters, of the imbalance between the two groups in each cluster". I am curious about where this statement comes from? To my best knowledge, no other prior work has considered this definition of fairness in fair clustering, but I would be happy to be corrected if I am mistaken. Either way, it would be good to discuss why the authors make this statement.
While a minor issue, multiple idiosyncracies and issues have been identified with the Adult dataset (https://arxiv.org/pdf/2108.04884) and it would be better to use more recent versions of the US Census data (such as from the folktables package) if possible.

问题

Please see the weaknesses section. Each of the points above can be considered a question. Also note that owing to the strengths and weaknesses listed above, the paper's contributions currently constitute a technically solid, moderate-to-high impact work, which forms the basis for my score.

局限性

The authors have sufficiently discussed limitations of their work.

作者回复

2024-08-07

We thank the reviewer for the questions and positive feedback!

Definitions of fairness: The notions of fairness we discuss cover a range of fairness definitions used in a significant part of the literature, focused on group fairness notions. We would be glad to include a discussion on additional definitions in the final version of our paper. For some definitions, the notion of a Pareto front does not apply: for example, the socially fair k-means clustering notion proposed by Ghadiri et al 2021 [29] defines a single optimization objective that already incorporates fairness (by minimizing the max distance between points in each group and their cluster centers). With this definition, the fairness objective and the clustering objective are not separated in a multi-objective optimization problem, but rather combined in a single objective. Therefore, there is no Pareto front, since this notion applies to a multi-objective optimization problem. For Ghadiri et al, the problem is about how close an algorithm can approximate the optimal solution. For other definitions of individual fairness, such as recent adaptations of individual fairness under bi-criteria optimization problems (Mahabadi and Vakilian 2020 [46]), one can apply a repeated approximation algorithm under an upper bound on one of the objectives (similar to the repeated-FCBC algorithm we proposed). In doing so, one would obtain loose approximation guarantees (see for example the $(\beta, \gamma)$ -approximation guarantee for individual fairness proved by Mahabadi and Vakilian 2020 [46]). Dynamic programming approaches would not directly apply, however, we think this would be an excellent avenue for future work.

Sum/max of imbalances: The sum of imbalances and max of imbalances objectives are indeed not previously defined in the literature: we introduce them as simple examples of objectives for which the Pareto front can be computed in polynomial time using a novel approach that adapts a graph matching argument to our clustering setting. We denoted them as natural due to their simplicity and we use them to exemplify that not all fairness objectives render the Pareto front problem difficult. We are happy to clarify this point in the final version of the paper.

US Census datasets: We appreciate the reviewer’s suggestion of using newer versions of the Census data — we would be glad to include an empirical analysis on the data obtained from the folktables package in the final version of the paper.

2024-08-11

I would like to thank the authors for their rebuttal. After going through it, I believe the paper's contributions still currently constitute a technically solid, moderate-to-high impact work as I had mentioned in my original review. I will hence keep my score.

审稿意见

评分: 6置信度: 32024-07-13

This paper considers the Pareto-front between quality and fairness in clustering problems, which captures the trade-off between these two objectives. A clustering is on the Pareto front if its clustering cost and fairness objectives are strictly worse than the pair of other clustering.

They first consider the assignment problem. Given n data points with group labels and k centers, the goal is to assign data points to centers to form clustering on the Pareto front. They provide a dynamic programming algorithm which computes the Pareto front of the assignment problem in O(kn^{l(k-1)}) time.

Next, they consider the clustering problem. They first use a vanilla clustering algorithm that achieves an alpha approximation on the cost and then use the algorithm for the assignment problem on computed centers. They show that this algorithm achieves a (2+alpha, 1) approximation on the Pareto front, which means for each algorithm solution, there exists a solution on the Pareto front such that the cost of the algorithm solution is 2+alpha approximation and the fairness objective is strictly better.

They also show that if the fairness objective is the sum of imbalances for two groups, the Pareto front of the assignment problem can be computed in poly time.

优点

The paper is well-written and easy to follow.
This paper provides a XP algorithm for the Pareto front for the assignment problem and utilizes it to achieve a (2+alpha, 1) approximation for the Pareto front for the clustering problem.
The algorithm works for a general metric based cost objective and pattern based fairness objective.

缺点

The DP algorithm for the assignment problem is quite straightforward and has a large running time. Although it is NP-hard even for computing the minimum cost clustering with a fairness objective constraint, previous fairness clustering work provide approximation algorithms and FPT algorithms for some fairness objectives.
The sum of imbalances and the max imbalances objectives seems quite restrictive since they make sense to me for almost balanced two groups.

Minor:

Line 254 cost min_i (d(u,i)+d(v,i)) should be min_i (d(u,s_i)+d(v,s_i))
Line 546, formula second inequality d(phi(x), phi^(x)) should be d(x, phi^(x))

after the response: The comparison of their DP algorithm and the FCBC algorithm for the Pareto front is fair and shows the strength of their algorithm in computing the Pareto front task.

问题

For the FCBC algorithm, can it be modified such that it can optimize the fairness objective subject to both upper and lower bound on the clustering cost?

局限性

Yes, they mentioned the limitations of their work.

作者回复

2024-08-07

We thank the reviewer for the comments and question!

Question about FCBC: we’re not sure why a lower bound on the clustering cost would be desirable, since the clustering cost objective is a minimization objective (hence, lower cost is better). To modify FCBC with a lower bound on the cost guarantee, one can turn the cost minimization objective into a cost maximization objective by switching the sign of the objective and solving k-means clustering by gradient ascent methods; with the inverted objective, FCBC should follow a similar analysis. We note that the focus of our paper is not the FCBC algorithm per se, rather, we adapt it to explore the approximated Pareto front that it can obtain.

Runtime: while previous work provides approximation algorithms for optimization clustering under a fairness constraint, none offers a method for computing the Pareto front. We show in Section 4.2 that we can adapt such polynomial-time methods to compute an approximation of the Pareto front; however, what we gain in runtime, we lose in approximation bounds: the repeated FCBC algorithm has an additive approximation on the fairness objective that can get large in practice (we get a different Pareto front, with points quite far away in the objective cost, compared to the dynamic programming approach), and with no theoretical bounds on the additive approximation.

Our main theoretical contribution gives tight approximation bounds to the true Pareto front, needing only some general conditions on the objective functions. In other words, we are agnostic to the specific objective functions used. Furthermore, for some fairness objectives, prior work has shown that there is no polynomial-time algorithm that can offer a $O(n^{\delta})$ -approximation: Thm 7.1 [26] for fairness across clusters objectives.

While prior work provides approximation algorithms for clustering objectives with fairness constraints, these methods do not all readily extend to computing the Pareto front. A contribution of our paper is to adapt one such method for computing the entire Pareto front: we adapted the FCBC algorithm [26] that optimizes fairness under a bounded clustering cost by varying the bound on the clustering cost. In doing so, we explore the runtime-approximation tradeoff. We consider this a contribution of our paper: we adapted FCBC to compute the Pareto front, and we compare it in practice with the Pareto front obtained by dynamic programming. Although FCBC has a polynomial runtime, in practice, the approximations are far from the true Pareto front.

Sum/max of imbalances: We agree with the reviewer that the sum of imbalances and max imbalances objectives are quite restrictive; in fact, we only give them as examples of objectives for which there are poly-time algorithms for computing the Pareto front. These objectives exemplify where the difficulty of computing the Pareto front arises from: fairness objectives that require clusters to match the group proportions present in the general population are difficult to optimize for even with approximation algorithms (Esmaeili et al [26] and our work). The reason for this difficulty can be explained by a reduction of our problem from the exact 3-cover problem, detailed by Esmaili et al [26] which shows that it is NP-hard to find an assignment of minimum cost where all clusters have red and blue points which match a dataset ratio of 1:3. In contrast, simpler objectives that only compute the difference between the number of nodes in each sensitive group across clusters allow poly time algorithms. We would be glad to clarify this point in the final version of the paper.

Minor comments: we thank the reviewer for noticing these minor issues, we will fix them in the final version of the paper.

2024-08-10

Thanks the authors for their detailed response. For the question, I was thinking about in Figure 2, FCBC only provides discrete points instead of a continuous Pareto front and they mentioned it is problematic when FCBC only recovers the vanilla clustering while the practitioner is willing to tradeoff the clustering cost for better fairness. I wonder whether it is because only one side constraint is imposed. Now, I figured out it does not make sense to add the lower bound. Thanks for explaining more about the running time and objectives. I think the comparison with the previous method is fair enough and shows the merit of the proposed dynamic programming in computing the Pareto front. Overall, I am willing to raise my score.

2024-08-13

Thank you to the reviewer for the reply and positive feedback! From our intuition, the problem the reviewer brings up might indeed arise from the one-sided constraint imposed + looser approximations (i.e. even if the cost constraint $\leq U$ , is imposed, the FCBC might recover a point with cost $<< U$ .

作者回复

2024-08-07

We appreciate all reviewers’ suggestions and positive comments. In particular, reviewers found our proposed problem novel and interesting, and with solid theoretical contributions.

In this general response, we’d like to comment on a few main threads opened by reviewers:

Runtime: one of the main comments is on the runtime of our proposed dynamic programming algorithm in computing the Pareto front. We note the following:

While previous work provides approximation algorithms for optimization clustering under a fairness constraint, none offers a method for computing the Pareto front. On the question ‘can we have faster algorithms with worse approximation bounds?’, the answer is yes. Our paper provides (the first, to our knowledge) such exploration: We show in Section 4.2 that we can adapt such polynomial-time methods to compute an approximation of the Pareto front; however, what we gain in runtime, we lose in approximation bounds: the repeated FCBC algorithm has an additive approximation on the fairness objective that can get large in practice (we get a different Pareto front, with points quite far away in the objective cost, compared to the dynamic programming approach), and with no theoretical bounds on the additive approximation.
For fixed number of clusters and number of sensitive attributes, the runtime of our dynamic programming algorithm is polynomial; the complexity arises from allowing the number of attributes and the number of clusters to vary. In related work on computing Pareto fronts in multiobjective optimization problems, objectives considered in these problems do not run the problem of an additional varying quantity, such as the number of clusters in our case. As a side note, recent faster approaches such as MOEA/D reduce the running time by dividing the problem in subproblems and solving them simultaneously (Zhang and Li 2007 [60]); this approach works best for objectives with continuous and convex Pareto fronts, as well as for linearly separable objectives. In our problems, we do not benefit from such properties, since the Pareto front need not be convex, and the clustering cost is not separable: reassigning a point to a different cluster changes the centers of both the original cluster and the new cluster. We would be glad to include a more extensive discussion on related work and running time in the final version of the paper.
Overall, our main theoretical contribution gives tight approximation bounds to the true Pareto front, needing only some general conditions on the objective functions. In other words, we are agnostic to the specific objective functions used. Both theoretical and empirical analysis highlight the complexity of this problem: certain objectives do not allow even a good poly-time approximation (objectives that sum fairness across clusters, as seen in previous work), whereas other objectives allow a poly-time algorithm (objectives such as sum/max of imbalances, as exemplified by our work).

Limitations of sum/max of imbalances: A few reviewers have pointed out limitations of the sum/max of imbalances objectives. We agree that such objectives are simple and only apply in limited settings; we do not propose them as ideal objectives, but rather give them as examples for which we can compute the Pareto front in polynomial time. These objectives exemplify where the difficulty of computing the Pareto front arises from: fairness objectives that require clusters to match the group proportions present in the general population are difficult to optimize for even with approximation algorithms (Esmaeili et al [26] and our work); in contrast, simpler objectives that only compute the difference between the number of nodes in each sensitive group across clusters allow poly-time algorithms.

Our paper is agnostic to specific objectives chosen; we highlight the main contribution of our paper as giving sufficient conditions for general classes of fairness and clustering objectives in order to be able to compute the Pareto front — the choice of which objectives may be of most use lies with a central decision-maker who can adjust the objective design based on of specific applications of clustering with fairness considerations.

We agree with all reviewers that this work opens several follow-up questions, from obtaining faster algorithms with even looser approximation guarantees on either the clustering or fairness objective, to extending this work to individual fairness objectives. We believe this to be a testament to an interesting problem that opens new avenues for research.

评论- Please comment on the rebuttal

2024-08-12

Dear review eL7E and 95VK,

Can you please comment on the responses from the authors? Please also try to read the responses to other reviewers, in addition to yours.

Regards, Area Chair

最终决定Accept (poster)

2024-09-25

This paper initiates the study of reporting the Pareto front of the fairness-quality tradeoff for fair clustering, under various notions of fairness.

As agreed by the reviewers, the topic is new and well motivated. However, as also pointed out by the reviewers, the major weakness is that the main result, which is an algorithm to report the (approximate) Pareto front, is exponential time. This makes the algorithm completely impractical.

Nonetheless, the suggested quantitative study of the tradeoff between fairness and quality is important, and may potentially generate high impact/many followup works. The conceptual value of the proposed new direction deserves to be presented in the conference.