PaperHub
7.3
/10
Poster4 位审稿人
最低4最高5标准差0.5
4
4
5
5
4.5
置信度
创新性2.8
质量3.5
清晰度3.5
重要性3.0
NeurIPS 2025

Improved Algorithms for Overlapping and Robust Clustering of Edge-Colored Hypergraphs: An LP-Based Combinatorial Approach

OpenReviewPDF
提交: 2025-05-12更新: 2025-10-29
TL;DR

This paper presents improved algorithms for overlapping and robust clustering of edge-colored hypergraphs; our algorithms combine the strengths of LP with the efficiency of combinatorial algorithms, efficiently producing high-quality solutions.

摘要

关键词
overlapping edge-colored clusteringrobust edge-colored clusteringedge-colored clusteringhypergraph clusteringprimal-dual methodsapproximation algorithms

评审与讨论

审稿意见
4

This paper studies the recently-introduced clustering problems on edge-colored hypergraphs, generally called Edge-Colored Clustering (ECC). In ECC, we are given an edge-colored hypergraph, where each hyperedge has a single color, and asked to assign colors to nodes so as to minimize the number of mistakes, i.e., the number of hyperedges that contain a node that does not have the hyperedge's color. The authors specifically consider the following three problems: Local ECC, Robust ECC, and Global ECC. Local ECC and Global ECC formulate the overlapping nature of clustering. Both problems consider assigning possibly multiple colors to each node; the former imposes an upper bound blocalb_\text{local} on the number of colors that each node can receive (i.e., Local), while the latter imposes an upper bound bglobalb_\text{global} on the total number of colors that the nodes can receive beyond the single colors (i.e., Global). On the other hand, Robust ECC is capable of taking into account the existence of outliers in hypergraphs, where we can ignore at most brobustb_\text{robust} nodes. The contribution of this paper is to develop primal-dual approximation algorithms for the above three problems. For Local ECC, the proposed algorithm achieves the state-of-the-art approximation ratio of blocal+1b_\text{local}+1. For Global ECC and Robust ECC, the proposed algorithms have the approximation ratios of 2(bglobal+1)2(b_\text{global}+1) and 2(brobust+1)2(b_\text{robust}+1), respectively. The authors also complement their positive results with some inapproximability results and integrality-gap results. Computational experiments demonstrate that the proposed algorithms work well in terms of both solution quality and running time.

优缺点分析

Strengths: This is a technically solid paper. Clustering is one of the central topics in data mining and machine learning, and particularly, ECC has recently been actively studied in the literature. The proposed algorithms are well motivated. Existing algorithms for ECC fall into two classes: (i) scalable combinatorial algorithms that only achieve poor approximation ratios (i.e., the maximum size of hyperedges) and (ii) LP rounding algorithms that achieve a good (bicriteria) approximation but require much time complexity. The proposed algorithms bridge this gap: The algorithms are scalable and recover (or sometimes discover) the best-known approximation ratios. The authors also complement their positive results with some inapproximability results. In particular, the authors show that Local ECC is (blocal+1ϵ)(b_\text{local}+1-\epsilon)-inapproximable and (blocalϵ)(b_\text{local}-\epsilon)-inapproximable under UGC and P != NP, respectively, which (almost) match the proposed algorithm's approximation ratio. The technical claims are all validated with rigorous mathematical discussions. Computational experiments are performed under reasonable setups, and it is observed that the proposed algorithms outperform the baseline methods.

Weaknesses: The novelty is slightly limited. As the problems studied are all existing problems, there is no essential novelty in the problem formulations. The only novelty can be found in the algorithm design and analysis, but all algorithms follow the standard technique for designing approximation algorithms, i.e., the primal-dual method, and the approximation-ratio analysis is an expected outcome. The reviewer does not say those are straightforward, but still, their novelty is not that significant. At least, the current statements like "we present a new algorithmic framework..." are exaggerations. Indeed, the authors do not present a new algorithmic framework; they follow the well-established algorithmic framework (i.e., the primal-dual method) to design approximation algorithms.

Minor comments:

  • The problem generalizations for which the authors present the proposed algorithms should be mentioned in the problem formulations.
  • Algorithm 1 should return σ\sigma.
  • Theorems 3.7 and 3.8 should be modified so that they refer to the concrete algorithms.
  • Figure 1 should be enlarged for readability.

问题

  • In the problem formulations, Local ECC and Global ECC look similar; however, in the algorithm section, it is stated that the proposed algorithms are similar for Global ECC and Robust ECC. Could the authors give a brief explanation why this happens? The reviewer imagines it is due to the global nature of the constraints, but some additional comments would be helpful.
  • In the algorithm descriptions, the concept of "unit rate" is a bit ambiguous. Why don't the authors explicitly specify such rates using the notation in the constraints?
  • Also, it is stated that the node selection ordering is arbitrary to prove the approximation ratio. However, the selection might be critical in the practical performance. Which selection rule do the authors employ in the experiments? Don't the authors investigate the effect of the selection in the practical performance? Figure 1(b) shows that the relative error comes almost solely from the mistakes of the algorithms, and thus, there might be some room for improving the practical performance.

局限性

yes.

最终评判理由

My original assessment was borderline accept. After carefully reviewing the authors’ responses and the discussions with the other reviewers, I find the proposed revisions to be reasonable. Although I still consider the novelty of the work to be somewhat limited, I maintain a positive overall view of the paper.

格式问题

N/A

作者回复

Thank you for your valuable comments and questions. They will be very helpful in improving this paper, especially in refining how we describe our algorithms and enhancing its clarity.

We view the main contributions of our paper as (i) providing improved algorithms for Local ECC, Robust ECC, and Global ECC that are both fast and produce high‑quality solutions, and (ii) answering previous questions about whether the existing algorithms could be improved, thereby offering a clearer picture of these problems. Using the primal-dual method, in conjunction with existing or new LPs, we obtain a family of algorithms that solves all three problems. Of course, we do not claim credit for the primal–dual method itself. If our choice of the word "framework" may give rise to such a misunderstanding (and we see how that might happen), we will find a more appropriate expression. At present, we are considering the term "collection of algorithms", but we would welcome suggestions for better alternatives.

We now address the minor comments and questions one by one.

  • The problem generalizations ...

While the algorithms we present in Section 3 (and in the appendix) solve slightly generalized versions of the problems, we defined the problems without these generalizations in Section 2. We initially made this choice because the paper did not include an explanation of the contexts in which such generalizations arise, and moreover, the existing literature had only considered the versions defined in Section 2. However, we agree with the comment that the generalized version of the problem definition should be introduced earlier rather than waiting until we present the algorithms. If the paper is accepted, we will include in Section 2 of the camera-ready version an explanation of these generalizations. Thank you for this suggestion.

  • Algorithm 1 should return sigma\\sigma.

We will fix this in the camera-ready version.

  • Theorems 3.7 and 3.8 should ...

Theorems 3.7 and 3.8 provide lower bounds on the integrality gaps. An integrality gap is a property of an LP relaxation itself, rather than of any algorithm using the relaxation. It measures how much relaxing the integrality constraints of an ILP can change the optimal value, revealing an inherent limitation on the performance of any algorithm whose analysis compares its output against the LP optimum. The two theorems did not refer to a concrete algorithm as they concern integrality gaps.

  • Figure 1 should be enlarged for readability.

Thank you for bringing this issue to our attention. Another review also raised a related point. As suggested in that review, we will redraw Figure 1(a) using a log scale. We will also enlarge both Figure 1(a) and Figure 1(b) in the camera-ready version.

  • In the problem formulations, Local ECC and Global ECC look similar; ...

That is exactly correct. Both problems involve global constraints, and as a result their LP formulations become somewhat similar. Consequently, the algorithm for Global ECC is closer to that for Robust ECC than to that for Local ECC, which is why we presented the first two algorithms together. As suggested, we will add a comment in the paper explaining this similarity. We appreciate this helpful suggestion.

  • In the algorithm descriptions, the concept of "unit rate" is ...

Thank you for sharing this feedback. We have carefully considered throughout the preparation of this manuscript how to best present the algorithms—balancing between an intuitive process‑over‑time presentation and a fully mechanical discretized one. The process‑over‑time view does not directly correspond to how the algorithm is mechanically implemented; in this view, only the relative rates matter, and their absolute values do not. For example, suppose two variables x1x_1 and x2x_2 start at 0 and increase at rates 1 and 2, respectively, until one reaches 1. After 0.5 units of imaginary "time", we obtain x1=0.5x_1 = 0.5 and x2=1x_2 = 1. If the rates were instead 2 and 4, the elapsed time would be 0.25, but the resulting values of x1x_1 and x2x_2 would be identical. This illustrates that the specific absolute values of the rates are irrelevant, which is why we used "unit rates" to keep the presentation (and, more importantly, the analysis) simple.

That said, your comment is especially helpful, as it highlights a point that was not adequately emphasized in the paper. While the process‑over‑time presentation is intuitive, it hides details of how the algorithm is actually implemented. This may cause confusion. For this reason, we provided a discretized version (see, e.g., Algorithm 2) that makes all details explicit enough to be directly translated into code, but this point was not sufficiently emphasized in the paper. We will clarify this point in the camera‑ready version. Thank you for raising this issue.

  • Also, it is stated that the node selection ordering is ...

We are not entirely certain whether we have correctly understood this question. We presume it concerns our algorithm for Local ECC, since it is the only algorithm that involves a node ordering. If "node selection ordering" refers to the order by which vertices are considered in the for loop of Algorithm 1, then it is correct that this choice leaves some flexibility in implementation. Initially, we did not think that this ordering would be important, and our implementation of Algorithm 1 simply processed the vertices in the order they appear in the input.

In response to this comment, we conducted an additional experiment where the vertices were selected in a random order. For each instance, we performed 20 independent runs and observed that the solution quality is improved by <2.1% on average compared to the original implementation, suggesting that the potential for optimization is relatively limited. We will add a discussion of this point to the camera‑ready version of the paper.

We hope our responses have adequately addressed your comments. We greatly appreciate your thoughtful review and consideration.

评论

Thank you for addressing my comments. I have carefully reviewed the authors’ response and found it reasonable.

Theorems 3.7 and 3.8 provide lower bounds on the integrality gaps...

My apologies, I intended to refer to Theorems 3.5 and 3.6 instead of Theorems 3.7 and 3.8. In these theorems, the authors only state the existence of algorithms; however, they indeed have the algorithms, and the theorems should be modified to explicitly mention this.

Although the novelty of the work is somewhat limited, I hold a positive view of the paper.

评论

Thank you for the clarification and nice suggestion. We will explicitly refer to the concrete algorithms in Theorems 3.5 and 3.6.

审稿意见
4

The edge-coloring clustering problem is given a hypergraph (V,E)(V,E) along with a coloring of the edges c:E[C]c:E\rightarrow [C]. Given a coloring of the nodes σ:V[2C]\sigma: V\rightarrow [2^C], we say that σ(v)\sigma(v) induces a mistake on eEe\in E if vev\in e and c(e)σ(v)c(e)\notin \sigma(v).

The goal is to find σ\sigma such that the number of mistakes are minimized. Since a trivial solution making no mistakes colors all nodes with all colors, the authors consider constrained versions of the problem where each node can have at most bb many colors (LECC) or the total number of colors is at most V+b|V|+b (GECC). Finally, the authors also consider a variant where σ:V[C]\sigma: V\rightarrow [C], but nodes may be removed from the graph (RECC).

For all of these problems, the authors design approximation algorithms. Specifically for LECC, the authors propose a primal dual algorithm that with an approximation ratio of b+1b+1, which is optimal assuming the unique games conjecture and nearly optimal assuming PNPP\neq NP. For the other two algorithms the authors devise approximation algorithms that match the integrality gap of their proposed linear program.

优缺点分析

I am not sure how to rate a paper like this in the context of NeurIPS. To me this is a classic approximation algorithms paper that belongs at a theory conference. The connection to clustering or machine learning is fairly strenuous, in my opinion, regardless of whether or not this problem has been previously in a paper published at similar conferences.

As is often the case with theory papers submitted to NeurIPS, nearly all technical details are in the appendix. While it is hard to fault the authors for this, they might want to consider how to distribute the pages that the reviewers are expected to read. I read the LECC algorithms and lower bounds, as well as the RECC algorithm as this was the chronological order in which they were presented in the main body and also seemed to be how the authors themselves valued the results. If for some reason the GECC result was the most important one (which I didn't read), the authors may want to consider revising the paper.

I would also encourage the authors to not "waste" three pages of main body on experiments with a rather dubious quality. The main take-away message seems to be that the previous greedy algorithm (that algorithm is never specified), is worse than the LP-based algorithm. I would have believed that with significantly less space, and preferred more theoretical content in the main body. If the authors did this because some previous reviewer did not engage with the proofs and instead complained about a lack of experiments, then they have my sympathies, but the criticism remains.

In any case, as far as approximation algorithms go, the algorithm for LECC is technically fairly basic, being a standard primal dual algorithm that commits to colors when complimentary slackness conditions are satisfied. People familiar with similar algorithms for set cover and facility location should be able to derive these results very quickly without looking at this paper. The hardness results for LECC are also standard reductions to vertex cover in hypergraphs. The RECC problem is likewise solved with a primal dual approach. I did not read [19,20], but apparently it is based on the same LP relaxation save for adding the constraint that encodes that every node is assigned at most one color or it is removed. Why the previous algorithms did not consider this constraint is puzzling, as without it the integer linear program can assign multiple colors to a node. Then even the ILP will have an unbounded gap to the value of the optimal solution. In any case, including both LP formulations in the main body would have been nice as while it is not to difficult for the reader familiar with primal dual algorithms to formulate the LPs, I do not believe that merely stating the dual in the main body will be of much use to readers that are not familiar.

If I were to rate this paper for a top algorithms conference, it would be a reject as the techniques are well known. The problem has been studied previously, so it seems that there might be some interest in having a solution, though I still doubt that this algorithm or for that matter the problem are practically viable. I eventually talked myself into a minor positive rating for this paper, as it is a thorough treatment of the problem.

问题

No questions.

局限性

yes

最终评判理由

Nothing changed, so my opinion also has not changed.

格式问题

none

作者回复

Thank you for your valuable comments. They will be very helpful in improving this paper. In particular, they have been insightful in guiding us on how to reorganize the paper for the camera-ready version that will be prepared if the paper is accepted, where one additional page will be allowed.

Although the Questions section did not contain any questions, we would still like to briefly give one mathematical clarification and share a few explanations of our perspectives.

The clarification is regarding the comment "the ILP [of Crane et al.] will have an unbounded gap". Our LP relaxation for Robust ECC is as follows:

\\begin{aligned} \\min \\ & \\sum_{e \\in E} w_e y_e \\\\ \\text{s.t. } \\ & z_v + \\sum_{c \\in C} x_{v,c} \\le 1, & & \\forall v \\in V, \\\\ & z_v + x_{v,c_e} + y_e \\ge 1, & & \\forall e \\in E, \\ v \\in e, \\\\ & \\sum_{v \\in V} z_v \\le b_{\\mathrm{robust}}, & & \\\\ & x_{v,c} \\ge 0, & & \\forall v \\in V, c \\in C, \\\\ & y_e \\ge 0, & & \\forall e \\in E, \\\\ & z_v \\ge 0, & & \\forall v \\in V, \\end{aligned}

whereas Crane et al. [19] uses the following LP (slightly modified for easier comparison):

\\begin{aligned} \\min \\ & \\sum_{e \\in E} y_e \\\\ \\text{s.t. } \\ & \\sum_{c \\in C} x_{v,c} \\ge |C|-1, & & \\forall v \\in V, \\\\ & x_{v,c_e} - z_v \\le y_e, & & \\forall e \\in E, \\ v \\in e, \\\\ & \\sum_{v \\in V} z_v \\le b_{\\mathrm{robust}}, & & \\\\ & x_{v,c} \\in [0,1], & & \\forall v \\in V, c \\in C, \\\\ & y_e \\in [0,1], & & \\forall e \\in E, \\\\ & z_v \\in [0,1], & & \\forall v \\in V. \\end{aligned}

(The two LPs use opposite senses for the "binary" variable xx.) Note that the ILPs obtained by adding the integrality constraint xv,c,ye,zvZx_{v,c},y_{e},z_{v}\in\mathbb{Z} to the above two LPs both yield the exact optimal value of the original problem. To put it differently, if we take all integral feasible solutions to each LP and project them onto the space of yy's and zz's, the two LPs give the same set, whereas doing the same for all fractional feasible solutions results in different sets (as is often the case between two lifted LP relaxations for an optimization problem with differing integrality gaps).

There was a comment that the volume of experimental analysis in the main body is too large compared to the theoretical content. Thank you for the comment. We agree that the theoretical analysis did not receive ideal coverage in the main body of the paper. While we provided an almost complete sketch of the analysis for our algorithm for Local ECC in Section 3, we were not able to do the same for the other algorithms. In determining the relative amount of theoretical and experimental analysis, we also referred to the literature as the three problems were previously studied; however, as the reviewer noted, our main consideration was what should be presented within the first nine pages. Since the primary contribution of this paper is in giving new algorithms that outperform existing ones, it was crucial to demonstrate the advantages of the new methods. We felt that not all readers would readily be convinced of these improvements without thorough experimental results on both solution quality and running time. That said, if the paper is accepted, the camera-ready version allows one additional page, which will help address the concern. Additionally, the description of the experimental setup currently occupies about one page of the main body; we plan to condense this part and reallocate space to bring part of the theoretical analysis from the appendix into the main body. Thank you for this helpful comment.

Another point made in the review was that the hardness proof is relatively simple. We agree that this proof is not technically challenging, and we do not consider it to be a primary contribution of the paper. However, we felt it was important to make this hardness result part of the published record, as it was not known in the existing literature and, in fact, questions had even been posed that this hardness result rules out. To settle these questions and to provide a better landscape of the problems, we thought it worthwhile to point out that a relatively simple argument can be used to show the desired hardness results. For the same reason—to provide a better landscape—we also reported integrality gap results.

The review also pointed out that, in Section 3.2, we presented only the dual LP without the primal. We fully agree that additionally including the primal would be preferable, and we will do so in the camera-ready version. In that section, we provided only (an outline of) the algorithm without detailed analysis due to space constraints, and the dual alone was "sufficient" for the mechanical description of the algorithm. However, we completely agree that presenting both the primal and dual LPs would be much better, and we will make this improvement in the camera-ready version with the additional page allowed.

Overall, taking the reviewer’s valuable comments into account and making use of the additional page available in the camera-ready version, we will make adjustments to the organization of the paper—particularly regarding what belongs in the main body versus the appendix.

We hope our responses have adequately addressed your comments. We greatly appreciate your thoughtful review and consideration.

审稿意见
5

This paper studies overlapping and robust edge-colored clustering (ECC) problem on hypergraphs. It is a node multi-coloring task such that the total number (even weight) of mistaken hyperedges whose colors are not covered by their nodes is minimized. The authors investigate three versions named Local, Global and Robust ECCs that have some kinds of budgeted number of node colors or node deletion, and present several algorithmic and lower bound results on them. The main technical contributions include new characterizations of ECCs by linear programming, and combinatorial algorithms based on the primal-dual schema that brings improvements in solution quality. Since the new algorithmic framework is combinatorial and doesn’t need to actually solve LP, it also brings great improvement in time complexity (theoretically). The proofs of lower bound results, including the reductions and instance designs, are constructive. The solution quality and efficiency of their algorithms are evaluated by experiments.

优缺点分析

Strengths:

  • This paper has provided a new framework of primal-dual schema for the three kinds of ECC problems.
  • It is well written and easy to follow.
  • The experimental results are good.

Weaknesses:

  • I did not find many weaknesses except that the six subfigures of Figure 1(a) are hard to read. The running times of Greedy and Proposed methods are indistinguishable. It is better to have logarithmic scaling on the y-axis also.

问题

N/A.

局限性

Yes.

最终评判理由

This paper proposes combinatorial algorithms for ECC based on the classic primal-dual schema. I think it should be accept.

格式问题

No major formatting issues.

作者回复

Thank you for your valuable comments. They will be very helpful in improving this paper, especially in the presentation of the experimental results.

As you suggested, we replotted Figure 1(a) with the y-axis in log scale, which makes the differences between the lines much clearer. We will update the figure, and indicate in its caption that the y-axis is in log scale.

We hope our responses have adequately addressed your comments. We greatly appreciate your thoughtful review and consideration.

评论

Thank the authors' response. Although I have noticed the methodological novelty issue raised by other reviewers, I still think that this paper is worth publishing. I stick to my score.

审稿意见
5

The authors consider several variants of edge coloring clustering (ECC) in hypergraphs. The goal of the problem is to given colored edges, color vertices minimizing disagreements. The variants are where we allow multiple colors for vertices but restricting the number of allowed colors. The authors propose approximation algorithms for their LP-based problems, and provide inapproximability results. The authors also combine additional experiments.

优缺点分析

  • The approximation results are interesting, and the inapproximability results provide a more complete results.
  • The paper is well-written, though relatively dense.
  • As the algorithms are LP-based, the running times are not clear. Are these algorithms scalable? What is the running time for these? Can one do a combinatorial algorithm derived from LP?
  • The practical impact of these problems may be limited, nevertheless the relaxation variants are worth studying.

问题

What is the computational complexity of the original ECC? Can we develop combinatorial algorithms for the proposed variants?

局限性

yes

最终评判理由

In my view, this is a good submission, and the upsides outweigh downsides.

格式问题

No issues

作者回复

Thank you for your valuable comments and questions. They will be very helpful in improving this paper. In particular, multiple comments of yours prompted us to reconsider what should be included in the main body of the paper rather than in the appendix.

While we did present the asymptotic running times of all our algorithms, most of them were in the appendix and not very visible:

  • Our algorithm for Local ECC runs in linear time, i.e., O(sumvinVdv)O(\\sum_{v\\in V}d_v) time. Note that the length of the description of a hypergraph is O(sumvinVdv)O(\\sum_{v\\in V}d_v). This is the only algorithm whose running time was mentioned in the main body of the paper, on page 5. The analysis is, however, in the appendix, Lemma B.3.

  • Our algorithm for Robust ECC and our algorithm for Global ECC both run in O(EsumvinVdv)O(|E|\\sum_{v\\in V}d_v) time. This fact was not mentioned in the main body. The analysis is also in the appendix, Lemma D.3.

We will include a short discussion on these asymptotic running times in the main body of the paper.

A summary of previous work was presented also in the appendix (Appendix A), illustrating how ECC and its overlapping and robust generalizations are applied and how they relate to correlation clustering (CC). We will move this summary to the main body as well.

While the aforementioned theoretical running time bounds of our algorithms are small, their scalability can be fully established only after verifying their empirical performance as well. We empirically evaluated our algorithms' running times in Section 4. The experiments confirmed that our algorithms run significantly faster than previous LP-rounding algorithms, demonstrating their scalability. This is because our algorithm does not solve an LP: only in its analysis LPs are used. This is exactly the reason why all our algorithms are indeed combinatorial.

Another question was on the computational complexity of the ECC. Angel et al. [8] showed the NP-hardness of the "vanilla" version of ECC as we mentioned in Section 1, and Veldt [48] showed its APX-hardness.

We hope our responses have adequately addressed your comments. We greatly appreciate your thoughtful review and consideration.

评论

Thank you for your response. I keep my scores unchanged as they were already positive.

最终决定

Reviewer fUgt provides a perfect summary of this submission: “This paper studies overlapping and robust edge-colored clustering (ECC) problem on hypergraphs. It is a node multi-coloring task such that the total number (or weight) of mistaken hyperedges whose colors are not covered by their nodes is minimized. The authors investigate three versions named Local, Global and Robust ECCs that have some kinds of budgeted number of node colors or node deletion, and present several algorithmic and lower bound results on them. The main technical contributions include new characterizations of ECCs by linear programming, and combinatorial algorithms based on the primal-dual schema that brings improvements in solution quality. Since the new algorithmic framework is combinatorial and doesn’t need to actually solve LP, it also brings great improvement in time complexity (theoretically). The proofs of lower bound results, including the reductions and instance designs, are constructive. The solution quality and efficiency of their algorithms are evaluated by experiments.”

The general opinion about this submission is quite positive. The considered problem is interesting, and the paper provides a thorough and comprehensive treatment of the problem. The only weakness mentioned by reviewers HJYD and oQ2L is that the novelty of the work is somewhat limited because the methods (e.g. primal-dual approximation algorithms) are well-known. On a technical level this submission is not a breakthrough, however, the problem and the results are interesting.