Rhomboid Tiling for Geometric Graph Deep Learning
摘要
评审与讨论
This paper proposes Rhomboid Tiling (RT) clustering, a hierarchical clustering method designed for geometric graph deep learning. The method leverages higher-order Voronoi structures to improve graph pooling and demonstrates competitive performance across multiple benchmark datasets.
给作者的问题
Please see weaknesses and comments.
论据与证据
The claims in the submission are generally supported by mathematical proofs, theoretical analysis, and empirical results, but the lack of computational complexity discussion weakens the justification of its scalability and key design choices.
方法与评估标准
The proposed RT clustering method and RTPool model are well-aligned with the problem of geometric graph learning, and the evaluation criteria, including benchmark datasets from molecular and bioinformatics domains are appropriate; however, broader dataset diversity could further validate generalizability.
理论论述
The theoretical proofs, particularly Theorems 3.1, 3.2, and 3.3, appear mathematically sound, but the lack of computational complexity analysis raises concerns about scalability.
实验设计与分析
The experimental design, including comparisons across 7 benchmark datasets and ablation studies, is generally sound,
补充材料
The supplementary material was not explicitly provided in the reviewed content, but the appendix includes proofs of theoretical claims and experimental details, which were examined for mathematical soundness and experimental validity.
与现有文献的关系
The paper builds on prior work in graph pooling (e.g., DiffPool, MinCutPool), computational geometry (Voronoi tessellation, Delaunay complexes), and topological data analysis.
遗漏的重要参考文献
The paper thoroughly discusses related works in graph pooling (DiffPool, MinCutPool), computational geometry (Voronoi tessellation, Delaunay complexes), and topological methods (Wit-TopoPool, SIN), but it lacks citations to recent advances in topological data analysis (e.g., persistent homology for graph learning) and efficient geometric clustering methods, which could provide additional context for its contributions.
其他优缺点
Strengths:
1.The paper provides solid theoretical support for the proposed method.
2.The authors have released the code to ensure reproducibility.
3.Extensive experiments validate the effectiveness of the proposed approach.
Weaknesses:
1.The paper lacks a detailed description of the datasets, making it difficult to determine the data dimensions and raising concerns about the scalability of the proposed method.
2.The comparative analysis does not include recent studies from 2024, making it challenging to demonstrate the effectiveness of the proposed approach against more recent advancements.
3.The paper lacks complexity analysis, both theoretically and experimentally.
4.It would be beneficial to include additional visualization experiments to illustrate how RT clustering performs hierarchical clustering.
其他意见或建议
-
Clarity of Notation and Mathematical Definitions – Some mathematical formulations, particularly in Section 3 (RT Clustering Definition), are complex and may benefit from more intuitive explanations or examples. A clearer breakdown of key notations and geometric interpretations would improve readability, especially for non-experts in computational geometry.
-
Baseline Selection and Fair Comparisons – While the paper compares against 19 methods, it is unclear whether hyperparameters for all baselines were tuned fairly or if default settings were used. A discussion on how baseline models were optimized would ensure a fair evaluation of the proposed method’s advantages.
-
Hyperparameter Sensitivity Analysis – The paper does not explore how sensitive the method is to key hyperparameters, such as the number of pooling layers. A hyperparameter sensitivity study would help assess the robustness of the approach across different datasets.
We sincerely thank the reviewer for the thoughtful and constructive feedback. Below we address each concern raised.
1. Dataset Description
To address the reviewer’s concern, we provide a summary of the datasets used in our classification experiments:
Table: Description of classification datasets
| Dataset | BZR | COX2 | MUTAG | PTC_MR | PTC_MM | PTC_FM | PTC_FR |
|---|---|---|---|---|---|---|---|
| No. graphs | 405 | 467 | 188 | 344 | 336 | 349 | 351 |
| Avg. nodes | 35.75 | 41.22 | 17.93 | 25.56 | 24.25 | 25.00 | 24.96 |
We also include additional experiments on regression tasks and social network datasets. Their statistics are summarized below. Please refer to our response to Reviewer FBWn for performance results.
Table: Description of regression and social datasets
| Dataset | Esol | FreeSolv | Lipo | IMDB-BINARY | IMDB-MULTI |
|---|---|---|---|---|---|
| No. graphs | 1128 | 642 | 4200 | 1000 | 1500 |
| Avg. nodes | 26.00 | 18.00 | 49.00 | 19.77 | 13.00 |
2. Comparison with Recent Studies
We thank the reviewer for this suggestion. To address it, we included two pooling methods published in 2024:
- Hop-Pool (Zhang et al. (2024), Multi-hop graph pooling adversarial network for cross-domain remaining useful life prediction, Reliability Engineering & System Safety, 244, 109950.)
- Mv-Pool (Ma et al. (2024), GraphADT: empowering interpretable predictions of acute dermal toxicity with multi-view graph pooling and structure remapping, Bioinformatics, 40(7), btae438.)
We followed the default settings from their original papers and conducted a grid search around those values to fairly tune the hyperparameters. The results of these methods compared to ours are summarized below:
Table: Performance of different models on benchmark datasets (mean ± std). Best results are in bold.
| Model | BZR | COX2 | MUTAG | PTC_MR | PTC_MM | PTC_FM | PTC_FR |
|---|---|---|---|---|---|---|---|
| Hop-Pool | 85.37±4.36 | 85.11±3.74 | 94.74±4.76 | 65.71±2.85 | 73.59±5.27 | 64.15±4.62 | 65.71±3.71 |
| Mv-Pool | 78.05±3.38 | 82.98±5.24 | 89.64±2.43 | 68.58±2.61 | 70.65±4.83 | 62.86±3.37 | 65.72±2.14 |
| RTPool | 88.29±0.98 | 89.36±2.33 | 94.74±3.33 | 76.57±1.14 | 82.94±2.20 | 77.72±1.14 | 82.29±2.80 |
These results show that RTPool consistently outperforms or matches state-of-the-art methods from 2024 on all benchmark datasets.
3. Computational Complexity
Due to space constraints, please refer to our detailed theoretical and empirical complexity analysis provided in our response to Reviewer kfVb. In short, our RT pooling model have overall complexity of in , and our runtime comparisons confirm the model’s efficiency.
4. Additional Visualization
We appreciate the suggestion. We have added a visualization example using the molecular graph of Formaldehyde to illustrate how RTpool performs hierarchical clustering pooling. The visualization is available at the anonymous link: RTPool_Visual
5. Clarity of Notations
We have revised our notation and mathematical expressions to improve consistency and clarity—e.g., clarifying the usage of matrices and . Additionally, we added an example in the appendix using the molecular graph of Formaldehyde to illustrate the construction of its rhomboid tiling in a step-by-step manner to improve the readability.
6. Baseline Comparisons
The baseline results are taken from the Wit-TopoPool paper, which also serves as a strong recent benchmark. According to the authors, all baselines in that work, including Wit-TopoPool itself, were tuned using grid search over a fixed set of hyperparameter choices, and evaluated under the same cross-validation protocol. Our model uses the same setting, and this ensures a fair and consistent comparison.
7. Hyperparameter Sensitivity
We thank the reviewer for pointing this out. We have already included a sensitivity analysis of the key structural hyperparameters and the number of pooling layers in our response to Reviewer z8We.
In addition, we performed sensitivity experiments on the learning rate and the final dropout rate. The results are summarized below.
Table: Sensitivity to learning rate (LR)
| Dataset | LR | Accuracy |
|---|---|---|
| COX2 | 0.0002 | 87.23±1.50 |
| 0.0005 | 87.66±0.95 | |
| 0.001 | 89.36±2.33 | |
| 0.002 | 88.93±1.78 | |
| MUTAG | 0.0002 | 89.47±0.00 |
| 0.0005 | 91.57±2.88 | |
| 0.001 | 94.74±3.33 | |
| 0.002 | 93.68±2.10 | |
| PTC_MR | 0.0001 | 73.72 ± 2.39 |
| 0.0002 | 76.57 ± 1.57 | |
| 0.0005 | 72.57 ± 1.40 | |
| 0.001 | 70.29 ± 1.40 |
Table: Sensitivity to final dropout rate
| Dataset | Dropout | Accuracy |
|---|---|---|
| COX2 | 0.4 | 88.51 ± 1.91 |
| 0.5 | 89.36 ± 2.33 | |
| 0.6 | 88.93 ± 1.78 | |
| MUTAG | 0.4 | 92.63 ± 2.88 |
| 0.5 | 94.74 ± 3.33 | |
| 0.6 | 93.69 ± 2.36 | |
| PTC_MR | 0.2 | 74.86 ± 1.27 |
| 0.3 | 76.57 ± 1.14 | |
| 0.4 | 75.43 ± 1.56 |
These results show that RTPool is relatively robust to variations in learning rate and dropout.
This paper proposes a geometry-aware graph clustering algorithm for enhancing geometric graph classification performance across diverse datasets. The method captures high-order structural information of geometric graphs through high-order Voronoi tessellation and Delaunay complexes. Building on this foundation, this paper introduces a hierarchical graph pooling operation that constructs a normalized clustering matrix via Rhomboid incidence matrices. This operation aggregates lower-order cluster features into higher-order representations and updates node features using highly expressive GNNs. The underlying graph structure can be flexibly configured as either Delaunay graphs or generated graphs expanded from chemical bonds to incorporate structural priors. Extensive experiments on multiple benchmark datasets demonstrate comparable or competitive results against state-of-the-art methods.
给作者的问题
How does the incremental construction mechanism of rhomboid tiling mentioned in the paper handle geometric conflicts between newly added clusters and existing clusters?
论据与证据
The motivation of this paper is clearly articulated. Traditional connectivity-based graph clustering methods lack effective geometric information and fail to adequately capture the inherent spatial geometric features in geometric graphs. Furthermore, topology-driven clustering approaches struggle to identify geometric substructures in dense regions and cannot effectively characterize spatial proximity relationships between atoms or molecules, thereby limiting the model's ability to learn complex geometric patterns. Through rigorous analysis and theoretical proofs, this work confirms that the proposed Rhomboid Tiling-based clustering method effectively addresses these limitations.
方法与评估标准
This paper proposes a geometric-aware hierarchical clustering method based on the rhomboid tiling structure and a corresponding graph pooling model. The method constructs high-order geometric clusters through spherical partitioning of the spatial domain, where atomic nodes in geometric graphs are hierarchically aggregated based on spatial proximity. Lower-order clusters are formed by small-scale atomic groups, while higher-order clusters emerge by merging geometrically adjacent lower-order clusters. A rhomboid cell depth-based weighting mechanism is introduced to quantify the importance of subclusters. The model incorporates molecular chemical bonds to construct hierarchical graph structures, leverages GNNs to hierarchically extract geometric features across layers, and ultimately obtains global representations through multi-layer geometric pooling. This approach preserves 3D spatial relationships while enabling efficient hierarchical information abstraction. The method's design is well-founded and should enable efficient clustering of data with complex geometric structures.
理论论述
I have reviewed the proofs and reasoning in the main text, which are clear and comprehensible, and the relevant formulas are consistent with the descriptions in the methodology section of the paper.
实验设计与分析
The proposed method is validated on datasets including molecular property prediction and protein-protein interactions, with experimental results demonstrating that the proposed approach achieves SOTA performance. Ablation studies further confirm the critical impact of the proposed modules on performance enhancement. The experiments in this work are comprehensive, and the method exhibits promising predictive capabilities even on more challenging data scenarios.
补充材料
I carefully reviewed the proof process in the appendix of the paper and conducted a systematic derivation.
与现有文献的关系
The efficient clustering algorithm proposed in this paper holds significant implications for adaptive research in point cloud testing. The experiments conducted on complex data such as proteins have validated its feasibility, suggesting that it may also be beneficial for the clustering and construction of neighborhood relationships in point clouds, as discussed in [1].
[1]. Yang S, Wang Y, Van de Weijer J, et al. Trust your good friends: Source-free domain adaptation by reciprocal neighborhood clustering[J]. IEEE Transactions on pattern analysis and machine intelligence, 2023, 45(12): 15883-15895.
遗漏的重要参考文献
I believe the methodologies referenced and the literature discussed in this paper demonstrate relatively comprehensive coverage.
其他优缺点
This paper proposes a highly intriguing graph clustering algorithm. By introducing a novel Rhomboid Tiling clustering method, it effectively handles diverse types of data with complex geometric information, achieving highly accurate classification. I believe this work will provide significant inspiration for research in point cloud feature extraction, unsupervised adaptation.
其他意见或建议
None.
We appreciate the reviewer's positive recognition of our rhomboid tiling clustering framework and its potential for broader applications.
1.Question about geometric conflicts
Thank you for the insightful question.
Our method adopts a layer-wise hierarchical clustering strategy. Given a set of clusters corresponding to nodes in the layer of order- Delaunay complex, we apply RT clustering to map these clusters onto a higher-order structure—specifically, the new layer of order- Delaunay complex—where . During this process, we do not incrementally modify or extend the existing lower-order clusters. Instead, we directly construct an entirely new layer of clusters at order-.
As you pointed out, a possible concern is whether clusters in the new layer (order-) may geometrically conflict—e.g., a point being assigned to multiple clusters, or clusters overlapping in space. This can indeed occur at the point level; a point may belong to multiple higher-order clusters. However, this is not problematic.
In particular, if and are point sets forming two distinct clusters from the new layer at order-, then even if they share some points (i.e., ), their corresponding convex hulls and only intersect at their boundaries. Their interiors remain disjoint. This structural property ensures that the rhomboid tiling construction preserves geometric consistency, and does not introduce significant geometric conflicts during hierarchical clustering.
This paper introduces Rhomboid Tiling (RT) clustering, a novel hierarchical clustering method for geometric graph deep learning. The RT clustering approach is based on the rhomboid tiling structure, which extends Voronoi tessellation and Delaunay complex theory to efficiently capture high-order geometric relationships within graph-structured data. Based on RT clustering, the authors propose RTPool, a new graph clustering pooling model tailored for graph classification tasks.
给作者的问题
see the above
论据与证据
While the paper presents some experimental results, some claims require additional clarification and justification:
-
The geometric advantage of RT clustering over existing pooling methods is not extensively analyzed.
-
The computational complexity of RTPool is not explicitly compared against existing graph pooling models, making it unclear whether the method is scalable.
方法与评估标准
Yes, I think the choice of graph classification datasets is reasonable, as they involve molecular graphs where geometric properties are crucial.
理论论述
The paper presents several theoretical formulations related to high-order Voronoi tessellation, Delaunay complexes, and Rhomboid Tiling clustering. The main theorems (Theorems 3.1, 3.2, 3.3) are well-structured, but there is no empirical validation that confirms these theoretical claims in real-world datasets. Also, the weighting mechanism for RT clustering (Theorem 3.3) is introduced without much intuition on its impact on pooling quality.
实验设计与分析
The experimental setup is well-defined, following the same train-test split strategies as previous works.
补充材料
Yes, I have checked the proofs but (to be honest) not fully understand the details.
与现有文献的关系
I am not clear about this work's relationship to existing hierarchical clustering techniques, spectral pooling, and adaptive pooling. Authors may consider providing a more comprehensive review of related work and explicitly highlight how this method differs from classical pooling techniques.
遗漏的重要参考文献
N/A
其他优缺点
S1: The paper presents a novel geometric clustering approach based on rhomboid tiling, which has not been extensively explored in the context of graph pooling.
S2: The proposed RTPool method consistently outperforms existing state-of-the-art graph pooling approaches across seven benchmark datasets, including molecular and biochemical graphs.
W1: For readability, I feel that some mathematical notations and derivations are quite dense, making it challenging for readers unfamiliar with computational geometry.
W2: The paper does not clearly position its key contributions within the broader graph learning and pooling literature. While rhomboid tiling is novel, its relationship to existing hierarchical clustering techniques, spectral pooling, and adaptive pooling is not sufficiently discussed.
其他意见或建议
-
The choice of underlying graph representations (Delaunay graphs vs. generated graphs) is not fully justified.
-
A deeper analysis of how different choices of k-values impact clustering performance is needed.
updated after reading the rebuttal and responses
All my concerns have been clarified. Therefore, I am happy to increase my score to '4: Accept'.
We thank the reviewer for their valuable and constructive feedback. Below we respond to the key concerns raised.
1. Readability
We have revised our notation and mathematical expressions to improve consistency and clarity—e.g., clarifying the usage of matrices and . Additionally, we added an example in the appendix using the molecular graph of Formaldehyde to illustrate the construction of its rhomboid tiling in a step-by-step manner.
2. Positioning of RTpool
RTpool is a hierarchical clustering pooling method. Unlike traditional methods such as DiffPool, which learn how to group nodes via a trainable assignment matrix, RTpool performs clustering purely based on the geometric structure of the input graph.
This geometry-driven approach offers several advantages:
- It naturally incorporates higher-order geometric information from the graph, which contributes to the superior performance of RTpool.
- It avoids additional learnable parameters or training overhead, making the training process efficient.
- The clustering results depend only on the input graph's geometry and are less affected by noise or quality variations across the training dataset.
A known limitation is that RTpool does not support adaptive pooling to a target size. However, we can mitigate this by leveraging the "birthtime" of each cluster, which is naturally generated during the RT clustering process and reflects its importance. By removing clusters with low importance after pooling, we can control the final graph size if needed.
3. Choice of Underlying Graph
Thank you for pointing this out. We clarify our reasoning as follows:
-
RT clustering already relies on the geometric structure of the input graph. Since Delaunay-connected nodes are often clustered together, reusing Delaunay graphs after pooling adds limited new information.
-
RT clustering depends solely on geometric positions and does not fully capture edge-level connectivity of the initial graph. However, in domains like molecular graphs, edge connections (e.g., chemical bonds) carry crucial information. To compensate, we use the original molecular graph to construct generated graphs after pooling, allowing the model to retain important high-order connectivity information of initial graph.
Therefore, we adopt generated graphs as the underlying graph after each pooling layer to enhance representation learning.
4. Choice of
In RTpool, the choices of order is determined by two parameters: the difference and the number of pooling layers.
- can be viewed as the "step size": at each pooling layer, features are aggregated from order to order .
- The number of pooling layers determines how many such "steps" we perform in total.
We conducted a sensitivity analysis for both parameters. The results are shown below:
Table: Sensitivity to
| Dataset | Accuracy | |
|---|---|---|
| COX2 | 1 | 89.36 ± 2.33 |
| 2 | 92.76 ± 1.90 | |
| 3 | 86.38 ± 1.90 | |
| MUTAG | 1 | 94.74 ± 3.33 |
| 2 | 89.64 ± 2.36 | |
| 3 | 88.42 ± 2.10 | |
| PTC_MR | 1 | 76.57 ± 1.14 |
| 2 | 78.86 ± 1.57 | |
| 3 | 69.71 ± 1.56 |
Table: Sensitivity to the number of pooling layers
| Dataset | #Pooling Layers | Accuracy |
|---|---|---|
| COX2 | 1 | 89.36 ± 2.33 |
| 2 | 88.50 ± 1.17 | |
| MUTAG | 1 | 94.74 ± 3.33 |
| 2 | 90.52 ± 2.35 | |
| PTC_MR | 1 | 76.57 ± 1.14 |
| 2 | 72.57 ± 2.56 |
We observe that RTPool performs robustly when or , and shows degraded performance when . This aligns with our theoretical analysis, which suggests that setting may result in a loss of node feature information during the pooling process, thus hurting performance. Therefore, we recommend choosing or in practice. While our original model used by default, these experiments suggest that using can yield better results on some datasets.
The number of pooling layers functions similarly to the number of layers in an MLP—deeper models can increase capacity, but may also introduce overfitting or unnecessary compression on small datasets. For most of the small-to-medium-sized benchmarks in our experiments, we find that a single pooling layer is sufficient and often preferable in practice.
The authors have clarified my previous concerns. Thanks for the efforts. I have no further comments and will increase my score accordingly.
We sincerely appreciate the reviewer's positive feedback and support.
The author proposes a new pooling methgod inspired by the the rhomboid tiling structure. The author starts from “High-Order Voronoi Tessellation”, which is a method to partitate space based on points. The Voronoi cell Q indicates that all points blong to Q are clustered together and separated with other points. All the partitions are called order-K Voronoi tessellation. And the high-order Deluanay Complex, which is defined as the nerve of Voronoi Tessellation, encodes the relationship between these clusters. And Rhomboid Tiling can be used to build the relationship between different orders of Deluanay Complex.
With these theories as foundation, the author porpose the RT clustering, which basically use spheres to partition points. The partitions can be used to define geometric relationships or further clustering. And the author also provides methods on two important subproblem, how to choose step size between order k and how to assign weights to different points. Furthermore, authors propose RTPool, which applies the mentioned methods as pooling methods. The author also provides extensive experiments to prove the efficiency of the proposed methods.
给作者的问题
See weakness.
论据与证据
Yes.
方法与评估标准
1: The paper is clear to read. The authors clearly states the theory and algorithm of their methods.
2: The idea is novel. Pooling based on cluster is interesting.
3: The experiment result is good.
理论论述
No
实验设计与分析
No issues.
补充材料
No.
与现有文献的关系
The approach proposed is potentially benefit other scientific domains.
遗漏的重要参考文献
No.
其他优缺点
Question and concerns:
1: It looks like the authors does not talk about the efficiency of their methods. These operations seems introduce extra overheads to the pooling operation. It can also be helpful to provide experiment results to show efficiency, comparing with other pooling methods.
2: In section 3.3, the author talk about C_l, which is clustering matrix for layer l. But previously, the author only talk about C_k. How does C_l and C_k connect with each other? And how to choose the value of k, what will the maximum value of k for each pooling? Please provide some explanation.
其他意见或建议
Please see the weakness.
We thank the reviewer for their helpful feedback. Below, we provide detailed responses to each of the points raised.
1. Model Efficiency
To address the reviewer’s concern, we provide both the theoretical time complexity of RTPool and empirical evidence comparing it with other pooling methods.
Theoretical Time Complexity
When the input geometric graph is embedded in , both the construction of the rhomboid tiling and the pooling operation based on this structure have theoretical time complexity , where is the number of input nodes.
Conclusion 1:
The total time complexity of computing rhomboid tiling up to order in is:
as induced by result in Corbet et al. (2021) and Corbet et al. (2023). And in our practice, dimension , and maximun order or , the complexity reduces to .
Conclusion 2:
The number of nodes in the -th RT layer (order- Delaunay complex) is (Lee, 1982), and with small and sparse input graphs, this ensures the total time complexity of RTPool remains .
Each pooling layer consists of:
- Step 1: Matrix multiplication between clustering matrix () and node features ():
- Step 2: GIN layer on sparse graph with nodes:
Thus, total per-layer complexity is .
References
- Corbet et al. (2023), Computing the Multicover Bifiltration, Discrete & Computational Geometry.
- Corbet et al. (2021), Computing the Multicover Bifiltration, arXiv:2103.07823.
- Lee (1982), On k-nearest neighbor Voronoi diagrams in the plane, IEEE Trans. Computers.
Empirical Efficiency
We also benchmarked RTPool against other pooling methods such as DiffPool, MinCutPool, HaarPool, etc.
RTPool demonstrates comparable efficiency.
Table 1: Accuracy over 5 runs × 100 epochs
| Model | BZR | COX2 | MUTAG | PTC_MR | PTC_MM | PTC_FM | PTC_FR |
|---|---|---|---|---|---|---|---|
| MinCutPool | 76.47±2.32 | 79.86±2.47 | 69.47±2.11 | 66.86±2.91 | 72.94±1.18 | 56.74±5.63 | 62.86±1.81 |
| DiffPool | 78.54±0.98 | 77.93±3.18 | 73.68±3.33 | 69.14±2.80 | 67.06±2.20 | 68.57±3.61 | 68.57±1.81 |
| HaarPool | 78.05±0.00 | 80.64±4.58 | 68.42±0.38 | 62.29±4.20 | 64.71±3.72 | 61.14±6.66 | 65.67±2.25 |
| Wit-TopoPool | 80.98±2.39 | 80.43±1.59 | 85.32±2.58 | 71.84±1.14 | 72.82±4.71 | 68.56±4.84 | 68.57±4.04 |
| Hop-Pool | 78.05±1.39 | 80.00±1.70 | 87.56±4.41 | 61.71±2.91 | 70.59±0.78 | 58.29±1.40 | 63.43±1.14 |
| Mv-Pool | 75.60±1.98 | 79.68±1.27 | 73.68±10.53 | 64.57±4.64 | 68.24±1.18 | 57.14±0.00 | 65.71±1.77 |
| RTPool | 84.39±1.19 | 85.96±1.04 | 83.16±5.16 | 72.86±3.65 | 72.94±3.43 | 68.86±8.59 | 67.71±7.75 |
Table 2: Total Runtime (seconds) over 5 runs × 100 epochs
| Model | BZR | COX2 | MUTAG | PTC_MR | PTC_MM | PTC_FM | PTC_FR |
|---|---|---|---|---|---|---|---|
| MinCutPool | 2297.74 | 2554.96 | 1999.89 | 2016.22 | 2482.84 | 2470.79 | 2355.13 |
| DiffPool | 2507.53 | 2813.80 | 2144.75 | 2235.34 | 2637.06 | 2621.80 | 2529.79 |
| HaarPool | 3238.15 | 2666.93 | 1787.79 | 1412.01 | 1407.39 | 2075.04 | 2022.91 |
| Wit-TopoPool | 4512.64 | 4904.08 | 7475.70 | 7390.13 | 7332.05 | 7370.51 | 7357.56 |
| Hop-Pool | 1483.80 | 1420.63 | 1190.00 | 1171.09 | 1450.05 | 1421.54 | 1355.92 |
| Mv-Pool | 11244.74 | 9137.48 | 5828.78 | 8935.74 | 8327.48 | 8326.58 | 8907.54 |
| RTPool (constructor) | 1616.06 | 2356.41 | 189.59 | 341.39 | 321.39 | 331.57 | 360.00 |
| RTPool (model) | 1117.62 | 1053.07 | 1041.59 | 673.77 | 1268.01 | 1708.35 | 306.10 |
Despite running for only 100 epochs, RTPool achieves strong results with limited training time, even when including the overhead of rhomboid tiling construction.
2. Relationship between and
denotes the clustering matrix from the order- Delaunay complex to order-. When the pooling layer index , we use as the clustering matrix . So and refer to the same object with different notations. We thank the reviewer for pointing this out and will revise our notation to avoid confusion.
The maximum value of is set to 2 or 3 in our experiments, which is equal to #pooling layers+1. (The choice of #pooling layers is provided in the appendix). Due to space limits, please refer to our response to Reviewer z8We for detail on the choice of .
The paper introduces a novel hierarchical clustering method—Rhomboid Tiling (RT) clustering—for geometric graph deep learning. Unlike traditional clustering-based pooling methods that mainly rely on graph connectivity, RT clustering leverages high-order geometric structures derived from concepts such as alpha shapes, higher-order Voronoi tessellations, and Delaunay complexes. Using these ideas, the authors design RTPool, a graph pooling model that constructs hierarchical representations by clustering vertices using the rhomboid tiling structure. The model is validated on seven benchmark datasets (including chemical and molecular graphs), where it outperforms 19 state-of-the-art competitors. The paper also provides theoretical analysis, including necessary and sufficient conditions for cluster membership and a weighting mechanism based on the frequency of shared high-dimensional rhomboids.
给作者的问题
Could you elaborate on the sensitivity of RTPool to the choice of clustering parameters (k₁ and k₂)? In particular, how robust is the performance when these parameters are varied within the recommended range?
Have you considered applying RT clustering to graph tasks beyond classification, such as regression or link prediction? If so, what challenges might arise in those contexts?
Can you provide more details on the computational complexity of the RT clustering procedure, especially in high-dimensional spaces, and how it scales with graph size?
In cases where the underlying graph is not strictly geometric (e.g., social networks), do you foresee adaptations of RT clustering that could still capture meaningful structure?
论据与证据
The main claims—that RT clustering can capture higher-order geometric information and that RTPool yields superior performance—are backed by both rigorous theoretical derivations and extensive empirical evaluations. The experiments compare RTPool against a broad spectrum of baselines using standard benchmarks and include ablation studies to isolate the contribution of individual components. The theoretical claims are supported by proofs (presented in the supplementary material) that derive conditions under which clusters form and justify the weight definition, though these proofs might benefit from further independent validation.
方法与评估标准
The proposed method is well-motivated: it addresses the limitations of connectivity-only approaches in graph pooling by incorporating geometric information. The derivation of RT clustering from higher-order Voronoi and Delaunay structures is conceptually sound and aligns with existing computational geometry ideas. In terms of evaluation, the use of established benchmark datasets (e.g., MUTAG, PTC variants) and comparisons with a wide range of baselines (from graph kernel methods to modern GNN pooling techniques) are appropriate for the problem setting.
理论论述
The paper includes proofs for its main theoretical results, such as Theorem 3.1 (characterizing when a vertex belongs to a cluster) and Theorem 3.2 (guiding the choice of clustering parameters k₁ and k₂). These proofs appear detailed and correct on inspection, though their full validity would benefit from additional scrutiny by experts in computational geometry. The weighting mechanism based on counting shared depth-(d+1) rhomboids is also justified through a theorem that connects it to geometric proximity.
实验设计与分析
The experimental design is solid. The authors test their model on multiple datasets with different characteristics (chemical vs. molecular graphs) and conduct an ablation study to assess the impact of replacing RTPool with trivial pooling, varying the choice of GNN architectures for feature updates, and selecting different underlying graph constructions. The consistent performance gains reported and the detailed breakdown of results suggest that the experimental analysis is sound and that the improvements are not merely due to incidental factors.
补充材料
The supplementary material has been reviewed and includes detailed proofs of the theoretical claims as well as additional experimental settings and ablation study results. These materials provide further clarity on the derivation of the clustering conditions and the implementation details that support the empirical results.
与现有文献的关系
The paper builds on several important threads in the literature. It extends ideas from traditional graph clustering pooling methods (e.g., DiffPool, MinCutPool) by incorporating geometric insights from alpha shapes, Voronoi tessellations, and Delaunay complexes. This connection to classical computational geometry distinguishes it from methods that rely solely on graph connectivity. The work also relates to recent advances in topological and geometric deep learning, positioning its contributions within both the graph neural network and computational geometry communities.
遗漏的重要参考文献
The paper cites a wide range of relevant works in topological pooling or geometric deep learning have been considered.
其他优缺点
Strengths: • Innovative integration of Rhomboid Tiling clustering into GNN pooling. • Strong theoretical grounding that clarifies the clustering mechanism. • Comprehensive experimental evaluation demonstrating significant performance gains.
Weaknesses: • The complexity of the geometric constructions and associated proofs may pose challenges for reproducibility and practical implementation. • Sensitivity to hyperparameter choices (e.g., the difference between k₂ and k₁) might require careful tuning in practice. • The approach is tailored to geometric graphs; it is less clear how well it generalizes to non-geometric settings.
其他意见或建议
It would be useful for the authors to provide further insights into the computational overhead of the RT clustering process and to discuss potential strategies for scaling to larger graphs. Additionally, clarifying the limits of parameter sensitivity could help practitioners better understand when and how to deploy the method.
We sincerely thank the reviewer for their thoughtful and constructive comments. Below, we provide detailed responses to each of the points raised.
1. Hyperparameter sensitivity
To address this concern, we have conducted a detailed sensitivity analysis by varying the value of . The results are summarized in Table 1 below:
Table: Sensitivity to .
| Dataset | Accuracy | |
|---|---|---|
| COX2 | 1 | 89.36 ± 2.33 |
| 2 | 92.76 ± 1.90 | |
| 3 | 86.38 ± 1.90 | |
| MUTAG | 1 | 94.74 ± 3.33 |
| 2 | 89.64 ± 2.36 | |
| 3 | 88.42 ± 2.10 | |
| PTC_MR | 1 | 76.57 ± 1.14 |
| 2 | 78.86 ± 1.57 | |
| 3 | 69.71 ± 1.56 |
From the results above, we observe that RTPool performs robustly when or , and shows degraded performance when . This aligns with our theoretical analysis, which suggests that setting leads to a loss of node feature information during the pooling process, thus hurting performance. Therefore, we recommend choosing or in practice. While our original model used by default, these experiments suggest that using can yield better results on some datasets.
2. RTPool for Graph Regression Tasks
We believe that our model is not only suitable for graph classification tasks, but also for graph regression. This is because the RTPool process incorporates geometric information into the pooling operation, ensuring that the final graph-level representation captures essential geometric structures. As a result, this learned representation can benefit a variety of downstream tasks—regardless of whether the objective is classification or regression.
To support this claim, we have conducted additional experiments on regression datasets. The results, in terms of RMSE, are reported in the following table:
Table: RMSE comparison on graph regression datasets
| Model | Esol | FreeSolv | Lipo |
|---|---|---|---|
| MinCutPool | 2.1913 ± 0.0374 | 4.0111 ± 0.0170 | 1.3481 ± 0.0224 |
| StructPool | 2.1749 ± 0.0411 | 4.0077 ± 0.0150 | 1.3422 ± 0.1264 |
| DiffPool | 3.7699 ± 0.2035 | 5.2877 ± 0.2049 | 2.7431 ± 0.0753 |
| HaarPool | 2.1035 ± 0.0340 | 3.8892 ± 0.0098 | 1.3361 ± 0.0627 |
| Wit-TopoPool | 1.8783 ± 0.1628 | 4.2159 ± 0.0816 | 1.0916 ± 0.5007 |
| Hop-Pool | 2.4831 ± 0.0760 | 4.0030 ± 0.0940 | 1.3725 ± 0.0738 |
| Mv-Pool | 2.5691 ± 0.0484 | 4.0627 ± 0.1048 | 1.3746 ± 0.0682 |
| RTPool | 2.0195 ± 0.5318 | 3.6666 ± 0.2112 | 1.0789 ± 0.0496 |
As shown in the table, RTPool achieves the best or near-best performance across all three datasets, demonstrating its effectiveness in graph regression tasks compared to other pooling baselines.
3. Computational Complexity of the RT Clustering
Theoretically, the construction of the Rhomboid Tiling structure has complexity bounds , and the associated pooling operation in RTPool has complexity bounds . Here is the number of nodes, and is the dimenstion of space in which the graph is embedded. We have also conducted experiments to empirically assess the efficiency of RTPool compared to other pooling baselines.
Due to space constraints, we kindly refer the reviewer to our detailed response to Reviewer kfVb, where we provide both the theoretical proof and empirical runtime results
4. RTPool for Social Network Graphs
To adapt RTPool to non-geometric graphs such as social networks, one key step is to find an appropriate way to embed the graph nodes into . A feasible approach is to compute the graph Laplacian and use the combination of eigenvectors corresponding to the three smallest non-zero eigenvalues as coordinates for each node.
Based on this idea, we applied RTPool to social network datasets which contain only graph connectivity. The results are shown below:
Table: Accuracy (%) on social network datasets (mean ± std).
| Model | IMDB-BINARY | IMDB-MULTI |
|---|---|---|
| MinCutPool | 70.77 ± 4.89 | 49.00 ± 2.83 |
| DiffPool | 68.60 ± 3.10 | 45.70 ± 3.40 |
| SAGPool | 74.87 ± 4.09 | 49.33 ± 4.90 |
| HaarPool | 73.29 ± 3.40 | 49.98 ± 5.70 |
| Wit-TopoPool | 78.40 ± 1.50 | 53.33 ± 2.47 |
| Hop-Pool | 68.04 ± 2.04 | 49.33 ± 5.09 |
| Mv-Pool | 69.75 ± 3.61 | 51.67 ± 0.74 |
| RTPool | 73.06 ± 3.84 | 53.33 ± 1.26 |
As shown above, RTPool achieves competitive performance despite lacking enough geometric information
The paper proposes a hierarchical clustering algorithm for geometric deep learning. The main contribution is RTPool, a pooling model method where nodes are clustered using a rhomboid tiling structure. The paper is also accompanied by a theoretical analysis showing sufficient conditions for cluster membership.
The paper has been generally well-received by all reviewers (1 Accept, 4 weak accepts). The contribution is considered novel, the theoretical analysis motivates the contribution, and the experiments appear convincing both in width (7 benchmarks, comparison with 19 existing methods) and depth (ablation analyses).
Some reviewers didn't participate in the discussion after rebuttal, but reviewing the author's responses, I can confirm that their concerns have been mostly resolved. In all likelihood, had these reviewers participated in the rebuttal discussion, the scores could have been even higher.
I recommend weak acceptance (subject to ICML's quota on acceptable papers).