Q1. The Introduction's definition of Graph-of-Net (GON) is vague and needs a precise mathematical formulation of each node-as-graph (size, structure, features) and clarification of how it differs from traditional graphs in modeling hierarchical, multi-dimensional relationships.

R1: Thank you for your insightful comments. In our framework, GON is designed as a hierarchical graph where each node is not a single data point but an entire graph. Formally, let the top-level graph (network) be defined as , where is the set of graph nodes and denotes the edge set. For each node , we associate a graph: . Here, represents the nodes within the graph, is the set of edges among these nodes, and is the feature matrix corresponding to the graph nodes. The size of each graph is determined by , which depends on the inherent structure or the domain-specific construction of the graph. The graph structure, represented by , captures the internal relationships among the nodes in the graph. This structure may vary depending on the level of detail or domain-specific insights desired. Each graph is equipped with a feature representation that encodes the characteristics of the nodes in . The formation of these features can be based on raw data attributes or results from a prior processing step.

Traditional graph models represent data as a general graph where each node corresponds to an atomic data point. In contrast, GON captures multi-dimensional relationships by explicitly modeling two levels of interaction. This dual-level representation allows GON to be particularly effective for complex hierarchical data, as it provides the capacity to model nested relationships and capture both local and global patterns within the data.

Q2. In some formulas (e.g., the PPR algorithm), the derivation process could benefit from more detailed explanations. Providing a more comprehensive derivation would aid in understanding.

R2: The Personalized PageRank algorithm provides a way to measure how “close” or “important” one node is relative to another within a network. Imagine a random walker who starts at a specific node in the network. Instead of wandering the network entirely at random, the walker follows a rule: at each step, they decide either to move to one of the neighboring nodes or to jump back to the starting node. This jump-back mechanism ensures that the influence of the starting node remains strong throughout the walk.

What makes PPR particularly useful is that it considers not only the direct connections between nodes but also the broader network structure. In simple terms, it captures the idea that even if two nodes share the same label or initial property, they can have varying levels of relatedness depending on how they are connected within the network.

By adopting this method, our approach goes beyond simply saying, "nodes with the same label are similar." Instead, we are able to gauge the subtle nuances in how closely nodes are related based on both their labels and their positions within the graph structure. This allows our model to better capture complex relationships and provides a more refined way of measuring similarity among node graphs.

Q3. It would be helpful to clarify how GON manages the hierarchical structures within the graphs of nodes.

R3: Thank you for the valuable feedback. In a Graph-of-Net (GON), each high-level node represents an entire graph that can have its own structure and detailed relationships. Instead of treating every graph as a monolithic object, we decompose the problem into two levels: 1) At the lower-level, we focus on extracting meaningful representations from the individual graphs. We utilize graph neural networks to process each graph, effectively summarizing its properties into a fixed-size embedding. 2) Once each graph has been transformed into a representation, these embeddings serve as the nodes for the higher-level graph. The interrelations among these nodes are then modeled, combining both the abstract representation of the graphs and the positional information within the larger network. The two-stage process helps us manage the inherent complexity of graphs that reside within nodes. We will add above description in subsequent revisions of the paper.