“Methods: ...The problem formulation is overly general, ... ”

The reason our problem formulation is deliberately general is that our primary motivation is to rigorously analyze and broadly improve the expressive power of existing KGFMs, including widely used models like ULTRA. Our theoretical framework (MOTIF) thus intentionally generalizes beyond specific architectures.

“Experimental Designs: …It would be beneficial for the authors to provide similar examples using real-world data ….” “W4. The experimental results on real-world data are not very convincing....”

To provide more concrete insight into why higher-order motifs are necessary, we specifically analyzed scenarios where higher-order motifs excel in downstream tasks. Empirically, we have conducted Synthetic Experiments in Table 1 to pinpoint the exact place where binary motifs are not enough, proven by the failure of ULTRA on all constructed datasets. For real-world experiments, we found that MOTIF, equipped with higher-order motifs, significantly outperforms binary motifs on datasets containing relatively few relations, such as WordNet-based datasets, which contain only around 10 relations. This can be reflected in the average performance from WN v1-WN v4 here from zero-shot and end-to-end results:

Setting	Model	MRR	H@10
Zero-shot	ULTRA	0.575	0.679
Zero-shot	MOTIF	0.601	0.701
End-to-End	ULTRA	0.480	0.656
End-to-End	MOTIF	0.607	0.717

In Sec. 7.3, we conduct a detailed investigation showing that these cases revealed that binary motifs (as used by ULTRA) often construct relation graphs with structurally very similar nodes, thereby failing to distinguish different relations adequately, as shown in Fig. 8. In contrast, higher-order motifs frequently break such node invariances, leading to more discriminative and informative relation representations. We will discuss these and demonstrate the necessity of higher-order motifs using real-world examples.

“Theoretical Claims: I did not thoroughly verify the proofs, as the theoretical claims are not critical to this paper..."

We would like to respectfully emphasize that one of the primary motivations and key contributions of this work is precisely to establish a rigorous and systematic theoretical framework to analyze and understand the expressive power of existing KGFMs, such as ULTRA. We believe these theoretical results significantly deepen our fundamental understanding of why certain KGFM variants outperform others in practice, and thus we respectively suggest that they should not be overlooked.

Strengths and Weaknesses:

“W1. The definition of relational hypergraphs is problematic… W2. … more intuitive explanations of link/relation invariants are needed. W3. More generally, the writing could be improved for better clarity,...”

We thank the reviewer for carefully pointing out these areas for improvement. We agree that the clarity and precision of our definitions, explanations, and examples can be improved for better presentation. We will explicitly clarify the definitions of relational hypergraphs (W1), provide clearer and more intuitive explanations for link/relation invariants (W2), and enhance overall readability with additional illustrative examples (W3).

“W4. “The experimental results on real-world data are not very convincing....”

Please see our detailed response on experimental design and analysis.

Other comments:

“S1. In line 56, it would be helpful to provide a clearer explanation of the term "binary motifs," as it appears to be a key concept.”

We will clarify this in the revised manuscript by explicitly defining a binary motif as a motif with a motif graph containing exactly two relation types ().

“S2. More generally, how are different entities and relations matched across various KGs? ...”

Our framework inherently matches similar relations across different KGs by constructing similar relational hypergraphs based on their structural roles. Specifically, relations with structurally similar contexts in different KGs will yield similar embeddings thanks to conditional MPNN since they can effectively capture the induced similar hypergraph neighborhoods without manual matching of entities and relations IDs. The same principle applies to entities: entities embedded in structurally similar local neighborhoods involving similar relations will naturally obtain similar embeddings across KGs.

Fig. 1 in our manuscript already illustrates how structurally similar relations across different KGs (e.g., provide ↔ supply, research ↔ produce) receive similar embeddings due to similar relational contexts. We will further expand this figure with additional explanatory details in the revised manuscript to ground this key concept.

“S3. A minor suggestion: In lines 111–113...”

We will modify these in the updated manuscripts.