Summary of Revision:

We sincerely thank all the reviewers for their insightful reviews and valuable comments, which are instructive for us to improve our paper further.

The reviewers generally held positive opinions of our paper, in that the proposed approach is “technically reasonable”, “well presented”, “clearly explained”, and we “clearly describes the motivation for the paper and the methods used”; this paper is “well-organized”, “clearly presented”, “well-written”, and “easy to follow”; we “demonstrate effectiveness through both qualitative and quantitative analyses”, the experiment results are “remarkable”; and the proposed approach “achieves state-of-the-art (SOTA) performance”.

The reviewers also raised insightful and constructive concerns. We made every effort to address all the concerns by clarifying DECRL’s distinctions from related methods and the ability to address diverse relation semantics. We supplement with new experiments on Wikidata, YAGO, and GDELT datasets, along with additional ablation and case studies.

Q1: DECRL’s distinctions from related methods

While entity groups, hypergraphs, and clustering can all model high-order correlations, our approach offers unique advantages:

Firstly, methods using entity groups and hypergraphs require learning entity assignment mappers at each timestamp, a process that is complex to update and maintain, resulting in significant computational overhead. Our clustering-based approach, however, adapts more flexibly to dynamic data changes and is lightweight, facilitating easier integration with other techniques.

Secondly, we use a fuzzy clustering algorithm with a fuzzy smoothing hyperparameter that controls node membership distribution, preventing clusters with very few nodes. This approach allows for effective cluster construction even for nodes with limited interactions. Using entity graphs or hypergraphs to achieve a similar advantage would be much more computationally expensive and resource-intensive.

In addition, our experiments demonstrate the effectiveness of our approach compared to methods using entity groups and hypergraphs. For example, the DECRL-w/o-fusion variant, which uses only clustering for representation learning, achieves MRR, Hits@1, Hits@3, and Hits@10 scores of 57.98, 41.90, 66.97, and 92.00, respectively. These results outperform the hypergraph-based method, i.e., DHyper, which scores 56.15, 43.76, 65.46, and 85.89 on the same metrics. This superior performance can be partly attributed to fuzzy clustering’s ability to prevent the formation of extremely small clusters.

Q2: The ability to address diverse relation semantics

Firstly, it is important to note that TKG datasets do not provide explicit semantic descriptions of relations like “leave from” or “transfer to”. The training process typically uses only entity and relation IDs. However, we do model different relation types using a Relation-Aware Graph Convolutional Network, capturing distinct characteristics of various relation types even without explicit semantic descriptions.

Moreover, DECRL is designed to capture the temporal evolution of high-order correlations, which indirectly addresses the issue of diverse relation semantics. For example, if entities consistently interact over a continuous period, they have a higher probability of being clustered together at each timestamp. By capturing the temporal evolution of high-order correlations, our approach reinforces the closeness of their relationship over time. Conversely, if entities do not consistently interact over time, they have a lower probability of being clustered together. The temporal evolution component allows for the gradual distancing of these entities in the representation space. Therefore, DECRL can effectively handle scenarios where different relations may indicate varying levels of future interaction, without relying on explicit semantic information.

To illustrate this capability, we would like to draw attention to the comparison between Figure 2d (Final DECRL) and Figure 2f (Final DECRL-w/o-fusion, which only models high-order correlations without capturing their temporal evolution). This comparison clearly illustrates that capturing the temporal evolution of high-order correlations leads to superior entity representations, as evidenced by the larger inter-cluster distances and tighter intra-cluster entity groupings. Furthermore, by comparing the first and third columns of Figure 2 in the manuscript, we can observe the progression of training. This comparison demonstrates that capturing the temporal evolution of high-order correlations gradually increases the separation between clusters while simultaneously tightening the grouping of entities within clusters.

Q3: Performance on Wikidata, YAGO, and GDELT datasets.

We have conducted additional experiments on WIKI, YAGO, and GDELT datasets. The results of these experiments are presented in Tables 1 and 3 of the attached rebuttal PDF. We are pleased to report that our approach has achieved the SOTA relation prediction performance across all these datasets, demonstrating the effectiveness and robustness of our approach.

New experimental results (see the rebuttal PDF):

Performance on different datasets: Tables 1 and 3 in the rebuttal PDF show the relation prediction performance of DECRL on WIKI, YAGO, and GDELT.
Performance of entity prediction task: Table 2 in the rebuttal PDF shows the entity prediction performance of DECRL on GDELT.
The contributions of attentive temporal encoder and the fuzzy c-means clustering method: Table 4 in the rebuttal PDF shows the performance comparison of DECRL and its variants on ICEWS14.
Model efficiency: Figure 1 in the rebuttal PDF illustrates the training time comparison with DHyper on ICEWS14 (in seconds).
Case study: Figure 2 in the rebuttal PDF illustrates the entity representations of DHyper on ICEWS14C.