Claims And Evidence

Response to C1: Please refer to "Response to W2" for Reviewer bfPu.

Methods And Evaluation Criteria

Response to M1:

NeuroCUT [1] is a reinforcement learning-based partitioning method, while DGCLUSTER [2] and DMoN [3] employ graph neural networks to optimize clustering objectives. However, these methods are designed for graph clustering, which aims to minimize inter-cluster connections, whereas Max--Cut seeks to maximize inter-partition connections. As a result, they are not directly applicable to our problem. Additionally, while NeuroCUT claims to support arbitrary objective functions, its node selection heuristics are only tailored for graph clustering, making it unsuitable for Max--Cut.
Despite these differences, we evaluated NeuroCUT as a representative baseline of graph clustering. We trained it on 500 3-regular graphs as ROS and tested it on Bitcoin-OTC, a real-world signed network with 5,881 nodes and 35,592 weighted edges (ranging from -10 to 10), which captures trust relationships among Bitcoin traders. The results is shown in "Response to R1" for Reviewer bfPu, ROS significantly outperforms NeuroCUT and other baselines, further demonstrating its effectiveness for Max--Cut.

Response to M2: We include a real-world dataset, Bitcoin-OTC, in our evaluation, which contains 5,881 nodes and 35,592 weighted edges. The comparison results with baselines on this dataset are presented in "Response to R1" for Reviewer bfPu.

Response to M3: Please refer to the "response to W1" of reviewer bfPu for details regarding the generalizability of our method to unseen .

Experimental Designs Or Analysis:

Figure 2 represents the fine-tuning time for ROS. ROS does not require ground-truth generation, unlike supervised methods. Pre-training for a specific is lightweight—training on 500 regular graphs for one epoch takes only 8.75 seconds, whereas L2O baselines like ECO-DQN and ANYCSP require hours or even days. The scalability of fine-tuning (inference) time is detailed in Table 1 of the manuscript, and ROS efficiently scales to instances of large .

Essential References Not Discussed

Please see the response to M1.

Weakness

Response to W1: Please see "Response to W2" for Reviewer bfPu.

Response to W2: As stated in Section 4.1, the training dataset consists of 500 regular graphs, and the pre-training process runs for only one epoch, requiring just 8.75 seconds in total. In contrast, L2O baselines like ECO-DQN and ANYCSP require significantly longer training times, ranging from several hours to multiple days. This highlights the efficiency of ROS. The fine-tuning (inference) time is already reported in Section 4.

Response to W3: Please refer to the "response to W1" for reviewer bfPu regarding the generalizability of our method to unseen .

Response to W4: Please see Response to M2.

Response to W5: Please see Response to M1.

Response to W6: We upload our code in https://anonymous.4open.science/r/ROS_anonymous-1C88/.

Other Comments Or Suggestions

Since fine-tuning directly solves test instances, we cannot remove this stage. However, to enhance readability, we now include results for ROS-vanilla (i.e., ROS without pre-training). The updated tables explicitly present ROS-vanilla results, improving clarity. Below are the updated rows in Tables 1, 2, and 3 of the manuscript (Tables 4 and 5 already included ROS-vanilla results):

Updated Row in Table 1 in manuscripts

Model
ROS-vanilla

Updated Row in Table 2 in manuscripts

Model	G70 ()	G70 ()	G72 ()	G72 ()	G77 ()	G77 ()	G81 ()	G81 ()
ROS-vanilla	9004	9982	6066	7210	8678	10191	12260	14418

Updated Row in Table 3 in manuscripts

Model	G70 ()	G70 ()	G72 ()	G72 ()	G77 ()	G77 ()	G81 ()	G81 ()
ROS-vanilla	8989.38	9973.75	6140.50	7207.13	8744.47	10190.37	12278.70	14341.25

Questions For Authors

Response to Q1: Please refer to the "response to W1" for reviewer bfPu.

Response to Q2: Please see Response to M1.

Response to Q3: Please see Response to M2.

Reference

[1] Rishi Rajesh Shah et al., NeuroCut: A Neural Approach for Robust Graph Partitioning, in KDD. [2] Anton Tsitsulin et al., Graph clustering with graph neural networks. In JMLR. [3] Aritra Bhowmick et al, DGCLUSTER: A Neural Framework for Attributed Graph Clustering via Modularity Maximization. In AAAI.