Thank you for your praise and appreciation of our work, including your remarks that "the experimental work is solid", "the numbers are impressive", "easy to follow", "practical" and so on. Encouraged by your comments, we are pleased to respond to your valuable suggestions to further improve our work.

For Weakness 1 and Question 1

Previous work has shown that tree-based structures exhibit significant advantages in LLM reasoning, as demonstrated in [1,2]. In addition, earlier works have demonstrated from multiple perspectives that hierarchical tree-based architecture is superior to linear architecture, including but not limited to the following points:

Divide and Conquer Strategy [3]: Tree-based architectures leverage the divide-and-conquer approach, which is a well-established algorithmic paradigm. This strategy breaks a problem into smaller subproblems, solves each subproblem independently, and combines their results. This can lead to more efficient processing, especially in complex, large-scale problems.
Scalability [4]: Trees can handle larger and more complex datasets more efficiently than linear structures. As data grows, the depth of a tree increases logarithmically rather than linearly, allowing for more scalable processing.
Improved Decision Making [5]: In decision-making processes, tree structures can better model decision paths and outcomes, providing clearer insights into the reasoning behind decisions.
Cognitive Alignment [6,7,8]: Human reasoning often aligns more closely with hierarchical structures, which can make tree-based models more intuitive and easier to interpret.

We have also included the above discussions in the appendix (Sec. A.3) of the revised paper version.

For Weakness 2 and Weakness 3

Following your suggestion, we have added the following experiments:

Selection of MAXDegree and MAXDepth for the Table-Tree: In our experiments, we initially conduct preliminary experiments to roughly determine their value ranges. We then perform detailed hyperparameter experiments to identify the optimal values. The ablation study results on WikiTQ are listed in the table below. It can be seen that our proposed method is relatively robust with respect to these two hyperparameters.

MAXDepth	6	8	10
Accuracy(%)	59.96	61.11	60.47

MAXDegree	3	4	5
Accuracy(%)	60.03	61.11	60.91

Memory Efficiency of Tree-of-Table: In fact, Tree-of-Table is a training-free, prompt-based method that requires no additional data, making it very memory-efficient. In our analysis, we find that the tree-of-table structure increased memory cost by less than 5% compared to the chain-of-table structure.

For Question 2 and Question 3

To address your concern regarding error propagation, we have added the following study. Specifically, we manually remove important information randomly during the table condensation step (at the root) and evaluate the performance. The experimental results on WikiTQ below show that even when a certain amount of important information is removed, the performance of Tree-of-Table does not significantly degrade and still achieves relatively correct results, indicating the robustness of our method.

Remove Percent(%)	5	8	10	12	15
Accuracy(%)	61.09	61.00	60.87	60.22	59.45