Thank you for your valuable feedback, please find below response to some of the points raised in your review.

D1, D2: Please see our answer to point (B) in the response to all reviewers. If you have suggestions for additional statistics we would be happy to extend Table 1 and 4. The 3 training sets we created (Single Table, Binary Join, Multi-Join) progressively increase the difficulty of the prediction task and are modeled after common patterns seen in real database products (In [1] Amazon reports the majority of production database queries have 0-2 joins).

D3, D5: We have updated the paper to include training and inference times. The GNN model has an inference time of 35ms, while the transformer model takes 97ms. Furthermore, we now compare our approach to PostgreSQL's cardinality estimation (Figures 3, 4). PostgreSQL, a widely used database, serves as a baseline in most relevant papers [2-5]. Our methods outperform its traditional summary-based techniques and show greater robustness, especially at the tail. In general, the research on instance-based learned cardinality estimation has shown they outperform training-free (traditional) methods [2-4, 8-16]. The cost to train and the investment needed to build training pipelines has been a major blocker for wide adoption of such techniques despite their high accuracy. Thus if pre-trained or zero-shot models prove to be equally as performant as instance-based methods then we could use them in practice.

D3, D4: The scientific problem we targeted in this paper is if cardinality estimation can be done in a zero-shot / pre-trained way. A positive answer motivates a new benchmark that will allow the training and testing of such models. Using GNN and transformer models, we show that the problem is not zero-shot but needs fine-tuning. Potentially there are other model architectures that could work better for this problem. The contribution of this paper is showing that cardinality estimation can be solved with pre-trained models, coupled with a smaller number of training data used for fine-tuning and introduce a new benchmark that can enable research in this area. We hope to motivate the research community to focus on zero-shot/pre-trained models for cardinality estimation and use CardBench to do so.

D6: Q-error has become the standard metric for evaluating cardinality estimation [2-5, 8-16]. Deploying models in a database system can indeed provide useful insights but requires query execution performance analysis to correctly attribute performance wins or shortcomings. It is well accepted in the database community that accurate cardinality estimation is beneficial [6]. Cardinality estimation is a major building block in many database problems, including all recommenders (index, materialized views, partitioning), query optimization, as well as workload management and scheduling. As such accurate cardinality estimation is critical for high-performance databases.

[1] Alexander van Renen et al 2024. Why TPC is Not Enough: An Analysis of the Amazon Redshift Fleet VLDB 2024

[2] Zongheng Yang et al. Balsa: Learning a Query Optimizer Without Expert Demonstrations SIGMOD '22

[3] Ryan Marcus, et al VLD 2019. Neo: a learned query optimizer.

[4] Kipf, Andreas, et al. "Learned cardinalities: Estimating correlated joins with deep learning." arXiV 2018

[5] Benjamin Hilprecht and Carsten Binnig. VLDB 2022. Zero-shot cost models for out-of-the-box learned cost prediction.

[6]: Leis, Viktor, et al. "How good are query optimizers, really?." Proceedings of the VLDB Endowment 9.3 (2015): 204-215.

[7] Unnesting Arbitrary Queries Thomas Neumann and Alfons Kemper https://cs.emis.de/LNI/Proceedings/Proceedings241/383.pdf

[8]: Benjamin Hilprecht, et al DeepDB: learn from data, not from queries! Proc. VLDB 2020

[9]: Getoor, Lise, Benjamin Taskar, and Daphne Koller. "Selectivity estimation using probabilistic models." SIGMOD 2001.

[10]: Liu, Henry, et al. "Cardinality estimation using neural networks." Proceedings of the 25th Annual International Conference on Computer Science and Software Engineering. 2015.

[11]: Dutt, Anshuman, et al. "Selectivity estimation for range predicates using lightweight models." Proceedings of the VLDB Endowment 12.9 (2019): 1044-1057.

[12] Kipf, Andreas, et al. "Learned cardinalities: Estimating correlated joins with deep learning." arXiv preprint arXiv:1809.00677 (2018).

[13] Sun, Ji, and Guoliang Li. "An end-to-end learning-based cost estimator." arXiv preprint arXiv:1906.02560 (2019).

[14] Woltmann, Lucas, et al. "Cardinality estimation with local deep learning models." Proceedings of the second international workshop on exploiting artificial intelligence techniques for data management. 2019.

[15] Parimarjan Negi et al Robust Query Driven Cardinality Estimation under Changing Workloads. VLDB 2016

[16] Negi, Parimarjan, et al. "Flow-loss: Learning cardinality estimates that matter." arXiv preprint arXiv:2101.04964 (2021).