Thank you for reviewing our work! Here are some detailed comments which we believe resolve all of your main concerns.

Complexity

"The proposed AT algorithm has higher time complexity compared to zero-sum AT algorithms such as PGD, FGSM and TRADES."

This is true. In some sense, the benefits of BETA-AT (e.g., eliminating robust overfitting, matching the performance of AutoAttack without heuristics, etc.) all come at the cost of increased computation cost. In Appendix B, we examine this tradeoff closely; we show that while is slower than PGD and TRADES, BETA is 5.11 times faster than AutoAttack.

Questions

"In Table 1. Why does BETA-AT have lower test accuracy than BETA-AT for clean data?"

This is the expected result and consistent with the vast majority of the literature in the field of adversarial robustness. It is well known (see, e.g., [A,B,C]) that adversarial robustness is at odds with clean accuracy. As an adversary gets stronger (i.e., it uses more steps of gradient ascent), the trade-off in clean accuracy becomes more pronounced. In Table 1, as we use more steps of gradient ascent (i.e., we replace BETA-AT with BETA-AT), the model becomes more robust, but clean accuracy falls.

[A] Tsipras, Dimitris, et al. "Robustness may be at odds with accuracy." arXiv preprint arXiv:1805.12152 (2018).

[B] Zhang, Hongyang, et al. "Theoretically principled trade-off between robustness and accuracy." International conference on machine learning. PMLR, 2019.

[C] Dobriban, Edgar, et al. "Provable tradeoffs in adversarially robust classification." IEEE Transactions on Information Theory (2023).