We thank the reviewer for their time and constructive comments. We have updated the manuscript to improve organization, results interpretation. We thoroughly address your concerns below. We hope the reviewer’s score can be updated to reflect the significance, novelty, and timeliness of our study.

Authors should discuss difference to existing works, epsecially to works in similar topics.

Thank you for the constructive feedback! We have made substantial revision. To clarify, our novel model design is partially based on the physical constraint loss used by PINN, but also expands and improves the PINN to address the problem of weather downscaling and forecasting, collaborating with meteorology experts and physicists. The weather prediction task is highly complicated, with many weather factors and physical processes in the play, easily influenced by local variations like change of boundary conditions, small-scale phenomena like microclimates and external forces like heat from the sun. Many of these vital factors, which have huge impacts in the first-principle equations, are missing in the data due to difficulties to measure and quantify. For example, as Eq. 14 in Page 9 shows, the effect of friction characterizing air viscous resistance cannot be represented by any of the existing data elements. The nuances make the physical equation incomplete. Therefore, it is intractable to rely on a fixed PDE, while PINN directly uses a known PDE to guide optimization under all circumstances, which could be hard to adapt to the real world.

To address this concern, we do not rely on any known physical equations with PINN to guide prediction, but propose a data-driven approach to adaptively consummate the first-principle physical equation to explain the physical mechanism that drives the weather prediction. Our approach has the potential of discovering the intricate interplay between various weather factors that is previously ignored, as an adaption to various conditions in different areas. We not only complete the tasks of weather downscaling and forecasting, but also provide insights of the nuances between different climates at any continuous spacetime. Our model is of great value to the meteorology community, as verified by the physicists we are collaborating with.

Furthermore, we propose a latent force term as a parametrization term in the equation, following the parametrization strategy [1] widely adopted by meteorology experts to supplement the forces that cannot be represented by the selected explicit PDE terms. All of these novel model designs differ from existing works in the PINN family. There is almost no work that successfully applies PINN for weather prediction. We believe that our approach fills this gap in time and provides a feasible way to improve our understanding of the physical mechanism of climate and improve the deep learning models’ performances, which is well-supported by our promising experimental results. We wish to highlight that our PhyDL-NWP framework only contains 55 thousand parameters, as shown in Table 7 in the appendix, which is about 1000 times lighter than some large models and extremely efficient to train. In our experiments, the training of deep learning model usually takes 20~50x more time than obtaining the PDE we need. Our contribution is not trivial and is of great value to not only the meteorology community but also the representation learning community. When tackling similar scientific tasks that also involve complex interplay between variables and insufficient data measurement of the nuances, our work will provide a valuable reference.

Authors should discuss difference to existing works, especially to the downscaling problem.

Our work is inherently different from most of the existing downscaling works, as our approach directly models the continuous dynamics, instead of performing discrete superresolution. Our approach is much more elegant, adaptive and accurate. In our paper, we use a paragraph to highlight this difference in Sec. 3.2, "With the trained , we can easily obtain the weather factors at any given input coordinates , which can be continuous over the spacetime. This property of modeling the meteorology dynamics is naturally suitable for weather downscaling: unlike the previous methods powered by the discrete encoding and decoding of neural networks on finer-grained data as labels, the in PhyDL-NWP can perform weather downscaling with unlimited granularity without labels given a continuous coordinate, as described by Fig. 1. Therefore, as long as we obtain the dynamics of meteorology and the solver of weather factors through updating and , the downscaling task is solved automatically.".

[1] Warner, Thomas Tomkins. Numerical weather and climate prediction. Cambridge University Press, 2010.