1. Too much important information is deferred to supplementary material

We apologize for placing substantial content in the supplementary materials due to the NeurIPS page limitations. In the final version, we will integrate key derivations from the supplementary materials into the main text. Here are the specific improvements we will merge Section 1.1 from the supplementary material into Section 3.1 in the main text, as you suggested. This section contains crucial explanations of:

Basic definitions of spatial frequency and sampling rates from line 43 to line 77;
The relationship between pixel-aligned primitives and Nyquist constraints from line 79 to line 83.

If you have any question and recommendation, we welcome discussion during the discussion period.

2. The use of maximum sampling frequency:

We sincerely appreciate the reviewer's insightful critique regarding our Nyquist-based adaptation. Our choice of using the maximum sampling frequency (Equation 3) is motivated by the design of Mip-Splatting [1] (Equation 7).

The key insight of Mip-Splatting is that for accurate reconstruction, we need to ensure that each 3D Gaussian primitive satisfies the Nyquist sampling criterion for at least one camera view where it is visible. This is because if a primitive can be accurately reconstructed from at least one view, we have captured its essential geometric information. Using max ensures we respect the highest sampling rate available, preventing aliasing in the view with the finest sampling. By selecting the maximum frequency as our sampling frequency, our surfel adaptation module guarantees that the network captures sufficient geometric information from at least one input view, which is sufficient for recovering Gaussian surfels with fine geometric details.

Alternative design choices and their limitations: We considered two alternative additive aggregation strategies: Sum of sampling frequencies and Root mean square of sampling frequencies. They would overestimate the effective sampling frequency, making it unlikely to satisfy the Nyquist criterion from any single perspective. To validate our design choice, we conducted experiments with different frequency aggregation strategies. As shown in Table A, we evaluate surface reconstruction results on DTU benchmarks, and the result shows that our method can present better reconstruction quality.

Table A Experiments on different final frequency settings

	Mean CD
	1.28
	1.25
	1.12

We will enhance our manuscript by:

(1) Acknowledging Mip-Splatting [1] as the motivation for our maximum frequency selection;

(2) Providing detailed theoretical justification in Section 3.1.2 explaining why the maximum frequency is both sufficient and theoretically sound for accurate surfel prediction.

3. Occlusion is not taken into consideration in this approximation

We appreciate the reviewer's attention to occlusion handling. In our current implementation, the visibility function in Equation 3 accounts for basic visibility by checking if the Gaussian center falls within the view frustum. Similar to Mip-Splatting [1], we found the current approach sufficient for our experiments.

4. Clarification on the "real signal" to recover

What is the "real signal" we want to recover?

The real signal we aim to recover is the 3D surface geometry of the scene. However, we only have access to discrete 2D image observations of this continuous 3D signal. And 2D Gaussian surfels are our chosen representation to approximate this surface.

Specifically, we model the real signal using a collection of 2D Gaussian primitives (following the foundation established by 2DGS [2] and Gaussian Surfels [3]). The problem of reconstructing the surface from discrete 2D sampling is thus reformulated as reconstructing the Gaussian primitives from the 2D image data.

Why apply Nyquist theorem to 2D Gaussian primitives?

To reconstruct Gaussian primitives accurately, we need to analyze three key components:

Spatial frequency of representation elements: Each 2D Gaussian surfel has an inherent spatial frequency (Equation 4).
Spatial sampling frequency from 2D image: Given the sampling density in the image plane, we can compute the sampling frequency (Equation 2)
Nyquist Sampling theorem constraint: To accurately represent a signal component with frequency , we need sampling rate .

By applying the Nyquist theorem to 2D Gaussian primitives, we can identify which Gaussian elements violate the Nyquist sampling criterion. Our network design then specifically addresses these under-sampled primitives to ensure complete recovery of all Gaussian elements, thereby achieving a high-fidelity approximation of the 3D surface geometry.

5. Experimental fairness concerns

Thank you for raising this important point about experimental fairness. Let us clarify our experimental setup and provide additional results to address your concern.

Experimental setup clarification: For the methods compared in our experiments:

Methods that don't require task-specific training: Some approaches like 2DGS [2] and FatesGS [4] are optimization-based methods that directly optimize on the given input views without requiring pre-training. These methods were run directly on the 2-view inputs.
Methods with pre-trained models: For learning-based methods (VolRecon [5], UFORecon [6]), we acknowledge that we initially used their publicly available pre-trained models, which were trained on 3-view datasets. We recognize this could introduce bias in the comparison.

Additional experiments with re-trained models: To address this fairness concern, we have conducted additional experiments where we re-trained the top-performing learning-based methods specifically for 2-view reconstruction. As shown in Table B, even with task-specific training, our method maintains superior performance while offering higher efficiency. We will include these results and clarify the experimental setup in the revised manuscript to ensure transparency.

Table B Additional 2-view training on VolRecon and UFORecon.

ID	24	37	40	55	63	65	69	83	97	105	106	110	114	118	122	Mean
VolRecon [5]	1.32	3.03	1.66	1.42	1.64	2.11	1.40	1.74	1.49	1.25	1.50	1.52	0.95	1.34	1.63	1.60
UFORecon [6]	1.12	2.22	1.61	2.53	1.72	2.40	1.46	1.40	2.02	0.93	2.10	1.87	1.34	1.98	1.55	1.75
Ours	1.23	2.64	1.63	0.90	1.24	1.14	1.12	1.18	1.13	0.79	0.84	0.54	0.51	0.84	1.04	1.12

References:

[1] Yu Z, Chen A, Huang B, et al. Mip-splatting: Alias-free 3d gaussian splatting[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2024: 19447-19456.
[2] Huang B, Yu Z, Chen A, et al. 2d gaussian splatting for geometrically accurate radiance fields[C]//ACM SIGGRAPH 2024 conference papers. 2024: 1-11.
[3] Dai P, Xu J, **e W, et al. High-quality surface reconstruction using gaussian surfels[C]//ACM SIGGRAPH 2024 conference papers. 2024: 1-11.
[4] Huang H, Wu Y, Deng C, et al. FatesGS: Fast and accurate sparse-view surface reconstruction using gaussian splatting with depth-feature consistency[C]//Proceedings of the AAAI Conference on Artificial Intelligence. 2025, 39(4): 3644-3652.
[5] Ren Y, Wang F, Zhang T, et al. Volrecon: Volume rendering of signed ray distance functions for generalizable multi-view reconstruction[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023: 16685-16695.
[6] Na Y, Kim W J, Han K B, et al. Uforecon: Generalizable sparse-view surface reconstruction from arbitrary and unfavorable sets[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2024: 5094-5104.