PaperHub
6.3
/10
Poster3 位审稿人
最低3最高4标准差0.5
4
3
3
ICML 2025

Deep Electromagnetic Structure Design Under Limited Evaluation Budgets

OpenReviewPDF
提交: 2025-01-22更新: 2025-07-24
TL;DR

We introduce a method to efficiently design electromagnetic structures under limited simulation budgets by employing Progressive Quadtree-based Search method and a consistency-based sample selection strategy.

摘要

关键词
Electromagnetic StructureSurrogate ModelTree Search

评审与讨论

审稿意见
4

The authors present Progressive Quadtree-based Search (PQS), a novel method for electromagnetic structure (EMS) design under limited evaluation budgets. PQS leverages a quadtree-based hierarchical representation to mitigate the curse of dimensionality inherent in conventional pixel-wise optimization. In addition, it introduces a Consistency-based Sample Selection (CSS) module that leverages model prediction consistency to dynamically allocate scarce evaluation resources. PQS is benchmarked on two real-world tasks, where it substantially outperforms baseline methods in both efficiency and robustness. Comprehensive ablation studies further validate the important role of the hierarchical representation and the adaptive sample selection mechanism.

给作者的问题

n/a

论据与证据

This work provides a successful demonstration of deep learning’s application in electromagnetic structure design. By directly addressing the often-overlooked challenge of high data collection costs in real-world scenarios, it shows the potential to shorten product design cycles in industrial settings, making it well-suited for practical, large-scale applications. The claims of the paper are well-supported by evidence from experiments.

  1. Experiments on two real-world tasks demonstrate that PQS yields designs with enhanced performance, with limited evaluation budgets.
  2. Ablation studies confirm the importance of the QSS and CSS strategy.
  3. Robustness tests indicate that PQS consistently delivers reliable performance.

方法与评估标准

The paper presents effective methods for addressing EMS design challenges. Its innovative quadtree-based representation reduces the problem’s dimensionality, and the CSS mechanism ensures efficient utilization of computational resources under limited evaluation budgets. Additionally, the evaluation tasks (DualFSS and HGA) are highly relevant to real-world electromagnetic structure design applications. However, some detailed issues still remain.

  1. Regarding the CSS mechanism, it relies on assessing the consistency of historical predictors, but at t=0 there is only one predictor available, so it is unclear how consistency is computed. The authors should provide additional details on this aspect.
  2. Although the predictor itself is not the main contribution of the methods, I noticed that the authors consistently used ResNet-50 as the architecture. Therefore, it would be interesting to discuss whether using different predictor architectures could lead to significant differences in prediction accuracy.
  3. As noted in the paper, there is currently a lack of publicly available real-world datasets and benchmarks in the EMS design field. Therefore, I recommend that the authors release the annotated EMS design data, as such a resource would greatly assist future researchers in improving and advancing related methodologies.
  4. The stopping condition in Algorithm 1 is inconsistent with that in Figure 2. In the former, the algorithm stops once the simulation budget is exhausted, whereas in the latter, it stops when a satisfactory solution is reached. Furthermore, Algorithm 1 should output the optimal solution found by the search, not a satisfactory solution.

理论论述

This paper demonstrates strong practical applicability and innovation, especially with the introduction of the PQS and the CSS mechanism, both of which have been experimentally validated for their effectiveness. However, the theoretical background could be expanded further. For example, it would be beneficial to explore more thoroughly how much the quadtree-based representation reduces the search space size. What is the specific relationship between the search space size and the parameter NmaxN_{max}?

实验设计与分析

The paper compares the proposed method with various baseline approaches and supports its claims with comparative evaluations, robustness tests, and ablation studies, providing solid evidence of its effectiveness. Nevertheless, some improvements are possible:

  1. While the authors note that simulation time represents the primary cost within the overall optimization, it would still be beneficial to provide a more detailed computational time including simulations and model-updating—especially since PQS continuously retrains its predictor.
  2. Although the authors explain in the appendix why Bayesian optimization cannot be successfully applied, the paper lacks a more detailed analysis. Since Bayesian optimization is commonly used for optimizing expensive evaluations, I recommend that the authors include a more thorough discussion on this topic.

补充材料

I reviewed the supplementary material, which provides additional details about the implementation, experimental setups, and further experimental results.

与现有文献的关系

The key contributions of this paper are well-grounded in the existing literature, addressing key gaps.

遗漏的重要参考文献

The authors have done a thorough job of addressing the essential references relevant to their contributions.

  1. The paper clearly discusses key methodologies in the field of electromagnetic structure design, including both surrogate-based optimization and generative approaches.
  2. The authors have made meaningful comparisons with analogous fields, highlighting the unique challenges of this task.

其他优缺点

  1. The images in the appendix are excessively large. Please adjust their sizes to improve readability.
  2. The citation for the Genco baseline is missing the publication year.

其他意见或建议

  1. Figure 1 can benefit from a longer caption.
  2. In the "Depth-wise Importance Assignment" section, there appear to be repetitive sentences.
  3. In the "Effectiveness of Consistency-based Sample Selection" section, "Top-M" should be corrected to "Top-K".
  4. In the "Quadtree-based Representation" section, "so taht" should be corrected to "so that".
作者回复

Q1. The CSS mechanism relies on assessing the consistency of historical predictors. However, there is only one predictor available at t=0.

A1. Thank you for your insightful comment. We actually train multiple predictors at t=0 rather than relying on a single one. These predictors are then used to compute and assess consistency. We will clarify this detail in the revised manuscript.

Q2. It would be interesting to discuss whether using different predictor architectures could lead to significant differences in prediction accuracy.

A2. We have conducted further experiments to validate this. Please refer to A2 in our response to Reviewer 63Rq for more details.

Q3. I recommend that the authors release the annotated EMS design data.

A3. We will release the annotated EMS design dataset upon acceptance.

Q4. The stopping condition in Algorithm 1 is inconsistent with that in Figure 2 ... Furthermore, Algorithm 1 should output the optimal solution found by the search, not a satisfactory solution.

A4. Thank you for your suggestion. We will correct this in our revised version.

Q5. For example, it would be beneficial to explore more thoroughly how much the quadtree-based representation reduces the search space size. What is the specific relationship between the search space size and the parameter NmaxN_{max}?

A5. The quadtree representation is used to reduce the complexity of the design space. Each leaf node in the quadtree represents a binary decision (either 0 or 1) for its corresponding subregion. Thus, if the current set of leaf nodes is LL, the total size of the design space is 2L2^{|L|}. The parameter NmaxN_{{max}} limits the maximum number of leaf nodes, i.e., LNmax|L| \leq N_{{max}}, so the upper bound of the search space is 2Nmax2^{N_{{max}}}. In other words, a smaller NmaxN_{{max}} effectively reduces the search space size.

Q6. It would still be beneficial to provide a more detailed computational time including simulations and model-updating—especially since PQS continuously retrains its predictor.

A6. Below, we provide a detailed breakdown of the average computational time for simulation in Table 2 and model-updating in Table 3. Specifically, we measured the model-updating time under both initial and final dataset conditions for each task:

  • HGA: 300 samples → 1,000 samples.
  • DualFSS: 500 samples → 1,000 samples.

All model updates were consistently performed for 200 epochs.

Table 2: Simulation cost per iteration (seconds).

TaskAvg.Time
HGA5588
DualFSS5837

Table 3: Model updating cost per iteration (seconds).

TaskUpdating SamplesAvg.Time±Std
HGA30018.8716±0.8278
HGA100043.8452±0.2518
DualFSS50026.9258±0.5701
DualFSS100044.5284±0.2120

In both tasks, simulation is the dominant cost. Model updating remains lightweight and efficient throughout the search.

Q7. Since Bayesian optimization is commonly used for optimizing expensive evaluations, I recommend that the authors include a more thorough discussion on this topic.

A7. We will include a more comprehensive discussion of Bayesian optimization in our revised manuscript.

审稿意见
3

I have no experience in this field so I am not capable of completing a sound review. I acknowledge that some of the comment are generated with the assistance of LLM, and some parts are left blank.

The paper proposes Progressive Quadtree-based Search (PQS) for electromagnetic structure (EMS) design under limited budgets, tackling high-dimensional spaces (e.g., 10^86 configurations) and costly simulations (660-42,780 seconds each). PQS uses Quadtree-based Search Strategy (QSS) for hierarchical exploration and Consistency-based Sample Selection (CSS) to balance exploitation/exploration. Tested on DualFSS and HGA, PQS outperforms baselines (e.g., 15.20 dB vs. 7.28 dB for DualFSS) with 1000 simulations, cutting costs by 75-85% and saving 20-48 days. Contributions include PQS, QSS, and CSS for efficient EMS design.

给作者的问题

N/A

论据与证据

Claims (PQS outperforms baselines, QSS reduces dimensionality, CSS boosts efficiency, 75-85% cost reduction) are supported by experiments (Tables 3, 6; Figures 3, 4) and metrics (Agg Obj, Kendall’s tau). Time-saving estimate (20-48 days) lacks detailed derivation but aligns with simulation reductions. Evidence is clear and convincing overall.

方法与评估标准

N/A (not familiar)

理论论述

No formal proofs

实验设计与分析

Experiments (Section 5, Appendix D) compare PQS vs. baselines (1000 vs. 7000/4000 simulations) and ablate QSS/CSS (Tables 5, 6; Figures 3, 7).

limited to two tasks, but conclusions hold.

补充材料

N/A

与现有文献的关系

Not familiar with this field

遗漏的重要参考文献

Not familiar with this field

其他优缺点

Strengths: Original QSS+CSS combo, significant cost savings, clear presentation (e.g., Figure 2).

其他意见或建议

N/A

作者回复

We sincerely appreciate your time and valuable feedback. We acknowledge your concerns and would like to clarify our motivations and technical contributions.

1. Significance of the Task & Motivation

Electromagnetic structure (EMS) design is fundamental to modern antenna/material development, yet faces two critical challenges:

  • Vast Design Space: Conventional grid-based representation creates exponentially growing search spaces (e.g., 16×16 layout can contain over 107710^{77} potential design).
  • Expensive Evaluations: Each design must be evaluated via time-consuming electromagnetic simulations (often taking minutes to hours), making large-scale evaluations impractical for industrial timelines.

Existing approaches primarily revolve around two strategies:

  • Predictor-Based Methods: Use predictors to approximate the costly electromagnetic simulator. However, such predictors need large amounts of high-quality data to achieve sufficient accuracy, which is infeasible when budgets are tight.
  • Generative Approaches: Use generative models to produce promising designs directly. They also require tens of thousands of labeled samples for training.

In short, our motivation is to reduce the reliance on large evaluation costs by introducing a more efficient search mechanism that narrows down the design space in a structured manner.

2. Methodological Innovation

Our proposed Progressive Quadtree-based Search (PQS) addresses these challenges through:

  • Quadtree-based Search Strategy:
    • Quadtree-based Representation: To reduce the dimensionality, we encode the layout in a quadtree that recursively subdivides the structure from a coarse resolution to finer details.
    • Progressive Tree Search: To focus the limited budget on the most impactful design decisions, we decompose the high-dimensional search into a tree search from global pattern to local refinement.
  • Consistency-based Sample Selection: Rather than requiring a highly accurate predictor, we adopt an adaptive sample selection. Specifically:
    • Ranking-based Reliability: Kendall's Tau is used to measure how consistently the predictors ranks candidate designs across consecutive iterations, which reveals the model's reliability to guide the search effectively.
    • Dynamic Exploitation & Exploration: Based on the consistency, our method dynamically allocates evaluations: high consistency leads to evaluate top-ranked designs, while low consistency leads to an additional exploration to uncover overlooked candidates.

3. Practical Validation

Our proposed approach has already been successfully validated in real-world engineering tasks:

  • High-Quality Outcomes: Compared to baseline methods, PQS yields designs with approximately 70% greater improvement in DualFSS design objectives.
  • Cost Savings: Achieved 75%-85% cost reduction vs. generative approaches.
  • Robust Performance: Repeated experiments showed that PQS exhibited 29%-92% lower variance compared to baseline methods.

4. Broader Impact to ICML Community

By bridging the gap between theoretical optimization and industrial constraints, PQS provides:

  • A scalable framework for high-dimensional and expensive structural design problems.
  • A low-budget solution for EMS design with limited computational resources.

We will revise the manuscript to better emphasize these aspects and would be happy to provide supplementary materials detailing industrial implementation.

Q1. Time-saving estimate (20-48 days) lacks detailed derivations.

A1. Thank you for your valuable feedback. We acknowledge that the original time-saving estimate (20–48 days) lacked detailed derivation. Upon re-evaluating the calculations, we identified an error in rounding and task-specific simulation reductions. Here is the correct derivation:

  • DualFSS Task: The average simulation time is 583.7 seconds; The reduction in simulations is 3,000 times; The total time saved is 583.7×3,0003,600×2420.27\frac{583.7 \times 3,000}{3,600 \times 24} \approx 20.27 days.
  • HGA Task: The average simulation time is 558.8 seconds; The reduction in simulations is 6,000 times; The total time saved is 558.8×6,0003,600×2438.80\frac{558.8 \times 6,000}{3,600 \times 24} \approx 38.80 days.

Thus, the refined time-saving range is 20.27–38.80 days. We apologize for the oversight and have updated the manuscript to reflect this correction. Importantly, the revised values still robustly demonstrate that our method achieves significant efficiency gains compared to generative approaches.

审稿意见
3

This paper proposes a novel method for electromagnetic structure (EMS) design under limited computational budgets, called Progressive Quadtree-based Search (PQS). PQS employs a Quadtree-based Search Strategy (QSS) to progressively explore the high-dimensional EMS design space and incorporates a Consistency-based Sample Selection (CSS) mechanism to optimize search efficiency under constrained evaluation resources. The method is evaluated on two real-world engineering tasks: Dual-layer Frequency Selective Surface (DualFSS) and High-gain Antenna (HGA). Experimental results demonstrate that PQS can efficiently identify high-performance designs while reducing computational costs by 75%–85% compared to baseline methods, significantly shortening the product design cycle.

给作者的问题

I have no more questions.

论据与证据

  • The paper employs Surrogate-GA, which is no longer representative of state-of-the-art (SOTA) surrogate-assisted evolutionary algorithms (SAEAs). Recent survey [1] indicate that SAEAs have become the dominant approach for high-dimensional expensive optimization problems. The paper should compare PQS with SOTA SAEAs rather than a 2020-era Surrogate-GA.

  • The paper utilizes ResNet50 as the predictor but lacks details on training data sources and training methodology. The dataset size is limited to 300 samples, which may be insufficient for complex EMS tasks. A larger dataset should be used, and additional experiments should evaluate the impact of dataset size on performance.

  • The observation that random sampling (RS) and Surrogate-RS outperform Surrogate-GA and Surrogate-GW is unexpected, as it contradicts typical evolutionary algorithm performance trends. The paper does not provide a satisfactory explanation for this result.

[1] M. Zhou, M. Cui, D. Xu, S. Zhu, Z. Zhao and A. Abusorrah, “Evolutionary Optimization Methods for High-Dimensional Expensive Problems: A Survey,” in IEEE/CAA Journal of Automatica Sinica, vol. 11, no. 5, pp. 1092-1105, May 2024, doi: 10.1109/JAS.2024.124320.

方法与评估标准

  • The dataset contains only 300 samples, which may hinder generalization. The study should explore the impact of dataset size by conducting experiments with larger datasets.

  • The baseline methods do not include the most recent SAEA algorithms, affecting the fairness of the evaluation.

  • Only ResNet50 is used as the predictor. The study should compare different classifiers to assess their impact on performance.

理论论述

The paper does not provide theoretical justification for why QSS is superior to other dimensionality reduction approaches, such as PCA or AutoML-based search space simplification.

实验设计与分析

  • The paper relies on older methods (e.g., Surrogate-GA, Surrogate-RS), omitting SOTA SAEAs, which undermines the validity of the comparisons.

  • The predictor is trained with only 300 samples, which may lead to poor generalization. The study should analyze the predictor’s accuracy across different dataset sizes.

  • The superior performance of RS and Surrogate-RS over Surrogate-GA contradicts standard evolutionary optimization trends, and the paper does not provide a satisfactory explanation.

补充材料

Important predictor training details are missing, particularly regarding ResNet50 training data sources, hyperparameter choices, and training iterations. This should be addressed.

与现有文献的关系

The paper references some relevant works but fails to cite the latest SAEA methods. Additionally, it does not cover recent advances in EMS optimization, such as deep reinforcement learning (DRL) or Bayesian optimization (BO).

遗漏的重要参考文献

I do not have more suggestions.

其他优缺点

  • The paper does not compare with SOTA surrogate-assisted evolutionary algorithms.

  • Only 300 samples are used, which may not be sufficient to train a robust predictor.

  • The better performance of RS over Surrogate-GA is counterintuitive and requires further justification.

其他意见或建议

  • Expand the dataset size and analyze its impact on the predictor’s accuracy.

  • Include SOTA SAEAs for fair comparisons.

  • Explain the unexpected experimental results (why RS outperforms Surrogate-GA).

作者回复

Q1. PQS should be compared with SOTA surrogate-assisted evolutionary algorithms (SAEAs).

A1. Thank you for your constructive suggestion. We compared our method with TS-DDEO[1] and SAHSO[2] in Tables 1-3. PQS outperforms them by 3.14-3.60 and 9.63-10.99 and achieves significantly higher robustness in the aggregation value of objectives, with 15%-19% lower variance. This stems from PQS’s ability to hierarchically explore the design space and adaptively allocate the tight evaluation budget via CSS. This efficiency could significantly benefit real-world applications where evaluation budgets are tight.

Table 1: Comparisons on High-gain Antenna (HGA).

MethodAgg Obj↑Obj1↑Obj2↑
TS-DDEO[1]0.52011.04410.5201
SAHSO[2]0.05890.05892.2410
PQS (Ours)3.65953.65956.4820

Table 2: Comparisons on Dual-layer Frequency Selective Surface (DualFSS).

MethodAgg Obj↑Obj1↑Obj2↑
TS-DDEO[1]5.56275.56279.0000
SAHSO[2]4.20664.20668.0576
PQS (Ours)15.196415.196431.0443

Table 3: Robustness comparison results on High-gain Antenna (HGA).

MethodAgg Obj↑Obj1↑Obj2↑
TS-DDEO[1]0.03±0.420.62±0.870.27±0.36
SAHSO[2]-0.29±0.400.20±0.750.06±0.61
PQS (Ours)4.34±0.344.45±0.304.53±0.46

[1] A two-stage surrogate-assisted metaheuristic algorithm..., Soft Comput. 2023.

[2] A surrogate-assisted hybrid swarm optimization..., Swarm Evol. Comput. 2022.

Q2. Only 300 samples may hinder generalization. Authors should examine dataset sizes and classifiers' impact on performance.

A2. We acknowledge dataset size and model choice's importance. However, in EMS design, simulation takes hours to days (e.g., 559-583 seconds per evaluation). To address this, we begin with a small initial dataset (300 samples) and use the remaining evaluation budget iteratively during optimization. This ensures that computational resources are prioritized for optimization rather than data collection.

In Table 4, we evaluated Kendall's Tau (KTau) between predicted and ground truth across dataset sizes (300-1,100) on a fixed 5,800-sample test set (10 trials). We clarify two key observations:

  • Expanding the dataset from 300 to 1,100 samples maintains weak KTau correlation (<0.3), showing larger datasets don't proportionally enhance accuracy in our task.
  • All models perform poorly, implying that model capacity is not the limiting factor.

These observations suggest that while gathering more data can be helpful, it is not always cost-effective in EMS design. Our PQS is designed to achieve data-efficient optimization, attaining superior performance even with a limited budget.

Table 4: Impact of training sample size and predictor on model performance.

ModelKTau @300 SamplesKTau @700 SamplesKTau @1100 Samples
GoogLeNet0.2148±0.00880.2610±0.00840.2718±0.0071
ResNet500.2175±0.00650.2600±0.01260.2896±0.0084
ResNet1010.2146±0.01350.2605±0.00920.2882±0.0065

Q3. The observation that RS and Surrogate-RS outperform Surrogate-GA and Surrogate-GW is unexpected.

A3. One possible factor is each method's reliance on the predictor for solutions. Specifically:

  • Surrogate-GA and Surrogate-GW rely on the predictor to guide the evolution of new populations. With a limited training set (300 samples), the predictor's accuracy suffers, misleading GA and GW optimizers to suboptimal regions.
  • By contrast, RS and Surrogate-RS avoid predictor dependence: they generate candidates via uniform random sampling. This is like our QSS, which also avoids heavy predictor reliance by hierarchical random exploration.

Q4. Why QSS is superior to other dimensionality reduction approaches, such as PCA or AutoML-based search space simplification?

A4. Our QSS excels in search space reduction and dynamic adaptation:

  • Explicit space shrinking: Unlike PCA that projects the design space into a continuous latent space, our QSS preserves the discrete, grid-like nature of EMS through its quadtree-based hierarchical representation, ensuring clear shrinking.
  • Dynamic adaptation during search: PCA or feature selection methods perform static, global dimensionality reduction. In contrast, our QSS dynamically adjusts the search granularity during optimization. Initially, it explores coarse global patterns and later focuses on locally promising regions, enabling efficient exploration under tight budgets.

Q5. It does not cover recent advances such as deep reinforcement learning or Bayesian optimization.

A5. We will carefully discuss and cite these works in our revised paper.

Q6. Important training details are missing.

A6. We clarify training details: Batch Size: 256, Epochs: 200, Cosine Annealing LR (lr=0.01, T_max=200, η_min=1e-6), Gradient Clipping (Global L2 norm ≤1.0/batch), Optimizer: Adam, resnet_lr=0.01, fc_lr=0.01. These will be added to the manuscript.

最终决定

The paper proposes a method named Progressive Quadtree‑based Search (PQS) that tackles electromagnetic‑structure (EMS) design when every simulator call is painfully slow and budgets are tight. By encoding the 2‑D layout as a quadtree, the method searches from coarse global patterns down to fine local details. The paper introduces a practically motivated strategy for budget‑constrained EMS optimisation. Empirical gains seem large (3–10× objective improvement, 75–85 % cost cut) and now benchmarked against two recent SAEA algorithms after the rebuttal. While theoretical framing is light and some baselines (e.g. Bayesian optimisation, DRL) are still absent, the added experiments strengthened the results in the paper.

Given the current scores and the rebuttal, I will recommend the paper for acceptance.