Conformal Mixed-Integer Constraint Learning with Feasibility Guarantees

审稿意见

评分: 4置信度: 32025-06-30

In this paper, the author proposes a new method to address challenges in constraint learning(CL) in optimization problems. The authors introduces a framework that integrates conformal prediction to CL problems to ensure that solutions to optimization problems with learned constraints are practically implementable.

优缺点分析

Strengths:

The paper is well-written and well-organized.
The authors clearly define the problem, and the procedure to implement the proposed framework.
The authors validate their approach on both synthetic dataset and real dataset with real-world applications.

Weakness:

[Conditional independence assumption] The authors discuss the potential limitations of the assumption. While acknowledged as a necessary "theoretical compromise", the assumption is a notable one, and its potential impact warrants careful consideration by practitioners.

问题

[Influence of the specific conformal scores] In Section 4.1 and Section 4.2, specific conformal scores are chosen for the regression and classification problems, respectively. A more detailed analysis of how the choice of a conformal score function influences the results would be a valuable addition. Furthermore, how to evaluate or select between different valid score functions for a given application?

局限性

N/A

最终评判理由

The paper is interesting and the authors solve my concerns. I've changed my score accordingly.

格式问题

N/A

作者回复

2025-07-31

Thank you for the thoughtful review and helpful feedback. Below we address weaknesses and questions.

W1: We thank the reviewer for highlighting the importance of Assumption 4.1 and for prompting a more detailed discussion of its role, interpretation, and limitations. We agree that this assumption is central to our theoretical guarantees and warrants careful justification. We believe it is reasonable in our setting and necessary to extend conformal guarantees to constrained optimization problems.

Assumption 4.1 states that, conditional on ground-truth feasibility ( $h(x) \in \mathcal{Y}$ ), the event of C-MICL feasibility ( $(x, z) \in \mathcal{F}_N = \\{(x, z) \in \mathcal{X}: g(x, z) \leq 0,\ \mathcal{C}(x) \subseteq \mathcal{Y}\\}$ ) is independent of whether the conformal set contains the true function value ( $h(x) \in \mathcal{C}(x)$ ).

To motivate Assumption 4.1 and clarify when it is plausible, note first that the ground-truth feasibility (GTF) conditional coverage guarantee from Lemma 3.1 can be achieved in a fully data-driven way (e.g., using Mondrian conformal prediction or other label-conditional conformal methods): $\mathbb{P}(h(x) \in \mathcal{C}(x) \mid h(x) \in \mathcal{Y}) \geq 1-\alpha$ $\mathbb{P}(h(x) \in \mathcal{C}(x) \mid h(x) \notin \mathcal{Y}) \geq 1-\alpha$

However, in C-MICL we aim to guarantee coverage over the feasible region of the optimization problem $\mathcal{F}_N = \\{(x, z) \in \mathcal{X}: g(x, z) \leq 0,\ \mathcal{C}(x) \subseteq \mathcal{Y}\\}$ , i.e., $\mathbb{P}(h(x) \in \mathcal{C}(x) \mid (x, z) \in \mathcal{F}_N) \geq 1-\alpha$

If the predictive model $\widehat{h}(x)$ were perfect (i.e., $\widehat{h}(x) = h(x)$ ), then the regions $\mathcal{F}_N$ and the ground-truth feasible region $\mathcal{F}$ would coincide, and the coverage guarantee would transfer directly. However, since we are interested in the more realistic case where $\widehat{h}(x)$ is imperfect, $\mathcal{F}_N$ and $\mathcal{F}$ differ in a data-dependent way.

In this case, since the feasible region $\mathcal{F}_N$ is implicitly shaped by the calibration set (via $\mathcal{C}(x)$ ), there is a natural dependency between the feasible solutions of the C-MICL problem and the calibration data, which invalidates standard conformal guarantees relying on i.i.d. calibration and test data. Assumption 4.1 precisely seeks to decouple this dependency: it allows us to approximate conformal coverage within $\mathcal{F}_N$ by assuming that feasibility does not systematically bias conformal validity, once conditioned on ground-truth feasibility.

To build intuition, consider partitioning $\mathcal{F}_N$ into two disjoint subsets: $\mathcal{F}_N \cap \mathcal{F}$ and $\mathcal{F}_N \cap \mathcal{F}^c$ . Then, Assumption 4.1 implies that $\mathcal{F}_N \cap \mathcal{F}$ (respectively $\mathcal{F}_N \cap \mathcal{F}^c$ ) is not systematically biased towards a region of $\mathcal{F}$ ( $\mathcal{F}^c$ ) that is miscalibrated. Mathematically, $\mathbb{P}(h(x) \in \mathcal{C}(x) \mid (x, z) \in \mathcal{F}_N \cap \mathcal{F}) = \mathbb{P}(h(x) \in \mathcal{C}(x) \mid (x, z) \in \mathcal{F}_N, h(x) \in \mathcal{Y}) \approx \mathbb{P}(h(x) \in \mathcal{C}(x) \mid h(x) \in \mathcal{Y})$ $\mathbb{P}(h(x) \in \mathcal{C}(x) \mid (x, z) \in \mathcal{F}_N \cap \mathcal{F}^c) = \mathbb{P}(h(x) \in \mathcal{C}(x) \mid (x, z) \in \mathcal{F}_N, h(x) \notin \mathcal{Y}) \approx \mathbb{P}(h(x) \in \mathcal{C}(x) \mid h(x) \notin \mathcal{Y})$ These enable us to translate the conformal coverage guarantees from the ground-truth feasible region $\mathcal{F}$ to the feasible set $\mathcal{F}_N$ used in the optimization.

Assumption 4.1 is therefore reasonable when the calibration data adequately covers the parts of the input space that intersect the feasible region $\mathcal{F}_N$ , both within the ground-truth feasible region $\mathcal{F}$ and its complement $\mathcal{F}^c$ . In this sense, it aligns with standard generalization assumptions that require the training and calibration data to be representative of the regions where predictions are deployed. Alternatively, Assumption 4.1 can be approximated using more granular conditional conformal methods, by partitioning the optimization region into finer subregions and enforcing local coverage guarantees within each, which can then be translated to the feasible set $\mathcal{F}_N$ .

We will revise the paper to better explain this interpretation and expand the discussion on when the assumption might fail. For instance, the assumption may break down if the feasible region $\mathcal{F}_N$ is heavily concentrated in areas where the calibration set is sparse or systematically miscalibrated. In our experimental settings, we observe good empirical alignment between target and achieved coverage (Appendix E), suggesting that Assumption 4.1 holds reasonably well in practice in realistic data scenarios.

Q1: Thanks for the opportunity to provide more details on the influence of the specific conformal scores used in our C-MICL framework. In the current work, conformal score functions were selected to maintain general applicability across a wide range of regression and classification models, in line with the model-agnostic nature of the proposed C-MICL framework. Following standard practice in the conformal prediction literature, we employed score functions that are symmetric and proportional to the model's estimated uncertainty (see Angelopoulos et al. [52]). By aligning with widely accepted strategies in the literature, we aimed to provide a robust yet general approach that performs well across various learning settings without relying on model-specific scores.

Specifically, in our experiments we use a residual-based score function for the regression setting, as it relies solely on point predictions and can therefore be applied uniformly across all regression models. For classification, the score function was chosen to preserve linearity in the decision space, enabling efficient integration into the underlying optimization problem.

We agree that exploring the effect of different valid conformal scores and providing guidance for their selection is a valuable direction for future work. There is already a substantial body of literature that investigates alternative score functions and their impact on the resulting conformal sets, which we will add to the revised manuscript. These works highlight the importance of constructing informative conformal sets to ensure both the feasibility and the quality of the downstream optimization problems. We appreciate the opportunity to discuss this further and will incorporate this perspective in the updated manuscript.

2025-08-04

Thank you for addressing my concern. I will increase my rating accordingly.

审稿意见

评分: 4置信度: 32025-07-01

This paper introduces Conformal Mixed-Integer Constraint Learning (C-MICL), a novel framework that integrates conformal prediction into mixed-integer constraint learning (MICL) to ensure probabilistic feasibility guarantees. The core idea is to replace heuristic approaches (e.g., ensemble-based W-MICL) with a principled conformal prediction-based mechanism that enables certified feasibility of solutions to learned-constraint optimization problems, under a mild conditional independence assumption. The method is model-agnostic and MIP-compatible, requiring only that the learned constraint model is representable as a mixed-integer program. It supports both regression and classification settings. Theoretical results are provided to justify the probabilistic guarantees. The empirical evaluation spans real-world-inspired MILP problems, including a chemical reactor optimization and a food basket design task. C-MICL consistently achieves the desired feasibility level, offers competitive objective performance, and exhibits significant speedups compared to ensemble methods.

优缺点分析

Strengths:

The proposed C-MICL framework provides formal probabilistic guarantees on the feasibility of solutions with respect to the unknown true constraints, which is a major advance over existing heuristic approaches.
C-MICL avoids ensemble-based methods and scales efficiently—requiring only one or two models—while achieving significant computational speedups without sacrificing objective performance.
Across diverse case studies, C-MICL consistently achieves target feasibility levels, while maintaining competitive objective values.
The paper is well-organized and clearly written, with intuitive explanations, theoretical results, and implementation details presented in a logical and accessible manner.

Weaknesses:

While plausible, the assumption that C-MICL feasibility and conformal coverage are conditionally independent (Assumption 4.1) is strong and difficult to validate in practice. This is the linchpin of the main guarantee, and its empirical verification is only approximate.
In cases of poorly performing base predictors or high noise, conformal sets may become overly conservative, potentially leading to empty or trivial feasible sets. While discussed in the limitations, this could be a practical barrier.

问题

As stated in the Limitation, in cases where the base model is inaccurate or uncertainty is large, the conformal set may become too wide, potentially making the C-MICL problem infeasible. Do the authors have strategies for detecting or mitigating this issue (e.g., fallback mechanisms, feasibility repair, or relaxed constraints)?
While the paper claims generality beyond MILPs, experiments focus exclusively on linear settings. Have the authors tested C-MICL in mixed-integer nonlinear problems (MINLPs)? If so, what challenges arise?

局限性

The authors have provided a thoughtful discussion of the limitations in Section 6, notably including the reliance on the conditional independence assumption, and the potential for conformal sets to become overly conservative when the predictive model is poorly calibrated. These are critical aspects and are appropriately acknowledged.

最终评判理由

The paper proposes a novel conformal mixed-integer constraint learning. During the author-reviewer discussion, the authors provided detailed response, which addressed my concerns. So I will maintain my score that is inclined to acceptance.

格式问题

None

作者回复

2025-07-31

Thank you for the thoughtful review and helpful feedback. Below we address weaknesses and questions.

W1: We thank the reviewer for highlighting the importance of Assumption 4.1 and for prompting a more detailed discussion of its role, interpretation, and limitations. We agree that this assumption is central to our theoretical guarantees and warrants careful justification. We believe it is reasonable in our setting and necessary to extend conformal guarantees to constrained optimization problems.