PaperHub
5.8
/10
Rejected5 位审稿人
最低5最高8标准差1.2
8
5
5
5
6
2.6
置信度
ICLR 2024

The Phase Transition Phenomenon of Shuffled Regression

OpenReviewPDF
提交: 2023-09-22更新: 2024-02-11

摘要

This paper studies the phase transition phenomenon inherent within the shuffled (permuted) regression problem, which has found applications in databases, privacy, and data analysis, etc. For the permuted regression task: $\mathbf{Y} = \mathbf{\Pi}^{\natural}\mathbf{X}\mathbf{B}^{\natural}$, the goal is to recover the permutation matrix $\mathbf{\Pi}^{\natural}$ as well as the coefficient matrix $\mathbf{B}^{\natural}$. It has been empirically observed in prior studies that, when recovering $\mathbf{\Pi}^{\natural}$, there exists a phase transition phenomenon: the error rate drops to zero rapidly once the parameters reach certain thresholds. In this study, we aim to precisely identify the locations of the phase transition points by leveraging techniques from {\em message passing} (MP). \noindent In our analysis, we first transform the permutation recovery problem into a probabilistic graphical model. Then we leverage the analytical tools rooted in the message passing (MP) algorithm and derive an equation to track the convergence of the MP algorithm. By linking this equation to the branching random walk process, we are able to characterize the impact of the signal-to-noise-ratio (SNR) on the permutation recovery. Depending on whether the signal is given or not, we separately investigate the oracle case and the non-oracle case. The bottleneck in identifying the phase transition regimes lies in deriving closed-form formulas for the corresponding critical points, but only in rare scenarios can one obtain such precise expressions. To tackle this challenge, we propose the Gaussian approximation method, which allows us to obtain the closed-form formulas in almost all scenarios. In the oracle case, our method can fairly accurately predict the phase transition SNR. In the non-oracle case, our proposed algorithm can predict the maximum allowed number of permuted rows and uncover its dependency on the sample number. Numerical experiments suggest that the observed phase transition points are well aligned with our theoretical predictions. This is an exciting exploration. We hope that our study will motivate exploiting message passing algorithms (and related techniques) as a new effective tool for studying permuted regression problems. For example, identifying the phase transition locations for sparse permuted recovery~\citep{zhang2021sparse} would be a highly interesting (and challenging) application. In this paper, we also briefly illustrate the use of the message passing algorithm for {\em partial correspondence recovery}, an important problem that deserves its own thorough study.
关键词
message passingphase transitionpermuted linear regression

评审与讨论

审稿意见
8

The paper studies the phase transition phenomenon in the shuffled (permuted) regression problem: the error rate drops to zero rapidly once the parameters reach certain thresholds, which is empirically observed in previous works.

This paper aims to theoretically characterize and precisely identify the locations of phase transition thresholds using message passing techniques: For the oracle case (regression coefficients known), the paper derives an analytical formula to predict the phase transition SNR; For the non-oracle case (unknown coefficients), the paper proposes approximations to predict the maximum number of shuffled rows and its dependence on problem parameters.

Numerical experiments validate the theoretical predictions and show close alignment with the observed phase transitions.

Overall, the paper provides new theoretical understanding of phase transitions in shuffled regression

优点

  1. This paper is well written and the main stream easy to follow even though I am not quite familar with this area.

  2. This paper gives the first theoretic result to identify the precise locations of phase transition thresholds associated with permuted linear regression, showing the precise positions of the phase transition points in the large-system limit. The technical contribution seems to be significant.

  3. Numerical experiments in this paper align well with theoretical predictions, even when problem sizes are not very large. This suggests the theory accurately captures the phase transition phenomenology.

缺点

  1. The theoretic analysis in this paper uses a performing Gaussian approximation from statistical physics, so the analysis here is not exactly rigorous.

  2. The technical analysis and derivations in this paper are quite complex. This could make it difficult for a broad audience to digest the theory. Although it is indeed a theory paper, a more gentle treatment on the mathematical details in the main text is still favorable.

问题

  1. I would like the authors to discuss the connections to phase transitions studied in other problems, which could provide more insight on this phenomenon.
评论

Thanks for your positive comments and support of our work. We are glad that you like our paper and find our paper well-written and accessible to audiences not familiar with this area. We really like your comment that our paper makes significant technical contributions and our theory's accurate capture of the phase transition phenomenon.

In addition, we would like to highlight that our main results obtained with complex derivations, especially those concerning the higher moments of Gaussian random variables, can be reused and serve as independent technical interests.

Moreover, let us answer your question concerning the connections to phase transitions studied in other problems. The most related work are (Montanari & Mezard, 2009) and (Semerjian et al, 2020). In (Montanari & Mezard, 2009), all edge weights are i.i.d. samples and there is no phase transition behavior. Whereas, (Semerjian et al, 2020) observe the phase transition behavior when switching the independent identical distribution (i.i.d.) assumption to independent non-identical distributions. Hence, we conjecture the phase transition is caused by the distribution discrepancy among different edges.

Again, thanks for your review of our work.

审稿意见
5

This paper studies the shuffled linear regression problem: linear measurements of a signal are perturbed via an additive noise and a permutation of the measured observables. The objective is then to recover the signal and the permutation (or only the permutation in the oracle version of the problem in which the signal is also observed). The manuscript relies on a previous study of the planted matching problem to propose expressions of phase transition points, as well as Gaussian approximations.

优点

The shuffled linear regression problem is an interesting one, and it is a desirable objective to understand the phase transition it undergoes. In this respect the connection to the planted matching problem appears original and promising.

缺点

The clarity and pedagogy of the presentation of the paper seems rather insufficient to me. Section 2 is a review of the message-passing approach to the linear assignment problem, but the relation to the present problem is not motivated beforehand. Moreover the relation between the purely random and the planted version is not clear, and I don't think it is possible to understand the reasoning in Sec. 2.3 that leads to theorem 1, on which the rest of the results of the paper relies, without reading the original paper [Semerjian et al (2020)]. The Appendix A which seems to be aimed at clarifying this derivation remains quite obscure.

One of my main concerns is the use of theorem 1 in a setting where it does not apply: in [Semerjian et al (2020)] the weights on the edges of the graph are independent by construction, which allowed to write the recursive distributional equations (7). But here the weight matrix E=X B has strong correlations in the edge weights, it is not clear why these should be neglected. The manuscript states on page 8 that in the "non-oracle case", "Unlike the oracle case, we notice the edge weights Eij are strongly correlated", but isn't it already the case that in the oracle case these are correlated ?

There are also imprecisions in the use of the "phase transition" phrasing, since there is not a clear description of the qualitative differences between the two phases that the transition should separate, and since a study of the effect of the system size on the sharpness of the transition is not presented in the main part of the text.

问题

  • in the first line of equation (4) you express the m messages in terms of the hatted m messages, whereas the second line express the hatted m in terms of the hatted m, wouldn't it be more logical that the second line express the m hatted in terms of the m, or to suppress the first line since the second one is closed on one type of messages?

  • in the upper panels of Fig. 2 the SNR axis extends to negative values, what is the meaning of the part of the curve at negative SNR?

  • on page 6, "As a mitigation, we resort to approximating ... by its lower-bound", could you explain which inequality you use to derive this expression?

  • on page 8 one can read that "this estimator can reach the statistical optimality in a broad range of parameters", could you give evidence for this statement? It is indeed quite surprising: from the beginning of the Section one understands that the "statistically optimal" estimator would involve solving a QAP, which is not doable efficiently, so how can one asserts that the LAP reaches statistical optimality if the latter is not known?

  • on page 14, in the description of the numerical procedure, I'm not sure that it is the dichotomy search that should be expanded upon, but rather what corresponds to the line 5 and 6 of algorithm 1. What is meant by "run experiments", the resolution of the LAP? With an exact algorithm or via message passing? Could you also specify the definition of the error rate, is it the fraction of samples on which the inferred permutation fails to coincide with the planted one, or the average fraction of the permutation that is correctly inferred? This last question applies as well to the y axis on the lower panels of figure 2.

评论

Next, let us address your other questions.

1. Justifications of weak correlations of matrix EE for the oracle case.

First, we can calculate the correlations between entries in EE. We can find most of these correlations are zero. There are only two scenarios with non-zero correlations, (i) Eπ(i),π(i)E_{\pi(i), \pi(i)} Eπ(j)π(j)E_{\pi(j)\pi(j)} and (ii) Eiπ(j)E_{i \pi(j)} Ejπ(i)E_{j \pi(i)}, which is with O(n)O(n) number while the total edge number is O(n2)O(n^2). In this sense, we can say EE is only weakly correlated. In addition, the numerical evidence in Table 1 and Table 2 also suggests that we can safely ignore these weak correlations and predict the phase transition points to a good extent.

2. Meaning of phase transition.

We use the term phase transition to denote the phenomenon such that a small change in the parameter will lead to a sudden change in the reconstruction performance. That is, the error rate drops from one to zero drastically when SNR exceeds the threshold. The meaning of phase transition can be observed in Table 1, Table 2, and Figure 2, where we specifically study the change in error rates.

评论

Dear Reviewer, we would like to first thank you for recognizing our method to be original and promising. Please allow us to clarify the statistical optimality of LAP/QAP in the non-oracle case, where there seems to be a misunderstanding.

First, whether the QAP will reach the statistical optimality still remains an open problem. To the best of our knowledge, the only related work is (Zhang, Slawski & Li, 2022) but they only prove the optimality holds for a restricted region. More specifically, they require the rank(B)clogn/n\textup{rank}(B) \leq c\log n/n.

Second, the optimality of LAP over a wide range is answered by the cited work (Zhang & Li, 2020) (c.f. one line above Equation (19)). Work (Zhang & Li, 2020) show that the mini-max lower bound of SNR requirement, i.e., no estimator can obtain the correct permutation matrix provided that logSNRc0logn/stable-rank(B)\log SNR \leq c_0 \log n/\textup{stable-rank}(B); and the LAP in Equation (19) can obtain the correct permutation matrix when logSNRc1logn/stable-rank(B)\log SNR \geq c_1 \log n/\textup{stable-rank}(B) (stable-rank(B)=BF2/B2\textup{stable-rank}(B)=|B|_F^2/|B|^2 is the stable rank).

Interestingly, based on the current literature, the LAP estimator in Equation (19) can actually reach statistical optimality in a wider range than the QAP estimator.

评论

Your detailed comments are answered in the following.

1. Explanation of the reasoning in Section 2.3.

First, we explain the reasoning in Section 2.3 and the role of Theorem 1. Section 2.3 basically is to analyze the impact of SNR on the convergence behaviors of the message passing (c.f. Equation (6)). As it is very difficult to track all message flows, we view the message flows to be i.i.d. samples. Here, we regard the message flow hi(i,π(i))h_{i\rightarrow (i, \pi^{\natural}(i))} and hj(i,π(i))h_{j\rightarrow (i, \pi^{\natural}(i))} along the correct correspondence edge to be i.i.d. samples (represented by the random variable HH). Similarly, we regard the message flows along the rest edges as i.i.d. samples from a distribution denoted by H^\hat{H}. In this way, we only need to track two scalars (HH and H^\hat{H}) instead of all message flows. The iterative equation is Equation (6), which can be further simplified to Equation (8) and can be analyzed by Theorem 1. When infθ1/θlog(i=1nEeθΞi)>0\inf_{\theta} 1/\theta \log\left(\sum_{i=1}^n \mathbb{E} e^{-\theta \Xi_i}\right) > 0, we have HtH^{t} converge to a large positive value; whereas infθ1/θlog(i=1nEeθΞi)<0\inf_{\theta} 1/\theta \log\left(\sum_{i=1}^n \mathbb{E} e^{-\theta \Xi_i}\right) < 0 leads to a large negative value. Hence, we conclude there is a sudden behavior change, namely, phase transition happens when infθ1/θlog(i=1nEeθΞi)=0\inf_{\theta} 1/\theta \log\left(\sum_{i=1}^n \mathbb{E} e^{-\theta \Xi_i}\right) = 0.

2. Meaning of negative SNR. The negative SNR denotes the place of the potential phase transition points. As SNR is non-negative by nature, we can conclude there is no phase transition in this region, which means the permutation reconstruction cannot be ever reconstructed.

3. Lower Approximation of Page 6. We use the standard inequality log(1+x)x\log(1+x)\leq x.

In the end, we would like to thank you again for your detailed comments. Please feel free to let us know if you have any other questions. We will be happy to answer them. Thank you.

4. Presentation of Algorithm 1 and detailed explanation of Lines 5 and 6

Thanks for the suggestion in presenting Algorithm 1 that we may need to give more focus on Lines 5 and 6 rather than the binary search. It is interesting that some other reviewers suggested that the binary search method needs more clarification since many readers might be unfamiliar with it. We will try to balance these suggestions.

Regarding Line 5, it corresponds to the estimators to solve the permutation matrix, that is equation (9) for the oracle case and equation (19) for the non-oracle estimator.

As for Line 6, the error rate refers to the fraction of samples on which the inferred permutation fails to coincide with the planted one, as we state in the second paragraph in Appendix C "we calculate the error rate of the permutation matrix (full recovery)". Its mathematical definition is L1i=1L1(ΠΠ^)L^{-1}\sum_{i=1}^L 1(\Pi \neq \hat{\Pi}), where 1()1(\cdot) is the indicator function, LL is the number of experiments and is set as 100100. We explained this in the main text. Perhaps we can make it more explicit. Thanks for the suggestion.

Again, thanks for your review of our work. Please kindly let us know if there are further questions. Thank you.

审稿意见
5

In this paper, the authors characterize the phase transition threshold (in the signal-to-noise ratio, SNR) in the permuted linear regression problem. By leveraging the tool of message passing and connections to branching random walk process, closed-form formulas for the critical points, as well as the phase transition SNR are given, for both oracle (with known signal) and non-oracle (with unknown signal) settings.

Some numerical results are provided to support the proposed theoretical characterizations.

优点

This paper focuses on the important problem of the statistical characterization of the phase transition behavior of the fundamental permuted linear regression problem.

There are many interesting (in fact shining!) ideas in the derivation of the phase transition SNR. And the numerical results seem to support the derived results.

缺点

The theoretical results in this paper are presented in a somewhat vague fashion. If I understand correctly, some steps in the derivation are mathematically rigorous, while others are approximations and/or intuitions without strong theoretical evidence. It remains unclear, at least to me, which part of the results can be confidently called for future use, and which part cannot.

问题

My major concern is that (it seems to me) the following approximations/heuristics are used to derive the phase transition threshold:

  • approximation log(E[eθΞ])\log( \mathbb{E}[e^{-\theta \Xi}] ) by its lower bound;
  • Gaussian approximation of the random variable Ξ\Xi. It would be great if some further theoretical arguments and/or discussions can be provided to support these approximation. Otherwise, I do see why the derived phase transition is a reasonable approximate solution to the original problem, and the main results in this paper should be stated as "conjectures".

Some additional comments:

  1. is Proposition 1 proven somewhere?
  2. "We defer the technical details to Appendix": please refer to the specific section in the appendix.
评论

Dear Reviewer, thanks for your appreciation of our derivation of the phase transition SNR. It's very encouraging to see your positive comments such as our ideas are shining.

We feel your major concern is on whether our results can be reused or not.

First, we can assure you that all theorems/propositions are rigorously derivated and can be confidently reused. Regarding the approximations, they mainly exist in establishing the connections between the phase transition point and the SNR. To be more specific, it primarily involves two aspects (i) the process of linking the phase transition point and Equation (11) and (ii) approximating the solution of (11). The conclusions in Theorem 1, Proposition 1, and Theorem 2, are rock solid.

Next, we address your detailed questions as follow.

1. Approximation of logEeθΞ\log \mathbb{E}e^{-\theta \Xi}.

This is due to the relation log(1+x)x\log(1+x)\leq x.

2. Justification of Gaussian approximation.

The technique of Gaussian approximation is widely applied in analyzing the message-passing algorithm. One of the most successful its applications is in designing the LDPC code (c.f. 'Analysis of sum-product decoding of low-density parity-check codes using a Gaussian approximation' by Sae-Young Chung, T.J. Richard, and R.L. Urbanke). Its success is mostly verified by empirical experiments and the theoretical justifications are still awaiting to be given.

Again, we would like to mention that all theorems/propositions (Theorem 1, Proposition 1, and Theorem 2) are rigorously derived. The non-rigorous part is mostly about the dependence between the phase transition points and the expectation Eexp(θΞ)\mathbb{E} \exp(-\theta\Xi) and its calculations.

3. Position of the Proof of Proposition 1.

It can be found in Appendix B.

Again, thanks for your review of our work. Please kindly let us know if there are further questions. Thank you.

审稿意见
5

The paper delves into identifying the precise location of phase transition thresholds in permuted linear regressions. While prior work could ascertain this for the oracle case where the signal matrix BB^* is given, this study innovates by extending the analysis to non-oracle scenarios, employing techniques from quadratic assignment problems. Through mathematical formulations, the paper showcases a phase transition phenomenon that is consistent with the oracle case. Validation is presented using numerical experiments, demonstrating the model's ability to reconstruct permutation matrices in the noiseless context. The paper concludes by emphasizing its novel approach to pinpointing phase transition thresholds in the non-oracle realm, thus marking a significant advancement in understanding permuted linear regressions.

优点

  • Originality: Introduces a novel approach by tackling non-oracle scenarios in phase transition thresholds of permuted linear regressions, uniquely combining techniques from quadratic assignment problems.

  • Quality: Demonstrates strong mathematical rigor, with theorems well-supported by numerical experiments, ensuring both theoretical and practical robustness.

  • Significance: Offers deeper insights into phase transition thresholds in permuted linear regressions, bridging gaps between oracle and non-oracle scenarios, with potential for broader applications.

缺点

  • Notation Issues: The notation throughout the paper can be perplexing, especially for those not familiar with Mezard & Montanari (2009). Consistent and universally recognized notation would improve readability and understanding.

  • Specific Errors: The representation in equation (3) on page 3 seems incorrect. A direct comparison to the original definition from Mezard & Montanari (2009) indicates a mistake. Specifically, the equation should be 1=jΠi,j1 = \sum_j \Pi_{i,j} or 1jΠi,j1 \leq \sum_j \Pi_{i,j} instead of 1jΠi,j1 - \sum_j \Pi_{i,j}. Such inaccuracies can mislead readers and detract from the paper's overall quality.

  • Assumed Prior Knowledge: The paper heavily relies on readers having read Mezard & Montanari (2009). For wider appeal and comprehension, summarizing or introducing key concepts from the referenced work would be beneficial.

  • Constructive Feedback: A deeper review of the paper's mathematical foundations, specifically in terms of its equations and their derivations, is necessary. This will ensure the elimination of such errors in future revisions.

It would be beneficial for the authors to address these weaknesses explicitly in their revisions, ensuring both clarity for the readers and robustness in their mathematical representations.

问题

  • In the methodology section, there seems to be a gap between the presented model and its practical application. Could you elaborate on how the proposed model can be applied to real-world scenarios?

  • How do the findings of this paper differ or expand upon the conclusions drawn in Mezard & Montanari (2009)? A comparative analysis would be useful for readers familiar with the referenced work.

  • Were there any limitations or constraints in the datasets or simulations used in the experiments? Understanding the scope and boundary conditions would be beneficial.

评论

Dear Reviewer, thanks for recognizing our work to be novel, of high quality, and with great significance.

First, there seems to be a misunderstanding on Equation (3), please allow us to clarify. Our notations with (Montanari & Mezard, 2009) are in fact equivalent, so Equation (3) is correct. In both our work and (Montanari & Mezard, 2009), the terms with the indicator function are one when jΠij=1\sum_{j}\Pi_{ij} = 1 (and iΠij=1\sum_i \Pi_{ij} = 1), and zero otherwise. They are both to enforce the constraint such that Π\Pi is a permutation matrix.

Next, we answer your other questions.

1. Difference with Mezard & Montanari (2009).

We consider a much harder problem than that in Mezard & Montanari (2009). In our work, the distributions of edge weights are neither identical nor independent. Moreover, we discover and analyze a phase transition phenomenon, which is the main contribution of our paper. Mezard & Montanari assume each edge weight is i.i.d. distributed and there is no phase transition phenomenon present.

2. Constraints on the simulations.

There is hardly any constraint on the numerical experiments. In our experiments, we only assume the entries in the sensing matrix XX to be i.i.d. sub-gaussian with zero mean. For example, we consider Gaussian distributed XX in Table 1 and Figure 2, and uniformly [1,1][-1, 1] distributed XX in Table 2. The sensing noise matrix WW is a Gaussian random matrix with each entry being a standard normal random variable. As for the signal matrix BB, it can be an arbitrarily fixed matrix in the oracle case; and is required to have all eigenvalues reside within approximately the same region in the non-oracle case.

3. Practical applications of permuted linear regression.

For the permuted linear regression, its application widely ranges from database merging, to privacy, to communications, to computer vision, to robotics, to sensor networks, etc. Two of the most important practical applications of the permuted linear regression include (i) linkage record, where we would like to align data from different sources and merge them into one comprehensive database; (ii) data de-anonymization, where we want to use public information to infer and uncover hidden identities or connections within private datasets.

Since our work is a theoretical paper, our focus is on the theoretical analysis and prefer to leave the applications to the references, which are (Unnikrishnan et al., 2015; Pananjady et al., 2018; Slawski & Ben-David, 2019; Pananjady et al., 2017; Slawski et al., 2020; Zhang & Li, 2020). We can include more content on the practical applications of permuted linear regression if the readers feel helpful. Thanks for the suggestion.

Again, we appreciate your valuable feedback. Please kindly let us know if there are further questions. Thank you.

审稿意见
6

The paper's primary focus is the analysis of phase transition thresholds within the context of recovering the permutation matrix in shuffled regression, as described by the model Y=ΠXB+σWY = \Pi X B + \sigma W. To achieve this, the paper employs techniques derived from message passing algorithms. The central goal in the paper is to establish phase transition thresholds by assessing the signal-to-noise ratio (snr\text{snr}) in two distinct scenarios. In the oracle case, the matrix BB is assumed to be known, while in the non-oracle case, BB is considered unknown. The derived thresholds serve as critical points where the error rates for permutation matrix recovery sharply drop to zero when the signal to noise ratio exceeds a certain threshold. Numerical results confirms the accuracy and reliability of the theoretically derived phase transition points in both the oracle and non-oracle cases.

优点

The paper showcases notable strengths, particularly in terms of its technical contributions. It is the first work that leverages the framework of message passing algorithms to derive phase transition thresholds for shuffled regression. Furthermore, the section dedicated to related work is thoughtfully presented and offers a clear and comprehensive overview of the existing literature.

缺点

  • The paper's motivation could benefit from a more explicit and clear description. While the challenges of the problem are well-defined, a paragraph highlighting the practical applications of shuffled regression would greatly enhance the paper's appeal. Providing real-world scenarios where this problem arises would place it within a broader context of applicability and increase its relevance.

  • Additionally, I would suggest reorganizing the notation-heavy equations, starting with equation (4) and its subsequent equations regarding message flows right until message passing update equation. Improved organization and perhaps some explanatory text would enhance the overall readability of the paper.

问题

  • The paper could benefit from a clearer explanation of the relationship between the permutation recovery problem and the linear assignment problem as defined in Equation (2). Providing more context on how the linear assignment problem is connected to the ML estimator for permutation recovery would be beneficial. The sudden introduction of the linear assignment problem might be confusing to readers initially. Additionally, it would be helpful to include a note regarding the operational significance of the matrix EE and why it plays a crucial role in this context.

  • The derivation of the lower bound for logEeθΞ\log \mathbb{E}e^{-\theta\Xi} on page 6 is not clearly explained. Providing a more detailed and comprehensible explanation of how this lower bound is derived would greatly enhance the paper's clarity.

  • Given the significance of the Gaussian approximation in obtaining closed-form expressions for snr\text{snr}, it would be valuable to provide an intuitive explanation of why this approximation is effective.

  • The paper imposes a condition on the ratio of the singular values for matrix BB when computing snrnon-oracle\text{snr}_{\text{non-oracle}}. An explanation for the rationale behind this condition would be beneficial.

评论

Dear Reviewer, thanks for your feedback and recognizing that our work is the first work that leverages the framework of message-passing algorithms to derive phase transition thresholds. In addition, we are pleased that you find our literature review comprehensive and informative.

Let us address your questions.

1. Lower bound on Equation (6). The lower bound is obtained with the relation log(1+x)x\log(1 + x) \leq x.

2. Intuition of Gaussian approximation. The choice of Gaussian approximation in the message passing (MP) is mainly due to its past success in analyzing the MP algorithm. One of the most successful examples is (c.f. 'Analysis of sum-product decoding of low-density parity-check codes using a Gaussian approximation' by Sae-Young Chung, T.J. Richard, and R.L. Urbanke), a famous paper in coding theory with more than 1600+ citations.

3. Role of the assumption such that all singular values of the matrix BB are of the same range in computing snrnon-oraclesnr_{\textup{non-oracle}}.

First, this assumption can greatly simplify the expression of snrnon-oraclesnr_{\textup{non-oracle}}. Otherwise, snrnon-oracle\textup{snr}_{\textup{non-oracle}} will be the solution of a quartic equation (fourth order) rather than a quadratic equation (second order). Second, this assumption can preserve the generality of our theorem since this assumption can cover a wide family of matrices, say sub-Gaussian random matrices. It is well known that all their eigenvalues are within the region [1c0m/p, 1+c0m/p][1-c_0\sqrt{m/p},~1 + c_0 \sqrt{m/p}] (after column normalization), and satisfy our assumption that all eigenvalues are of the same range.

In addition, we appreciate your suggestions on the paper organization such as adding more discussion on the practical applications. As our work a theoretical paper focusing on bridging the message passing with the analysis of permuted linear regression, we limit the content to avoid distraction. The problem of permuted regression has many useful applications, for example:

(A) Record linkage.
Given multiple databases containing information about the same entities, the objective is to merge them into one comprehensive database. However, these data may not be well-aligned due to the data formatting or data quality issues. Modeling the mismatches as a permutation is investigated as a mitigation strategy.

(B) Data de-anonymization. This task can be regarded as the opposite side of privacy protection. The intruders aim to infer the hidden identities/labels in certain private networks with public information. One commonly used method is to compare the correlation between this information and to find matching pairs with maximum correlation sum. Here, the permuted linear regression arises as a natural generalization.

For more applications, we recommend (Unnikrishnan et al., 2015; Pananjady et al., 2018; Slawski & Ben-David, 2019; Pananjady et al., 2017; Slawski et al., 2020; Zhang & Li, 2020), which are a good starting point. We are happy to add more references on the applications if the reviewer suggests.

Thanks again for your review of our work. We will carefully revise our paper according to these suggestions. Please feel free to let us know if you have more questions. Thank you.

评论

First, we would like to thank all reviewers for their time and effort! Many good things were said about our paper, including:

  • The paper showcases notable strengths, particularly in terms of its technical contributions (Reviewer jCjs).

  • It is the first work that leverages the framework of message passing algorithms to derive phase transition thresholds for shuffled regression (Reviewer jCjs).

  • the section dedicated to related work is thoughtfully presented and offers a clear and comprehensive overview of the existing literature (Reviewer jCjs).

  • Original: Introduces a novel approach by tackling non-oracle scenarios in phase transition thresholds of permuted linear regressions, uniquely combining techniques from quadratic assignment problems (Reviewer QZh8).

  • High Quality: theorems well-supported by numerical experiments, ensuring both theoretical and practical robustness. (Reviewer QZh8)

  • Significance: Offers deeper insights into phase transition thresholds in permuted linear regressions, bridging gaps between oracle and non-oracle scenarios, with potential for broader applications (Reviewer QZh8)

  • There are many interesting (in fact shining!) ideas in the derivation of the phase transition SNR. And the numerical results seem to support the derived results. (Reviewer gR3L)

  • Important Question: This paper focuses on the important problem of the statistical characterization of the phase transition behavior of the fundamental permuted linear regression problem. (Reviewer gR3L)

  • connection to the planted matching problem appears original and promising (Reviewer zFq3)

  • this paper well written and the main stream easy to follow even though I am not quite familar with this area (Reviewer HokC)

  • gives the first theoretic result to identify the precise locations of phase transition thresholds associated with permuted linear regression (Reviewer HokC)

  • showing the precise positions of the phase transition points in the large-system limit. The technical contribution seems to be significant (Reviewer HokC)

  • theory accurately captures the phase transition phenomenology (Reviewer HokC).

The following context will address your comments/questions in detail. Please let us know if you have any more questions. We will be happy to address them. Thank you.

AC 元评审

This paper presents a study of phase transitions in the shuffled linear regression. The focus is a theoretical determination of a transition in the ability to reconstruct the signal. Much of the paper is concerned with the oracle setting where the matrix of regressors is actually given. This setting has, in my opinion, a rather limited interest. Moreover, there are doubts about the rigour of the analysis, even in this oracle setting, due to correlations that are neglected. This is addressed by reviewer zFq3 and the authors answer does not convince me, the correlations that are present could still destroy the phase transition make it just a cross-over or shift its position. The authors justify their usage of Theorem 1 partly based on simulations. But only one value for n is taken in Table 1, whereas a careful numerical investigation of phase transitions should investigate several system sizes and study the sharpness of the change as the system size grows. The overall contribution does not seem significant enough.

为何不给更高分

This paper could be accepted with its flaws if one considers that approximate results are ok. In that case they should not be presented as theorems ..... but the usage of the concept of Theorem seems very broad in machine learning papers, so this is perhaps acceptable.

为何不给更低分

N/A

最终决定

Reject