Accurate Identification of Communication Between Multiple Interacting Neural Populations
摘要
评审与讨论
The paper introduces Multi-Region Latent Factor Analysis via Dynamical Systems (MR-LFADS), a novel model for accurately identifying inter-region neural communication in multi-region recordings. MR-LFADS extends the existing LFADS framework by jointly inferring region-specific inputs from unrecorded brain areas and explicitly constraining inter-region communication signals based on observed neural activity, effectively mitigating biases from incomplete recordings and under-constrained latent representations. The key conceptual contribution is the explicit inference of region-specific inputs and rate-constrained inter-region communication messages, reducing ambiguity and misidentification inherent in previous methods. By better constraining inferred communication signals through observable neural activity, MR-LFADS provides a robust tool for analyzing multi-region neural recordings and elucidating principles of brain-wide information processing.
给作者的问题
N/A
论据与证据
Yes, the claims made in the paper are generally supported by clear and convincing evidence.
方法与评估标准
Yes, the methods and evaluation criteria chosen by the authors are sensible and well-aligned with their stated goals. The synthetic datasets are thoughtfully designed to capture specific challenges faced by communication models, such as the presence of unobserved inputs, nonlinear dynamics, and temporal structure in neural interactions.
理论论述
Yes. This paper does not contain explicit theoretical proofs or formal theoretical claims requiring validation.
实验设计与分析
While the authors present extensive validation using synthetic datasets, which clearly demonstrate the model's capabilities in controlled scenarios, but the evaluation is solely based on synthetic data. Although synthetic datasets have the advantage of known ground truths, allowing precise assessment of the model’s accuracy, they inherently lack the full complexity and variability present in actual neural recordings. Evaluating MR-LFADS on neural recordings may scientifically get meaningful insights into brain-wide neural communication that are currently unexplored or poorly understood.
补充材料
Yes. It includes detailed descriptions of MR-LFADS's architecture, inference, training methods, hyperparameter selection, and synthetic experimental setups.
与现有文献的关系
The paper introduces MR-LFADS, extending LFADS to infer communication across multiple neural regions by explicitly modeling unobserved inputs and constraining inter-region messages through observed neural activity. It situates itself relative to existing methods and demonstrates improved performance on synthetic benchmarks that emulate cognitive neuroscience tasks
遗漏的重要参考文献
No
其他优缺点
Strengths:
- Creatively adapts the established LFADS framework to explicitly model communication among multiple interacting neural regions.
Weakness:
- The lack of validation on actual neural recordings somewhat limits the impact. Demonstrating applicability to neural data could substantially strengthen claims regarding scientific insight.
其他意见或建议
N/A
We thank the reviewer for their thoughtful and constructive feedback. We’re encouraged by the reviewer’s comments highlighting MR-LFADS as a novel extension of LFADS, and by the positive assessments on our synthetic dataset. Below, we respond to the reviewer's concerns on the lack of comparison to experimental data. Rebuttal figures referenced throughout our response can be accessed at the following link: https://drive.google.com/drive/folders/1R94up1vl04bkkE12tpT80YeHo39vEbqw?usp=share_link
We agree that evaluation on experimental data is essential. We will update our paper to include demonstrations of MR-LFADS on multi-region electrophysiology from 5 simultaneously recorded Neuropixel probes in mice (Chen et al., Cell, 2024) performing a decision making task (Fig. 2a in our rebuttal figures PDF). We show that MR-LFADS predicts experimentally perturbed firing rates across multiple brain regions following photoinhibition of the anterior lateral motor (ALM) cortex (Fig. c-e). Critically, MR-LFADS was not trained on photoinhibition trials. Thus, MR-LFADS' ability to predict photoinhibition effects suggests identification of an accurate model of inter-region communication. Additionally, we show that MR-LFADS(R) yields more consistent solutions across random initializations (seeds) on these experimental data, as compared to the under-constrained MR-LFADS(G) (Fig. 2f,h).
-
Photoinhibition study: We applied MR-LFADS-R to a -region network spanning ALM, thalamus(A), thalamus(O), midbrain reticular nucleus (MRN), and superior colliculus (SC). Thalamus(A) includes the ventral medial (VM) and ventral anterior lateral (VAL) nuclei, which are strongly reciprocally connected with ALM (Guo et al., Nature, 2017), while Thalamus(O) includes other thalamic sub-areas, such as anterior ventral (AV) and lateral dorsal (LD) nuclei. In a subset of trials, ALM is briefly photoinhibited (Fig. 2a,c). We only fit MR-LFADS to unperturbed "control" trials and reserve the photoinhibition trials for post-training validation.
We first confirm that MR-LFADS fits held-out control trials well (Fig. 2b). We then simulate photoinhibition in MR-LFADS by setting the MR-LFADS ALM-region rates to be the trial-average time-varying firing rates from the photoinhibition trials from the experiment (Fig. 2d), denoted .
These perturbed firing rates then propagate through MR-LFADS via inferred messages, ultimately predicting photoinhibited firing rates . We find that the predicted ordering of ALM photoinhibition influence aligns with the experimental data (Fig. 2e).
-
Consistency across seeds: We fit MR-LFADS-R and MR-LFADS-G to a -region network (ALM, Thalamus, MRN). Both models produced consistent effectomes across different random seeds (Fig. 2f), but MR-LFADS-R exhibited significantly greater consistency in message content (Fig. 2g,h).
Thank you for providing the additional results on the experimental neural data, which address my concern. I’ll update my score to 3.
We thank the reviewer for their follow-up and are glad that the additional experimental results helped to address their concern about real-data applicability.
This paper addresses the challenge of identifying connectivity patterns between brain regions from neural recordings. It introduces Multi-Region LFADS (MR-LFADS), an extension of the LFADS framework (Latent Factor Analysis via Dynamical Systems) for modeling multiple interacting neural populations. The key innovation of MR-LFADS is the introduction of constrained communication channels, ensuring that each region is driven not only by its own inferred inputs but also by message signals from other recorded regions. The model is rigorously evaluated using synthetic data across three experiments, assessing the impact of unobserved inputs, the effectiveness of communication learning, and the ability to infer random connectivity patterns. MR-LFADS demonstrates superior performance compared to a static linear model and two recent dynamic state-space models.
给作者的问题
I am curious in Experiment 1, if the ground truth follows the pattern in Figure (a.ii), where there is no actual communication between nodes, can the model correctly identify this scenario, or it will still predict some spurious connections?
论据与证据
In general, the results are well supported by the experiments. The key claims made in the paper are as follows:
- Inferring inputs, rather than assuming all stimuli are known to all regions, is critical for accurate communication mapping. This is primarily demonstrated by Experiment 1, which shows that when stimuli are provided to MR-LFADS (S), the model fails to recover the true communication between nodes.
- Constraining messages to be derived from observable rates (MR-LFADS-R) improves identification accuracy compared to unconstrained latent communication schemes. Experiment 2 supports this claim by demonstrating that the rate-based (R) version outperforms other versions. It correctly encodes the stimulus while minimizing correlation with the decision variable, ensuring more reliable communication inference.
- MR-LFADS enhances the identification of true communication signals compared to prior models. This claim is supported by Experiment 3, which shows that MR-LFADS outperforms benchmark models across more than 30 randomly generated connectivity patterns. However, it is important to note that its applicability to real neuroscience studies may be limited, as the number of nodes tested (<5) is relatively small for a neuroscience application.
方法与评估标准
The benchmark and metric selections are reasonable. The innovation of inferring unknown inputs and constraining message pathways has proven effective in improving accuracy under various conditions.
However, incorporating Granger causality as a benchmark could help broaden the paper’s appeal to a wider neuroscience audience. Additionally, it would be interesting to analyze whether the model tends to miss edges or detect spurious ones.
理论论述
There are no formal mathematical theorems in this paper; all claims are supported empirically.
实验设计与分析
The experimental design is carefully controlled, with each experiment focusing on a specific aspect of the problem. The study considers different time lags, varying connectivity weights, and different numbers of regions. However, my main concern is the real-world applicability of the method, as all testing is performed on synthetic datasets.
补充材料
I quickly went through all the supplementary
与现有文献的关系
The paper builds on prior models, particularly LFADS, and demonstrates superior performance compared to existing methods, making it a valuable contribution to the literature.
遗漏的重要参考文献
The references cover the necessary background, but it would be beneficial to discuss classic connectivity methods such as Granger causality analysis and Dynamic Causal Modeling (DCM) to enhance the paper’s relevance to a broader neuroscience community.
其他优缺点
Although all claims are well supported, as noted by the authors in the conclusion, these problems are inherently ill-posed, and the experiments rely on strong prior assumptions. As a result, some claims may not fully generalize to real-world applications.
其他意见或建议
NA
We thank the reviewer for the thoughtful and insightful comments. We are encouraged that our model was recognized as a valuable contribution and that the key claims were seen as well supported. Below, we respond to each point raised. Rebuttal figures referenced throughout our response can be accessed at the following link: https://drive.google.com/drive/folders/1R94up1vl04bkkE12tpT80YeHo39vEbqw?usp=share_link
-
On number of regions: We agree that evaluation on larger networks is important for assessing relevance to neuroscience applications. We will invest in improving MR-LFADS infrastructure to support larger networks, and we will provide these ongoing improvements to the community by releasing and continually updating an MR-LFADS code package. To provide immediate neuroscience relevance, see point 2 below.
-
On real-world applicability: We agree that evaluation on experimental data is essential. We will update our paper to include demonstrations of MR-LFADS on multi-region electrophysiology from 5 simultaneously recorded Neuropixel probes in mice (Chen et al., Cell, 2024) performing a decision making task (Fig. 2a in our rebuttal figures PDF). We show that MR-LFADS predicts experimentally perturbed firing rates across multiple brain regions following photoinhibition of the anterior lateral motor (ALM) cortex (Fig. c-e). Critically, MR-LFADS was not trained on photoinhibition trials. Thus, MR-LFADS' ability to predict photoinhibition effects suggests identification of an accurate model of inter-region communication. Additionally, we show that MR-LFADS(R) yields more consistent solutions across random initializations (seeds) on these experimental data, as compared to the under-constrained MR-LFADS(G) (Fig. 2f,h).
-
Photoinhibition study: We applied MR-LFADS-R to a -region network spanning ALM, thalamus(A), thalamus(O), midbrain reticular nucleus (MRN), and superior colliculus (SC). Thalamus(A) includes the ventral medial (VM) and ventral anterior lateral (VAL) nuclei, which are strongly reciprocally connected with ALM (Guo et al., Nature, 2017), while Thalamus(O) includes other thalamic sub-areas, such as anterior ventral (AV) and lateral dorsal (LD) nuclei. In a subset of trials, ALM is briefly photoinhibited (Fig. 2a,c). We only fit MR-LFADS to unperturbed "control" trials and reserve the photoinhibition trials for post-training validation.
We first confirm that MR-LFADS fits held-out control trials well (Fig. 2b). We then simulate photoinhibition in MR-LFADS by setting the MR-LFADS ALM-region rates to be the trial-average time-varying firing rates from the photoinhibition trials from the experiment (Fig. 2d), denoted .
These perturbed firing rates then propagate through MR-LFADS via inferred messages, ultimately predicting photoinhibited firing rates . We find that the predicted ordering of ALM photoinhibition influence aligns with the experimental data (Fig. 2e).
-
Consistency across seeds: We fit MR-LFADS-R and MR-LFADS-G to a -region network (ALM, Thalamus, MRN). Both models produced consistent effectomes across different random seeds (Fig. 2f), but MR-LFADS-R exhibited significantly greater consistency in message content (Fig. 2g,h).
-
On inclusion of Granger causality (GC) and Dynamic Causal Modeling (DCM): We agree that both GC and DCM offer valuable approaches for modeling inter-region interactions. While we were very interested in including these methods, a careful and fair implementation -- particularly of the many nonlinear variants of GC and the more involved generative modeling in DCM -- was beyond the time frame of this rebuttal. Rather than risk an incomplete comparison, we will update our Related Work section to highlight these approaches, and we will pursue them in our ongoing work in this space.
-
On spurious connections: We agree that it would be helpful to further analyze MR-LFADS’s behavior -- specifically, whether the model tends to miss true connections or infer spurious ones, as the reviewer suggests. We have added the following to our paper: "It would also be interesting, from an interpretability perspective, to assess whether MR-LFADS tends to under- or over-estimate connectivity -- that is, whether it is more likely to miss true connections or to infer spurious ones -- and if these mismatches are driven by particular statistical properties of the signals or the dynamics of the downstream areas.”
Regarding the reviewer’s question about Exp.1: yes, the model does infer weak connections even when no ground truth connectivity is present. However, these inferred message norms are two orders of magnitude smaller than those associated with true connections. To illustrate this, we have added heatmap visualizations showing the full distribution of inferred message norms (Fig. 3).
I would like to thank the authors for conducting the additional experiments. This work would be a valuable contribution to the literature.
A minor point regarding the rebuttal: my original question was about what pattern the model would infer if the ground truth followed the structure shown in Figure (a.ii), rather than the connections depicted for Figure (a.i). I'm curious whether the model tends to explain the data primarily through node connectivity, and Figure (a.ii) would serve as an extreme test case for that. That said, this is a minor question and does not affect my overall opinion of the work.
We thank the reviewer for their assessment and are glad to hear that they view this work as a valuable contribution to the literature. We also appreciate the clarification regarding the original question. Indeed, as described, MR-LFADS is encouraged to explain data via communication rather than inferred (unobserved) inputs whenever possible. There may be recording scenarios for which this inductive bias is not appropriate. We will expand our Conclusion section to address such potential limitations along with techniques to detect and manage such scenarios.
In the paper "Accurate Identification of Communication Across Multiple Interacting Neural Populations" the authors propose an extension of LFADS to multiple regions to model region-specific inputs and cross-region interactions. The authors test their model on synthetic data with ground truth network connectivity, external inputs and cross-region interactions. They show that their model outperforms existing approaches in identifying region-specific inputs and inter-region communication.
给作者的问题
- Line 81, right column: You claim that existing methods risk incorrect inferences about connectivity and communication due to incomplete recordings. Could you elaborate more on the functional aspects of such incorrect inferences due to incomplete recordings? For models providing purely functional descriptions of neural activity, there is usually no claim about the physiological connectivity of the regions, nor about causality. What does it even mean for an inference to be incorrect in the context of the functional model? I suppose you mean that if you were to add more complete recordings, the inferred connectivity would change. Could you clarify this point?
- Line 211, left column: You said that you set the KL penalty weight for messages higher than for inferred inputs to encourage the model to use messages. But shouldn't it be the other way around, i.e. the KL penalty weight for messages should be lower? This is also suggested by the more detailed description in the appendix.
论据与证据
The authors claim that their model infers region-specific inputs from unobserved areas, outperforming approaches that rely on manual input specification. They also claim that their model constrains inferred inputs and inter-region communication more accurately than existing approaches. The authors evaluate their model on synthetic data only and show that it outperforms previous communication models across various neuroscience related synthetic tasks. While they explore a wide range of synthetic tasks, there is no real data analysis to support their claims. Moreover, while they propose a model variant for Poisson output, they only consider Gaussian output in their experiments.
方法与评估标准
The proposed MR-LFADS model extends the LFADS model by replicating a version of the LFADS model for each region and adding constrained communication channels between the regions. They propose variants of the single-region LFADS models for Poisson outputs and for Gaussian outputs. Communication messages are multivariate Gaussians variables with mean and covariance depending on the time varying average activities of other regions. The inference scheme follows the typical VAE variational inference. The evaluation metrics include the similarity between inferred pairwise message norms and ground truth quantified using Jenson-Shannon divergence, and the correspondence between inferred messages and ground truth quantified using linear regression to predict ground truth messages from inferred messages. These are reasonable evaluation metrics for the proposed model.
理论论述
There are no theoretical claims in the paper.
实验设计与分析
The authors evaluate their model on three experiments with synthetic data with known ground truth network connectivity, external inputs and cross-region interactions. They compare their model against reduced-rank regression, multi-population sticky recurrent SLDS and multi-region switching dynamical systems. The first experiments involves a synthetic memory network trained to recall private stimuli and received messages to assess the impact of unobserved inputs. The second experiment involves a synthetic pass decision network to assess the effect of direct constraint of messages via observed data. The third experiment involves synthetic randomized networks trained on randomly selected tasks to assess flexibility in coordination. The experiments are designed to highlight common failure modes of identifying region-specific inputs and inter-region communication. Overall, the authors succeed in showing that their model outperforms existing approaches in the synthetic tasks they consider. However, for the third experiment, the results are not as clear cut as in the previous experiments. While they do not present a systematic ablation study, they consider different variants of the model in the experiments to assess the impact of e.g. the KL penalty weight for messages.
补充材料
The supplementary material provides additional details on training the proposed model. It also contain sections on the assessed competing models, including the reduced-rank regression and multi-region switching dynamical systems. Moreover, there are additional details on the evaluation metrics and on the synthetic data experiments.
与现有文献的关系
The LFADS model is a key ingredient in the proposed MR-LFADS model. The authors go beyond the LFADS model by considering multiple regions and communication between them. This generalizes the original framework and makes it useful for learning communication across multiple interacting neural populations. The authors show that this approach has advantages over existing state-of-the-art methods for communication analysis in neuroscience.
遗漏的重要参考文献
To the best of my knowledge, the authors have discussed all essential references.
其他优缺点
The paper is very well written and easy to follow. The authors provide a detailed description of the model and the inference scheme. The experiments are well designed and the results are clearly presented. In my view, the main weaknesses are two-fold: the lack of real data analysis and in the evaluation the focus on Gaussian output only. I don't think these are fatal flaws, but the provided analysis did not fully overcome concerns about the validity of the claims in more realistic settings.
其他意见或建议
Line 28, right column: "should should" -> "should" Fig 1: g_0 and u hat are multivariate but the sigmas in the Gaussians suggest univariate variables. The priors on the other hand use the identity matrix in the covariances. This looks inconsistent. Line 633, left column: "doen" -> "done"
We thank the reviewer for their thorough and constructive feedback. We appreciate the positive assessment of the manuscript’s contribution, clarity, and experimental design. Below, we respond to each of the points raised. Rebuttal figures referenced throughout our response can be accessed at the following link: https://drive.google.com/drive/folders/1R94up1vl04bkkE12tpT80YeHo39vEbqw?usp=share_link
-
On Poisson outputs: We thank the reviewer for this helpful suggestion. We agree that it is important to demonstrate that the model can operate on spike train data, not just continuous activity. To this end, we applied MR-LFADS to large-scale electrophysiology recordings from Chen et al. (Cell, 2024) (Fig. 2a in our rebuttal figures PDF), which consist of spiking data and were modeled using a Poisson output distribution (Fig. 2b). We elaborate on these results below.
-
On real-world applicability: We agree that evaluation on experimental data is essential. We will update our paper to include demonstrations of MR-LFADS on multi-region electrophysiology from 5 simultaneously recorded Neuropixel probes in mice (Chen et al., Cell, 2024) performing a decision making task (Fig. 2a). We show that MR-LFADS predicts experimentally perturbed firing rates across multiple brain regions following photoinhibition of the anterior lateral motor (ALM) cortex (Fig. c-e). Critically, MR-LFADS was not trained on photoinhibition trials. Thus, MR-LFADS' ability to predict photoinhibition effects suggests identification of an accurate model of inter-region communication. Additionally, we show that MR-LFADS(R) yields more consistent solutions across random initializations (seeds) on these experimental data, as compared to the under-constrained MR-LFADS(G) (Fig. 2f,h).
-
Photoinhibition study: We applied MR-LFADS-R to a -region network spanning ALM, thalamus(A), thalamus(O), midbrain reticular nucleus (MRN), and superior colliculus (SC). Thalamus(A) includes the ventral medial (VM) and ventral anterior lateral (VAL) nuclei, which are strongly reciprocally connected with ALM (Guo et al., Nature, 2017), while Thalamus(O) includes other thalamic sub-areas, such as anterior ventral (AV) and lateral dorsal (LD) nuclei. In a subset of trials, ALM is briefly photoinhibited (Fig. 2a,c). We only fit MR-LFADS to unperturbed "control" trials and reserve the photoinhibition trials for post-training validation.
We first confirm that MR-LFADS fits held-out control trials well (Fig. 2b). We then simulate photoinhibition in MR-LFADS by setting the MR-LFADS ALM-region rates to be the trial-average time-varying firing rates from the photoinhibition trials from the experiment (Fig. 2d), denoted .
These perturbed firing rates then propagate through MR-LFADS via inferred messages, ultimately predicting photoinhibited firing rates . We find that the predicted ordering of ALM photoinhibition influence aligns with the experimental data (Fig. 2e).
-
Consistency across seeds: We fit MR-LFADS-R and MR-LFADS-G to a -region network (ALM, Thalamus, MRN). Both models produced consistent effectomes across different random seeds (Fig. 2f), but MR-LFADS-R exhibited significantly greater consistency in message content (Fig. 2g,h).
-
On notational issues and typos: We greatly appreciate the reviewer’s attention to detail. We have corrected the identified issues and revised the notation for the priors over inferred inputs and initial conditions in the updated manuscript (Fig. 4). Typographical errors on lines 28 and 633 have also been fixed. Additionally, the reviewer was correct to point out the inconsistency in the KL divergence description on line 211 -- we have updated this to reflect the correct direction of the penalty: "Therefore, we carefully set the KL penalty weight for inferred inputs () to be higher than that for messages ()."
-
On clarifications for incorrect inferences due to incomplete recordings: We appreciate this thoughtful question from the reviewer. The interpretation offered — “I suppose you mean that if you were to add more complete recordings, the inferred connectivity would change” — captures our intended meaning precisely. In practice, we often cannot guarantee that all relevant areas are recorded. As a result, inputs from unobserved regions must be accounted for (as latent variable inferred inputs) to minimize the deleterious effects of unobserved nuisance variables. We will update our paper to clarify this point.
With an application to experimental data, demonstration on spiking data and further clarifications, the rebuttal strengthens an already good paper, making it outstanding in my opinion. I will adjust my score to 5.
We thank the reviewer for their kind words and are appreciative of the updated assessment.
The paper presents a modeling approach for inferring dynamics and communication between brain areas in multi-region neuroscience recordings. The approach extends the LFADS model to multiple regions via the introduction of messages between brain regions that are functions of the inferred firing rates. This has an effective low-rank communication structure, and differs from previous dynamics models that have modeled communication between brain areas in the latent space. The authors nicely point out potential limitations of that approach. The model is demonstrated in three synthetic settings, where the proposed approach performs well compared to others in terms of recovery of the underlying communication structure.
给作者的问题
Eg. (8) is not very clear. The text refers to vectors of message norms, however the JS divergence is defined in terms of distributions. If distributions are being used to compute the KL divergences, which ones are they? From my understanding, P, Q, and M are vectorized norms and it is not clear whether this is a reasonable equation.
论据与证据
Many of the claims are supported. Overall, this is important work. However, there are some issues that should be addressed before I can recommend acceptance of this paper. I have detailed these in other comments, but specifically I am referring to: issues with the generative model and questions or limitations of the evaluation techniques.
Separately, in the motivation the submission states that existing models have not incorporated unobserved inputs. However, I think this is incorrect and overinterprets the inferred inputs . Statistically, these are latent noise inputs and other models such as MR-SDS also have inferred latent noise inputs. The primary difference here is that in this case, the latent noise inputs are assumed to be much lower dimensional than the size of the RNN state (as in the LFADS setup), which induces something akin to a low-rank noise covariance.
方法与评估标准
Overall, the methods and evaluation criteria generally make sense. The simulations are important to evaluate the model's ability to recover ground truth quantities across multiple complex settings.
However, there is an inconsistency in the generative model that must be addressed. As written the proposed model does not appear to be consistent with a single generative model. Eq. (6) places a prior over the messages but Eq. (7) then defines a different distribution over the messages. This does not correspond to a proper generative model, as there should only be one generative distribution over the messages. In other responses, I discuss potential issues with the JS metric and the limited ability to draw conclusions from the inferred effectome plots.
理论论述
N/A
实验设计与分析
Yes, I checked the procedures for the simulated experiments and have noted potential issues in other sections.
补充材料
Yes, I read through the suppmat sections on the models, synthetic datasets, and evaluation metrics.
与现有文献的关系
Neuroscience technologies are enabling neural recordings across brain areas, and it is important to develop computational methods for the analysis of such data. This paper presents an approach that is related to other work in this area. The primary innovations are extending the LFADS framework with low-dimensional inferred inputs to the multi-region setting and constraining the communication between brain areas to occur through the inferred firing rates.
遗漏的重要参考文献
N/A
其他优缺点
It is a nice idea to constrain the communication to be a function of the rates.
其他意见或建议
In equation (5), appears to be a vector that represents the diagonal of the covariance matrix. I suggest modifying the notation here to ensure the MVN distribution has a matrix-valued covariance.
The ground truth and inferred effectome plots with arrows between areas are helpful for qualitative presentation. However, as these are thresholded and clipped, they are not a good approach to visualize for rigorous evaluation. I'd recommend using the imshow plots as in 2e for the inferred communication.
伦理审查问题
N/A
We thank the reviewer for their thoughtful and constructive feedback. We also appreciate the reviewer’s positive assessment of our effort to address limitations in common architectural choices for communication models. Below, we respond to each of the points raised. Rebuttal figures referenced throughout our response can be accessed at the following link: https://drive.google.com/drive/folders/1R94up1vl04bkkE12tpT80YeHo39vEbqw?usp=share_link
-
On the use of Jensen-Shannon divergence (JSD): Indeed, JSD is classically defined over probability distributions. We used it heuristically by treating the normalized message-norm vectors as categorical distributions and used JSD to assess how closely they match in shape. We agree that this is not the most principled application of JSD. We have now re-evaluated our approach using cosine similarity, , as an alternative metric for comparing inferred message-norm vectors to ground truth. We find that all model comparisons and trends remain consistent when using cosine similarity. We will update our paper, exchanging JSD for cosine similarity throughout. We have provided the corresponding comparisons between JSD and cosine similarity in Figure 1 of our rebuttal figures PDF.
-
On inferred latent inputs in MR-SDS: We appreciate the reviewer's insight into inferred latent noise in MR-SDS (Karniol-Tambour el al., ICLR, 2024). Eq 4 of the MR-SDS paper states: In the absence of provided external inputs, , the components that could account for unobserved inputs are the dynamics, , or the covariance of the noise term, . Since the dynamics function is agnostic to external inputs, as the reviewer points out, is the only parameter that can reflect the influence of unobserved inputs, potentially through a low-rank parameterization to reflect low-dimensional inputs. However, this term captures only the variance of the inferred inputs, ignoring the time-varying mean component. That is, the inputs in LFADS are parameterized as (Eq 23 in the LFADS paper), with both and being inferred. Therefore, it is not entirely clear to us whether the noise in MR-SDS serves an identical role to the inferred inputs in LFADS. We would greatly appreciate further clarification from the reviewer in case we are misunderstanding important details here.
-
On generative model notation (Eq. 5–7):
We appreciate the reviewer’s attention to this detail and we appologize for the confusion. We do have a proper generative model, and we will update our paper to clarify as much. In our submission, Equation (6) describes the prior over the messages (and other latent variables), and Equation (7) provides the approximate posterior over the messages (which is used for inference and not if generating data from the model). The latter was not clear in our initial submission, and we will clarify this in the revised paper. In addition, we have updated Equation (5) in our paper: , to indicate the diagonal covariance, as suggested. -
On effectome visualizations: We thank the reviewer for this helpful suggestion. We agree that the thresholded diagrams serve primarily a qualitative purpose. In response, we have added unthresholded heatmap visualizations (as in Fig. 2e of our paper) to provide a more rigorous presentation of the inferred effectome (Figure 3 of our rebuttal figures PDF).
This study addresses the challenge of identifying connectivity patterns between brain regions from neural recordings. It introduces Multi-Region LFADS, an extension of the Latent Factor Analysis via Dynamical Systems for modeling multiple interacting neural populations. The key innovation is the introduction of constrained communication channels, ensuring that each region can be activated not only by its own inferred inputs but also by message signals from other recorded regions. The model is evaluated using synthetic data assessing the impact of unobserved inputs, the effectiveness of communication learning, and the ability to infer random connectivity patterns. The author demonstrate that their new method performs better compared to a static linear model and two recent dynamic state-space models. The reviewers appreciated that the paper makes a case for being accepted in ICML but also raised suggestions both of citation of literature and of additional analyses and results to be added, which are expected to be implement in a revision.