PaperHub
6.5
/10
Poster4 位审稿人
最低5最高8标准差1.1
5
6
8
7
3.8
置信度
正确性3.0
贡献度2.3
表达3.8
NeurIPS 2024

Global Distortions from Local Rewards: Neural Coding Strategies in Path-Integrating Neural Systems

OpenReviewPDF
提交: 2024-05-14更新: 2024-11-06

摘要

关键词
neurosciencerecurrent neural networksgrid cellsgeometry

评审与讨论

审稿意见
5

This paper develops a model meant to explain recent experimental work showing that the orderly receptive fields of grid cells in the rodent brain can be distorted by spatial landmarks. Here, the model is a biologically-inspired ANN which is trained to path-integrate, with non-uniform weighting across the environment to induce distortion. The resulting distortions due to reward are then quantified, which offers some concrete hypotheses about the corresponding behavior expected from grid cells in the brain.

优点

  • Grid cell distortion is a strong empirical finding that calls into question some key assumptions in systems neuroscience, and developing explanations and models for it will be a major contribution to theoretical neuroscience.
  • The model the authors consider here is eminently simple and easy to explain, but captures the problem quite well.
  • The writing is polished, and the paper is clear and easy to read.

缺点

  • The theory in Section 3 is not very deep. As I understand it, it basically lists a few plausible distortions (diffused, attracted, banded) and considers what impact the first two would have on the network. However, only diffused units are studied in the experiments in Section 4, even though banded units are also found, and an additional type (ring units) identified in the experiments is not accounted for. Overall, I feel that the theory doesn't have much explanation to offer of the empirical phenomena.
  • The experiments are not particularly in-depth. Some distortions are identified, but only diffusion is carefully examined. How particular changes to the loss correspond to particular distortions is not studied. This is a passable study of how the manifold geometry is warped (which could still benefit from more thorough experiments), but a suitable contribution for this venue would have to involve a more comprehensive analysis of distortion.

问题

  • In Fig 3, why does the diffused firing field have different periodicity from the original?
  • Line 228 states that "we observed the emergence of diffused and band units, as predicted by our theoretical framework". However, I think Section 3 only examines the effect of diffused and attracted units on the manifold; as far as I can tell, it doesn't make any claims about when units with these tunings are expected to appear.
  • It makes sense to me that attracted units -- which would increase the resolution for highly salient regions of space (correct me if I'm wrong) -- are a relevant distortion. However, they don't seem to be found in Section 4! This is surprising to me.
  • The text of the introduction strongly suggests that reward will be incorporated into the loss function. However, the actual method simply allows for weighting the prediction error for different locations, which is more generic (this is an advantage, in my opinion) and should be better reflected in the introduction.
  • It may be helpful to quantify distortion with some other measures (e.g. distribution distance).

局限性

Yes

作者回复

Thank you for taking the time to read and review our paper. Your feedback was invaluable, as it highlighted that we had not properly emphasized several crucial aspects of our work. In particular, we had not sufficiently explained the generality of our theoretical framework and the fact that our experiments were made possible only when freezing the place cell read-out weights. We address your comments and questions in detail below.

Theoretical contributions

We thank you for this important remark. Motivated by your feedback, we provide additional elements to our theory, as discussed in the global answer to all reviewers.

We now explicitly model the emergence of the four deformations: diffused units, band units, ring units, and attracted units. Interestingly, this only required a simple modification of the theory. We look forward to hearing your thoughts about it.

Experimental contributions

You noted that we did not sufficiently study how particular changes in the loss function correspond to particular changes in the rate map distortions. Indeed, the earlier experiments were simple in the sense that differences in reward location, magnitude, or spread were not well explored. We now performed these experiments, please see our global response for details.

As you mentioned, the most relevant modification in our work is the differential weighing of the loss function to incorporate the reward location. Here, we wish to emphasize another of our unique contributions that was missing from your summary. When we first performed the saliency training, we observed that the topology of the neural manifold was not preserved (Fig. 7B). This was a surprising result, one we had not anticipated. Only after we froze the pre-trained output weights to the place cells, we were able to observe both the global distortions and the preservation of the topology (Fig. 7C).

So, when referring to saliency training, we do not mean a simple modification of the loss function, but also this non-trivial aspect of weight freezing. We believe setting up this framework to study the effects of local rewards is an important technical contribution of our work, which the future work can build on.

Signal periodicity

In Fig 3, showing results on synthetic neural responses, the diffused units have the same spatial periodicity as the original units – as the diffused units were created by applying a Gaussian convolution to the original units. That is, the locations of the signal peaks are the same.

Wording on diffused and band units

This is a very good point, thank you for bringing it up. As you correctly point out, our theory does not explain why some distortions appear and some others don’t. Instead, we provide a mathematical formulation of their distortions, and explain what is the effect of this distortion on the neural manifold. We apologize for the confusing wording, which we propose to correct as follows: "we observed the emergence of diffused, band and ring units, which fall into the category of distortion via isotropic and anisotropic deformations from our theoretical framework".

Attracted units

You mentioned that the lack of attracted units in the trained piRNN was surprising. We also expected to see some local distortions of the grid cell firing patterns, given the standard interpretation of neuroscience experiments like Boccara et al. However, to our surprise, we did not observe this kind of local “magnification” or deformations in any of our experiments. Instead, we observed more global changes in the rate maps – which inspired the title of the work, “Global Distortions from Local Rewards”.

We believe this is a strength of our work, not a weakness. Our results challenge the conventional interpretation that reward modulation results in local magnification of the grid cell lattice, which may be a simplistic explanation. Indeed, in hindsight, the original findings might have been inconsistent with the conventional wisdom that grid cells provide a global code for space. Thus, our results provide the first evidence that reward modulation may result in global deformations of spatial responses in MEC, and the local distortions may arise due to interactions with other experimental variables not controlled in Boccara et al., 2019, or a reasonable but incorrect interpretation of the data.

It may also be true that future changes to the saliency training may recapitulate the experimental findings. Our work is just the first step (which sets up the framework) to study this problem theoretically and in detail.

Rephrasing introduction.

We agree with the suggestion of rephrasing the introduction so that it no longer suggests that “reward will be incorporated into the loss function”. Indeed, as you mention, the actual method is simpler. We plan to rephrase as follows: “we modify the loss function to differentially weigh position estimation error based on a saliency map of space.” Thank you for the suggestion!

Quantifying distortion

This is an excellent suggestion! To address this, we performed statistical tests with structural similarity index and Wasserstein distances. Specifically, we computed these distances between images before and after distortion for each class of distortions, and now defined the distortion units as a result of this statistical analysis.

Overall, we believe your feedback was very helpful for increasing our impact. To address your concerns, we have performed additional analyses (leading to new results, see our general response!) and expanded on our theory. If you believe we have adequately addressed your concerns, would you consider increasing your score to support the acceptance of our work?

评论

Thank you for the thorough response to my comments. I sincerely appreciate that they've been taken seriously. I am convinced that you've addressed my concerns on the experimental side of things, and I'm increasing my score to a 5 (from 3) accordingly. Thank you also for drawing my attention to the necessity of freezing the readout weights, which is another (i) an important starting point for future work in this direction and (ii) a prediction of this model for neuroscience which should be highlighted.

While I believe the new theoretical results as described will significantly strengthen the paper, I'm unwilling to evaluate them sight unseen and so am not incorporating them into my revised score.

评论

Thank you very much for your feedback throughout the process, taking our rebuttal into consideration, and engaging with us constructively. Thanks to you, our results are much better presented and the paper is strengthened!

审稿意见
6

The overarching goal of this paper is to provide a theoretical framework to understand the experimentally observed effects of rewards on grid cells, which have historically been known to encode spatial information. To do so, they focus on path-integrating RNNs whose units behave like grid cells post training. They show that modeling the influence of rewards as a diffeomorphism of the environment (with constant firing rate budget per neuron), results in changing fire rates of grid cells such that the underlying manifold (toroid) remains unchanged in topology, however its geometry get distorted. That is, grid cells are able to preserve spatial information as before, while accounting for rewards through a global distortion. They further fine-tune piRNNs with a modified loss that captures rewards, and show that under the right training scheme the resultant grid cell firing patterns change in a way that only changes the geometry and not the topology of the neural manifold. In particular, they observe ring like structures, diffused firing grids and band like structures in the presence of rewards in the environment.

优点

  1. Firstly, the paper is well-written and very clear.
  2. The motivation is very strong: understand what grid cells encode is indeed an important scientific question with recent discoveries showing changes in grid cell firing patterns in the presence of environment cues.
  3. The theoretical framework describing the effect of rewards as a diffeomorphism and the consequent effect on the neural manifold is solid, and a strong conjecture re coding in grid cells.
  4. The addition to the loss function, and fine-tuning schemes that preserved the toroid topology are indeed unique contributions.

缺点

  1. My main concern here is that while the theoretical ideas underlying this work make sense, in practice we do not see firing patterns that have been observed empirically. I understand that the diffeomorphism theory can in principle explain attractive firing patterns that have been experimentally observed, however, the path-integrating models trained by the authors assuming a circular reward function do not show such patterns. As a result, while intuitively the theory makes sense, it seems to me that it is crucial to address the disconnect in the observed results compared to experimental findings. I am curious to hear the authors thoughts on this.

  2. I also wish that the experiments were more thorough, for example investigating a range of salience functions potentially uncovering distinct firing rate patterns. Perhaps a different salience map would lead to attraction firing patterns?

问题

  1. With respect to continual learning, would this scheme be capable of training a salience-trained piRNN on a new changed reward function? I am curious as to whether a salience-remapping can be observed in such a setting, while preserving the same spatial encoding. (To be clear, I am not suggesting this experiment be run during the rebuttal phase, but curious to hear the authors thoughts on this).

  2. While the authors claim global distortion in the presence of rewards, in my understanding references 16,17,18 which experimentally observe firing rate distortions report them as local changes as opposed to global distortions How do the authors reconcile their findings with these experimental works?

  3. Is there a link to be made between the reward-based loss function and the diffeomorphism theory presented earlier? I.e. does this loss function encourage such a mapping?

局限性

The main limitation of this paper seem to be its disconnect with existing experimental work that observed reward-based firing rate distortions in grid cells. Since this is a paper attempting to explain a scientific phenomenon, it is important to address this. I am relatively torn as I think the theory itself is a novel contribution and one that deserves to see the light of the day, I would appreciate if the authors could answer my questions in the weakness and questions section.

作者回复

Thank you very much for your thorough review. We appreciate your attention and address your comments and questions below.

Main concern: link between theory, and firing patterns observed in piRNN experiments versus real experiments

Thank you for raising the point that distortions of firing patterns in real rodent experiments differ from those observed in piRNN experiments. We think that this is an important point, and is actually one of our most interesting findings, which is answered below in response to your question #2.

On piRNN experiments

We agree that more thorough experiments would be beneficial to this work. This concern was shared across several reviewers. Consequently, we have run additional experiments. These experiments include a wider range of saliency functions as per your suggestion. Please see our global answer where we present and discuss their results.

Answer to question 1. on continual learning

You asked whether this scheme would be capable of training a salience-trained piRNN on a new changed reward function. This is a very interesting question! In the same way that we pretrained a network with uniform saliency (no reward) and then fine tuned the network with non-uniform saliency s(x), a good starting point would be to introduce additional phases of fine-tuning with different saliency maps, and analyze the resulting deformations after each phase. We expect that the neural responses will adapt by deforming in the ways we have described (developing a mixture of band units, diffused units, and ring units), while phases of training without rewards would see a convergence back to the typical hexagonal grid cells.

Answer to question 2. on global distortions

You noted that previous works reporting experimental observations of firing rate distortions interpret them as local changes as opposed to global distortions. This is what we also expected to see a priori, given the claims from neuroscience experiments in Boccara et al. However, to our initial surprise, we did not observe this kind of local “magnification” or deformations in any of our experiments. Instead, we observed more global changes in the rate maps – which inspired the title of the work, “Global Distortions from Local Rewards”. Thus we challenge the conventional interpretation that reward modulation results in local magnification of the grid cell lattice as too simplistic an explanation. Indeed, because grid cells provide a global code for space, we propose that reward modulation should result in global deformations of spatial responses in MEC, as observed in our piRNN experiments.

Answer to question 3. on link between loss function and diffeomorphism theory

You raise a thought-provoking question about how a grid cell deformation, modeled as a diffeomorphism of the 2D environment, depends on the chosen loss function. Dorrell & Latham et al., 2023 showed that hexagonal grid cell representations result from minimizing a particular loss function subject to biological and representational constraints. Their loss function encourages the separation of neural representations of different positions in space, and includes a term that weighs the importance of separating any pair of positions x and x’. This term is directly analogous to the saliency map we introduce in the loss of our piRNN. Because Dorrell & Latham et al. do not study grid cell deformations, we believe that the framework we introduce in our paper for studying grid cell deformations opens a new route to make progress on this question, complementing existing work in the literature. We note that this question is of utmost importance to neuroscience and AI: given some task and objective function, can we predict and explain the representations learned by neural systems?

Conclusion

We thank you for your thorough review and specifically for prompting us to discuss the disconnect between the results from the piRNN experiments and the real experiments, as well as its link with our theoretical framework. We hope that we have addressed your concerns and we look forward to discussing these topics further.

评论

Thank you for your response. I appreciate the new experiments. Thanks also for discussing the disconnect between the two findings. I hope that this will be clarified in the manuscript as well. As I said earlier, the theory itself is a novel contribution and I hope going forward this will help us understand experimental findings. I am raising my score to a 6 in response to the authors detailed rebuttal.

评论

Thank you very much for your feedback throughout the process and taking our rebuttal into consideration. We highly appreciate your detailed questions and constructive criticism -- we believe our paper is stronger thanks to you!

审稿意见
8

This paper provides a mathematical theory of how spatially local reward distortions can lead to global representational distortions in grid cells.

优点

The paper is extremely well-written, self-contained, nicely motivated from biology, and mathematically elegant. As someone who does not know the grid cell theory literature that well, I really appreciated learning from this paper.

缺点

The paper seems to be somewhat similar to previous work in the literature (e.g., Xu et al, Conformal Isometry of Lie Group Representation in Recurrent Network of Grid Cells), impacting its novelty. Similarly, a look at the literature shows that there are hundreds of theory papers on grid cells in the literature (which makes sense, they are a beautiful and simple phenomenon), so the paper is not particularly original in this regard.

问题

  • Could the authors please clarify equations (1) and (3)? I am confused by them. In particular, (1) seems to suggest that ϕ(x)=Qr(x)\phi(x) = Qr(x). But equation (3) suggests that they are not equal (otherwise LerrorL_{error} would always be zero). Perhaps relatedly, where does the position estimate x^\hat{x} enter into the loss?

局限性

Yes.

作者回复

We thank you for the thoughtful review, and address your comment and questions below. We look forward to hearing from you in case you have any additional comments!

Answer to weakness on Originality/Novelty: We agree that the originality and novelty of our approach could have been made clearer in the original paper, which did not emphasize these points enough. In the final version, we propose to add a subsection 2.3 in Section 2 on Background & Related Works to explain the following:

In this work, we use the supervised piRNN approach as a framework to study the deformations in grid cell firing patterns reported in the experimental neuroscience literature. We do so by modifying the piRNN’s supervised loss in an interpretable way to account for varying saliency of space. Particularly, our saliency training allows for systematic study of two highly entangled behavioral variables: i) where the rewards are located and ii) which locations the agents have the tendency to visit the most. To our knowledge, this approach is completely novel in the literature.

In fact, in our new Fig. N4, we show that the grid cells contributing to the coding of the origin (the most visited location), but not the saliency map (the reward location), are robust to global distortions during the saliency training. In other words, as a secondary objective is introduced, grid cells with projections to the place fields at the edges are more likely to become distorted. To our knowledge, we are the first to make this prediction, which can be tested experimentally, for instance, by introducing both reward and landmark locations. Furthermore, we introduce a mathematical, geometric framework to explain and quantify the firing patterns’ deformations observed in our experiments; this is also novel and original. Finally, we show “that global distortion (geometry) can be added while maintaining the toroidal topology for path integration” (citing reviewer yMbD), which is true once the read-out weights to place fields are frozen, but not otherwise!

One related work is Nayebi, et. al. “Explaining heterogeneity in medial entorhinal cortex with task-driven neural networks” (NeurIPS 2021). The authors were also motivated by results from experimental neuroscience on reward-modulated neural responses by grid cells. While not the central question of their work, Nayebi, et al. propose a modification of training piRNNs to account for reward modulation: they modify the way training paths are generated in the presence of reward. Specifically, the authors modified the training paths ge to be either (1) direct movement toward a reward location, called “pure exploitation”, (2) random walk, as is standard in the piRNN literature, called “pure exploration”, or (3) an intermediate policy consisting of a mix of movement toward reward and random walk exploration. Their results are consistent with the existence of nontrivial reward-modulated response changes in piRNNs. However, they did not seek to characterize the specific deformations to the neural responses, as we do in our work.

Answer to question on Equations (1) and (3): Thank you for pointing this out. On the one hand, there is ϕ(xt)\phi(x_t), which is the vector of place cell activity corresponding to position xtx_t. This is an “encoding” of 2D position into a place cell code.

On the other hand we have ϕ^t=Qrt\hat{\phi}_t = Qr_t, which is a linear readout of the recurrent grid cell activity rtr_t. We apologize for the typo in Equation (1), which thus should have been: ϕ^=Qr\hat\phi = Qr and x^=argmaxϕ^.\hat x = \text{argmax} \hat \phi.

Successful path integration here means that ϕ^tϕ(xt)\hat \phi_t \simeq \phi(x_t), captured in the LerrorL_{error} term. To further highlight this point, we can rewrite Equation (3) as

Lerror=t=1Tϕ(x+)ϕ^(x+)2=t=1Tϕ(x+)Qr(x+)2.L_{error} = \sum_{t=1}^T ||\phi(x + …) - \hat \phi (x + …)||^2 = \sum_{t=1}^T ||\phi(x + …) -Qr (x + …)||^2.

We thank you for catching this.

Conclusion

We once again thank you for your time and attention. We hope that we were able to resolve your concern. If not, we look forward to discussing these topics further! If you believe we were able to adequately address your concerns, we hope that you will consider supporting our work with a strong acceptance!

评论

I thank the authors for addressing my concerns, and for clearing up my confusion regarding Equation (3). I have accordingly increased my score to a Strong Accept.

评论

Thank you very much for your kind words, time, and consideration. We appreciate your encouraging words and please let us know if you end up having additional questions!

审稿意见
7

The paper investigates the phenomenon of grid cell distortion in rewarded environments, a topic of interest in neuroscience. The authors propose a theoretical framework to understand how the 2D firing fields of grid cells deform while preserving their topological structure in high-dimensional neural space. By providing a position dependent weighted loss term, spatial saliency loss, the experiments showed that global distortion (ring, diffused and band patterns) appeared in the firing patterns of grid cells. The authors also provided detailed analyses of piRNNs under various conditions, including frozen and trainable place cells.

优点

The paper is well written with a comprehensive review of related works and makes a good point on a interesting topic.

The authors propose a theory which offers a potential explanation of deformation. It shows that global distortion (geometry) can be added while maintain the toroidal topology for path integration. This finding could inspire future research in the field.

I really like the simplicity of this work. The introduction of the spatial saliency loss (Eq. 7) is elegantly simple, building upon previous literature in a clear and interpretable way. The author leverages the flexibility of piRNN and specify an explicit scale at different locations while preserving the topological properties.

缺点

The work is all based on simulated results and evaluate on mostly on the change of the firing patterns. It would be better for the authors to show some comparison with the "real" data from rodent brain, i.e. the diffused or band unit is also found in biology experiments. If the simulation results can match with the experiment in some metrics, it will greatly enhance the confidence of this work.

More experiment can be provided with the change of the settings, such as different numbers of grid cells or change of s(x), to show the generality of the theory and method.

问题

  1. In the experiment, s(x) is defined as a Gaussian and the mean is at the center of the map, which gives more weight on the transformation at the center area. I may expect to see firing patterns that magnify towards the center. Do you have any insights on why the distortion seems to be uniformly appeared on the whole map?

  2. For s(x), have the authors tried to change the scale or the position of the distribution to see how the patterns changed?

  3. I'm curious if the authors tried different number of grid cells or modules in the experiments. Does the distortion need a larger number of cells to form?

  4. In Figure 5, grid cells in the same module may emerge different distortion patterns. For example, in module 2, there are diffused and ring units. Are there any insights why this happens?

局限性

The limitation of this work is well illustrated in the appendix.

作者回复

We thank you very much for the thorough review and for your time. We address your comments and questions below.

On real neuroscience experiments:

You make the very important point that linking the theory with “real” data from rodent brains is important to increase the confidence of this work. Thank you for bringing it up. We think that this point deserves additional attention and we discuss it below.

The existing experimental neuroscience data on such reward-modulated deformations of grid cells firing patterns is limited in number and in the strength of the conclusions. We are aware of the results from [Boccara et al. 2019, Butler et al. 2019, Krupic et al. 2018, and Wang et al. 2021]. Boccara et al. 2019 in particular motivated our approach and we explained and discussed its results with our theoretical framework. To summarize: Boccara et al. observed an attraction of grid cells firing fields towards the reward – a phenomenon called “attracted units” in our paper. Furthermore, band units are well described in the literature – see Krupic et al. 2012. Are you aware of any other recent public datasets that may be relevant? If so, we would be happy to discuss them as well.

Due to the current limited experimental data, we chose to study in silico ANN-based experiments inspired by the existing, real neuroscience experiments. Therefore, the scope of our work is: we investigate how the representations learned by artificial path-integrating RNNs (piRNNs) change to account for “saliency” in the environment, in a way that is (1) interpretable, and (2) inspired by neuroscience experiments. We observe that after training with saliency conditions, our piRNN learns a more diverse set of neural representations (ring neurons, diffused neurons, band neurons) compared to observed in Xu et al. and Gao et al. earlier – while still maintaining toroidal topology. We then build a theoretical framework to explain how different deformations in the individual rate maps affect the geometry of the neural representation manifold, which is novel in the literature.

Unfortunately, being a theory lab, we cannot perform additional experiments on real rodents ourselves. Yet, we believe that our geometric study of piRNNs already provides interesting results for the study of RNNs, which is an important field of research in AI/ML. Additionally, we hope that our work will inspire experimental neuroscientists to perform corresponding experiments on rodents. In that perspective, it might even be preferable that theory and experiments come from two separate research teams, as its final validation on real data would even be stronger.

On additional piRNN experiments: Thank you for proposing this more comprehensive set of experiments. We have run them; please see the global response on experiments. We believe that these additional results strengthen our paper and we appreciate your suggestion.

Answer to question 1.): why such global distortions?

You asked why the distortions of grid cells firing patterns appear uniformly, while the reward (saliency) is local and localized at the center of the space. This is a great question! We also expected to see some local distortions of the grid cell firing patterns, given the claims from neuroscience experiments in Boccara et al. However, to our initial surprise, we did not observe this kind of local “magnification” or deformations in any of our experiments. Instead, we observed more global changes in the rate maps – which inspired the title of the work, “Global Distortions from Local Rewards”. Thus we challenge the conventional interpretation that reward modulation results in local magnification of the grid cell lattice as too simplistic an explanation. Indeed, because grid cells provide a global code for space, we propose that reward modulation should result in global deformations of spatial responses in MEC, as observed in our piRNN experiments.

Answer to question 2. on piRNN experiments that vary the saliency map

Thank you for proposing additional experiments with variations to the saliency map s(x)! Please see our results in the experiments section of the global response.

Answer to question 3. on the link between distortion and number of grid cells/modules

While we have not run experiments varying the number of grid cells, this is an interesting modification we will investigate in the near future. We would like to emphasize that the number of modules, as we have defined them, is an emergent property of the trained network and thus we do not have direct control over it. How the number of modules that emerge depend on various experimental parameters is a very interesting question that needs to be further explored, but is out of scope here.

Answer to question 4. on why different distortion patterns appear?

Thank you for the great thought-provoking question. We had not considered why different distortion patterns indeed jointly appear, i.e., why we observe both diffused and ring units after introduction of the reward. Your question also prompts the follow-up question: what role can diffused units, band units, and ring units play? To bring some insights to this question, in our new Fig. N4, we show that the grid cells contributing to the coding of the origin (the most visited location), and not the center of the saliency map (the reward location), are robust to global distortions during the saliency training. In other words, as a secondary objective is introduced, grid cells with projections to the place fields at the edges are more likely to become distorted.

Conclusion

We hope that we were able to address your comments and resolve your questions. These discussions have been beneficial for us, and we will gladly incorporate them into the final version of the paper. If you have any remaining questions, we'd be happy to discuss further!

评论

Thank you for your detailed rebuttal. The results are indeed inspiring. I hope the author can have theoretical breakthroughs along this direction. I believe this paper deserve to be accepted and would like to maintain my score.

评论

Thank you very much for your thorough engagement with our work -- your feedback and questions have helped us strengthen the paper. We appreciate your encouraging words!

作者回复

We thank the four reviewers for their time and attention. For this rebuttal, we ran additional experiments to draw a more complete picture of the phenomena of firing patterns distortions, and expanded on our theory to address reviewers’ concerns. Along the way, we also found some new exciting results!

Additional experiments

Following suggestions from all 4 reviewers, we performed additional experiments by varying:

  1. the location of the reward [xx_* in Eq. 8] xx_*in [(0.5,0.5), (0.8,0.8), (0.5,0.8)]
  2. the overall magnitude of the reward [s0s_0 in Eq. 8] in [1,10,100]
  3. the spread of the reward [σ\sigma_* in Eq. 8] in [0.05,0.1,0.2,0.5]

These experiments, coupled with a new statistical analysis (explained in Figs. N1-N3), confirmed the consistency of our finding that about one-third of the toroidal cells become distorted during the saliency training regardless of the hyperparameters. Moreover, we also realized that the distortions did not emerge in random populations of cells, rather in those that did not preferentially send read-out weights to the place field at the origin (Fig. N4). Notably, even when the saliency map was placed in a location other than the origin (reward location), the global distortion likelihood still decreased towards the origin (most visited location); providing an interesting contrast to the prior work that has observed local distortions towards reward locations.

To date, prior experimental work had not distinguished between the tendency of the animals to visit familiar locations from the reward valence. Thus, our synthetic training allowed us to disentangle these aspects and bring forth new biological predictions, i.e., that global (but not necessarily local) distortions to grid cells may be signs of continual learning towards secondary objectives. Future explicit tests of these ideas can be performed experimentally by introducing both reward and landmark locations to the path-integrating RNNs and/or freely behaving animals!

Overall, after saliency training across several hyperparameters, we observed that

  • the toroidal topology of the neural manifolds is preserved (data not shown),
  • diffused, band, and ring neurons consistently emerged as the leading distortions (Figs N1-N3 for the new quantification analysis), and
  • the distortions did not emerge in random populations of cells, rather those that were coding locations furthest away from the origin (Fig. N4, new result).

Finally, we want to note that we ran slightly more than 650 experimental configurations (saliency training), which took on average around 3 hours per experiment. Overall, we used 2400 GPU hours, not counting the additional analyses we performed on top of the saliency training. We will make all of these models and our analyses publicly available, which we believe is an important step forward for the field. We kindly ask that the reviewers consider this as a contribution!

Link between theory, results on piRNN experiments versus results on real experiments

In addition to our experimental efforts, we have also performed additional theoretical investigations and expanded on our geometrical framework (to address primarily the concerns by Reviewers STz6 and BFFn). Notably, we complemented our current theory with a framework of global distortions based on diffusion processes. This represents a simple modification, and mathematical refinement, of section 3.2 “Diffused Units: Smoothing of Firing Fields”. Yet, and despite its simplicity, it now characterizes the three phenomena: diffused units, band units, and ring units. Below, we provide the intuition behind the new framework that can model ring, band, and diffused deformations of the grid cell lattice:

  • Diffused units: In this work, we show that diffused units, observed in practice in our piRNN experiments, lead to smaller neural manifolds in neural state space and leave the energy budget unchanged. Diffused units can be explained by 2D isotropic smoothing of the original grid lattice.
  • Band units: These units emerge as anisotropic smoothing of the firing fields, which can be useful for example to incorporate direction information. We show how band units, if emerge in large fractions, can lead to a collapse of the toroidal manifold into a circle. In our experiments, we observe only a small fraction of these, and thus the toroidal manifold structure is preserved.
  • Ring units: These units emerge after angular smoothing of firing fields, which can be useful for example to encode angle-invariant distance information. Similar to before, the energy budget is unchanged. Notably, we observed ring units even when the saliency map was not at the origin, hinting that these units may be more relevant with coding frequently visited space information (here origin), which is also angle-independent.

Conclusion

To conclude, our theory does not explain which firing patterns should or should not emerge (which we now explicitly discuss in the paper). Instead, we introduce a rich mathematical framework that (1) quantifies all four firing field deformations, observed across piRNN and real experiments, with group actions, and (2) relate the firing field deformations with a neural manifold deformations. However, we recognize that one part of the theory (group action by diffeomorphisms) explains the results observed in real experimental data, while another part (group action by diffusion) explains the results observed in our piRNN experiments. Accordingly, results from real and piRNN experiments do not agree: the former finds locally attracted units, while the latter finds global distortions.

While we were shocked by this disconnect at first, we now believe that it represents one of our most interesting findings in this work. All in all, we believe that truly uncovering these mysteries will require a collaborative effort. Since all our datasets will be made public, we are also looking forward to the inputs from the full community!

最终决定

This paper uses recurrent neural networks trained on path integration (piRNNs) to model and understand the effect of local rewards on spatial representation in the brain, which is a well-established question in neuroscience and has been explored in empirical animal studies. As noted by all the reviewers, the theoretical model is both novel and intriguing, and attempts to provide a framework to explain previous empirical observations. The reviewers unanimously agree that the paper is appropriate for publication at NeurIPS.

One issue raised by the reviewers is that certain aspects of the model’s predictions appear to contradict some existing empirical findings in neuroscience. However, this could be seen as an interesting interaction between theory and experiment, which could be further explored in future work. I concur with the reviewers that the novelty of the theoretical contribution justifies the paper's acceptance at NeurIPS.