[PDF] If deep learning is the answer, then what is the question?

Abstract

Neuroscience research is undergoing a minor revolution. Recent advances in machine learning and artificial intelligence (AI) research have opened up new ways of thinking about neural computation. Many researchers are excited by the possibility that deep neural networks may offer theories of perception, cognition and action for biological brains. This perspective has the potential to radically reshape our approach to understanding neural systems, because the computations performed by deep networks are learned from experience, not endowed by the researcher. If so, how can neuroscientists use deep networks to model and understand biological brains? What is the outlook for neuroscientists who seek to characterise computations or neural codes, or who wish to understand perception, attention, memory, and executive functions? In this Perspective, our goal is to offer a roadmap for systems neuroscience research in the age of deep learning. We discuss the conceptual and methodological challenges of comparing behaviour, learning dynamics, and neural representation in artificial and biological systems. We highlight new research questions that have emerged for neuroscience as a direct consequence of recent advances in machine learning.

Full PDF

IIf deep learning is the answer, then what is the question?

Andrew Saxe, Stephanie Nelli and Christopher Summerfield Department of Experimental Psychology University of Oxford Oxford, UK Correspondence: [email protected] [email protected] [email protected] bstract

Perspective , our goal is to offer a roadmap for systems neuroscience research in the age of deep learning. We discuss the conceptual and methodological challenges of comparing behaviour, learning dynamics, and neural representation in artificial and biological systems. We highlight new research questions that have emerged for neuroscience as a direct consequence of recent advances in machine learning. . Introduction

Recent years have seen a dramatic resurgence in optimism about the progress of AI research, driven by advances in deep learning . Deep learning is the name given to a methodological toolkit for building multi-layer (or deep) neural networks that can solve challenging problems in supervised classification , generative modelling , or reinforcement learning . Neuroscience and AI research have a rich shared history , and deep networks are now increasingly being considered as promising theories of neural computation. The recent literature is studded with comparisons of behaviour and brain activity in biological and artificial systems , summarised in a growing number of review articles . In this Perspective , we assess the opportunities and challenges presented by this new wave of intellectual synergy between neuroscience and AI research.

2. Neoconnectionism?

The idea that neural networks can serve as theories of neural computation is not new. During the parallel distributed processing (PDP) movement of the 1980s, psychologists and computer scientists proposed neural networks as solutions to key problems in perception, memory and language . Contemporary deep networks resemble scaled-up connectionist models, and recent advances in machine learning are also heavily indebted to the ubiquity of digital data and the relatively low cost of computation in the 21 st century . It might thus be tempting to dismiss current excitement around deep learning models for neuroscience as a rehashing of earlier ideas, owing more to slow churn of scientific fashion than to genuine intellectual progress. However, many researchers believe that deep learning models have the potential to radically reshape neural theory, and to open new avenues for symbiotic research between neuroscience and AI research . This is because contemporary deep networks are different from their connectionist ancestors in a crucial way: their learning is grounded in quasi-naturalistic sensory signals, such as image pixels or auditory spectrograms , rather than their input and output units being hand-labelled by the researcher. Contemporary deep networks can thus learn ‘end -to- end’ in a sensory ecology that resembles our own: natural sounds and scenes for supervised learning and generative modelling, and 3d environments with realistic physics for deep reinforcement learning. This advent of end-to-end models of biological function has allowed researchers to attempt to model, for the first time, the de novo emergence of neural computations that are capable of solving real-world problems. One major line of research has examined the representations formed by supervised deep networks that are trained to label objects in natural scenes (Fig. 1). A striking observation is that biologically plausible neural representations can emerge in networks that combine gradient descent with a handful of simple computational principles . When deep networks are endowed with local receptivity, convolutions, pooling and normalisation, the early layers acquire simple filters for orientation and spatial frequency , just like neurons in area V1; whereas in deeper layers, the distributions and similarity structure of neural representations for objects and categories resemble those in the primate ventral stream . Representational equivalence may be stronger in more accurate networks , and network activations even can be harnessed for novel image synthesis, allowing causal assays of the predictive links between artificial and biological networks . One corollary of these findings is that the sophisticated behaviours and structured neural representations observed in humans nd other animals might emerge from a limited set of computational principles, as long as the input data are sufficiently rich, and the network is appropriately optimised .

3. Deep learning as a framework for neural computation

This claim has potentially profound implications for neuroscience. It has already prompted calls for systems neuroscientists to refrain from building theories that impose intuitive functional significance on neural circuits by fiat, and instead to study the computations that emerge spontaneously during the training of deep networks . This so-called deep learning framework bypasses the handcrafting in classical neuroscience, where for example constraints are placed on neural encoding by assuming a shape for tuning curves, or population dynamics are explicitly engineered via the wiring diagram of excitatory and inhibitory neurons. Instead, the role of the researchers is now principally to specify the overall architecture, the learning rule and the cost function; control is relinquished over the microstructure of computation, which instead emerges organically over the course of training . An extension of this argument draws an analogy between optimisation over computation in neural networks, and over biological forms by evolution: in both cases, interpretable functional adaptations emerge without meaningful constraints being imposed on the search process . In other words, it is claimed that neural systems are fundamentally uninterpretable, and structured theories of perception and cognition are “just so stories” that reflect more closely the researcher’s quest for meaning than the reality of neural computation . One appealing aspect of the deep learning framework is that it relieves researchers of the burden of exhaustively documenting and interpreting the coding properties of single neurons. As methodological advances have permitted simultaneous recordings from large numbers of neurons , a doctrine has emerged by which neural representation is dynamically multiplexed across populations . From this perspective, single neurons code for multiple experimental variables and their interactions , exhibiting the nonlinear, mixed selectivity that is also a hallmark of units in deep networks . This tendency seems to be most pronounced in higher cortical areas, such as the parietal and prefrontal cortex, that support working memory and action selection . In these regions, the coding properties of single neurons can be highly heterogenous and vary in mystifying ways over the course of a given trial . However, when neural activity is examined at the population level, for example by using dimensionality reduction, neural patterns emerge that meaningfully distinguish experimental variables . Another key observation is that these patterns of population activity can be recreated when the same analysis is applied to unit activations in recurrent neural networks trained to evaluate time-varying decision evidence , judge the length of a time interval , or maintain information over a delay period . Accordingly, a major theoretical perspective is emerging that proposes deep recurrent neural networks as computational theories for sensorimotor integration and working memory processes. In the domain of working memory, a particularly interesting new line of research has used recurrent networks to ask when codes for stored information should be static or dynamic, addressing a key question in systems neuroscience . This work has contributed to claims that it is futile to characterise the coding properties of individual cells or infer how they participate in computation . Instead, it is argued, the computational model is only explainable at the aggregate level of the population, which is ultimately driven by the structure of the network and the way it is optimised.

4. Limitations of the deep learning framework

The deep learning framework proposes powerful new tools for modelling the bewildering volumes of data that are now routinely recorded in systems neuroscience labs. However, we hope that enthusiasm for deep networks as computational models will be tempered with a sober consideration of how they can be usefully deployed to understand neural mechanism

Figure 1. Representational equivalence between neural networks and the primate brain. A. Left: schematic illustration of simple and complex cell receptive fields from mammalian V1. Right: example filters learned in the first hidden layer of a deep convolutional neural network. [REF 2] B. Example representational similarity matrices illustrating the similarity in population activity evoked by objects in early visual areas of the primate brain (recorded with electrophysiology) and in the intermediate layers in a deep convolutional neural network. [REF 14] C. Schematic neural firing rates in response to a series of natural images (black trace; examples above) and activity predicted as a linear transform of neural network activity (red trace). [REF 13] D. Representational similarity matrices as in B except comparing inferior temporal (IT) cortex with the final layers of a convolutional neural network. [REF 14]. E. Schematic showing relationship between variance explained in IT signals and classification accuracy for pseudo-randomly generated neural networks that are trained either to maximise classification performance (blue dots) or variance explained in neural signals (yellow dots) [REF 13]. F. Left: state space analysis of neural signals from macaque area LIP recorded during a dot motion categorisation task. Red and blue lines show different motion directions belonging to opposing categories, trajectories are plotted in two dimensions. Right: same analysis conducted on hidden units of a recurrent neural network [REF 47]. G. Left: state space analysis performed on neural signals recorded from macaque dorsomedial prefrontal cortex during performance of fast or slow interval reproduction task, plotted in three dimensions. Right: same analysis conducted on hidden units of a recurrent neural network [REF 50]. nd cognitive function. In other words, if deep learning is the answer – then what are the questions that neuroscientists should ultimately be asking? Unfortunately, as currently articulated, the deep learning framework offers only the scantiest roadmap for neuroscience research . If neural computation emerges uncontrollably through blind, unconstrained optimisation, then how can neuroscientists formulate new, empirically testable hypotheses about neural mechanism? Such hypotheses are argued to take the form of design choices about learning rules, regularisation principles, or architectural constraints in deep networks . There is some evidence that more judicious design choices for deep networks may permit a closer match to biology . For example, adding recurrent connections improves the fit to neural data , especially for more challenging natural images, and at later post-stimulus time points , whereas including a biologically plausible front end (a ‘retina net’) encourages the formation of realistic coding properties, including cell types typically found in the thalamus . In general, however, we lack overall guiding principles for making such design choices. In machine learning research, networks are rarely built with biological plausibility in mind, and so there is relatively little prior guidance in how they might be used to model neural systems. Moreover, understanding the mapping from design to performance in deep networks is challenging, which is presumably why AI has a relatively poor track record in conducting interpretable or overtly hypothesis-driven research, preferring to focus instead on whether the system works rather than why it works . At worst, the deep learning framework seems to broadside neuroscience with an existential threat. The novel research programme asks researchers to document how different architectures or algorithms can encourage deep networks to form semantically meaningful representations or exhibit complex behaviours, like humans and other animals. This endeavour sounds suspiciously similar to contemporary AI research itself. As such, the project does not seem to build upon the comprehensive understanding of neural computation that has been furnished by decades of research into biological brains. Rather, it seems to propose sweeping this knowledge away, merging the goals of theoretical neuroscience with those of contemporary AI research.

5. Are deep networks promising theories of object recognition?

The deep learning framework is built upon the proposal that neural networks learn representations and computations that resemble those in biological brains. However, it is possible that the equivalence between deep networks and animal brains has been overstated . Indeed, comparing the multivariate representations in brains and neural models is fraught with statistical challenges . Currently, one popular approach is to learn a linear mapping from neurons to network units, and to evaluate the predictive validity of the resulting regression model in a held-out dataset. Adopting this approach for image classification, the highest-performing deep networks can explain an impressive 60% of the variance in neuronal responses in primate area IT. However, neural networks that perform substantially worse at image classification explain just 5% less . Indeed, directly comparing predictive accuracy (on BOLD signals) for trained and untrained (initialised to random) networks, the difference is quite small – on the order of 5-10% accuracy difference for most visual regions . It is often forgotten that landmark studies on which claims of representational equivalence are based actually used untrained deep networks . Testing whether neural signals are an affine transform of model activations is a good start, but such a relationship could exist even if the neural patterns differ wildly in terms of sparsity or dimensionality. Stricter tests of shared coding are provided by methods that restrict the freedom of the mapping function, such as representational similarity analysis (RSA) . RSA discloses a superficial resemblance between brains and networks but it is hard to tell whether this agreement is driven principally by stimuli that are physically similar, such as faces, and the differing patterns evoked by animate vs. inanimate objects . An important goal for future research, thus is going to be to more rigorously assess the status of the claim that deep networks and biological brains learn equivalent neural codes. Another way to test the equivalence between biological and artificial systems is to study their behaviour, i.e. to examine how their response patterns evolve over the course of training, and how they can be experimentally manipulated. This is vital because the computations in a neural system can only be understood in the context of the behaviour they produce . Revealingly, humans and machines make sharply different sorts of errors in assays of object recognition. In one study, networks were prone to confuse object classes that humans and even monkeys could safely tell apart, such as dogs and guitars, and patterns of confusion among individual images were shared by humans and macaques, but not deep networks . Similarly, humans generalise far better than deep networks to images that have been perturbed by adding pixelated noise, or bandpass filtering , and are less prone to be fooled by deliberately misleading images . There is a widespread view that biological vision exhibits a robustness that is currently lacking in supervised deep neural networks. More generally, animal behaviour is richly structured, in theory permitting researchers to make systematic comparisons with machine performance . For example, animal decisions are subject to stereotyped biases, but also irreducibly noisy ; animals are flexible but forgetful, behaving as if memory and control systems were capacity limited , and the rate and effectiveness of information acquisition depends strongly on the structure and ordering of the study materials . Mature theories of biological brain function should be able account for these phenomena, and we hope that future deep network models will be held to this standard.

6. Theory and understanding of deep learning models

The computations performed by deep networks are enmeshed in millions of trainable parameters.

It is not surprising, thus, that they have been dubbed “black boxes” . Despite this complexity, in neural networks we can access every synaptic weight and unit activation over the course of learning, a feat that remains impossible in animal models. These considerations raise thorny questions concerning the utility of deep networks as neural models, and more generally, raise the question of what it means to “understand” a n eural process via a computational model . Thus far, neuroscientists have preferred to employ off-the-shelf deep convolutional networks as neural models . However, collaborations between theoretical neuroscientists, physicists and computer scientists have paved the way for a new approach that uses idealised neural network models to understand the mathematical principles by which they learn , and deploys the results to predict or explain phenomena in psychology or neuroscience . For this endeavour to be tractable, deep network models must be simplified, for example by employing linear activation functions (“deep linear” networks) , structured environments , or by studying limit cases, such the limits of infinite width or depth, the high-dimensional limit , or the shallow limit (Fig. 2). Paradoxically, these infinite-size networks are often more interpretable than those with fewer units, because their learning trajectory is more stable and not prone to be waylaid by bad local minima . Some network idealisations have offered exact solutions for the learning trajectories that every single synapse will follow , and Figure 2. Understanding deep networks using idealized models. A-B. By simplifying the neural nonlinearity, deep linear networks permit exact solutions to training error dynamics (A) and weight dynamics (B) for certain initializations [REF 65]. Full simulations from small random weights track these dynamics closely. C. Training speed in deep linear networks depends on initialization. Small random weights scale exponentially in depth, unsupervised pretraining scales linearly, and orthogonal initializations are depth-independent. D. Training error of nonlinear networks of different sizes trained on the XOR binary classification task. Networks with few hidden units exhibit complex trajectories, often ending at nonzero error. Large networks reliably find a solution and take similar trajectories. This trajectory can be described analytically in the infinite width limit under a particular initialization regime [REF 73-74]. E. Analytical training and testing dynamics for a ‘student’ network learning from a ‘teacher’ network, permitting analysis of generalization performance on novel examples and the overtraining phenomenon [REF 67-69]. F. Analytical predictions for the generalization error after extensive training in the high-dimensional regime where data is scarce relative to the number of weights. Generalization error peaks at the transition from over- to under-parameterization [REF 69-70]. nswered perplexing questions about network behaviour: for example, why learning often involves transitions between quasi-discrete stages, why deep networks are often slower to train, or why an initial epoch of layer by layer statistical learning (“unsupervised pretraining”) can accelerate future learning with gradient descent. This work questions the notion that deep neural networks are “black boxes” and promises interpretable neural network models of biological phenomena (Fig. 2).

Figure 3. Developmental trajectories in deep linear neural networks. A. An idealized hierarchical environment. Items (leaf nodes) possess many properties like “can fly” or “has roots.” Nearby items in the tree are more likely to share properties. B. Schematic 2D embedding of internal representations for each item over learning in a deep linear network trained to output each item’s properties. The network passes through a series of stages in which higher hierarchical distinctions are learned before lower distinctions. C. Only deep networks exhibit quasi-stage-like transitions in learning, which arise from saddle points in the error surface. D. For a class of hierarchies, learning speed is a decreasing function of hierarchy level, and the network will exhibit progressive differentiation. E. Deep but not shallow networks can make transient errors on specific items and properties during learning. F. Internal neural representational similarity reliably mirrors behavioural similarity in networks trained from small weights. Networks trained from large weights exhibit correct behaviour but idiosyncratic neural representations. [A-F: REF 65]. G. This dependence on initialization can be understood through a transition in learning dynamics between a “feature learning” regime and a “lazy learning” regime [REF 64]. ecently, this approach has been applied to the study of semantic cognition (Fig. 3). . During development, children transition through discrete stages in which they rapidly acquire new categories or concepts. Their learning is also highly structured: for example, semantic knowledge is progressively differentiated, as children pick up on broader hierarchical distinctions (animal vs. plant) before finer distinctions (rose vs. daisy) and display stereotyped errors, such as thinking that worms have bones . Deep networks trained on richly structured data are known to exhibit these phenomena , but only recently has it been shown that stage-like transitions arise due to saddle points in the error surface, progressive differentiation from the way the singular values of the input-output correlations drive learning over time, and semantic illusions from pressure to sacrifice accuracy on exceptions to meet the global supervised objective . Moreover, it can be shown that these phenomena are an inevitable consequence of depth itself, arising in deep linear networks but not shallow networks even though the two classes of model converge to identical terminal solutions. This highlights the importance for neuroscientists of studying learning dynamics , i.e. the trajectory that learning takes, rather than simply examining representations in networks that have converged. One potential concern is that insights acquired in this way might not scale, because models are idealisations that eschew the messy complexity of state-of-the art deep networks and make assumptions that are false for biology (e.g. linear transduction or layers of infinite width). However, we argue that neural theory is well served by analytic formulations of complex phenomena that give rise to specific, falsifiable predictions for neural circuits and systems. We hope that neuroscientists will incorporate reductions of deep network models into their canonical set of neural theories, rather than blindly seeking correspondences between brains and fully-fledged deep learning systems that offer little hope of being understood.

7. Learning rules for sensory systems

Research using idealized neural networks offers the hope that we can understand how learning occurs in biological brains. But specifically, what research questions should we ask? What empirical phenomena might deep networks predict that conventional models from classical neuroscience might not? Psychologists and neuroscientists have traditionally debated the extent to which perceptual representations are prespecified by evolution or learned via experience . For example, it remains controversial whether primate face representations are innate or acquired . The deep learning framework reframes this debate by asking how neural codes emerge from different learning principles. One strong candidate is supervised learning with gradient descent, in which representations are sculpted by feedback about the label, name or category associated with a sensory input . Thus far, supervised models have been the major focus of comparisons between deep networks and biology . However, a long tradition in neuroscience emphasizes unsupervised principles such as Hebbian learning, or argues that representations are formed by a pressure to accurately predict the spatially or temporally local environment under an efficiency constraint . Indeed, recent deep generative models show a remarkable ability to disentangle complex, high-dimensional signals into their latent factors under this self-supervised objective . Finally, a successful AI model that has yet to impact neuroscience proposes instead that representation formation is driven by the need to accurately predict the motivational value of experience . One way to evaluate these schemes is to compare their ability to furnish deep networks with rich representations and complex behaviours when exposed to naturalistic data. To date, most successful candidates use some form of gradient descent. However, standard supervised models, such as those popular for explaining primate object recognition, seem to require improbable quantities of labelled data – unlike human infants, who gain sophisticated object understanding even before language is acquired. Another challenge for networks trained with gradient descent is the problem of assigning credit among parameters. Deep networks cascade Figure 4. Testing principles of learning using perceptual learning paradigms. A. A simple deep network model of perceptual learning. Clockwise or counter clockwise oriented visual inputs flow through two layers of weights to an output layer that reports direction of rotation. Initially, the input layer weights have bell-curve tuning to orientation and the output layer weights are untuned. B. Schematic of tuning curve slope changes due to gradient descent learning in the model (left) and primate V1 (right). C. Schematic of total synaptic change in input weights and output weights over training. The higher layer changes more. D. Schematic of behavioural performance transfer to an untrained retinal location over learning. Early learning transfers well but late learning is retinotopically specific. E. A schematized conceptual space of learning rules, in which specific learning rules are points. Experimental observations may be consistent with regions of this space (coloured circles) containing several learning rules. Intersecting many constraints can begin to narrow the set of candidate learning algorithms. F. Schematic predictions of gradient descent — for which the most informative neurons change most — and correlation Hebbian learning, for which the most active neurons change most. G. Schematic predictions of contrastive Hebbian learning, for which higher layers change more than lower, and a predictive coding scheme for which lower layers change more than higher. imple operations across layers to permit complex input-output transformations, for example learning a hierarchy of detectors for object edges, parts and wholes in successive layers. Whilst this divide-and-conquer strategy maximizes representational power, it demands that a change at one synapse accounts for how this adjustment will propagate through the rest of the network. A grand challenge for neuroscience is to test whether learning in the brain can in fact assign credit across the neural hierarchy, and if so, to identify a biologically realistic implementation, i.e. one where updates are local, and forward and backward connectivity in the network is not required to be symmetric. While credit assignment was once thought to be biologically implausible, we now have a growing set of candidate implementations in need of empirical tests . Learning principles also make divergent predictions about how representations should emerge or change during prolonged training. This opens the door to studies of perceptual learning that can attempt to confirm or refute these predictions . For example, Fig. 4 shows the predictions of a neural network model trained to classify tilted gratings with gradient descent, under the assumption that input-layer units have initially bell-shaped tuning curves . Extant neural and behavioural phenomena emerge seamlessly from the model, such as stronger sharpening of the tuning functions of the most informative neurons, earlier and stronger representational changes in higher cortical stages (i.e. deeper layers) during training, greater proneness to transfer of coarse than fine discrimination tasks across space to new retinal locations, and transfer of fine discrimination tasks early but not late during training. These phenomena, which are characterized mathematically in deep linear networks, also occur in the nonlinear case . Critically, other learning principles make qualitatively different predictions (Fig. 4E-G). For example, under correlational Hebbian learning, the most active (rather than most informative) neurons change the most. Contrastive Hebbian learning causes the higher layer to change far more than the lower layer, whereas the converse is true for a predictive-coding scheme with top-down negative feedback. Whilst these constraints do not pin down the exact algorithm at work in perceptual learning, they offer collective evidence about the learning principles likely to be at work in biology. More generally, this approach opens the door to a new programme of experiments in which principles of learning are interrogated by measuring the dynamics of representational change across cortical stages using macroscopic imaging techniques such as fMRI or wide-field calcium imaging.

8. Deep learning principles for cognition

Deep neural networks excel at classifying complex inputs into distinct classes like objects or words. Equally important, however, is what comes next: we link objects and items into diverse knowledge structures that describe our world. We know, for instance, that a dog can bark and that a maple is a type of tree. Moreover, we form semantic categories from multimodal features, connecting the written and spoken name for an object with its shape, odour, and texture. This conceptual knowledge of the world transcends physical appearance, interlinking diverse and even unobservable object properties (for instance, that a dog has a spleen). The abstractions we acquire over the course of development form the building blocks for flexible generalization and higher-level cognition in maturity . valuating deep learning insights beyond the realm of perceptual tasks is a key open opportunity for neuroscientists. The behaviour of humans and other animals is governed by a rich array of cognitive functions, including modular memory processes, attentional and task-level control, and neural systems for navigation, planning, mental simulation, reasoning, and abstract inference. These cognitive functions are implemented in a regionally specialised brain, in which a patchwork of subcortical and allocortical structures interconnects with both granular and infragranular cortical zones, each housing unique cell types and circuits. If we are committed to deploying deep learning models as theories for biology, then we need to take seriously the question of how such elaborate structure in cognition and behaviour emerges from end-to-end optimization. How do humans learn abstract representations, divorced from physical object properties? How do we assemble knowledge into relational structures like trees, rings, and grids? How do we compose new behaviours from existing subcomponents? How do we rapidly acquire and generalise new memories? These are important questions for AI researchers as well, and indeed, some have expressed a hope that machine learning will soon offer more powerful models in which higher cognitive functions emerge naturally via a “blind search” process, allowing neu roscientists to sidestep the problem . Indeed, recent advances in AI research have followed the successful fusion of deep learning with other methods, such as reinforcement learning , context-addressable memory , or Monte-Carlo tree search , demonstrating a proof of concept for end-to-end learning in complex cognitive architectures. However, we argue that a more fruitful research agenda for neuroscientists builds off the work of past decades, in which researchers have experimentally dissected cognitive systems, in many cases providing a detailed, computationally grounded account of their function. For example, we understand a great deal about the navigation system in the rodent medial temporal lobe , the motor system in song birds , or the saccadic system in the macaque monkey . We argue for a research programme that embraces the deep learning framework but seeks to address concrete questions about theory and implementation that are recognisable to neuroscientists and cognitive scientists.

9. Abstraction and generalisation

Deep networks excel when data is abundant, and training is exhaustive. However, they struggle to extrapolate this knowledge to new environments populated by previously unseen features and objects. Humans, by contrast, seem to generalise effectively . For example, most people can navigate a foreign city where the language, coinage and customs are unfamiliar, because they understand concepts such as greeting, taxi and map . A popular view is that deep networks fail to transfer because they do not form neural codes that abstract over physically dissimilar domains. Building deep networks that can generalise in this way would be a major milestone for machine learning. This provides an incentive for neuroscientists to study how biological brains encode, compose and generalise abstract knowledge . Unfortunately, key methodological challenges arise for neuroscientists seeking to address this question. Firstly, it is unknown whether experimental animals such as rodents and macaques (or even our closer primate cousins ) have evolved neural mechanisms that permit the strong, flexible transfer of knowledge that characterise human intelligence. It is thus unclear whether invasive tools for recording and interference (such as electrophysiology or optogenetics) can be used to study generalisation and transfer in animals. To study human abstraction, we are obliged to use macroscopic imaging methods such as fMRI, MEG and EEG, that are less well uited to revealing how computation unfolds in neural circuits. Nevertheless, inventive new ways of using these tools are being developed, allowing researchers to probe replay , changes in excitatory-inhibitory balance , or hexagonal (grid) coding in human brain signals. Secondly, humans (and other animals) usually enter the laboratory with rich past experiences that sculpt the ways that they learn. This complicates direct comparisons between humans and neural networks, because it is difficult to imbue artificial systems with equivalent priors, or to eliminate human priors using wholly novel stimuli. Thirdly, humans and neural networks learn over very different timescales. For example, deep reinforcement learning systems exceed human performance on Atari video games, but require many times more training than a human player . In an end-to-end learning system, abstract representations need to be grounded in experience. One possibility is that lifelong exposure to huge volumes of sensory data might allow strong invariances to emerge naturally via either supervised or unsupervised learning. In the medial temporal lobe (MTL), which sits at the apex of the primate ventral stream, there is evidence that cells develop physically invariant coding properties. For example, in humans, ‘concept’ cells code for famous individuals or landmarks, irrespective of whether they are denoted by pictures or words . Echoes of this coding scheme can be seen in other animals, where MTL coding is tied more tightly to allocentric space. For example, hippocampal place cells code for locations in a way that is invariant to the viewpoint and heading direction , and in primates, ‘schema’ cells remap between environments in a way that allows for generalisation over common spatial configurations . These neural codes for high-level concepts can form when different features, objects or locations are repeatedly associated in space or time, for example via Hebbian learning . Indeed, fMRI studies of statistical learning have revealed that neural similarities (e.g. multivoxel pattern overlap) in the MTL recapitulate association strengths for pairs, lines, maps or hierarchies of stimuli . Moreover, in an bandit task, the entorhinal cortex is one brain region where a consistent mapping exists between neural patterns and the covariance among stimuli and rewards, irrespective of the physical images involved . Stitching together multiple patterns of association, and learning their structure, could allow animals to learn a comprehensive model of the world that can be used for navigation, inference and planning . In parallel with this growing emphasis on the virtues of model-based computation in neuroscience, machine learning researchers are building powerful deep generative models that are capable of disentangling the world into its latent factors, and recomposing these to construct realistic synthetic images in 3D . However, to date, wiring these generative models up with control systems to build intelligent agents has proved challenging, despite some promising efforts . Indeed, AI researchers have struggled to build model-based systems that can hold their own against model-free agents in benchmark problems such as Atari . It is paradoxical that this is occurring against a rich backdrop of neuroscience research that emphasises the virtues of model-based inference. In fact, neuroscientists have even begun to unravel how seemingly idiosyncratic coding properties in the MTL and other structures may be hallmarks of a normative scheme for computational efficient planning and inference . For example, grid cell codes in the medial entorhinal cortex and elsewhere may be signatures of a neural code that has learned the geometry by which space itself is structured, potentially supporting transfer learning for navigation . There are even hints that this coding scheme may apply to nonspatial as well as spatial domains , potentially laying the foundations for a heory of higher-order human reasoning . Although machine learning researchers have noted that lattice-like codes may emerge when deep RL systems are trained to navigate , they have yet to build on these insights for building stronger AI. More generally, understanding how to simulate biologically plausible model-based computations in a way that is useful to machine learning researchers is a potentially rich intellectual seam that neuroscientists are only just beginning to exploit.

10. Neural resource allocation during task learning

Humans and other animals continue to learn across their lifespan.

This “continual” learning might allow a human to acquire a second language, a monkey to adopt a new social role, or a rodent to navigate in a novel environment. This is in stark contrast to most current AI systems, that lack the flexibility to acquire new behaviours once they have achieved convergence on an initial task. Building machines that can learn continually, like humans and other animals, is proving one of the thorniest challenges in contemporary machine learning research . Fortunately, however, this question has opened up new avenues for neuroscience research focussed on how biology may have solved continual learning . It has long been noted that in neural networks, learning pursuant to an initial task A is often overwritten during subsequent training on task B ( “ catastrophic interference ” ) . This occurs because a parameterisation that solves task A is not guaranteed to solve any other task, and so during training on task B, gradient descent drives network weights away from the local minimum for task A. It occurs even when the network has sufficient capacity to perform both tasks, because simultaneous (or “interleaved”) training allows the discovery of a setting that jointly solves tasks A and B. In humans, new learning can sometimes degrade extant performance, for example when memorising associate pairs A-C after having encoded pairs A-B, but in general interference effects are far less dramatic than for neural networks . One popular model suggests that mammals have evolved to solve continual learning by using complementary learning systems in the hippocampus and neocortex . Unlike the cortex, hippocampus can rapidly learn sparse (or “pattern - separated”) re presentations of specific experiences, often called “episodic” memories , and these memories are replayed offline during periods of rest or sleep . Hippocampal replay provides an opportunity for virtual interleaving of past and present experience, potentially allowing memories to be gradually consolidated into neocortical circuits in a way that circumvents the problem of catastrophic interference. This theory is supported by a wealth of evidence, including the finding that hippocampal damage leads to a gradient of retrograde amnesia , and reports of double dissociations between instance-based memory (or “recollection”) in the hippocampus and summaries of past experience (or “familiarity”) in neocortex . In more recent years artificial replay of past experiences has emerged as a critical factor that allows deep networks to exhibit strong performance in temporally correlated environments , including deep reinforcement learning agents for dynamic video games . Pleasingly, this has allowed theorists to draw a link between computational solutions to continual learning in biological and artificial intelligence . Adaptations of the CLS framework allow it to account for seemingly contradictory phenomena, such as the involvement of medial temporal lobe structures in rapid statistical learning . hilst evidence grows that offline replay may be important for memory consolidation, the problem of continual learning has provoked new questions for neuroscientists. Is biological learning actively partitioned so as to avoid catastrophic interference? Unlike neural networks, animals do not always benefit from interleaved study conditions (imagine learning the violin and the cello at once). For example, humans who have been trained in a blocked fashion to classify naturalistic stimuli (trees) according to two orthogonal boundaries (their “leafiness” vs. “branchiness”) perform better on a later interleaved test (compared to those who experienced the same conditions at training and test) . Other evidence from human category learning implies that human knowledge may be actively partitioned by time and context . Indeed, promising solutions to continual learning in the machine learning literature rely on the identification of weight subspaces where new learning is least likely to cause retrospective interference, for example by “freezing” synapses that are more likely to participate in extant tasks . These tools are more effective when coupled with a gating process that overtly earmarks neural subspaces for new learning, in a way that resembles top-down attention in the primate neocortex . Another intriguing possibility is that unsupervised processes facilitate continual learning in biological systems by clustering neural representations according to their context. Hebbian learning might encourage the formation of orthogonal neural codes for different temporally contexts , which in turn allows tasks to be learned in different neural subspaces . The curious phenomenon of “representational drift” (where neural codes meander unpredictably over time) might reflect the allocation of information to different neural circuits in distinct contexts, allowing for task knowledge to be partitioned in a way that minimises interference . A more general question is how biological systems have evolved both to minimise negative transfer (interference) and maximise positive transfer (generalisation) among tasks. One fascinating theoretical perspective argues that the capacity limits inherent in biological control processes are a response to this conundrum . Using simulations involving deep networks, the authors show that shared and separate task representations have mixed costs and benefits, with shared codes allowing for generalisation between tasks at this risk of interference between tasks. They suggest that the brain has found a solution by promoting shared neural codes, which in turns allows for strong transfer, but deploying control processes to gate out irrelevant tasks that might provoke interference. They suggest that this answers the question of why, despite a brain that comprises billions of neurons and trillions of connections, humans struggle with multi-tasking problems such typing a line of computer code whilst answering a question .

11. Conclusions

Deep learning models have much to offer neuroscience. Most exciting is the potential to go beyond handcrafting of function, and to understand how computation emerges from experience. Neuroscientists have recognised this opportunity, but its exploitation has only just begun. In this

Perspective , we have tried to offer a roadmap for researchers wishing to use deep networks as neural theories. Our principal exhortation for neuroscientists is to use deep networks as predictive models that make falsifiable predictions, and to use model idealisation methods to provide genuine understanding of how and why they might capture biological phenomena. We caution against using increasingly complex models and simulations which outpace our conceptual insight and discourage the blind search for correspondences in neural odes formed by biological and artificial systems. Instead, we hope that neuroscientists will build models that explain human behaviour, learning dynamics and neural coding in rich and fruitful ways, but without losing the interpretability inherent to classical neural models.

Acknowledgements

This work was supported by generous funding from the European Research Council (ERC Consolidator award to C.S. and Special Grant Agreement 3 of the Human Brain Project) and a Wellcome Trust Sir Henry Dale Fellowship to A.M.S. eferences

1 LeCun, Y., Bengio, Y. & Hinton, G. Deep learning.

Nature , 436-444, doi:10.1038/nature14539 (2015). 2 Krizhevsky, A., Sutskever, I. & Hinton, G. E. in

Proceedings of the 25th International Conference on Neural Information Processing Systems.

3 Eslami, S. M. A. et al.

Neural scene representation and rendering.

Science , 1204-1210, doi:10.1126/science.aar6170 (2018). 4 Mnih, V. et al.

Human-level control through deep reinforcement learning.

Nature , 529-533, doi:10.1038/nature14236 (2015). 5 Silver, D. et al.

Mastering the game of Go without human knowledge.

Nature , 354-359, doi:10.1038/nature24270 (2017). 6 Hassabis, D., Kumaran, D., Summerfield, C. & Botvinick, M. Neuroscience-Inspired Artificial Intelligence.

Neuron , 245-258, doi:10.1016/j.neuron.2017.06.011 (2017). 7 Golan, T., Raju, P. C. & Kriegeskorte, N. Controversial stimuli: pitting neural networks against each other as models of human recognition Arxiv preprint (2019). 8 Flesch, T., Balaguer, J., Dekker, R., Nili, H. & Summerfield, C. Comparing continual task learning in minds and machines.

Proc Natl Acad Sci U S A , E10313-E10322, doi:10.1073/pnas.1800755115 (2018). 9 Geirhos, R. et al. in (Montréal, Canada, 2018). 10 Zhou, Z. & Firestone, C. Humans can decipher adversarial images. Nat Commun , 1334, doi:10.1038/s41467-019-08931-6 (2019). 11 Rajalingham, R. et al. Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.

J Neurosci , 7255-7269, doi:10.1523/JNEUROSCI.0388-18.2018 (2018). 12 Yamins, D. L. & DiCarlo, J. J. Using goal-driven deep learning models to understand sensory cortex. Nat Neurosci , 356-365, doi:10.1038/nn.4244 (2016). 13 Yamins, D. L. et al. Performance-optimized hierarchical models predict neural responses in higher visual cortex.

Proc Natl Acad Sci U S A , 8619-8624, doi:10.1073/pnas.1403112111 (2014). 14 Khaligh-Razavi, S. M. & Kriegeskorte, N. Deep supervised, but not unsupervised, models may explain IT cortical representation.

PLoS Comput Biol , e1003915, doi:10.1371/journal.pcbi.1003915 (2014). 15 Kell, A. J. E., Yamins, D. L. K., Shook, E. N., Norman-Haignere, S. V. & McDermott, J. H. A Task-Optimized Neural Network Replicates Human Auditory Behavior, Predicts Brain Responses, and Reveals a Cortical Processing Hierarchy. Neuron , 630-644 e616, doi:10.1016/j.neuron.2018.03.044 (2018). 16 Cichy, R. M., Khosla, A., Pantazis, D., Torralba, A. & Oliva, A. Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence. Sci Rep , 27755, doi:10.1038/srep27755 (2016). 17 Kar, K., Kubilius, J., Schmidt, K., Issa, E. B. & DiCarlo, J. J. Evidence that recurrent circuits are critical to the ventral stream's execution of core object recognition behavior. Nat Neurosci , 974-983, doi:10.1038/s41593-019-0392-5 (2019). 8 Kietzmann, T. C. et al. Recurrence is required to capture the representational dynamics of the human visual system.

Proc Natl Acad Sci U S A , 21854-21863, doi:10.1073/pnas.1905544116 (2019). 19 Guclu, U. & van Gerven, M. A. Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream.

J Neurosci , 10005-10014, doi:10.1523/JNEUROSCI.5023-14.2015 (2015). 20 Elsayed, G. F. et al. Adversarial Examples that Fool both Computer Vision and Time-Limited Humans. arXiv (2018). 21 Ullman, S., Assif, L., Fetaya, E. & Harari, D. Atoms of recognition in human and computer vision.

Proc Natl Acad Sci U S A , 2744-2749, doi:10.1073/pnas.1513198113 (2016). 22 Sinz, F. H., Pitkow, X., Reimer, J., Bethge, M. & Tolias, A. S. Engineering a Less Artificial Intelligence.

Neuron , 967-979, doi:10.1016/j.neuron.2019.08.034 (2019). 23 Marblestone, A. H., Wayne, G. & Kording, K. P. Toward an Integration of Deep Learning and Neuroscience.

Front Comput Neurosci , 94, doi:10.3389/fncom.2016.00094 (2016). 24 Kell, A. J. & McDermott, J. H. Deep neural network models of sensory systems: windows onto the role of task constraints. Curr Opin Neurobiol , 121-132, doi:10.1016/j.conb.2019.02.003 (2019). 25 Kriegeskorte, N. Deep Neural Networks: A New Framework for Modeling Biological Vision and Brain Information Processing. Annu Rev Vis Sci , 417-446, doi:10.1146/annurev-vision-082114-035447 (2015). 26 Bowers, J. S. Parallel Distributed Processing Theory in the Age of Deep Networks. Trends Cogn Sci , 950-961, doi:10.1016/j.tics.2017.09.013 (2017). 27 Cichy, R. M. & Kaiser, D. Deep Neural Networks as Scientific Models. Trends Cogn Sci , 305-317, doi:10.1016/j.tics.2019.01.009 (2019). 28 Lake, B. M., Ullman, T. D., Tenenbaum, J. B. & Gershman, S. J. Building machines that learn and think like people. Behav Brain Sci , e253, doi:10.1017/S0140525X16001837 (2017). 29 Lindsay, G. Convolutional Neural Networks as a Model of the Visual System: Past, Present, and Future. J Cogn Neurosci , 1-15, doi:10.1162/jocn_a_01544 (2020). 30 Zador, A. M. A critique of pure learning and what artificial neural networks can learn from animal brains.

Nat Commun , 3770, doi:10.1038/s41467-019-11786-6 (2019). 31 Rogers, T. T. & McClelland, J. L. Parallel Distributed Processing at 25: further explorations in the microstructure of cognition. Cogn Sci , 1024-1077, doi:10.1111/cogs.12148 (2014). 32 Hasson, U., Nastase, S. A. & Goldstein, A. Direct Fit to Nature: An Evolutionary Perspective on Biological and Artificial Neural Networks. Neuron , 416-434, doi:10.1016/j.neuron.2019.12.002 (2020). 33 Richards, B. A. et al.

A deep learning framework for neuroscience.

Nat Neurosci , 1761-1770, doi:10.1038/s41593-019-0520-2 (2019). 34 Lillicrap, T. & Kording, K. What does it mean to understand a neural network? BiorXiv preprint (2019). 35 Bashivan, P., Kar, K. & DiCarlo, J. J. Neural population control via deep image synthesis.

Science , doi:10.1126/science.aav9436 (2019). 6 Ponce, C. R. et al.

Evolving Images for Visual Neurons Using a Deep Generative Network Reveals Coding Principles and Neuronal Preferences.

Cell , 999-1009 e1010, doi:10.1016/j.cell.2019.04.005 (2019). 37 Saxe, A., Bhand, M., Mudur, R., Suresh, B. & Ng, A. Unsupervised learning models of primary cortical receptive fields and receptive field plasticity.

Advances in Neural Information Processing Systems (2011). 38 Stevenson, I. H. & Kording, K. P. How advances in neural recording affect data analysis. Nat Neurosci , 139-142, doi:10.1038/nn.2731 (2011). 39 Saxena, S. & Cunningham, J. P. Towards the neural population doctrine. Curr Opin Neurobiol , 103-111, doi:10.1016/j.conb.2019.02.002 (2019). 40 Fusi, S., Miller, E. K. & Rigotti, M. Why neurons mix: high dimensionality for higher cognition. Curr Opin Neurobiol , 66-74, doi:10.1016/j.conb.2016.01.010 (2016). 41 Rigotti, M. et al. The importance of mixed selectivity in complex cognitive tasks.

Nature , 585-590, doi:10.1038/nature12160 (2013). 42 Johnston, W. J., Palmer, S. E. & Freedman, D. J. Nonlinear mixed selectivity supports reliable neural computation.

PLoS Comput Biol , e1007544, doi:10.1371/journal.pcbi.1007544 (2020). 43 Raposo, D., Kaufman, M. T. & Churchland, A. K. A category-free neural population supports evolving demands during decision-making. Nat Neurosci , 1784-1792, doi:10.1038/nn.3865 (2014). 44 Yuste, R. From the neuron doctrine to neural networks. Nat Rev Neurosci , 487-497, doi:10.1038/nrn3962 (2015). 45 Park, I. M., Meister, M. L., Huk, A. C. & Pillow, J. W. Encoding and decoding in parietal cortex during sensorimotor decision-making. Nat Neurosci , 1395-1403, doi:10.1038/nn.3800 (2014). 46 Mante, V., Sussillo, D., Shenoy, K. V. & Newsome, W. T. Context-dependent computation by recurrent dynamics in prefrontal cortex. Nature , 78-84, doi:10.1038/nature12742 (2013). 47 Chaisangmongkon, W., Swaminathan, S. K., Freedman, D. J. & Wang, X. J. Computing by Robust Transience: How the Fronto-Parietal Network Performs Sequential, Category-Based Decisions.

Neuron , 1504-1517 e1504, doi:10.1016/j.neuron.2017.03.002 (2017). 48 Engel, T. A., Chaisangmongkon, W., Freedman, D. J. & Wang, X. J. Choice-correlated activity fluctuations underlie learning of neuronal category representation. Nat Commun , 6454, doi:10.1038/ncomms7454 (2015). 49 Remington, E. D., Narain, D., Hosseini, E. A. & Jazayeri, M. Flexible Sensorimotor Computations through Rapid Reconfiguration of Cortical Dynamics. Neuron , 1005-1019 (2018). 50 Remington, E. D., Egger, S. W., Narain, D., Wang, J. & Jazayeri, M. A Dynamical Systems Perspective on Flexible Motor Timing. Trends Cogn Sci , 938-952, doi:10.1016/j.tics.2018.07.010 (2018). 51 Masse, N. Y., Yang, G. R., Song, H. F., Wang, X. J. & Freedman, D. J. Circuit mechanisms for the maintenance and manipulation of information in working memory. Nat Neurosci , 1159-1167, doi:10.1038/s41593-019-0414-3 (2019). 52 Masse, N. Y., Rosen, M. C. & Freedman, D. J. Reevaluating the Role of Persistent Neural Activity in Short-Term Memory. Trends Cogn Sci , 242-258, doi:10.1016/j.tics.2019.12.014 (2020). 3 Orhan, A. E. & Ma, W. J. A diverse range of factors affect the nature of neural representations underlying short-term memory. Nat Neurosci , 275-283, doi:10.1038/s41593-018-0314-y (2019). 54 Lindsey, J., Ocko, S. A., Ganguli, S. & Deny, S. A unified theory of early visual representation from retina to cortex through anatomically constrained deep CNNs. BiorXiv preprint (2019). 55 Rahwan, I. et al.

Machine behaviour.

Nature , 477-486, doi:10.1038/s41586-019-1138-y (2019). 56 Thompson, J. A. F., Bengio, Y., Formisano, E. & Schonwiesner, M. How can deep learning advance computational modeling of sensory information processing? arXiv (2018). 57 Schrimpf, M. et al.

Brain-Score: Which Artificial Neural Network for Object Recognition is most Brain-Like?

BiorXiv preprint (2018). 58 Kriegeskorte, N., Mur, M. & Bandettini, P. Representational similarity analysis - connecting the branches of systems neuroscience.

Front Syst Neurosci , 4, doi:10.3389/neuro.06.004.2008 (2008). 59 Krakauer, J. W., Ghazanfar, A. A., Gomez-Marin, A., MacIver, M. A. & Poeppel, D. Neuroscience Needs Behavior: Correcting a Reductionist Bias. Neuron , 480-490, doi:10.1016/j.neuron.2016.12.041 (2017). 60 Gomez-Marin, A. & Ghazanfar, A. A. The Life of Behavior. Neuron , 25-36, doi:10.1016/j.neuron.2019.09.017 (2019). 61 Rich, A. S. & Gurekis, T. M. Lessons for artificial intelligence from the study of natural stupidity.

Nature Machine Intelligence , 174 –

180 (2019). 62 Shenhav, A., Botvinick, M. M. & Cohen, J. D. The expected value of control: an integrative theory of anterior cingulate cortex function.

Neuron , 217-240, doi:10.1016/j.neuron.2013.07.007 (2013). 63 Pashler, H., Rohrer, D., Cepeda, N. J. & Carpenter, S. K. Enhancing learning and retarding forgetting: choices and consequences. Psychon Bull Rev , 187-193, doi:10.3758/bf03194050 (2007). 64 Bahri, Y. et al. Statistical Mechanics of Deep Learning.

Annual Review of Condensed Matter Physics , 501:528 (2020). 65 Saxe, A. M., McClelland, J. L. & Ganguli, S. A mathematical theory of semantic development in deep neural networks. Proc Natl Acad Sci U S A , 11537-11546, doi:10.1073/pnas.1820226116 (2019). 66 Saxe, A.

Deep linear networks: a theory of learning in the brain and mind

PhD thesis, Stanford University, (2015). 67 Seung, H. S., Sompolinsky, H. & Tishby, N. Statistical mechanics of learning from examples.

Physical Review A , 6056 – NeurIPS.

69 Advani, M. & Saxe, A. M. High-dimensional dynamics of generalization error in neural networks. arXiv (2017). 70 Krogh, A. & Hertz, J. A. Generalization in a linear perceptron in the presence of noise.

Journal of Physics A: Mathematical and General , 1135 – et al. in Advances in Neural Information Processing Systems 26.

72 Saxe, A. M., McClelland, J. L. & Ganguli, S. in

International Conference on Learning Representations.

3 Lee, J. et al.

Wide Neural Networks of Any Depth Evolve as Linear Models Under Gradient Descent. arXiv (2019). 74 Jacot, A., Gabriel, F. & Hongler, C. in

Advances in Neural Information Processing Systems. – Behav Brain Sci , 113-124; discussion 124-162, doi:10.1017/S0140525X10000919 (2011). 76 Rogers, T. T. & McClelland, J. L. Semantic cognition: A parallel distributed processing approach . (MIT Press, 2004). 77 Op de Beeck, H. P., Pillet, I. & Ritchie, J. B. Factors Determining Where Category-Selective Areas Emerge in Visual Cortex.

Trends Cogn Sci , 784-797, doi:10.1016/j.tics.2019.06.006 (2019). 78 Deen, B. et al. Organization of high-level visual cortex in human infants.

Nat Commun , 13995, doi:10.1038/ncomms13995 (2017). 79 Arcaro, M. J., Schade, P. F., Vincent, J. L., Ponce, C. R. & Livingstone, M. S. Seeing faces is necessary for face-domain formation. Nat Neurosci , 1404-1412, doi:10.1038/nn.4635 (2017). 80 Olshausen, B. A. & Field, D. J. Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature , 607-609, doi:10.1038/381607a0 (1996). 81 Simoncelli, E. P. & Olshausen, B. A. Natural image statistics and neural representation.

Annu Rev Neurosci , 1193-1216, doi:10.1146/annurev.neuro.24.1.1193 (2001). 82 Friston, K. The free-energy principle: a unified brain theory? Nat Rev Neurosci , 127-138, doi:10.1038/nrn2787 (2010). 83 Kingma, D. P. & Welling, M. Auto-Encoding Variational Bayes. arXiv:1312.6114 (2013). 84 Burgess, C. P. et al. MONet: Unsupervised Scene Decomposition and Representation. 85 Lillicrap, T. P., Cownden, D., Tweed, D. B. & Akerman, C. J. Random synaptic feedback weights support error backpropagation for deep learning.

Nat Commun , 13276, doi:10.1038/ncomms13276 (2016). 86 Detorakis, G., Bartley, T. & Neftci, E. Contrastive Hebbian Learning with Random Feedback Weights. arXiv (2018). 87 Wenliang, L. K. & Seitz, A. R. Deep Neural Networks for Modeling Visual Perceptual Learning. J Neurosci , 6028-6044, doi:10.1523/JNEUROSCI.1620-17.2018 (2018). 88 Murphy, G. L. The Big Book of Concepts . (MIT Press, 2002). 89 Graves, A. et al.

Hybrid computing using a neural network with dynamic external memory.

Nature , 471-476, doi:10.1038/nature20101 (2016). 90 Wayne, G. et al.

Unsupervised Predictive Memory in a Goal-Directed Agent. arXiv (2018). 91 Silver, D. et al.

Mastering the game of Go with deep neural networks and tree search.

Nature , 484-489, doi:10.1038/nature16961 (2016). 92 Moser, E. I., Kropff, E. & Moser, M. B. Place cells, grid cells, and the brain's spatial representation system.

Annu Rev Neurosci , 69-89, doi:10.1146/annurev.neuro.31.061307.090723 (2008). 93 Fee, M. S., Kozhevnikov, A. A. & Hahnloser, R. H. Neural mechanisms of vocal sequence generation in the songbird. Ann N Y Acad Sci , 153-170, doi:10.1196/annals.1298.022 (2004). 94 Hanes, D. P. & Schall, J. D. Neural control of voluntary movement initiation.

Science , 427-430 (1996). 5 Tervo, D. G. R., Tenenbaum, J. B. & Gershman, S. J. Toward the neural implementation of structure learning.

Curr Opin Neurobiol , 99-105, doi:10.1016/j.conb.2016.01.014 (2016). 96 Tenenbaum, J. B., Kemp, C., Griffiths, T. L. & Goodman, N. D. How to grow a mind: statistics, structure, and abstraction. Science , 1279-1285, doi:10.1126/science.1192788 (2011). 97 Behrens, T. E. J. et al.

What Is a Cognitive Map? Organizing Knowledge for Flexible Behavior.

Neuron , 490-509, doi:10.1016/j.neuron.2018.10.002 (2018). 98 Penn, D. C., Holyoak, K. J. & Povinelli, D. J. Darwin's mistake: explaining the discontinuity between human and nonhuman minds.

Behav Brain Sci , 109-130; discussion 130-178, doi:10.1017/S0140525X08003543 (2008). 99 Schuck, N. W. & Niv, Y. Sequential replay of nonspatial task states in the human hippocampus. Science , doi:10.1126/science.aaw5181 (2019). 100 Kurth-Nelson, Z., Economides, M., Dolan, R. J. & Dayan, P. Fast Sequences of Non-spatial State Representations in Humans.

Neuron , 194-204, doi:10.1016/j.neuron.2016.05.028 (2016). 101 Liu, Y., Dolan, R. J., Kurth-Nelson, Z. & Behrens, T. E. J. Human Replay Spontaneously Reorganizes Experience. Cell , 640-652 e614, doi:10.1016/j.cell.2019.06.012 (2019). 102 Barron, H. C. et al.

Unmasking Latent Inhibitory Connections in Human Cortex to Reveal Dormant Cortical Memories.

Neuron , 191-203, doi:10.1016/j.neuron.2016.02.031 (2016). 103 Koolschijn, R. S. et al. The Hippocampus and Neocortical Inhibitory Engrams Protect against Memory Interference.

Neuron , 528-541 e526, doi:10.1016/j.neuron.2018.11.042 (2019). 104 Doeller, C. F., Barry, C. & Burgess, N. Evidence for grid cells in a human memory network.

Nature , 657-661, doi:10.1038/nature08704 (2010). 105 Constantinescu, A. O., O'Reilly, J. X. & Behrens, T. E. J. Organizing conceptual knowledge in humans with a gridlike code.

Science , 1464-1468, doi:10.1126/science.aaf0941 (2016). 106 Tsividis, P. A., Pouncy, T., Xu, J. L., Tenenbaum, J. B. & Gershman, S. J. in

AAAI Spring Symposium on Science of Intelligence: Computational Principles of Natural and Artificial Intelligence. .

107 Quiroga, R. Q., Reddy, L., Kreiman, G., Koch, C. & Fried, I. Invariant visual representation by single neurons in the human brain.

Nature , 1102-1107, doi:10.1038/nature03687 (2005). 108 O'Keefe, J. & Dostrovsky, J. The hippocampus as a spatial map. Preliminary evidence from unit activity in the freely-moving rat.

Brain Res , 171-175, doi:10.1016/0006-8993(71)90358-1 (1971). 109 Baraduc, P., Duhamel, J. R. & Wirth, S. Schema cells in the macaque hippocampus. Science , 635-639, doi:10.1126/science.aav5404 (2019). 110 Miyashita, Y. Neuronal correlate of visual associative long-term memory in the primate temporal cortex.

Nature , 817-820, doi:10.1038/335817a0 (1988). 111 Schapiro, A. C., Rogers, T. T., Cordova, N. I., Turk-Browne, N. B. & Botvinick, M. M. Neural representations of events arise from temporal community structure.

Nat Neurosci , 486-492, doi:10.1038/nn.3331 (2013). 12 Garvert, M. M., Dolan, R. J. & Behrens, T. E. A map of abstract relational knowledge in the human hippocampal-entorhinal cortex. Elife , doi:10.7554/eLife.17086 (2017). 113 Schapiro, A. C., Kustner, L. V. & Turk-Browne, N. B. Shaping of object representations in the human medial temporal lobe based on temporal regularities. Curr Biol , 1622-1627, doi:10.1016/j.cub.2012.06.056 (2012). 114 Schapiro, A. C., Turk-Browne, N. B., Norman, K. A. & Botvinick, M. M. Statistical learning of temporal community structure in the hippocampus. Hippocampus , 3-8, doi:10.1002/hipo.22523 (2016). 115 Schlichting, M. L., Mumford, J. A. & Preston, A. R. Learning-related representational changes reveal dissociable integration and separation signatures in the hippocampus and prefrontal cortex. Nat Commun , 8151, doi:10.1038/ncomms9151 (2015). 116 Zeithamova, D., Dominick, A. L. & Preston, A. R. Hippocampal and ventral medial prefrontal activation during retrieval-mediated learning supports novel inference. Neuron , 168-179, doi:10.1016/j.neuron.2012.05.010 (2012). 117 Park, S. A., Miller, D. S., Nili, H., Ranganath, C. & Boorman, E. D. Map making: Constructing, combining, and navigating abstract cognitive maps. BiorXiv preprint (2019). 118 Kumaran, D., Banino, A., Blundell, C., Hassabis, D. & Dayan, P. Computations Underlying Social Hierarchy Learning: Distinct Neural Mechanisms for Updating and Representing Self-Relevant Information.

Neuron , 1135-1147, doi:10.1016/j.neuron.2016.10.052 (2016). 119 Baram, A. B., Muller, T. H., Nili, H., Garvert, M. & Behrens, T. E. J. Entorhinal and ventromedial prefrontal cortices abstract and generalise the structure of reinforcement learning problems. BiorXiv preprint (2019). 120 Dolan, R. J. & Dayan, P. Goals and habits in the brain.

Neuron , 312-325, doi:S0896-6273(13)00805-2 [pii] 10.1016/j.neuron.2013.09.007 (2013). 121 Higgins, I. et al. Early Visual Concept Learning with Unsupervised Deep Learning. arXiv:1606.05579 (2016). 122 Higgins, I. et al.

SCAN: Learning Hierarchical Compositional Visual Concepts. arXiv:1707.03389 (2017). 123 Hessel, M. et al.

Rainbow: Combining Improvements in Deep Reinforcement Learning. arXiv (2017). 124 Stachenfeld, K. L., Gershman, S. J. & Botvinick, M. in

Advances in Neural Information Processing Systems 27 (2014). 125 Whittington, J. C. R. et al.

The Tolman-Eichenbaum Machine: Unifying space and relational memory through generalisation in the hippocampal formation.

BiorXiv preprint , doi:https://doi.org/10.1101/770495 (2019). 126 Bellmund, J. L. S., Gardenfors, P., Moser, E. I. & Doeller, C. F. Navigating cognition: Spatial codes for human thinking.

Science , doi:10.1126/science.aat6766 (2018). 127 Cueva, C. J. & Wei, X. X. Emergence of grid-like representations by training recurrent neural networks to perform spatial localization. arXiv (2018). 128 Banino, A. et al.

Vector-based navigation using grid-like representations in artificial agents.

Nature , 429-433, doi:10.1038/s41586-018-0102-6 (2018). 129 Parisi, G., Kemker, R., Part, J. L., Kanan, C. & Wermter, S. Continual Lifelong Learning with Neural Networks: A Review.

Neural Networks , doi:10.1016/j.neunet.2019.01.012. (2019). 30 Schapiro, A. C., Turk-Browne, N. B., Botvinick, M. M. & Norman, K. A. Complementary learning systems within the hippocampus: a neural network modelling approach to reconciling episodic memory with statistical learning.

Philos Trans R Soc Lond B Biol Sci , doi:10.1098/rstb.2016.0049 (2017). 131 French, R. M. Catastrophic forgetting in connectionist networks.

Trends Cogn Sci , 128-135, doi:10.1016/s1364-6613(99)01294-2 (1999). 132 McCloskey, M. & Cohen, N. J. Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem. Psychology of Learning and Motivation , 109-165 (1989). 133 McClelland, J. L., McNaughton, B. L. & O'Reilly, R. C. Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory. Psychol Rev , 419-457 (1995). 134 O'Reilly, R. C., Bhattacharyya, R., Howard, M. D. & Ketz, N. Complementary learning systems.

Cogn Sci , 1229-1248, doi:10.1111/j.1551-6709.2011.01214.x (2014). 135 Tulving, E. Episodic memory: from mind to brain. Annu Rev Psychol , 1-25, doi:10.1146/annurev.psych.53.100901.135114 (2002). 136 Carr, M. F., Jadhav, S. P. & Frank, L. M. Hippocampal replay in the awake state: a potential substrate for memory consolidation and retrieval. Nat Neurosci , 147-153, doi:10.1038/nn.2732 (2011). 137 Zola-Morgan, S. M. & Squire, L. R. The primate hippocampal formation: evidence for a time-limited role in memory storage. Science , 288-290, doi:10.1126/science.2218534 (1990). 138 Yonelinas, A. P. The Nature of Recollection and Familiarity: A Review of 30 Years of Research.

Journal of Memory and Language , 441-517 (2002). 139 van den Ven, G. M. & Tolias, A. S. Generative replay with feedback connections as a general strategy for continual learning. arXiv:1809.10635 (2019). 140 Kumaran, D., Hassabis, D. & McClelland, J. L. What Learning Systems do Intelligent Agents Need? Complementary Learning Systems Theory Updated. Trends Cogn Sci , 512-534, doi:10.1016/j.tics.2016.05.004 (2016). 141 Qian, T. & Aslin, R. N. Learning bundles of stimuli renders stimulus order as a cue, not a confound. Proc Natl Acad Sci U S A , 14400-14405, doi:10.1073/pnas.1416109111 (2014). 142 Collins, A. G. & Frank, M. J. Cognitive control over learning: creating, clustering, and generalizing task-set structure.

Psychol Rev , 190-229, doi:10.1037/a0030852 (2013). 143 Zenke, F., Poole, B. & Ganguli, S. Continual Learning Through Synaptic Intelligence. arXiv:1703.04200 (2017). 144 Kirkpatrick, J. et al.

Overcoming catastrophic forgetting in neural networks.

Proc Natl Acad Sci U S A , 3521-3526, doi:10.1073/pnas.1611835114 (2017). 145 Masse, N. Y., Grant, G. D. & Freedman, D. J. Alleviating catastrophic forgetting using context-dependent gating and synaptic stabilization.

Proc Natl Acad Sci U S A , E10467-E10475, doi:10.1073/pnas.1803839115 (2018). 146 Zeng, G., Chen, Y., Cui, B. & Yu, S. Continual learning of context-dependent processing in neural networks.

Nature Machine Intelligence , 364-372 (2019). 47 Bouchacourt, F., Palminteri, S., Koechlin, E. & Ostojic, S. Temporal chunking as a mechanism for unsupervised learning of task-sets. Elife , doi:10.7554/eLife.50469 (2020). 148 Harvey, C. D., Coen, P. & Tank, D. W. Choice-specific sequences in parietal cortex during a virtual-navigation decision task. Nature , 62-68, doi:10.1038/nature10918 (2012). 149 Rule, M. E., O'Leary, T. & Harvey, C. D. Causes and consequences of representational drift.

Curr Opin Neurobiol , 141-147, doi:10.1016/j.conb.2019.08.005 (2019). 150 Musslick, S. et al. in –834.