[PDF] On the Self-Repair Role of Astrocytes in STDP Enabled Unsupervised SNNs

Abstract

Neuromorphic computing is emerging to be a disruptive computational paradigm that attempts to emulate various facets of the underlying structure and functionalities of the brain in the algorithm and hardware design of next-generation machine learning platforms. This work goes beyond the focus of current neuromorphic computing architectures on computational models for neuron and synapse to examine other computational units of the biological brain that might contribute to cognition and especially self-repair. We draw inspiration and insights from computational neuroscience regarding functionalities of glial cells and explore their role in the fault-tolerant capacity of Spiking Neural Networks (SNNs) trained in an unsupervised fashion using Spike-Timing Dependent Plasticity (STDP). We characterize the degree of self-repair that can be enabled in such networks with varying degree of faults ranging from 50% - 90% and evaluate our proposal on the MNIST and Fashion-MNIST datasets.

Full PDF

11 On the Self-Repair Role of Astrocytes in STDPEnabled Unsupervised SNNs

Mehul Rastogi, Sen Lu, Naﬁul Islam, Abhronil Sengupta

Abstract —Neuromorphic computing is emerging to be a dis-ruptive computational paradigm that attempts to emulate variousfacets of the underlying structure and functionalities of the brainin the algorithm and hardware design of next-generation machinelearning platforms. This work goes beyond the focus of currentneuromorphic computing architectures on computational modelsfor neuron and synapse to examine other computational unitsof the biological brain that might contribute to cognition andespecially self-repair. We draw inspiration and insights fromcomputational neuroscience regarding functionalities of glial cellsand explore their role in the fault-tolerant capacity of SpikingNeural Networks (SNNs) trained in an unsupervised fashion usingSpike-Timing Dependent Plasticity (STDP). We characterize thedegree of self-repair that can be enabled in such networks withvarying degree of faults ranging from 50% - 90% and evaluateour proposal on the MNIST and Fashion-MNIST datasets.

Index Terms —Spiking Neural Networks, Astrocytes, Spike-Timing Dependent Plasticity, Unsupervised learning

I. I

NTRODUCTION

Neuromorphic computing has made signiﬁcant strides overthe past few years - both from hardware [1]–[4] and algo-rithmic perspective [5]–[8]. However, the quest to decode theoperation of the brain have mainly focused on spike basedinformation processing in the neurons and plasticity in thesynapses. Over the past few years, there has been increasingevidence that glial cells, and in particular astrocytes, playa crucial role in a multitude of brain functions [9]. As amatter of fact, astrocytes represent a large proportion of thecell population in the human brain [9]. There have beenalso suggestions that complexity of astrocyte functionalitycan signiﬁcantly contribute to the computational power ofthe human brain. Astrocytes are strategically positioned toensheath tens of thousands of synapses, axons and dendritesamong others, thereby having the capability to serve as a com-munication channel between multiple components and behaveas a sensing medium for ongoing brain activity [10]. This hasled neuroscientists to conclude that astrocytes play a majorrole in higher order brain functions like learning and memory,in addition to neurons and synapses. Over the past fewyears, there have been multiple studies to revise the neuron-circuit model for describing higher order brain functions toincorporate astrocytes as part of the neuron-glia network model[9], [11]. These investigations clearly indicate and quantify

Manuscript dated September, 2020.The authors are with the School of Electrical Engineering and ComputerScience, The Pennsylvania State University, University Park, PA 16802, USA.M. Rastogi is also afﬁliated with Department of of Computer Science andInformation Systems, Birla Institute of Technology and Science Pilani, GoaCampus, Goa 403726, India. E-mail: [email protected]. that incorporating astrocyte functionality in network modelsinﬂuence neuron excitability, synaptic strengthening and, inturn, plasticity mechanisms like Short-Term Plasticity andLong-Term Potentiation, which are important learning toolsused by neuromorphic engineers.The key distinguishing factors of our work against priorefforts can be summarized as follows: (i)

While recent literature reports astrocyte computationalmodels and their impact on fault-tolerance and synaptic learn-ing [9], [11]–[14], the studies have been mostly conﬁned tosmall scale networks. This work is a ﬁrst attempt to explorethe self-repair role of astrocytes at scale in unsupervised SNNsin standard visual recognition tasks. (ii)

In parallel, there is a long history of implementing as-trocyte functionality in analog and digital CMOS implemen-tations [15]–[21]. More recently, emerging physics in post-CMOS technologies like spintronics are also being lever-aged to mimic glia functionalities at a one-to-one level [22].However, the primary focus has been on a brain-emulationperspective, i.e. implementing astrocyte computational modelswith high degree of detail in the underlying hardware. Weexplore the aspects of astrocyte functionality that would berelevant to self-repair in the context of SNN based machinelearning platforms and evaluate the degree of bio-ﬁdelityrequired. (iii)

While Refs. [23], [24] discusses impact of faults inunsupervised STDP enabled SNNs, self-repair functionality insuch networks have not been studied previously.While neuromorphic hardware based on emerging post-CMOS technologies [3], [25]–[28] have made signiﬁcant ad-vancements to reduce the area and power efﬁciency gap ofArtiﬁcial Intelligence (AI) systems, such emerging hardwareare characterized by a host of non-idealities which has greatlylimited its scalability. Our work provides motivation towardautonomous self-repair of such faulty neuromorphic hardwareplatforms. The efﬁcacy of our proposed astrocyte enabledself-repair process is measured by the following steps: (i)

Training SNNs using unsupervised STDP learning rules innetworks equipped with lateral inhibition and homeostasis, (ii)

Introducing “faults” in the trained weight maps by settinga randomly chosen subset of the weights to zero and (iii) Implementing self-repair by re-training the faulty network withastrocyte functionality augmented STDP learning rules. We Note that “faults” are disjoint from the concept of “dropout” [29] usedin neural network training. In dropout, neurons are randomly deleted (alongwith their connections) only during training to avoid overﬁtting. In contrast,faults in our work refer to static non-ideal stuck at zero synaptic connectionspresent during both the training and inference stages. a r X i v : . [ c s . N E ] N ov also compare our proposal with sole STDP based re-trainingstrategy and substantiate our results on the MNIST and F-MNIST datasets.II. M ATERIALS AND M ETHODS

A. Astrocyte Preliminaries

In addition to astrocyte mediated meta-plasticity for learningand memory [12], [30]–[32], there has been indication thatretrograde signalling via astrocytes probably underlie self-repair in the brain. Computational models demonstrate thatwhen faults occur in synapses corresponding to a particularneuron, indirect feedback signal (mediated through retrogradesignalling by the astrocyte via endocannabinoids, a type ofretrograde messenger) from other neurons in the networkimplements repair functionality by increasing the transmissionprobability across all healthy synapses for the affected neu-ron, thereby restoring the original operation [12]. Astrocytesmodulate this synaptic transmission probability (PR) throughtwo feedback signalling pathways: direct and indirect, respon-sible for synaptic depression (DSE) and potentiation (e-SP)respectively. Multiple astrocyte computational models [12],[30]–[32] describe the interaction of astrocytes and neuronsvia the tripartite synapse where the astrocyte’s sensitivity to2-arachidonyl glycerol (2-AG), a type of endocannabinoid, isconsidered. Each time a post synaptic neuron ﬁres, 2-AG isreleased from the post synaptic dendrite and can be describedas: d ( AG ) dt = − AG τ AG + r AG δ ( t − t sp ) (1)where, AG is the quantity of 2-AG, τ AG is the decay rate of2-AG, r AG is the 2-AG production rate and t sp is the time ofthe post-synaptic spike.The 2-AG binds to receptors (CB1Rs) on the astrocyte pro-cess and instigates the generation of IP , which subsequentlybinds to IP receptors on the Endoplasmic Reticulum (ER) toopen channels that allow the release of Ca . It is this increasein cystolic Ca that causes the release of gliotransmittersinto the synaptic cleft that is ultimately responsible for the e-SP N1N2 A1 (a) e-SP N1N2 A1 (b)Fig. 1. (a) Network with no faults, (b) Network with fault occurring insynapse associated with neuron N2 [12]. 2-AG is local signal associated witheach synapse while e-SP is a global signal. A1 is the astrocyte. indirect signaling. The Li-Rinzel model [33] uses three chan-nels to describe the Ca dynamics within the astrocyte: J pump models how Ca is pumped into the ER from the cytoplasmvia the Sarco-Endoplasmic-Reticulum Ca -ATPase (SERCA)pumps, J leak describes Ca leakage into the cytoplasm and J chan models the opening of Ca channels by the mutualgating of Ca and IP concentrations. The Ca dynamicsis thus given by: d Ca dt = J chan + J leak − J pump (2)The details of the equations and their derivations can beobtained from Refs. [12] and [34].The intracellular astrocytic calcium dynamics control theglutamate release from the astrocyte which drives e-SP. Thisrelease can be modelled by: d ( Glu ) dt = − Glu τ Glu + r Glu δ ( t − t Ca ) (3)where, Glu is the quantity of glutamate, τ Glu is the glutamatedecay rate, r Glu is the glutamate production rate and t Ca is thetime of the Ca threshold crossing. To model e-SP: τ eSP d ( eSP ) dt = − eSP + m eSP Glu ( t ) (4)where, τ eSP is the decay rate of e-SP and m eSP is a scalingfactor. Eq. (4) substantiates that the level of e-SP is dependenton the quantity of glutamate released by the astrocyte.The released 2-AG also binds directly to pre-synpaticCB1Rs (direct signaling). A linear relationship is assumedbetween DSE and the level of 2-AG released by the post-synaptic neuron as: DSE = − AG × K AG (5)where, AG is the amount of 2-AG released by the post-synapticneuron and is found from Eq. (1) and K AG is a scaling factor.The PR associated with each synapse is given by the followingequation:PR ( t ) = PR ( t ) + PR ( t ) × (cid:18) DSE ( t ) + eSP ( t )100 (cid:19) (6)where, PR( t ) is the initial PR of the synapses, e-SP and DSEare given by Eq. (4) and (5) respectively. In the computationalmodels, the effect of DSE is local to the synapses connectedto a particular neuron whereas all the tripartite synapses con-nected to the same astrocyte receive the same e-SP. Under no-fault condition, the DSE and e-SP reach a dynamic equilibriumwhere the PR is unchanged over time, resulting in a ﬁxedﬁring rate for the neurons. When a fault occurs, this balancesubsides and the PR changes according to Eq. (6) to restorethe ﬁring rate to its previous value. To showcase this effectconsider for instance, Fig. 1 where a simple SNN with twopost-synaptic neurons is depicted. Let us assume that eachpost-neuron receives input spikes from 10 pre-neurons. Theinitial PR of the synapses were set to 0.5. Fig. 1(a) is the casewith no faults, while in Fig. 1(b), faults have occurred aftersome time in of the synapses associated with post-neuronN2 (Fig. 2). Note, here “faults” imply that the synapses do nottake part in transmission of the input spikes i.e. have a PR of 7LPHV H 63 7LPHV ' 6( 11 7LPHV 3 5 11IDXOW\V\QDSVH1KHDOWK\V\QDSVH 7LPHV ) U HT + ] 11 (a) 7LPHV H 63 7LPHV ' 6( 11 7LPHV 3 5 11IDXOW\V\QDSVH1KHDOWK\V\QDSVH 7LPHV ) U HT + ] 11 (b) 7LPHV H 63 7LPHV ' 6( 11 7LPHV 3 5 11IDXOW\V\QDSVH1KHDOWK\V\QDSVH 7LPHV ) U HT + ] 11 (c) 7LPHV H 63 7LPHV ' 6( 11 7LPHV 3 5 11IDXOW\V\QDSVH1KHDOWK\V\QDSVH 7LPHV ) U HT + ] 11 (d)Fig. 2. Simulation results of the network in Fig. 1 using the computationalmodel of astrocyte mediated self-repair from [12]. Total simulation time is400s. At 200s, faults are introduced in 70% of the synapses connected to N2.All the synapses have PR( t )=0.5. (a) e-SP of N1 and N2. It is the same forboth N1 and N2 since e-SP is a global function, (b) DSE of N1 and N2. It isdifferent for each neuron as it is dependent upon the neuron output. At 200s,after the introduction of the faults in N2, only DSE of N2 changes, (c) PRof different types of synapses connected to N1 and N2, and (d) Firing rate ofneurons N1 and N2.

0. This results in a drop of the ﬁring frequency associatedwith N2 while operation of N1 is not impacted. Thus, theamount of 2-AG released by N2 decreases, which increasesDSE and in turn increases the PR of the associated synapsesof N2 where no faults have occurred. Hence, we observe inFig. 2(d) that the increased PR recovers the ﬁring rate andapproaches the ideal ﬁring frequency. Note that the degree ofself-recovery, i.e. the difference between the recovered andideal frequency is a function of the fault probability. Thesimulation conditions and parameters for the modelling arebased on Ref. [12]. Interested readers are directed to Ref.[12] for an extensive discussion on the astrocyte computationalmodel and the underlying processes governing the retrogradesignalling.A key question that we have attempted to address in thiswork is the computational complexity at which we requireto model the feedback mechanism to implement autonomousrepair in such self-learning networks. Simplifying the feedbackmodelling would enable us to implement such functionalitiesby efﬁcient hardware primitives. For instance, the core func-tionality of astrocyte self-repair occurs in conjunction withSTDP based learning in synapses. Fig. 3 shows a typical STDP learning rule where the change in synaptic weight variesexponentially with the spike time difference between the pre-and post-neuron [35], according to measurements performedin rat glutamatergic synapses [36]. Typically, the height of theSTDP weight update for potentiation/depression is constant( A + / A − ). However, astrocyte mediated self-repair suggeststhat the weight update should be a function of the ﬁring rate ofthe post-neuron [35]. Assuming the fault-less expected ﬁringrate of the post-neuron to be f ideal and the non-ideal ﬁringrate to be f , the synaptic weight update window height shouldbe a function of ∆ f = f ideal − f . The concept has beenexplained further in Fig. 3 and is also in accordance withFig. 2 where the PR increase after fault introduction variesin a non-linear fashion over time and eventually stabilizesonce the self-repaired ﬁring frequency approaches the idealvalue. The functional dependence is assumed to be that ofa sigmoid function – indicating that as the magnitude of thefault, i.e. deviation in the ideal ﬁring frequency of the neuronincreases, the height of the learning window increases inproportion to compensate for the fault [35]. Note that the term“fault” for the machine learning workloads, described herein,refers to synaptic weights (symbolizing PR) stuck at zero.Therefore, with increasing amount of synaptic faults, f <

We utilized the Leaky Integrate and Fire (LIF) spikingneuron model in our work. The temporal LIF neuron dynamicsare described as, τ mem ∂v ( t ) ∂t = − v ( t ) + v rest + I ( t ) (7)where, v ( t ) is the membrane potential, τ mem is the membranetime constant, v rest is the resting potential and I ( t ) denotes thetotal input to the neuron at time t . The weighted summationof synaptic inputs is represented by I ( t ) . When the neuron’s 𝐴 + = 𝐾 −∆𝑓 STDP Learning

Macro-modelling astrocyte functionality: C h a ng e i n sy n a p se w e i gh t ( % ) -60 Spike Timing (ms) ∆𝑤 = 𝐴 + 𝑒𝑥𝑝 −∆𝑡 𝜏 + , ∆𝑡 > 0−𝐴 − 𝑒𝑥𝑝 −∆𝑡 𝜏 + , ∆𝑡 < 0 Fig. 3. In the above equations, the STDP learning window height is a non-linear increasing function of the deviation ∆ f from the ideal ﬁring frequencyof the post-neuron. membrane potential crosses a threshold value, v th ( t ) , it ﬁresan output spike and the membrane potential is reset to v reset .The neuron’s membrane voltage is ﬁxed at the reset potentialfor a refractory period, δ ref , after it spikes during which itdoes not receive any inputs.In order to ensure that single neurons do not dominate theﬁring pattern, homeostasis [6] is also implemented throughan adaptive thresholding scheme. The membrane threshold ofeach neuron is given by the following temporal dynamics, v th ( t ) = θ + θ ( t ) τ theta ∂θ ( t ) ∂t = − θ ( t ) (8)where, θ > v rest , v reset and is a constant. τ theta is theadaptive threshold time constant. The adaptive threshold, θ ( t ) is increased by a constant quantity θ + , each time the neuronﬁres, and decays exponentially according to the dynamics inEquation 8.A trace [37] based synaptic weight update rule was used forthe online learning process [6], [23]. The pre and post-synaptictraces are given by x pre and x post respectively. Whenever thepre (post) - synaptic neuron ﬁres, the variable x pre ( x post ) isset to 1, otherwise it decays exponentially to 0 with spike tracedecay time constant, τ trace . The STDP weight update rule ischaracterized by the following dynamics, ∆ w = (cid:40) η post ∗ x pre on post-synaptic spike − η pre ∗ x post on pre-synaptic spike (9)where, η pre /η post denote the learning rates for pre-synaptic /post-synaptic updates respectively. The weights of the neuronsare bounded in the range of [0 , w max ] . It is worth mentioninghere that the sum of the weights associated with all post-synaptic neurons is normalized to a constant factor, w norm [23]. C. Network Architecture … Input Layer Output Layer

Dense Connection Recurrent Connection … Fig. 4. The single layer SNN architecture with lateral inhibition andhomeostasis used for unsupervised learning.

Our SNN based unsupervised machine learning frameworkis based on single layer architectures inspired from corticalmicrocircuits [6]. Fig. 4 shows the network connectivityof spiking neurons utilized for pattern-recognition problems.Such a network topology has been shown to be efﬁcient inseveral pattern-recognition problems, such as digit recognition[6] and sparse encoding [38]. The SNN, under consideration,has an Input Layer with the number of neurons equivalent to the dimensionality of the input data. Input neurons generatespikes by converting each pixel in the input image to a Poissonspike train whose average ﬁring frequency is proportional tothe pixel intensity. This layer connects in an all-to-all fashionto the Output Layer through excitatory synapses. The Outputlayer has n neurons LIF neurons characterized by homeostasisfunctionality. It also has static (constant weights) recurrentinhibitory synapses with weight values, w recurrent , for lateralinhibition to achieve soft Winner-Take-All (WTA) condition.Each neuron in the Output Layer has an inhibitory connectionto all the neurons in that layer except itself. Trace-based STDPmechanism is used to learn the weights of all synapses betweenthe Input and Output Layers. The neurons in the Output Layerare assigned classes based on their highest response (spikefrequency) to input training patterns [6]. D. Challenges and Astrocyte Augmented STDP (A-STDP)Learning Rule Formulation

One of the major challenges in extending the astrocyte basedmacro-modelling in such self-learning networks lies in thefact that the ideal neuron ﬁring frequency is a function ofthe speciﬁc input class the neuron responds to. This is sub-stantiated by Fig. 5 which depicts the histogram distributionof the ideal ﬁring rate of the wining neuron in the fault-less network. Further, due to sparse neural ﬁring, the totalnumber of output spikes of the winning neurons over theinference window is also small, thereby limiting the amount ofinformation (number of discrete levels) that can be encoded inthe frequency deviation, ∆ f . This leads to the question: Canwe utilize another surrogate signal that gives us informationabout the degree of self-repair occurring in the network overtime while being independent of the class of the input data? ,GHDOILULQJUDWH + L V W RJ U D P F RXQ W &ODVV ,GHDOILULQJUDWH + L V W RJ U D P F RXQ W &ODVV Fig. 5. Histogram count of the ideal ﬁring rate of neurons responding todigit ‘0’ versus digit ‘1’ (measured from 5000 test examples of the MNISTdataset).

While the above challenge is related to the process ofreducing the STDP learning window over time, we observedthat using sole STDP learning or with a constant enhancedlearning rate consistently reduced the network accuracy overtime (Fig. 7). Fig. 8 also depicts that normal STDP retrainingwith faulty synapses slowly loses their learnt representationsover time. Re-learning all the healthy synaptic weights uni-formly using STDP with an enhanced learning rate shouldat least result in some accuracy improvement for the initialepochs of re-training, even if the modulation of learningwindow height over time is not incorporated in the self-repair framework. The degradation of network accuracy starting fromthe commencement of the retraining process signiﬁed thatsome additional factors may have been absent in the astrocytefunctionality macro-modelling process, which is independentfrom the above challenge of modulating the temporal behaviorof the STDP learning window.In that regard, we draw inspiration from Eq. 6, where weobserve that the initial fault-free value of the PR acts as ascaling factor for the self-repair feedback terms DSE and e-SP. We perform a similar simulation for the network shown inFig. 1, with each neuron receiving input from 10 synapses.However in this case, we set the initial PR of all of thesynapses to 0.5, except one connected to N2; for which theinitial PR was set to 0.1. In other words, 9 of the synapsesconnected to N2 have a PR( t )=0.5, while for one PR( t )=0.1.The lower initial PR value symbolizes a weaker connection.The network is simulated for 400s and at 200s, the associatedPR of 8 of the synapses with higher initial PR are reduced to0 to signify faulty condition (Fig. 6). We observe that afterthe introduction of the faults, the PR of the synapses with thehigher initial PR value is enhanced greatly compared to theone with the lower initial PR. This leads us to the conclusionthat synapses that play a greater role in postsynaptic ﬁringalso play a greater role in the self-repair process compared toother synapses.Since our unsupervised SNN is characterized by analogsynaptic weights in the range of [0 , w max ] , we hypothesizedthat this characteristic might underlie the reason for theaccuracy degradation and designed a preferential self-repairlearning rule for healthier synapses. This was found to result insigniﬁcant accuracy improvement during the retraining process(discussed in next section). Our formulated A-STDP learningrule formulation is therefore also guided by the followingquestion: Can we aggressively increase the healthy synapticweights during the initial learning epochs which preserves theoriginal representations learnt by the network?

Driven by the above observations, we formulated our As-trocyte Augmented STDP (A-STDP) learning rule during theself-repair process as, ∆ w = (cid:40) η post ∗ x pre ∗ ( w/w α ) σ on post-synaptic spike − η pre ∗ x post on pre-synaptic spike(10)where, w α represents the the weight value at the α -th per-centile of the network and serves as the surrogate signalto guide the retraining process. Fig. 9 depicts the tempo-ral behavior of w α for the 98-th percentile of the weightdistribution. After faults are introduced, w α is signiﬁcantlyreduced and slowly increases over time during the re-learningprocess. It ﬁnally saturates off at the bounded value w max .The term w/w α ensures that the effective learning rate forhealthier synapses ( w > w α ) is much higher than the learningrate for weaker connections ( w < w α ) while σ dictates thedegree of non-linearity. Since w α increases over time, theenhanced learning process also reduces and ﬁnally stops once w α saturates. It is worth mentioning here that w α , σ and w max are hyperparameters for the A-STDP learning rule. All 7LPHV H 63 7LPHV ' 6( 7LPHV 3 5 )DXOW\+LJKLQLWLDO35/RZLQLWLDO35 7LPHV ) U HT + ] (a) 7LPHV H 63 7LPHV ' 6( 7LPHV 3 5 )DXOW\+LJKLQLWLDO35/RZLQLWLDO35 7LPHV ) U HT + ] (b) 7LPHV H 63 7LPHV ' 6( 7LPHV 3 5 )DXOW\+LJKLQLWLDO35/RZLQLWLDO35 7LPHV ) U HT + ] (c) 7LPHV H 63 7LPHV ' 6( 7LPHV 3 5 )DXOW\+LJKLQLWLDO35/RZLQLWLDO35 7LPHV ) U HT + ] (d)Fig. 6. Simulation results of the network in Fig. 1 using the computationalmodel of [12] with synapses having different initial PR values. Total simu-lation time is 400s. At 200s, faults are introduced in 8 synapses with highinitial PR connected to N2. (a) e-SP of N1 and N2, (b) DSE of N2, (c) PRof the 3 types of synapses connected to N2 (orange: healthy synapse withPR( t )=0.5, green: healthy synapse with PR( t )=0.1 and blue: faulty synapsewith PR( t )=0.5 till 200s and PR( t )=0 afterwards) and (d) Firing rate ofneuron N2 . . . . . . . . 1XPEHURI7UDLQLQJ6DPSOHV 7 H V W $ FF X U D F\ $67'367'3 post 67'3 post 67'3 post 67'3 post Fig. 7. Test accuracy of a 225 neuron network on the MNIST dataset with70% faulty connections with normal and enhanced learning rates during STDPre-training process. Re-training with A-STDP rule is also depicted. hyperparameter settings and simulation details are presentedin the next section. III. R

ESULTS

We evaluated our proposal in the context of unsupervisedSNN training on standard image recognition benchmarks un-der two settings: scaling in network size and scaling in network

50% Fault Probability (a) Baseline Network (b) After STDP Re-learning

80% Fault Probability V a l u e o f S y n a p s e W e i g h t s (c) Baseline Network V a l u e o f S y n a p s e W e i g h t s (d) After STDP Re-learningFig. 8. (a)-(d) Learnt weight patterns for 225 neuron network on the MNISTdataset are shown. Re-training the network with sole STDP learning causesdistortion of the weight maps (50% and 80% fault cases are plotted). The redboxes in (a) and (b) highlight how the the neurons can change associationtoward a particular class during re-learning thereby forgetting their originallearnt representations. Receptive ﬁelds of all neurons undergo distortion forthe 80% fault case. . . . . . . . 1XPEHURI7UDLQLQJ6DPSOHV 9 D O XH R I Fig. 9. Value of w α (98-th percentile from weight distribution of the entirenetwork) during the self-repair process using A-STDP learning rule for a 225neuron network on the MNIST Dataset with 80% faulty connections. complexity. We used MNIST [39] and Fashion-MNIST [40]datasets for our analysis. Both datasets contain 28 × w α and σ for the A-STDPrule unchanged for all fault simulations. Fig. 10 shows atypical ablation study of the hyperparameters α and σ . Forthis study, we trained a 225-neuron network with 90% faults.We divided the training set into training and validation subsetsin the ratio of 5:1 respectively through random sampling. Thetwo accuracy plots shown in Fig. 10 are models retrained onthe training subset and then evaluated on the new validationset. Further hyperparameter optimizations for different faultconditions can potentially improve the accuracy improvementeven further. $ FF X U D F\ (a) $ FF X U D F\ (b)Fig. 10. Ablation studies for the hyperparameters (a) σ (with ﬁxed α = 98 )and (b) α (with ﬁxed σ = 2 ) in A-STDP learning rule. The network is ﬁrst trained with sole STDP learning rule for2 epochs and the maximum test accuracy network is chosenas the baseline model. Subsequently, faults are introduced byrandomly deleting synapses (from the Input to the OutputLayer) post-training. Each synaptic connection was assigneda deletion probability, p del , to decide whether the connectionwould be retained in the faulty network. For this work, p del was varied between 0.5 - 0.9 to analyze the network andre-train after introducing faults. Note that A-STDP learningrule is only used during this self-repair phase. It is worthmentioning here, that weight normalization by factor w norm (mentioned in Section III-B) is used before starting the re-training process. This helps to adjust the relative magnitudeof ﬁring threshold relative to the weights of the neurons (sincethe resultant magnitude diminishes due to fault injection).Fig. 11 shows the test classiﬁcation accuracy as a functionof re-learning epochs for a 225 / 400 neuron network with probability for faulty synapses. After the faults are TABLE IS

IMULATION P ARAMETERS

Parameters Value

Membrane Time Constant, τ mem τ trace v rest -65mVThreshold Voltage Constant, θ -52mVMembrane Reset Potential, v reset -60mVRefractory Period, δ ref τ theta msAdaptive Threshold Voltage Increment, θ + η post − (MNIST) × − (F-MNIST)Pre-Synaptic Learning Rate, η pre − (MNIST) × − (F-MNIST)Normalization Factor, w norm n neurons

225 / 400 (MNIST)400 (F-MNIST)Static Inhibitory Synaptic Weight, w recurrent -120 (MNIST)-250 (F-MNIST)A-STDP Weight-Percentile Hyperparameter, α σ . . . . . . . 1XPEHURI7UDLQLQJ6DPSOHV $ FF X U D F\ 01,677HVW$FFXUDF\ 1HXURQ1HWZRUN1HXURQ1HWZRUN (a) . . . . . . . 1XPEHURI7UDLQLQJ6DPSOHV $ FF X U D F\ )01,677HVW$FFXUDF\ 1HXURQ1HWZRUN (b)Fig. 11. Improvement of test accuracy during re-learning is depicted asa function of the training samples using A-STDP learning rule on the (a)MNIST (225 and 400 neuron networks) and (b) F-MNIST datasets (400neuron network). Mean and standard deviation of the accuracy is plotted for80% fault simulation in the networks. introduced, the network accuracy improves over time duringthe self-repair process. The mean and standard deviation oftest accuracy from 5 independent runs are plotted in Fig.11. Fig. 12 depicts the initial and self-repaired weight mapsof the 225 (MNIST) and 400 (F-MNIST) neuron networks,substantiating that original learnt representations are preserved MNIST V a l u e o f S y n a p s e W e i g h t s (a) Baseline Network V a l u e o f S y n a p s e W e i g h t s (b) After A-STDP Re-learning F-MNIST V a l u e o f S y n a p s e W e i g h t s (c) Baseline Network V a l u e o f S y n a p s e W e i g h t s (d) After A-STDP Re-learningFig. 12. (a-d) Initial and self-repaired weight maps of the 225 (400) neuronnetwork trained on MNIST (F-MNIST) dataset corresponding to 80% faultsimulations. during the re-learning process. Table II summarizes our resultsfor all networks with varying degrees of faults. The numbersin parentheses denote the standard deviation in accuracy fromthe 5 independent runs. Since sole STDP learning resultedin accuracy degradation for most of the runs, the accuracyis reported after 1 re-learning epoch. For some cases, someaccuracy improvement through normal STDP was also ob-served. The maximum accuracy is reported for the A-STDPre-training process. After repair through A-STDP, the networkis able to achieve accuracy improvement across all level offaults, ranging from 50% - 90%. Interestingly, A-STDP is ableto repair faults even in a 90% faulty network and improvethe testing accuracy by almost 9% (5%) for the MNIST (F-MNIST) dataset. Further, the accuracy improvement due toA-STDP scales up with increasing degree of faults. Note thatthe standard deviation of the ﬁnal accuracy over 5 independentruns is much smaller for A-STDP than normal STDP re-training, signifying that the astrocyte enabled self-repair isconsistently stable, irrespective of the initial fault locations.IV. D ISCUSSION

The work provides proof-of-concept results toward thedevelopment of a new generation of neuromorphic computingplatforms that are able to autonomously self-repair faulty non-ideal hardware operation. Extending beyond just unsupervisedSTDP learning, augmenting astrocyte feedback in supervisedgradient descent based training of SNNs needs to be exploredalong with their implementation on neuromorphic datasets[44]. In this work, we also focused on aspects of astrocyteoperation that would be relevant from a macro-modellingperspective for self-repair. Further investigations on under-

TABLE IIS

ELF -R EPAIR R ESULTS FOR

A-STDP E

NABLED

SNN S NetworkDescription FaultProbability Accuracy afterfaultinjection (%) Accuracy afterweightnormalization (%) Accuracy afterSTDPre-training (%) Accuracy afterA-STDPre-training (%) AccuracyGain fromA-STDPMNIST Dataset

225 Excitatory NeuronsBaseline Accuracy = 89.53% 50% . ± .

28 83 . ± .

49 76 . ± .

98 84 . ± . . ± .

41 80 . ± .

70 73 . ± .

18 82 . ± . . ± .

03 76 . ± .

22 70 . ± .

48 79 . ± . . ± .

26 69 . ± .

85 67 . ± .

37 75 . ± . . ± .

11 56 . ± .

45 61 . ± .

38 65 . ± . . ± .

57 85 . ± .

24 80 . ± .

24 87 . ± . . ± .

15 82 . ± .

22 79 . ± .

28 85 . ± . . ± .

10 79 . ± .

61 77 . ± .

62 83 . ± . . ± .

60 73 . ± .

87 73 . ± .

16 78 . ± . . ± .

27 59 . ± .

16 67 . ± .

77 68 . ± . Fashion-MNIST Dataset

400 Excitatory NeuronsBaseline Accuracy = 77.35% 50% . ± .

24 73 . ± .

50 73 . ± .

30 75 . ± . . ± .

19 71 . ± .

36 72 . ± .

60 75 . ± . . ± .

69 70 . ± .

44 70 . ± .

70 73 . ± . . ± .

22 66 . ± .

58 68 . ± .

47 70 . ± . . ± .

28 60 . ± .

86 63 . ± .

77 65 . ± . standing the role of neuroglia in neuromorphic computing canpotentially forge new directions related to synaptic learning,temporal binding, among others.D ATA A VAILABILITY S TATEMENT

The original contributions presented in the study are in-cluded in the article, further inquiries can be directed to thecorresponding author/s.A

UTHOR C ONTRIBUTIONS

AS developed the main concepts. MR, SL and NI performedall the simulations. All authors assisted in the writing of thepaper and developing the concepts.A

CKNOWLEDGMENTS

The work was supported in part by the National ScienceFoundation grants BCS

EFERENCES[1] P. A. Merolla, J. V. Arthur, R. Alvarez-Icaza, A. S. Cassidy, J. Sawada,F. Akopyan, B. L. Jackson, N. Imam, C. Guo, Y. Nakamura, B. Brezzo,I. Vo, S. K. Esser, R. Appuswamy, B. Taba, A. Amir, M. D. Flickner,W. P. Risk, R. Manohar, and D. S. Modha, “A million spiking-neuronintegrated circuit with a scalable communication network and interface,”

Science , vol. 345, no. 6197, pp. 668–673, 2014.[2] M. Davies, N. Srinivasa, T. Lin, G. Chinya, Y. Cao, S. H. Choday,G. Dimou, P. Joshi, N. Imam, S. Jain, Y. Liao, C. Lin, A. Lines, R. Liu,D. Mathaikutty, S. McCoy, A. Paul, J. Tse, G. Venkataramanan, Y. Weng,A. Wild, Y. Yang, and H. Wang, “Loihi: A neuromorphic manycoreprocessor with on-chip learning,”

IEEE Micro , vol. 38, no. 1, pp. 82–99, 2018.[3] A. Sengupta and K. Roy, “Encoding neural and synaptic functionalitiesin electron spin: A pathway to efﬁcient neuromorphic computing,”

Applied Physics Reviews , vol. 4, no. 4, p. 041105, 2017. [4] S. Singh, A. Sarma, N. Jao, A. Pattnaik, S. Lu, K. Yang, A. Sengupta,V. Narayanan, and C. R. Das, “Nebula: A neuromorphic spin-basedultra-low power architecture for snns and anns,” in .IEEE, 2020, pp. 363–376.[5] E. O. Neftci, H. Mostafa, and F. Zenke, “Surrogate gradient learning inspiking neural networks,”

IEEE Signal Processing Magazine , vol. 36,pp. 61–63, 2019.[6] P. U. Diehl and M. Cook, “Unsupervised learning of digit recognitionusing spike-timing-dependent plasticity,”

Frontiers in computationalneuroscience , vol. 9, p. 99, 2015.[7] A. Sengupta, Y. Ye, R. Wang, C. Liu, and K. Roy, “Going deeper inspiking neural networks: VGG and residual architectures,”

Frontiers inneuroscience , vol. 13, 2019.[8] S. Lu and A. Sengupta, “Exploring the connection between binary andspiking neural networks,” arXiv preprint arXiv:2002.10064 , 2020.[9] S. L. Allam, V. S. Ghaderi, J.-M. C. Bouteiller, A. Legendre, A. Nicolas,R. Greget, S. Bischoff, M. Baudry, and T. W. Berger, “A computationalmodel to investigate astrocytic glutamate uptake inﬂuence on synaptictransmission and neuronal spiking,”

Frontiers in computational neuro-science , vol. 6, p. 70, 2012.[10] W.-S. Chung, N. J. Allen, and C. Eroglu, “Astrocytes control synapseformation, function, and elimination,”

Cold Spring Harbor perspectivesin biology , vol. 7, no. 9, p. a020370, 2015.[11] R. Min, M. Santello, and T. Nevian, “The computational power ofastrocyte mediated synaptic plasticity,”

Frontiers in computational neu-roscience , vol. 6, p. 93, 2012.[12] J. Wade, L. J. McDaid, J. Harkin, V. Crunelli, and S. Kelso, “Self-repair in a bidirectionally coupled astrocyte-neuron (AN) system basedon retrograde signaling,”

Frontiers in computational neuroscience , vol. 6,p. 76, 2012.[13] S. Y. Gordleeva, S. V. Stasenko, A. V. Semyanov, A. E. Dityatev, andV. B. Kazantsev, “Bi-directional astrocytic regulation of neuronal activitywithin a network,”

Frontiers in computational neuroscience , vol. 6, p. 92,2012.[14] M. De Pitt`a, V. Volman, H. Berry, V. Parpura, A. Volterra, and E. Ben-Jacob, “Computational quest for understanding the role of astrocytesignaling in synaptic transmission and plasticity,”

Frontiers in computa-tional neuroscience , vol. 6, p. 98, 2012.[15] M. Ranjbar and M. Amiri, “An analog astrocyte–neuron interactioncircuit for neuromorphic applications,”

Journal of Computational Elec-tronics , vol. 14, no. 3, pp. 694–706, 2015. [16] G. Karimi, M. Ranjbar, M. Amirian, and A. Shahim-Aeen, “A neu-romorphic real-time vlsi design of ca2+ dynamic in an astrocyte,”

Neurocomputing , vol. 272, pp. 197–203, 2018.[17] R. K. Lee and A. C. Parker, “A CMOS circuit implementation of retro-grade signaling in astrocyte-neuron networks,” in . IEEE, 2016, pp. 588–591.[18] M. Amiri, S. Nazari, and M. Janahmadi, “Digital conﬁguration ofastrocyte stimulation as a new technique to strengthen the impairedastrocytes in the tripartite synapse network,”

Journal of ComputationalElectronics , vol. 17, no. 3, pp. 1382–1398, 2018.[19] S. Nazari, K. Faez, M. Amiri, and E. Karami, “A digital implementationof neuron–astrocyte interaction for neuromorphic applications,”

NeuralNetworks , vol. 66, pp. 79–90, 2015.[20] J. Liu, J. Harkin, L. P. Maguire, L. J. McDaid, and J. J. Wade, “SPAN-NER: a self-repairing spiking neural network hardware architecture,”

IEEE transactions on neural networks and learning systems , vol. 29,no. 4, pp. 1287–1300, 2017.[21] Y. Irizarry-Valle, A. C. Parker, and J. Joshi, “A cmos neuromorphicapproach to emulate neuro-astrocyte interactions,” in

The 2013 Interna-tional Joint Conference on Neural Networks (IJCNN) . IEEE, 2013, pp.1–7.[22] U. Garg, K. Yang, and A. Sengupta, “Emulation of astrocyte inducedneural phase synchrony in spin-orbit torque oscillator neurons,” arXivpreprint arXiv:2007.00776 , 2020.[23] D. J. Saunders, D. Patel, H. Hazan, H. T. Siegelmann, andR. Kozma, “Locally connected spiking neural networks for unsupervisedfeature learning,”

Neural Networks

Annals of Mathematics and Artiﬁcial Intelligence ,09 2019.[25] B. L. Jackson, B. Rajendran, G. S. Corrado, M. Breitwisch, G. W.Burr, R. Cheek, K. Gopalakrishnan, S. Raoux, C. T. Rettner, A. Padilla,A. G. Schrott, R. S. Shenoy, B. N. Kurdi, C. H. Lam, and D. S.Modha, “Nanoscale electronic synapses using phase change devices,”

ACM Journal on Emerging Technologies in Computing Systems (JETC) ,vol. 9, no. 2, p. 12, 2013.[26] D. Kuzum, R. G. Jeyasingh, B. Lee, and H.-S. P. Wong, “Nanoelectronicprogrammable synapses based on phase change materials for brain-inspired computing,”

Nano letters , vol. 12, no. 5, pp. 2179–2186, 2011.[27] S. H. Jo, T. Chang, I. Ebong, B. B. Bhadviya, P. Mazumder, and W. Lu,“Nanoscale memristor device as synapse in neuromorphic systems,”

Nano letters , vol. 10, no. 4, pp. 1297–1301, 2010.[28] S. Ramakrishnan, P. E. Hasler, and C. Gordon, “Floating gate synapseswith spike-time-dependent plasticity,”

Biomedical Circuits and Systems,IEEE Transactions on , vol. 5, no. 3, pp. 244–252, 2011.[29] N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhut-dinov, “Dropout: a simple way to prevent neural networks from over-ﬁtting,”

The journal of machine learning research , vol. 15, no. 1, pp.1929–1958, 2014.[30] V. Volman, E. Ben-Jacob, and H. Levine, “The astrocyte as a gatekeeperof synaptic information transfer,”

Neural computation , vol. 19, no. 2, pp.303–326, 2007.[31] S. Nadkarni and P. Jung, “Dressed neurons: modeling neural–glialinteractions,”

Physical biology , vol. 1, no. 1, p. 35, 2004.[32] ——, “Modeling synaptic transmission of the tripartite synapse,”

Phys-ical biology , vol. 4, no. 1, p. 1, 2007.[33] Y.-X. Li and J. Rinzel, “Equations for InsP3 receptor-mediated Ca i oscillations derived from a detailed kinetic model: a Hodgkin-Huxleylike formalism,” Journal of theoretical Biology , vol. 166, no. 4, pp. 461–473, 1994.[34] M. De Pitt`a, M. Goldberg, V. Volman, H. Berry, and E. Ben-Jacob,“Glutamate regulation of calcium and ip 3 oscillating and pulsatingdynamics in astrocytes,”

Journal of biological physics , vol. 35, no. 4,pp. 383–411, 2009.[35] J. Liu, L. J. McDaid, J. Harkin, S. Karim, A. P. Johnson, A. G. Millard,J. Hilder, D. M. Halliday, A. M. Tyrrell, and J. Timmis, “Exploring self-repair in a coupled spiking astrocyte neural network,”

IEEE transactionson neural networks and learning systems , vol. 30, no. 3, pp. 865–875,2018.[36] G.-q. Bi and M.-m. Poo, “Synaptic modiﬁcation by correlated activity:Hebb’s postulate revisited,”

Annual review of neuroscience , vol. 24,no. 1, pp. 139–166, 2001. [37] A. Morrison, M. Diesmann, and W. Gerstner, “Phenomenological modelsof synaptic plasticity based on spike timing,”

Biological cybernetics ,vol. 98, pp. 459–78, 07 2008.[38] P. Knag, J. K. Kim, T. Chen, and Z. Zhang, “A sparse coding neural net-work ASIC with on-chip learning for feature extraction and encoding,”

IEEE Journal of Solid-State Circuits , vol. 50, no. 4, pp. 1070–1079,2015.[39] Y. LeCun and C. Cortes, “MNIST handwritten digit database,” 2010.[Online]. Available: http://yann.lecun.com/exdb/mnist/[40] H. Xiao, K. Rasul, and R. Vollgraf. (2017) Fashion-mnist: a novel imagedataset for benchmarking machine learning algorithms.[41] D. J. Saunders, C. Sigrist, K. Chaney, R. Kozma, and H. T. Siegelmann,“Minibatch processing in spiking neural networks,” arXiv preprintarXiv:1909.02549 , 2019.[42] H. Hazan, D. J. Saunders, H. Khan, D. Patel, D. T. Sanghavi,H. T. Siegelmann, and R. Kozma, “Bindsnet: A machine learning-oriented spiking neural networks library in python,”

Frontiers inNeuroinformatics , vol. 12, Dec 2018. [Online]. Available: http://dx.doi.org/10.3389/fninf.2018.00089[43] Q. Zhu and Z. Wang,

Neural Processing Letters , 06 2019.[44] G. Orchard, A. Jayawant, G. K. Cohen, and N. Thakor, “Convertingstatic image datasets to spiking neuromorphic datasets using saccades,”