[PDF] Adaptive Extreme Edge Computing for Wearable Devices

Abstract

Wearable devices are a fast-growing technology with impact on personal healthcare for both society and economy. Due to the widespread of sensors in pervasive and distributed networks, power consumption, processing speed, and system adaptation are vital in future smart wearable devices. The visioning and forecasting of how to bring computation to the edge in smart sensors have already begun, with an aspiration to provide adaptive extreme edge computing. Here, we provide a holistic view of hardware and theoretical solutions towards smart wearable devices that can provide guidance to research in this pervasive computing era. We propose various solutions for biologically plausible models for continual learning in neuromorphic computing technologies for wearable sensors. To envision this concept, we provide a systematic outline in which prospective low power and low latency scenarios of wearable sensors in neuromorphic platforms are expected. We successively describe vital potential landscapes of neuromorphic processors exploiting complementary metal-oxide semiconductors (CMOS) and emerging memory technologies (e.g. memristive devices). Furthermore, we evaluate the requirements for edge computing within wearable devices in terms of footprint, power consumption, latency, and data size. We additionally investigate the challenges beyond neuromorphic computing hardware, algorithms and devices that could impede enhancement of adaptive edge computing in smart wearable devices.

Full PDF

AAdaptive Extreme Edge Computing for WearableDevices

Erika Covi , Elisa Donati , Hadi Heidari , David Kappel , Xiangpeng Liang , MelikaPayvand , and Wei Wang NaMLab gGmbH, N ¨othnitzer Strasse 64 a, 01187 Dresden, Germany Institute of Neuroinformatics, University of Zurich and ETH Zurich, Switzerland Microelectronics Lab (meLAB), James Watt School of Engineering, University of Glasgow, G12 8QQ, UK Bernstein Center for Computational Neuroscience, III Physikalisches Institut - Biophysik, Georg-August Universit ¨at,G ¨ottingen, Germany The Andrew and Erna Viterbi Department of Electrical Engineering, Technion - Israel Institute of Technology, Haifa32000, Israel,Formerly with Dipartimento di Elettronica, Informazione e Bioingegneria (DEIB), Politecnico di Milanoand IU.NET, Milan, Italy * All authors contributed equally to this work

ABSTRACT

Wearable devices are a fast-growing technology with impact on personal healthcare for both society and economy. Due to thewidespread of sensors in pervasive and distributed networks, power consumption, processing speed, and system adaptationare vital in future smart wearable devices. The visioning and forecasting of how to bring computation to the edge in smartsensors have already begun, with an aspiration to provide adaptive extreme edge computing. Here, we provide a holisticview of hardware and theoretical solutions towards smart wearable devices that can provide guidance to research in thispervasive computing era. We propose various solutions for biologically plausible models for continual learning in neuromorphiccomputing technologies for wearable sensors. To envision this concept, we provide a systematic outline in which prospectivelow power and low latency scenarios of wearable sensors in neuromorphic platforms are expected. We successively describevital potential landscapes of neuromorphic processors exploiting complementary metal-oxide semiconductors (CMOS) andemerging memory technologies (e.g. memristive devices). Furthermore, we evaluate the requirements for edge computingwithin wearable devices in terms of footprint, power consumption, latency, and data size. We additionally investigate thechallenges beyond neuromorphic computing hardware, algorithms and devices that could impede enhancement of adaptiveedge computing in smart wearable devices.

Keywords:

Neuromorphic computing, Edge computing, Wearable devices, Learning algorithms, Memristive devices

Wearable devices can monitor various human body symptoms ranging from heart, respiration, movement, to brain activities.Such miniaturized devices using different sensors can detect, predict, and analyze the physical performance, physiological status,biochemical composition, and mental alertness of the human body. Despite advances in novel materials that can improve theresolution and sensitivity of sensors, modern wearable devices are facing various challenges such as low computing capability,high power consumption, high amount of data to be transmitted, and low speed of the data transmission. Conventional wearablesensing solutions mostly transmit the collected data to external servers for off-chip computing and processing. This approachtypically creates an information bottleneck acting as one of the major limiting factors in lowering the power consumptionand improving the speed of the operation of the sensing systems. In addition, the use of conventional remote servers withconventional signal processing techniques for processing these temporal real-time sensing data makes it computationallyintensive and results in signiﬁcant power consumption and hardware occupation. Moreover, standard von-Neumann architecturesfeature a physical separation between memory and processing unit, thus further increasing the power consumption to shuttledata between units. Such solutions always need a trade-off between power lifetime and computing capability. Bringingcomputing at the edge enables faster response times and opens the possibility of personalized always-on wearable devices ablefor continuously interacting and learning with the environment. However, a radical change of paradigm which uses innovativealgorithms, circuits and memory devices is needed to maximize the system performance whilst keeping power and memorybudgets at a minimum.Conventional computers, using Boolean and bit-precise digital representations and executing operations with time-1 a r X i v : . [ c s . ET ] D ec ultiplexed and clocked signal, are not optimized for fuzzy inputs and complex cognitive tasks such as pattern recognition,time series prediction, and decision making. Deep Artiﬁcial Neural Networks (ANNs) on the other hand have demonstratedamazing results in a wide range of pattern recognition tasks including machine vision, Natural Language Processing (NLP), andspeech recognition . Dedicated hardware ANN accelerators, including Graphical Processing Units (GPUs), Tensor ProcessingUnits (TPUs), and custom Application Speciﬁc Integrated Circuits (ASICs) with parallel architectures are being developed toexecute these algorithms and obtain high accuracy inference results. GPUs provide a substrate for parallel processing nature ofthe ANNs and thanks to its very long memory bus it is perfect for running Vector Matrix Multiplications (VMMs) which are atthe core of the processing in deep neural networks. Therefore, GPUs support the parallelism whose massive version exists inthe brains for cognitive purposes, but they consume orders of magnitude more power than that of the brain , since they areclocked and the memory access is not localized. To solve this problem, ASIC accelerators try to reduce the complexity of thestructure by making the system more application speciﬁc and using clock gating and speciﬁc hardware structure which matchesbest to the structure of the mapped neural network to reduce power consumption through less memory read and data access .To go even further in power savings, there are two problems to be solved: (i) remove clock and (ii) perform computationwith co-localization of memory and processor. The ﬁrst problem calls for the development of event-based systems, whereprocessing is performed “asynchronously", i.e. only when there are input “events". The algorithmic basis for this kind of“asynchronous" processing is Spiking Neural Network (SNN), in which neurons spike asynchronously only to communicateinformation to each other.To avoid the data movement between the memory and the processor, the memory element should be not only used tostore data but also to perform computation inside the processor. This approach is called “in-memory computing". These twoapproaches of (i) event-based systems and (ii) in-memory computing, together with (iii) massive parallelism, are the threefundamental principles which have led to the development of neuromorphic computing, and to the realization of highly efﬁcientneuromorphic platforms . Therefore, in this article, we will refer to event-based highly parallel systems that are able toperform real-time sensory processing.Despite that current fully Complementary Metal-Oxide-Semiconductor (CMOS) implementations of neuromorphic plat-forms have shown remarkable performance in terms of power efﬁciency and classiﬁcation accuracy, there are still somebottlenecks hindering the design of embedded sensing and processing systems. First, the memory used is typically StaticRandom Access Memory (SRAM), which has very low static power consumption, but it is a large element (6 transistors percell) and it is volatile. The latter feature implies that the information about the network conﬁguration has to be stored elsewhereand transferred to the system at its startup. For large networks, it may take tens of minutes before the system is ready for normaloperation. Second, always-on adaptive systems need to work with time constants that have the same time-span of the taskthat is being learned (e.g. longer than seconds). Implementing such long time constants in neuromorphic CMOS circuits isimpractical, since it requires large area capacitors.To overcome the limitations of fully CMOS-based approaches, the intrinsic unique physical properties of emergingmemristive devices can be exploited for both long-term (non-volatile) weight storage and short-term (volatile) task-relevanttimescales. In particular, non-volatile devices feature retention times on a long time scale ( >

10 years, ) while showingweight reconﬁgurability with voltages compatible with typical CMOS circuits ( ≤ ), thus being able to emulate biological time constants.This non-volatile / volatile property of memristive devices, together with a small footprint and power efﬁciency, has indeedattracted a lot of interest in the last ten years . However, memristive technology has to be supported by ad hoc theoreticallysound biologically plausible algorithms enabling continual learning and capable to exploit the intrinsic physical properties ofmemristive devices, such as stochasticity, to achieve accuracy performance comparable to state-of-the-art ANN whilst reducingthe power consumption.This review discusses the challenges to undertake for designing extreme edge computing wearable devices in four differentcategories: (i) the state-of-the-art wearable sensors and main restrictions towards low-power and high performance learningcapabilities; (ii) different algorithms for modeling biologically plausible continual learning; (iii) CMOS-based neuromorphicprocessors and signal processing techniques enabling low-power local edge computing strategies; (iv) emerging memristivedevices for more efﬁcient and scalable embedded intelligent systems. As graphically summarized in Fig. 1, we argue that aholistic approach which combines and exploits all the strengths of these four categories in a co-designed system is the keyfactor enabling future generations of smart sensing systems. Sensors act as the information collector of a machine or a system that can respond to its physical ambient environment. Theyare able to translate a speciﬁc type of information from a physical environment such as the human body to an electrical signal( ). For collecting the information from the human body environment, wearable versions of the machine or the system, i.e.wearable devices, would be of great convenient and helpful. Wearable devices require miniaturize, ﬂexible, and highly sensitive igure 1. A graphical overview of adaptive edge computing in wearable biomedical devices. The ﬁgure shows the pathwayfrom wearable sensors to their application through intelligent learning.sensors to capture clear information from the body. However, from processing aspect and to make a signal meaningful towardspersonalized devices, further development is still needed.Due to the fact that the sensing signal is relatively weak and noisy, a readout circuit (normally composed by an ampliﬁer, aconditioning circuit and an analogue signal processing unit) is necessary to make the signal readable for a system ( ). Thesubsequent high-level system will process the data and send commands to actuators for a closed-loop control or interaction( ). For various applications ranging from the human-machine interface ( ) to health monitoring ( ), differentcombinations of sensor and system have been developed over the past decade ( ). The use of machine learning empowerssensor to build a novel smart application. The examples will be provided in the next section. Recently, the ﬁeld of artiﬁcial intelligence further boosts the possibility of smart wearable sensory systems. The emergingintelligent applications and high-performance systems require more complexity and demand sensory units accurately describethe physical object. The decision-making unit or algorithm can therefore output a more reliable result ( ). Depending on thesignal acquiring position, Fig. 1 summaries the four biopotential sensors and two widely used wearable sensors along with theirlearning systems and applications. The sensors for the biopotential will be introduced ﬁrst, and the other two wearable sensorswill be provided separately.The biopotential signal can be extracted from the human body using a sensor with direct electrode contact. The electrochem-ical activity of the cells in nervous, muscular and glandular tissue generates ionic currents in the body. An electrode-electrolytetransducer is needed to convert the ionic current to electric current for the front-end circuit. The electrode that is normallymade up of mental can be oxidized by the electrolyte, generating metal ions and free electrons. In addition, the anions inthe electrolyte can also be oxidized to neutral atoms and free electrons. These free electrons result in current ﬂow throughthe electrode. Thus, the surface potential generated by the electrochemical activities in cells can be sensed by the electrode.However, the bio-signals sensed by the electrode are weak and noisy. Before digitizing the collected signals by analog-to-digitalconverter, an analogue front-end is essential to provide a readable signal. The design requirements of the front-end for thebiopotential electrodes can be summarized as follow: i) high common mode rejection ratio; ii) high signal-to-noise-ratio; iii)low-power consumption; iv) signal ﬁltering, and v) conﬁgurable gain ( ). Electrocardiography (ECG).

ECG is the electrical activity generated by the electrochemistry around cardiac tissue.Containing morphological or statistical features, ECG provides comprehensive information for analyzing and diagnosingcardiovascular diseases ( ). In the previous study, automatic ECG classiﬁcation has been achieved using machine learningalgorithms, such as Deep Neural Network (DNN) ( ), Support Vector Machine (SVM) ( ), and Recurrent NeuralNetwork (RNN) ( ). According to Association for the Advancement of Medical Instrumentation, there are ﬁve classesof ECG type of interest: normal, ventricular, supraventricular, fusion of normal and ventricular, and unknown beats. Thesemethodologies can be evaluated by available ECG database and yield over 90% accuracy and sensitivity for the ﬁve classes,which is essential for future cardiovascular health monitoring. In wearable application, and present systems that measureECG and send it to the cloud for classiﬁcation and health monitoring. lectroencephalography (EEG). Our brain neurons communicate with each other through electrical impulses. An EEGelectrode can help to detect potential information associated with this activity through investigating EEG ( ) in the surfaceof the skull. In comparison with other biopotential signals, surface EEG is relatively weak (normally in the range of microvolt-level) and noisy ( ). Therefore, it requires high input impedance readout circuit and intensive signal pre-processing for cleanEEG data ( ). While wet-electrode (Ag/AgCl) is more precise and more suitable for clinical purpose, passive dry-electrodeis more suitable for daily health monitoring and brain-machine interface ( ). Besides, the applications also include mentaldisorder ( ), driving safety ( ), and emotion evaluation ( ). A commercial biopotential data acquisition system, BiosemiActive Two, provides up to 256 channels for EEG analysis ( ). For a speciﬁc application, we can reduce the number ofelectrodes to only detect the relevant areas, such as 19 channels for depression diagnosis ( ), four channels for evaluatingdriver vigilance ( ) and 64 channels for emotional state classiﬁcation ( ). Although EEG is on-body biopotential, most of theexisting EEG researches employed ofﬂine learning and analysis because of the system complexity and the high number ofchannels. In wearable real-time applications, usually a smaller number of channels were selected and the data were wirelesslysent to cloud for further processing ( ). Electrooculography (EOG).

The eye movement, which results in potential variations around eyes as EOG, is a combinedeffect of environmental and psychological changes. It returns relatively weak voltage (0.01-0.1mV) and low frequency (0-10Hz)( ). Differ from other eye tracking techniques using a video camera and infrared, EOG provides a lightweight, inexpensive andfully wearable solution to access human’s eye movement ( ). It is the most widely used approach of wearable human-machineinterface, especially for assisting quadriplegics ( ). It has been used to control a wheelchair ( ), control a prosthesis limb( ),( ) evaluate sleeping ( ). Additionally, recent studies fuse EEG and EOG to increase the degree of freedom of signaland enhance the system reliability because their similar implicit information such as sleepiness ( ) and mental health ( ).EOG can also act as a supplement to provide additional functions or commands to an EEG system ( ). Electromyography (EMG).

EMG is an electrodiagnostic method for recording and analyzing the electrical activitygenerated by skeletal muscles. EMG is generated by skeletal muscle movement, which frequently occurs in arms and legs. Ityields higher amplitude (up to 10 millivolts) and bandwidth (20-1000Hz) compared to the other biopotentials ( ). Nearthe active muscle, different oscillation signals can be measured by a dry electrode array, which allows the computer to senseand decode body motion ( ). A prime example is the Myo armband of Thalmic Labs, which is a commercial multi-sensordevice that consists of EMG sensors, gyroscope, accelerometer and magnetometer ( ). The sensory data is sent to phone or PCvia Bluetooth, at which various body movements can be obtained by feature extraction and machine learning. Moreover, theapplication of EMG is frequently linked to target control like a wheelchair ( ) and prosthetic hand ( ) for assisting disabledpeople. In addition, its application also includes sign language recognition ( ), diagnosis of neuromuscular disorders ( ),analysis of walking strides ( ) and virtual reality ( ). Machine learning enables the system to overcome the variation of EMGsignals from different users ( ). Photoplethysmography (PPG).

PPG is an non-invasive and low-cost optical measurement method that is often used forblood pressure and heart rate monitoring in wearable devices. The optical properties in skin and tissue are periodically changesdue to the blood ﬂow driven by the heartbeat. By using a light emitter toward the skin surface, the photosensor can detect thevariations in light absorption normally from wrist or ﬁnger. This variation signal is called PPG which is highly relevant to therhythm of the cardiovascular system ( ). Compared with ECG, PPG is easily accessible and low cost, which makes it an idealintermedia of wearable heart rate measurement. The main disadvantage against ECG is that the PPG is not unique for differentpersons and body positions. Thus, further analysis of PPG requires machine learning or other statistics tools for calibratingthe signal to different scenarios. For example, it can be used in biometric identiﬁcation after deep learning ( ). It is worthmentioning that PPG is a strong supplementary in the application of ECG. Bioimpedance spectroscopy (BIS).

BIS is another low-cost and powerful sensing technique that provides informative bodyparameters. The principle is that cell membrane behaves like a frequency-dependent capacitor and impedance. The emitterelectrodes generate multifrequency excitation signal (0.1-100MHz) on the skin while the receiver electrodes collect thesecurrent for demodulating the impedance spectral data of the tissue in between ( ). Compared to homogeneous materials,body tissue presents more complicated impedance spectra because of the cell membranes and macromolecules. Therefore,the tissue conditions, such as muscle concentration, structural and chemical composition, can be analysed through BIS. TheBIS can measure body composition such as fat and water ( ). Based on the different setup in terms of position and frequency,it can also be helpful in the early detection of diseases such as lymphedema, organ ischemia and cancer ( ). Furthermore,multiple pair-wise electrodes can form electrical impedance tomography that describes impedance distribution. By embeddingthese electrodes in a wristband, the tomography can estimate hand gesture after training, which is another novel solution ofinexpensive human-machine interface ( ). .2 Multisensory fusion in wearable devices Every sensor has its own limitation. In some demanding cases, an individual sensor itself cannot satisfy the system requirementsuch as accuracy or robustness ( ). The solution involves increasing the number and type of sensors to form a multisensorysystem or sensor network for one measurement purpose( ). Multiple types of sensor synergistically working in a systemprovide more dimensions of input to fully map an object onto the data stream. Different sensors return different data with respectto sampling rate, number of input and the information behind the data. Machine learning models, such as ANN and SVM,can be designed to combine multiple sources of data. Depended on the application, sensor types and data structure, severalapproaches have been proposed for multisensory fusion. Generally, in such a system, machine learning is frequently used andplays an vital role in merging different sources of sensory data based on its multidimensional data processing mechanism. Themachine learning algorithms allow sensory fusion occurs at the signal, feature or decision level( ). The results showed thata multisensory system is advantageous in improving system performance. For example, the fusion of ECG and PPG pattern canbe an informative physiological parameter for robust medical assessment ( ). Counting the peak intervals between PPG andECG can estimate the arterial blood pressure ( ). Interestingly, a recent study shows that the QRS complex of ECG can bereconstructed from PPG by a novel transformed attentional neural networks after training ( ). This could be beneﬁcial for theaccessibility of wearable ECG. Given the potential of the sensory system with machine learning, the main challenge raised is the shortage of power andcomputing efﬁcient ( ). The novel applications using multiple sensors and high learning ability usually require more energy inthe wearable computing unit ( ). Nevertheless, the power supply in the wearable domain is a difﬁculty with existing batterytechnologies. This weakness limits the further development of smart wearable device ( ). The existing solution is to wirelesslytransfer the raw data onto a cloud where the computationally intensive algorithm is implemented ( ). However, this solution isnot ideal considering 1) the complexity of using a wireless module, 2) the non-negligible power consumption, 3) the amountof data, 4) the space limitation due to the range of wireless transmission, 5) privacy issues due to the broadcast of signals, 6)non-negligible time latency due to communication channel. These drawbacks strongly limit the application of wearable sensors.Implementation of ANN in von Neumann architectures, which has been frequently used in sensors, is power-hungry.Conversely, it has been reported that signal processing activity in the brain is several orders of magnitudes more power-efﬁcientand one order in processing rate better than digital systems ( ). Compared to conventional approaches based on a binary digitalsystem, brain-inspired neuromorphic hardware yet to be advanced in the contexts of data storage and removal as well as theirtransmission between different units. In this perspective, a neuromorphic chip with a built-in intelligent algorithm can act as afront-end processor next to the sensor. The conventional Analog to Digital Converters (ADCs) could be replaced by a deltaencoder or feature extractor converting the sensor analog output to spike-based signal for the hardware (see Section 4). In theend, the output becomes the result of recognition or prediction instead of an intensive data stream. In this way, the computationoccurs at the local edge under low power and brain-like architecture. In this section we will highlight some recently introduced methods to port the power of modern machine learning to neuro-morphic edge devices. In the last couple of years, machine learning has made big steps forward reaching close-to humanperformance on a wide range of tasks. Many of the most successful machine learning methods are based on artiﬁcial neuralnetworks (ANN), which are inspired by the organization of information processing in the brain. However – somewhat contradic-tory – mapping modern ANN learning methods to brain-inspired hardware poses considerable challenges to the algorithm andhardware design. The main reason for this is, that the development of machine learning algorithms has been strongly inﬂuencedby the development of powerful mainframe computers that perform learning ofﬂine in big server farms only eventually sendingback results to the user. While this development has paved the ground for today’s success of ANNs, it has also lead the ﬁeldaway from following the principles used in biology for efﬁcient learning. In the following Section 3.1 we will review recentapproaches to combine the strengths of modern machine learning and brain-inspired algorithms, that are of particular interestfor edge computing applications. In Section 3.2 we will focus on the problem to cope with extreme memory constraints byexploiting sparsity. In Section 3.3 we will highlight additional open challenges and future work.

Today, the dominating method for training artiﬁcial neural networks is the error backpropagation (Backprop) algorithm ,which provides an efﬁcient and scalable solution to adapting the network parameters to a set of training data. Backprop is igure 2.

Biologically inspired models of learning in spiking neural networks (a) The e-prop algorithm approximatesback-propagation through time using random feedback to propagate error signals to synapses of a recurrent SNN (adaptedfrom ) (b) Synaptic sampling exploits the variability of learning rules and redundancy in the task solution space to learnsparse and robust network conﬁgurations (adapted from ) (c) Overcoming forgetting by selectively slowing down weightchanges . After learning a ﬁrst task A, parameter distributions are absorbed into a prior distribution that conﬁnes the motilityof synaptic weights in subsequent tasks (task B). n iterative, gradient-based, supervised learning algorithm that operates in three phases. First, a given input activation ispropagated through the network to generate the output based on the current set of parameters. Then, the mismatch betweenthe generated outputs and target values is computed using a loss function, and propagated backwards through the networkarchitecture to compute suitable weight changes. Finally, the network parameters are updated to reduce the loss. We will notgo into the details behind Backprop here, but see for an excellent review and historical survey of the development of thealgorithm. The problem of porting Backprop to neuromorphic hardware stems form a well-known shortcoming of the algorithmknown as locking – the weights of a network can only be updated after a full forwards propagation of the data through thenetwork, followed by loss evaluation, then ﬁnally after waiting for the back-propagation of error gradients . Locking preventsan efﬁcient implementation of Backprop on online distributed architectures. Also, Backprop is not well suited for spikingneural networks which have non-differentiable output functions. These problems have been recently addressed in brain-inspiredvariants of the Backprop algorithm. In recent years a number of methods have been proposed to approximate the gradient computation performed by Backprop inorder to prevent locking (see for a recent review). proposed to replace the non-local error back-propagating term ofthe Backprop algorithm by sending the loss through a ﬁxed feedback network with random weights that are excluded fromtraining. In this approach, named random feedback alignment the back-propagating error signal acts as a local feedback to eachsynapse, similar to a reward signal in reinforcement learning. The ﬁxed random feedback network de-correlates the error signalsproviding individual feedback to each synapse. Lillicrap et al. could show that this simple approach already provides a viableapproximation to the exact Backprop algorithm and performs well for practical machine learning problems of moderate size.In an event-based version of random feedback alignment, that is well suitable for neuromorphic hardware, was introduced.This approach was further generalized in to include a larger class of algorithms that use error feedback signals.An efﬁcient model for learning complex sequences in spiking neural networks, named

Superspike , was introduced in .The model also uses a learning rule that is modulated by error feedback signals and locally minimizes the mismatch betweenthe network output and a target spike train. To overcome the problem of non-differentiable output, Superspike uses a surrogategradient approach that replaces the inﬁnitely steep spike events with a ﬁnite auxiliary function at the time points of networkspike events . As in random feedback alignment, learning signals are communicated to the synapses via a feedbacknetwork with ﬁxed weights. Using this approach Zenke and others could demonstrate efﬁcient learning of complex sequencesin spiking networks.Another approach to approximate Backprop in spiking neural networks uses an anatomical detail of Cortical neurons. introduced a biologically inspired two-compartment neuron model that approximates the error backpropagation algorithmby minimizing a local dendritic prediction error. port learning by Backprop to neuromorphic hardware by incorporatingdynamics with ﬁnite time constants and by optimizing the backward pass with respect to substrate variability. They demonstratethe algorithm on the BrainScaleS analog neuromorphic architecture.

Recurrent neural network (RNN) architectures often show superior learning results for tasks that involve a temporal dimension,which is often the case for edge computing applications. Porting learning algorithms for RNNs is therefore of utmost importancefor efﬁcient machine learning on the edge. Backpropagation through time (BPTT) – the standard RNN learning method used inmost GPU implementations – unfolds the network in time and keep this extended structure in memory to propagate informationforward and backward which poses a severe challenge to the power and area constraints of edge computing. Recent theoreticalresults show that the power of BPTT can be brought to biologically inspired spiking neural networks (SNN) while at thesame time the unfolding can be prevented in an approximation that operates only forward in time, enabling online, always-on learning. This algorithm operates at every synapse in parallel and incrementally updates the synaptic weights. As for randomfeedback alignment and Superspike discussed above, the weight update depends only on three factors, where the ﬁrst two aredetermined by the states of the two related input/output neurons, and the third is given by synapse-speciﬁc feedback conveyingthe mismatch between the target and the actual output (see Fig. 2a for an illustration). The temporal gap between these factorsis mitigated by an eligibility trace describing a transient dynamic. Eligibility traces, have been theoretically predicted for a longtime , and have also recently been observed experimentally in the brain . .2 Efﬁcient learning under stringent memory constraints The amount of available resources in neuromorphic systems is kept low to increase energy efﬁciency. Memory elements areespecially impactful on the energy budget. Therefore, algorithms are needed that make efﬁcient use of the available memoryresources. The largest amount of memory in a network is usually consumed by the synaptic weights. Since in practice, theweights of many connections in a network converge to values close to zero, several methods have been proposed to reducethe memory footprint of machine learning algorithms by exploiting sparsity in the network connectivity. We will discuss heretwo types of algorithms: (1) those that are based on pruning connections after learning and (2) online learning with sparse networks. These two types of sparse learning algorithms are discussed in the following sections.

Many approaches to exploit sparsity in learning algorithms focus on pruning the network after training (see for a recentreview). Simple methods rely on pruning by magnitude, simply by eliminating the weakest (closest to zero) weights in thenetwork . Some methods based on this idea have reported impressive sparsity rates of over 95% for standard machinelearning benchmarks with negligible performance loss . Other methods are based on theoretical motivations and classicalsparsiﬁcation and regularization techniques . These models reach high compression rates. proposed a method toiteratively grow and prune a network in order to generate a compact yet precise solution. They provide a detailed compari-son with state of the art dense networks and other pruning methods and reaching sparsity above 99% for the LeNet-5 benchmark.

A number of authors also introduced methods that work directly with sparse networks during training, which is often themore interesting case for neuromorphic applications with online training. introduced an algorithm for online stochasticrewiring in deep neural networks that works with a ﬁxed number of synaptic connections throughout learning. The algorithmshowed close-to state of the art performance at up to 98% sparsity. Sparse evolutionary training (SET) introduced a heuristicapproach that prunes the smallest weights and regrows new weights in random locations. Dynamic Sparse Reparameterization introduces a prune-redistribute-regrowth cycle. They demonstrated compelling performance levels also for very deep neuralnetwork architectures. introduced a single shot pruning algorithm that yields sparse networks based on a saliency criterionprior to the actual training. introduced a reﬁned method for online pruning and redistribution that surpasses the previousmethods in terms of sparsity and learning performance.

As outlined above, edge computing poses quite speciﬁc challenges to learning algorithms that are substantially differentfrom requirements of classical applications. Some of the algorithms outlined above have already been succesfully ported toneuromorphic hardware. For example, the e-prop algorithm of has been implemented on the SpiNNaker 2 chip yielding anadditional energy reduction by two orders of magnitude compared to a X86 implementation . See the next Section 4 for moredetails on available neuromorphic hardware and their applications.In the remainder of this section we will highlight open challenges that remain to be solved for efﬁcient learning in edgecomputing applications. In addition to the stringent memory and power constraints learning at the edge also has to function inan online scenario where data arrive in a continuous stream. Some dedicated hardware resources, e.g. like memristive devicesdiscussed in Section 5, may also show high levels in intrinsic variability, so the learning algorithm should be robust againstthese noise sources. In this section we discuss recent advances in this line of research and provide food for thought on howthese speciﬁc challenges can be approached in future work.

Here we review recent advances in using inspiration from biology to make learning algorithms robust against device variability.Several authors have suggested that device noise and variability should not be seen as a nuisance, but rather can serve as acomputational resource for network simulation and learning algorithms (see for a thorough discussion). have shownthat variability in neuronal outputs can be exploited to learn complex statistical dependencies between sensory stimuli. Thestochastic behavior of the neurons is used in this model to compute probabilistic inference, while biologically motivatedlearning rules, that only require local information at the synapses can be used to update the synaptic weights. A theoreticalfoundation of the model shows that the spiking network performs a Markov chain Monte Carlo sampling process, that allowsthe network to ’reason’ about statistical problems.This idea is taken one step further in by showing that also the variability of synaptic transmission can be used forstochastic computing. The intrinsic noise of synaptic release is used to drive a sampling process. It was shown that this modelcan be implemented in an event-based fashion and was benchmarked on the MNIST digit classiﬁcation task, where it achieved .

6% accuracy. In it was shown that the variability of learning rules and weight parameters gives rise to a biologicallyplausible model of online learning. The intrinsic noise of synaptic weight changes drives a sampling process that can be usedto exploit redundancies in the task solution space (see Fig. 2b for an illustration). This model was applied to unsupervisedlearning in spiking neural networks, and to closed-loop reinforcement learning problems . In this model was also portedto the SpiNNaker 2 neuromorphic many-core system. Neuromorphic systems often operate in an environment where they are permanently on and learning a continuous stream ofdata. This mode of operation is quite different from most other machine learning applications that work with hand-labeledbatches of training data. Always-on learning on a system with limited resources inevitably leads to situations where the systemreaches the limits of its memory capacity and thus starts forgetting previously learned sensory experiences. Inspiration toovercome forgetting relevant information comes from biology. The mammalian brain seems to combat forgetting by activelyprotecting previously acquired knowledge in neocortical circuits . When a new skill is acquired, a subset of synapses isstrengthened, stabilized and persists despite the subsequent learning of other tasks .A theoretical treatment of the forgetting problem was conducted in the cascade model of Stefano Fusi and others .They could show that learning an increasing number of patterns in a single neural network leads unavoidably to a state whichthey called catastrophic forgetting. Trying to train more patterns into the network will interfere with all previously learnedones, effectively wiping out the information stored in the network. The proposed cascade model to overcome this problem usesmultiple parameters per synapse that are linked through a cascade of local interactions. This cascade of parameters selectivelyslows down weight changes, thus stabilizes synapses when required and effectively combats effects of forgetting. A relatedmodel, that uses multiple parameters per synapse to combat forgetting was used in (see also for a recently introducedvariation of the model). They used a Bayesian approach that infers a prior distribution over parameter values at each synapse.Synapses that stabilize during learning (converge to a ﬁxed solution) will be considered relevant in subsequent learning andBayesian priors help to maintain their values (see Fig. 2c for an illustration). Distributed computing architectures at the edge need to make decisions by integrate information from different sensors andsensor modalities and they should be able best make use of the sensory information across a wide range of tasks. It is clearly notvery efﬁcient to learn from scratch when confronted with a new task. Therefore, to boost the performance of edge computing,we will consider two aspects of transferring information to new situations: transfer of knowledge between sensors ( sensorfusion ), which has been treated in Section 2.2, and transfer of knowledge between multiple different tasks ( transfer learning ). Transfer learning denotes the improvement of learning in a new task through the use of knowledge from a related task thathas already been learned previously . This contrasts most other of today’s machine learning applications that focus onone very speciﬁc task. In transfer learning, when a new task is learned, knowledge from previous skills can be reused withoutinterfering with them. E.g. the ability to perform a tennis swing can be transferred to playing ping pong, while maintaining theability to do both sports. The literature on transfer learning is extensive and many different strategies have been developeddepending on the relationship between the different task domains (see and for systematic reviews). In machine learning anumber of approaches have been applied to a wide range of problems, including classiﬁcation of images , text orhuman activity .A very general approach to learn across multiple domains is followed in the learning to learn framework of . Theirmodel features networks that are able to modify their own weights through the network activity. These network are thereforeable to tinker with their own processing properties. This approach has been taken to its most extreme form where a networkleans to implement an optimization algorithm by itself . This model consists of an outer-loop learning network ( the optimizer )that controls the parameters of an inner-loop network ( the optimizee ). The training algorithm of the inner-loop network workson single tasks that are presented sequentially, whereas the outer-loop learner operates across tasks and can acquire strategies totransfer knowledge. This learning-to-learn framework was recently applied to SNNs to obtain properties of LSTM networks anduse them to solve complex sequence learning tasks . In the learning-to-learn framework was also applied to a neuromorphichardware platform.

Neuromorphic engineering is a branch of electrical engineering dedicated to the design of analog/digital data processorsthat aims to emulate biological neurons and synapses. It typically consumes less energy than conventional computingsystems and presents additional properties, such as massively parallel event-based computation, distributed local memory and daptation . This increasing interest in neuromorphic engineering shows that hardware SNNs are considered a key futuretechnology with high potential in key application, such as the Edge of Computing, and wearable devices.Neuromorphic technologies have sparked interest from universities and companies such as IBM and Intel . Inthis Section, we will provide an overview of the neuromorphic platforms, that to the best of our knowledge were deployed forbiomedical signal processing, showing promising results to be exploited in wearable devices. TrueNorth.

TrueNorth is IBM’s fully digital neuromorphic chip with one million neurons arranged in a tiled array of 4096neurosynaptic cores enabling massive parallel processing . Each core contains 13kB of local SRAM memory to keep neuronsand synapse’s states along with the axonal delays and information on the fan-out destination. There are 256 Leaky-Integratorand Fire (LIF) neurons implemented by time-multiplexing and 256 million synapses are designed in the form of SRAM memory.Each core can support up to 256 fan-in and fan-out, and this connectivity can be conﬁgured such that a neuron in any core cancommunicate its spikes any other neuron in any other core.Thanks to the event-driven , the co-location of memory and processing units in each core, and the use of low-leakage siliconCMOS technology, TrueNorth can perform 46 billion synaptic operations per second (SOPS) per watt for real-time operation,with 26 pJ per synaptic event. Its power density of 20 mW/cm is about three orders of magnitude smaller than that of typicalCPUs. SpiNNaker.

The SpiNNaker machine , designed by the University of Manchester, is a custom-designed ASIC basedon massively parallel architecture that has been designed to efﬁciently simulate large spiking neural networks. It consistsof ARM968 processing cores arranged in a 2D array where the precise details of the neurons and their dynamics can beprogrammed into. Although the processing cores are synchronous microprocessors, the event-based aspect of SpiNNaker isapparent in its message-handling paradigm. A message (event) gets delivered to a core generating a request for being processed.The communications infrastructure between these nodes is specially optimized to carry very large numbers of very smallpackets, optimal for spiking neurons.A second generation of SpiNNaker was designed by Technical University of Dresden . Spinnaker2 continues the line ofdedicated digital neuromorphic chips for brain simulation increasing the simulation capacity by a factor >

10 while stayingin the same power budget (i.e. 10x better power efﬁciency). The full-scale SpiNNaker2 consists of 10 Million ARM coresdistributed across 70000 Chips in 10 server racks. This system takes advantage of advanced 22nm FDSOI technology node withAdaptive Body Biasing enabling reliable and ultra-low power processing. It also features incorporating numerical acceleratorsfor the most common operations.

Loihi.

Loihi is Intel’s neuromorphic chip with many core processing incorporating on-line learning designed in 14 nmFinFET technology. The chip supports about 130000 neurons and 130 million synapses distributed in 128 cores. Spikes aretransported between the cores in the chip using packetized messages by an asynchronous network on chip. It includes threeembedded x86 processors and provides a very ﬂexible learning engine on which diverse online learning algorithms such asSpike-Timing Dependent Plasticity (STDP), different 3 factor and trace-based learning rules can be implemented. The chipalso provides hierarchical connectivity, dendritic compartments, synaptic delays as different features that can enrich a spikingneural network. The synaptic weights are stored on local SRAM memory and the bit precision can vary between 1 to 9 bits. Alllogic in the chip is digital, functionally deterministic, and implemented in an asynchronous bundled data design style. DYNAP-SE.

DYNAP-SE implements a multi-core neuromorphic processor with scalable architecture fabricated using astandard 0.18 µ m CMOS technology . It is a full-custom asynchronous mixed-signal processor, with a fully asynchronousinter-core and inter-chip hierarchical routing architecture. Each core comprises 256 adaptive exponential integrate-and-ﬁre(AEI&F) neurons for a total of 1k neurons per chip. Each neuron has a Content Addressable Memory (CAM) block, containing64 addresses representing the pre-synaptic neurons that the neuron is subscribed to. Rich synaptic dynamics are implemented onthe chip by using Differential Pair Integrator (DPI) circuits . These circuits produce EPSCs and IPSCs (Excitatory/InhibitoryPost Synaptic Currents), with time constants that can range from a few µ s to hundreds of ms . The analog circuits are operatedin the sub-threshold domain, thus minimizing the dynamic power consumption, and enabling implementations of neural andsynaptic behaviors with biologically plausible temporal dynamics. The asynchronous CAMs on the synapses are used to storethe tags of the source neuron addresses connected to them, while the SRAM cells are used to program the address of thedestination core/chip that the neuron targets. ODIN/MorphIC.

ODIN (Online-learning DIgital spiking Neuromorphic) processor occupies an area of only 0.086mm in28nm FDSOI CMOS . It consists of a single neurosynaptic core with 256 neurons and 256 synapses. Each neuron can beconﬁgured to phenomenologically reproduce the 20 Izhikevich behaviors of spiking neurons . The synapses embed a 3-bitweight and a mapping table bit that allows enabling or disabling Spike-Dependent Synaptic Plasticity (SDSP) locally , thusallowing for the exploration of both off-chip training and on-chip online learning setups.MorphIC is a quad-core digital neuromorphic processor with 2k LIF neurons and more than 2M synapses in 65nm CMOS . able 1. Summary of neuromorphic platforms and biomedical applications

Neuromorphic Chip DYNAP-SE SpiNNaker Loihi TrueNorth ODINCMOS Technology

Implementation

Mixed-signal Digital Digital ASIC Digital ASIC Digital ASIC

Energy per SOP

17 pJ @ 1.8V Peak power 1W perchip 23.6 pJ @0.75V 26 pJ @ 0.775 12.7 [email protected]

Size mm mm mm mm (core) 0.086 mm On-chip learning

No Yes (conﬁgurable) Yes(conﬁgurable) No Yes (SDSP)

Applications

EMG, ECG, HFO EMG and EEG EMG EEG and LocalFieldPotential (LFP) EMG

MorphIC was designed for high-density large-scale integration of multi-chip setups. The four 512-neuron crossbar cores areconnected with a hierarchical routing infrastructure that enables neuron fan-in and fan-out values of 1k and 2k, respectively.The synapses are binary and can be either programmed with ofﬂine-trained weights or trained online with a stochastic versionof SDSP.

Table 1 shows the summary of neuromorphic processors described previously and in which biomedical signal processingapplications were used. These works show promising results for always-on embedded biomedical systems.The ﬁrst chip presented in this table is DYNAP-SE, used to implement SNNs for the classiﬁcation or detection of EMG and ECG and to implement a simple spiking perceptron as part of a design to detect High Frequency Oscillation (HFO)in human intracranial EEG . In particular, in a spiking RNN is deployed for ECG/EMG signal separation to facilitatethe classiﬁcation with a linear read-out. SVM and linear least square approximation is used in the read out layer for andoverall accuracy of 91% and 95% for anomaly detection were reached respectively. In , the state property of the spiking RNNon EMG was investigated for different hand gestures. In the performance of a feedforward SNN and a hardware-friendlyspiking learning algorithm for hand gesture recognition using superﬁcial EMG was investigated and compared to traditionalmachine learning approaches, such as SVM. Results show that applying SVM on the spiking output of the hidden layerachieved a classiﬁcation rate of 84%, and the spiking learning method achieved 74% with a power consumption of about0 . mW . The consumption was compared to state-of-the-art embedded system showing that the proposed spiking network istwo orders of magnitude more power efﬁcient .Recently, the benchmark hand-gesture classiﬁcation was processed and compared on two other digital neuromorphicplatforms, i.e. Loihi and ODIN/MorphIC . A spiking Convolutional Neural Network (CNN) was implemented on Loihiand a spiking Multilayer Perceptron (MLP) was implemented on ODIN/MorphIC . Because of the properties of neuromorphicchips, on Loihi a late fusion was implemented combining the output from the spiking CNN for vision, and the spiking MLP forEMG signals; While on ODIN/MorphIC hardware, the two spiking MLPs were fused in the last layer. Due to the neuromorphicchip properties the Loihi implemented a late fusion of a spiking CNN, for vision and a spiking MLP for EMG signals. Inthe ODIN/MorphIC system two spiking MLPs were fused in the last layer. The comparison with the embedded GPU wasperformed in terms of accuracy, power consumption, and latency showing that the neuromorphic chips are able to achieve thesame accuracy with signiﬁcantly smaller energy-delay product, 30x and 600x more efﬁcient for Loihi and ODIN/MorphIC,respectively . In SNNs a single spike by itself does not carry any information. However, the number and the timing of spikes produced by aneuron are important. Just as their biological counterpart, silicon neurons in neuromorphic devices produce spike trains at a ratethat is proportional to their input current. At the input side, synapse circuits integrate the spikes they receive to produce analogcurrents, with temporal dynamics and time constants that can be made equivalent to their biological counterparts. The sum ofall the positive (excitatory) and negative (inhibitory) synaptic currents afferent to the neuron is then injected into the neuron.To provide biomedical signals to the synapses of the SNN input layer, it is necessary to ﬁrst convert them into spikes. Acommon way to do this is to use a delta-modulator circuit functionally equivalent to the one used in the Dynamic VisionSensor (DVS) . This circuit, in practice, is an ADC that produces two asynchronous digital pulse outputs (UP or DOWN) for very biosignal channel in the input. The UP (DOWN) spikes are generated every time the difference between the current andprevious value exceeds a pre-deﬁned threshold. The sign of the difference corresponds to the UP or DOWN channel where thespike is produced. This approach was used to convert EMG signals, used in mixed-signal neuromorphic chips and indigital ones , ECG signals , and EEG and HFO ones . Local adaptation is an important aspect in extreme edge computing, specially when it comes to wearable devices. The currentmethods for training networks for biomedical signals rely on large datasets collected from different patients. However, whenit comes to biological data, there is no “one size ﬁts all”. Each patient and person has their own unique biological signature.Therefore, the ﬁeld of Personalized Medicine (PM) has gained lots of attention in the past few years and the online on-edgeadaptation feature of neuromorphic chips can be a game changer for PM.As was discussed in Section 3.1, there are lots of effort in designing spike-based online learning algorithms which can beimplemented on neuromorphic chips.Example of today’s state of the art for on-chip learning are Intel’s Loihi , DynapSEL and ROLLS chip from UZH/ETHZ ,BrainScales from Heidelberg and ODIN from UC Louvain . Intel’s Loihi includes a learning engine which can implementdifferent learning rules such as simple pairwise STDP, triplet STDP, reinforcement learning with synaptic tag assignmentsor any 3 factor learning rule implementation. DynapSEL, ROLLS and ODIN encompass the SDSP, also known as the Fusilearning rule, which is a form of semi-supervised learning rule that can support both unsupervised clustering applications andsupervised learning with labels for shallow networks . BrainscaleS chip implements the STDP rule. Moreover, Spinnaker 1and 2 can implement a wide variety of on-chip learning algorithms since their designs make use of ARM microcontrollersproviding lots of conﬁgurability for the users. Generally, implementing on-chip online learning is challenging because of these two core reasons: locality of the weight updateand weight storage.

Locality

The learning information for updating the weights of any on-chip network should be locally available to the synapsesince otherwise this information should be “routed” to the synapse by wires which will take a signiﬁcant amount of area on chip.The simplest form of learning which satisﬁes this requirement is Hebbian learning which has been implemented on a varietyof neuromorphic chips forms of unsupervised/semi-supervised learning . However, Hebbian-based algorithms arelimited in the tasks they can learn and to the best of our knowledge no large scale task has been demonstrated using this rule.Since gradient descent-based algorithms such as Backprop has had lots of success in deep learning, there are more and morespike-based error Backprop rules that are being developed as was discussed in Section 3.1. These types of learning algorithmshave recently been custom designed in the form of spike-based delta rule as back-bone of the Backprop algorithm. For example,single layer implementation of the delta rule has been designed in and employed for EMG classiﬁcation . Expanding thisto multi-layer networks involves non-local weight updates which limits its on-chip implementation. Making the Backpropalgorithm local is a topic of on-going research which we have discussed in Section 3.1. Recently, a multi-layer perceptronerror-triggered learning architecture has been proposed to overcome the non-locality of multi-layer networks solving the spatialcredit assignment problem on chip

Weight storage

The ideal weight storage for online on-chip learning should have the following properties: (i) non-volatilityto keep the state of the learnt weights even when the power shuts down to reduce the time and energy footprints of reloading theweights to the chip. (ii) Linear update which allows the state of the memory to change linearly with the calculated update. (iii)Analog states which allows a full-precision for the weights. Non-volatile memristive devices have been proposed as a greatpotential for the weight storage and there is a large body of work combining the CMOS technology with that of the memristivedevices to get the best of two worlds.In the next Section we provide a thorough review on the state of the art for the emerging memory devices and the efforts tointegrate and use them in conjunction with neuromorphic chips.

The severe power and area constraints under which a neuromorphic processor for edge computing must work opened waystowards the investigation of beyond-CMOS solutions. Despite still at the dawn of its technological development, memristivedevices have been drawing attention in the last decade thanks to their scalability, low-power operation, compatibility withCMOS chip power supply and CMOS fabrication process, and volatile/non-volatile properties. In Section 5.1, we will introducememristive devices and the properties that are appealing for adaptive extreme edge computing paradigms. In Section 5.2, igure 3.

Memristive devices for neuromorphic computing. (a) Interface type RRAM device; (b) Filamentary RRAM device;(c) Phase change memory device; (d) MRAM device with in-plane spin polarization; (e) MRAM device with perpendicularspin polarization; (f) FTJ device.we will explore the role of memristive devices in neuromemristive systems and give examples of possible applications. InSection 5.3, we will discuss the current challenges and the future perspectives of memristive technology.

Memristive devices, as the name suggested, are devices which can change and memorize their resistance states. They areusually two-terminal devices, however, can be implemented with various physical mechanisms, resulting in versatile existingforms, e.g. resistive random access memory (RRAM, Fig. 3a and 3b) ( ), phase change memory (PCM, Fig. 3c) ( ),magnetic random access memory (MRAM, Fig. 3d and Fig. 3e) ( ), ferroelectric tunneling junction (FTJ, Fig. 3f) ( ), etc.The resistance memory of these devices can mimic the memory effect of the basic components of biological neural system,while the resistance changing can mimic the plasticity of biological synapse. Facilitated with their simplicity of two-terminalconﬁguration and scalability to nanoscale, they are inherently suitable for the hardware implementation of brain-inspiredcomputation materializing an artiﬁcial neural network, i.e. neuromorphic computation ( ).This notation, in recent years, has incited wide investigations on the various memristive devices and on their applications inneural network learning and recognition, or, in short, memristive learning ( ). The memristive learning can enable energyefﬁcient and low latency information process within a reduced size of systems abandoning the conventional von-Neumannarchitecture. Among other beneﬁts, this will also make it possible to process information where they are acquired, i.e. withinsensors, and reduce the bandwidth needed for transferring the sensor data to data center, accelerating the coming of the era ofInternet-of-Things (IOT). Table 2 summarizes the key features of the main memristive device technologies for neuromorphic /wearable applications in terms of cell area, electrical characteristics, main advantages and challenges. It is worth noticing thatsome ﬁgures of merit in this context are radically different with respect to standard memory requirements. Indeed, while in thememory scenario higher read currents enable faster reading speed, in neuromorphic applications currents as low as possibleare preferred, since the current is a limiting factor for neurons’ fan-out. Similarly, SET and RESET times should be as fastas possible in memory applications, while in our applications this requirement can be relaxed thanks to the lower operatingfrequency of the neurons (20 Hz to 100 Hz). Moreover, the number achievable conductance levels has to be increased ( ).Some non-idealities which are usually detrimental for memory applications, for instance stochasticity of switching parameters,are even beneﬁcial for the neural networks.In addition to the commonly referred non-volatile type of memristive switching, the RRAM device can also show volatilebehavior, which usually occurs when active materials such as silver or copper are used as electrode. The relatively long retentiontime of the volatile behavior (tens of milliseconds to seconds) is then found to be similar to the timescale of short term memory,and naturally was proposed to mimic the short term memory effect of biological synapses ( ).Although most researches on memristive devices are carried on rigid silicon substrates, the simple structure of memristivedevices can also be realized on ﬂexible substrates ( ), which opens new interesting possibilities for realizing local computation able 2. Key features of non-volatile memristive devices.

RRAM PCM MRAM FTJ

Cell area [min.feature size] F F F ( ) 4 F Retention >

10 years ( ) >

10 years ( ) Endurance ( ) 10 ( ) 10 ( ) > ( ) SET / RESET time

100 ps ( ) >

100 ns, 10 ns 20 ns ( ) 30 ns, 30 ns85 ps ( ) ( ) 3 ns ( ) ( ) Read current

100 pA ( ) 25 µ A ( ) 20 µ A ( ) 0.8 nA ( , devicediameter 300 nm)

Write energy per bit

20 fJ ( ) ∼

100 fJ ( ) 90 fJ ( ) <

10 fJ ( ) Main features

Scalability, speed, lowenergy Scalability, multilevel,low voltage Endurance, low power Endurance, low power,speed

Challenges

Variability RESET current,temperature stability,resistance drift Density, scalability,variability Scalability within wearable devices ( ). As mentioned in Section 5.1, the primary function of memristive devices is the usage as synaptic devices to implement thememory and plasticity of biological synapses. However, there are increasing interests for these devices to be utilized toimplement nanoscale and artiﬁcial neurons.On the neuron side, the memristive device gradual internal state change and its consequently abrupt switching closelymimic the integrate-and-ﬁre behavior of biological neurons ( , Fig. 4a-c). Due to the sample structure and nanometerlevel scalability, memristive neurons can be much more compact than current CMOS neurons which might consist of currentsensor, analog-to-digital converter (ADC), and analog-to-digital converter (DAC), and capacitors, all of which are expensive toimplement in current CMOS technology in terms of area and/or power consumption ( ). The implementation of memristiveneurons will also enable full memristive neuromorphic computing ( ), which promises further increases in the integration ofthe hardware neuromorphic computing.On the synaptic side, the key feature of the biological synapses is their plasticity, i.e. tunable weight, which can be generallyimplemented by resistance or conductance modiﬁcation in the memristive devices (Fig. 4d). Fundamental learning rulesbased on STDP have already been widely explored ( ). Spatial spiking pattern recognition ( ), spiking co-incidencedetection ( ), and spatial-temporal correlation ( ) has been reported recently. Synaptic metaplasticity, such aspaired-pulse facilitation, can also be achieved via various device operation mechanism ( ). There are generally two approaches for a hardware neuromorphic system implementing memristive devices as synapses: (i)deep learning accelerator, accelerating the artiﬁcial neural network computing with multiple layer and error back-propagation,as well as it’s variations, like convolutional neural network, recurrent neural network, etc.; (ii) brain-like computing, attemptingto closely mimicking the behaviors of biological neural system, like spike representation (Fig. 4d) and collective decisionmaking behavior. In the deep learning accelerator approach, on-line training places more requirements for the memristivesynapses. For instance, linear and symmetrical weight update is crucial for the on-line training ( ), while off-line trainingignores it since the synaptic weight can be programmed to the memristive device with ﬁne tuning and iterative verify ( ).Collective decision making is an important feature of the brain computing, which requires high parallelism and, consequently,low current devices. For instance, this feature is the essential for Hopﬁeld neural network ( ), cellular neural network ( ),and coupled oscillators ( ). In the Hopﬁeld neural network, the system automatically evolves to its energy minimization pointsleading the functionality of associative memory. The use of Hopﬁeld like recurrent neural networks (RNNs) with memristivedevices has already been successfully demonstrated in a variety of tasks ( ). As an example of memristive based coupledoscillator network, used a network of self-sustained van der Pol oscillators coupled with oxide-based memristive devices to igure 4.

Memristive devices as synapse or neuron for neuromorphic computing. (a)-(c) memristive device act as thresholddevice for the ﬁring function of biological neuron ( , reproduced under the CC BY license). (d) Conceptual illustration ofmemristive device as artiﬁcial synapse for brain-like neuromorphic computing ( , reproduced under the CC BY-NC license).investigate the temporal binding problem, which is a well known issue in the ﬁeld of cognitive neuroscience. In this experiment,the network is able to emulate an optical illusion which shows two patterns depending on the inﬂuence of attention. This meansthat the network is able to select relevant information from a pool of inputs, as in the case of a system collecting signals frommultiple sensors.

At present, memristive technology has been mainly used in relatively simple networks with Hebbian-based learning algorithms.However, more recently, systems able of solving different tasks, such as speech recognition ( ), and exploring differentarchitectures and learning algorithms are being investigated. In particular, the beneﬁts of exploiting sparsity, mentioned inSection 3.2, are demonstrated for feature extraction and image classiﬁcation in networks trained with stochastic gradientdescend and winner-take-all learning algorithms ( ), as well as in hierarchical temporal memory, which does not need training( ).In the latest years, memristive devices have been used in applications closer to biology, enabling hybrid biological-artiﬁcialsystems ( ) and investigating biomedical applications, ranging from speech and emotion recognition ( ) to biosignal ( )and medical image ( ) processing. Finally, an interesting application is the one of memristive biosensors, which used toimplement a system for cancer diagnostic. The innovative use of memristive properties was demonstrated in hardware andopens the way to a broader use of memristive technology where sensors and computing co-exist in the same system or, possibly,in the same device.

Implementation of mainstream deep learning algorithms with Backprop learning rule and memristive synapses imposes somerequirements for the memristive device, including linear current-voltage relation for reading, analog conductance tuning, linearand symmetric weight update, long retention time, high endurance, etc. ( ). However, no single device can fulﬁll all theserequirements simultaneously.Various techniques have been proposed to compensate the device non-idealities. For instance, to compensate the non-linearcurrent-voltage relation for reading, ﬁxed read voltage with variable pulse width or pulse number can be used for synaptic eight reading, and the readout is represented by the charge accumulation in the output nodes ( ). Linear and symmetricweight update is crucial for accurate online learning of a memristive multilayer neural network with Backprop learning rule( ). However, PCM devices usually only show gradual switching in set direction (weight potentiation), while RRAM devicesshow gradual switching in reset direction (weight depression). To achieve linear and symmetric weight update, differentialpair with two of these devices are usually used. For a differential pair with two PCM devices, the potentiation is achieved byapplying set pulses on the positive part and the depression is achieved by applying set pulses on the negative part, thus gradualweight update in both potentiation and depression can be achieved. To further enhance the linearity of weight update, a minorconductance pair consisting of capacitors can be used for frequent but smaller weight update, and ﬁnally transferred to themajor pair periodically ( ). Another option to improve device linearity is limiting the device dynamic range in a region farfrom saturation and where the weight update is linear .In addition to mitigate the non-idealities of memristive devices, more and more research efforts are made to exploit thesenon-idealities for brain-like computations. For instance, the stochasticity or noise in reading of memristive device can be usedfor the probability computation for restricted Boltzmann machine ( ), or escape for local minimization points in a Hopﬁeldneural network ( ). The Ag ﬁlament based resistive switching device shows short retention time and high switching dynamics,thus was proposed for reservoir computing ( ) and spatiotemporal computing ( ) to process time-encoded information.

The main steps to be taken to exploit the full potential of an ASIC for end-to-end processing system go through the integrationof memristive devices and sensors with CMOS technology. Indeed, the works presented so far are based either on simulationsor on real device data, or on memristive chips interfaced with some standard digital hardware. Despite integration of CMOStechnology has been demonstrated for non-volatile resistive switching devices already at a commercial level ( ), the designof co-integrated memristive-based neuromorphic processors is still under development. We envisage a three-phase process toachieve a fully integrated system.The ﬁrst one is the co-integration of non-volatile memristive devices with some peripheral circuits ( ) and to implementsome logic and multiply-and-accumulate (MAC) operations ( ), which reaches the maturity with the demonstration of a fullycointegrated SNN with analog neurons and memristive synapses ( ). The second phase is the co-integration of differenttechnologies. Despite this approach results in higher fabrication costs, it presents several advantages in terms of systemperformance, which can be more compact and potentially more power efﬁcient. In particular, the co-integration of non-volatileand volatile memristive devices can lead to a fully memristive approach. As an example, exploit volatile memristivedevices to emulate stochastic neurons and non-volatile memristive devices to store the synaptic weights on the same chip, thusdemonstrating the feasibility and the advantages of the dual technology co-integration process. Eventually, the ﬁnal step whichhas to be taken in the development of a dedicated ASIC for wearable edge computing is the co-integration of sensors andmemristive-based systems. tackled this challenge by designing and fabricating a gas sensing system able of gas classiﬁcation.The system uses RRAM arrays as memory, Carbon Nanotube ﬁeld effect transistor (CNFET) for computation and gas sensing,both 3D monolithically integrated on CMOS circuits, which carry out computation and allow memory access.

Adaptability is a feature of paramount importance in smart wearable devices, which need to be able to learn the unique feature oftheir user. This calls for the implementation of lifelong learning paradigms, i.e. the ability of continuously learning new featuresfrom experience. Typically, a network has a limited memory capacity dependent on the network size and architecture. Once themaximum number of experiences is recorded, new features learned will erase old ones, thus originating the phenomenon ofcatastrophic forgetting.The problem of an efﬁcient implementation of continual learning has been thoroughly investigated ( ). In the currentscenario, a dichotomy exist between backprop-based ANNs, which have very high accuracy but a limited memory capacity,and brain-inspired SNNs, which feature higher memory capacity thanks to their higher ﬂexibility, but at the cost of loweraccuracy. Models used to overcome forgetting are described in Section 3.3. The use of memristive devices in such networks isstill an open point. It is possible that memristive device will be beneﬁcial to increase the network capacity ( ) at no extracomputational cost thanks to their slow approach to the boundaries ( ), but so far this topic is still quite unexplored. Aninteresting approach is proposed by , where the key strengths of supervised convolutional ANNs, unsupervised SNNs, andmemristive devices are combined in a single system. The results indicate that this approach is robust against catastrophicforgetting, whilst reaching 93% accuracy when tested with both trained and non-trained classes.

In this study, we presented the state-of-the-art core elements that enable the development of wearable devices with extremeedge adaptive computing capability. Various sensors that can collect different bio-signals from the human body are investigated. here is a variety of sensing speciﬁcations in terms of size, resolution, mechanical ﬂexibility and output signals need to beconsidered along with their analogue readout circuit at a limited amount of power consumption. However, when the real-timeprocessing of these signals is deployed on edge, severe constraints raise in terms of power efﬁciency, fast response times,and accuracy in the data classiﬁcation. The widely-used solution is to ﬁnd a trade-off between the energy and computationalcapacity, or send the data to the cloud. However, these strategies are not ideal and slow down the development of wearablesmart sensing. To meet all the requirements, the development of a platform needs to be optimized in synergy with the otherelements and every aspect of the design, from the learning algorithms to the architecture.In particular, continual learning is required for adaptive wearable devices. In this respect, brain-inspired algorithms promiseto be valid alternatives to standard machine learning approaches such as Backprop and BPTT. The exploitation of sparsity innetwork connectivity increases the power efﬁciency by optimizing the use of the available memory. However, the problem ofalgorithmic robustness to non-ideal hardware (such as noise and variability) and the problems of forgetting and informationtransfer between tasks still persist and have to be solved in combination with neuromorphic and emerging technologies. SNNsare conceptually ideal for low-power in-memory computing. Their event-based approach together with the use of analogsubthreshold circuits to reproduce biological timescales, allows fast response times of the network while enabling smoothreal-time processing of data. The encoding of the incoming signals into spikes is however still challenging. Moreover, a fullyCMOS-based approach has two major technological issues. First, the synaptic weight is usually stored in SRAMs, which holdthe state only in the presence of a power supply. Second, capacitors used to implement biological time constants are massiveand may consume up to 60% of the chip area. Memristive technology can be beneﬁcial in this respect. Non-volatile devices canpotentially replace SRAMs and volatile devices offer a compact alternative to CMOS capacitors. Besides low-power operationin a small footprint, memristive devices also offer noisy properties, which – if exploited in the right way – might facilitate theimplementation of stochastic learning algorithms. However, the technology is still at its infancy and fabrication processes arestill under development, yielding high device variability, which makes it difﬁcult to produce reliable multi-bit memory.In summary, the ultimate goal towards smart wearable sensing with edge computing capabilities relies on a bespokeplatform consists of embedding sensors, front-end circuit interface, neuromorphic processor and memristive devices. Thisplatform requires high-compatibility of existing sensing technologies with CMOS circuitry and memristive devices to movethe intelligent algorithm into the wearable edge without signiﬁcantly increase the cost in energy. New solutions are needed toenhance the performance of local adaptive learning rules to be competitive with the accuracy of Backprop. Novel encodingtechniques to allow streamless communication from sensors to neuromorphic chip have to be developed and ﬂanked by efﬁcientevent-based algorithms. So far there is not a uniquely ideal solution, but we envisage that a holistic approach where all theelements of the system are co-designed as a whole is the key to build low-power end-to-end real-time adaptive systems fornext-generation smart wearable devices.

Conﬂict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or ﬁnancial relationships that could beconstrued as a potential conﬂict of interest.

Author Contributions

All the Authors equally contributed to the manuscript, actively participating to the discussions and to the writing. The maincontributors for each Section are as follows: X.L. and H.H. – wearable sensors; D.K. – biologically plausible models; M.P. andE.D. – signal processing and neuromorphic computing. E.C. and W.W. – memristive devices. E.C. led and coordinated thecooperative writing and all discussions.

Funding

This work was partially supported by the UK EPSRC under grant EP/R511705/1. E.C. and M.P. acknowledge funding by theEuropean Union‘s Horizon 2020 research and innovation programme under grant agreement No 871737.

Acknowledgments

The Authors would like to thank Prof. Thomas Mikolajick and Dr. Stefan Slesazeck for useful discussion on ferroelectric andmemristive devices.

References Schmidhuber, J. Deep learning in neural networks: An overview.

Neural networks , 85–117 (2015). . LeCun, Y., Bengio, Y. & Hinton, G. Deep learning.

Nature , 436–444, DOI: 10.1038/nature14539 (2015). Silver, D. et al.

Mastering the game of go with deep neural networks and tree search. nature , 484–489 (2016). Chen, Y.-H., Krishna, T., Emer, J. S. & Sze, V. Eyeriss: An Energy-Efﬁcient Reconﬁgurable Accelerator for DeepConvolutional Neural Networks.

IEEE J. Solid-state Circuits , 127–138 (2016). Cavigelli, L. & Benini, L. Origami: A 803-GOp/s/W Convolutional Network Accelerator.

IEEE Transactions on CircuitsSyst. for Video Technol. , 2461–2475 (2016). Song, J. et al.

An 11.5TOPS/W 1024-MAC Butterﬂy Structure Dual-Core Sparsity-Aware Neural Processing Unit in 8nmFlagship Mobile SoC. In

Proceedings of the IEEE International Solid-State Circuits Conference (ISSCC) , 130–132 (SanFrancisco, CA., 2019). Lee, J. et al.

LNPU: A 25.3 TFLOPS/W Sparse Deep-Neural-Network Learning Processor with Fine-Grained MixedPrecision of FP8-FP16. In

Proceedings of the IEEE International Solid-State Circuits Conference (ISSCC) , 142–144 (SanFrancisco, CA., 2019). Furber, S. B., Galluppi, F., Temple, S. & Plana, L. A. The spinnaker project.

Proc. IEEE , 652–665 (2014). Merolla, P. A. et al.

A million spiking-neuron integrated circuit with a scalable communication network and interface.

Science , 668–673 (2014).

Davies, M. et al.

Loihi: A neuromorphic manycore processor with on-chip learning.

IEEE Micro , 82–99 (2018). Schemmel, J. et al.

A wafer-scale neuromorphic hardware system for large-scale neural modeling. In

Proceedings ofIEEE International Symposium on Circuits and Systems (ISCAS) , 1947–1950 (2010).

Moradi, S., Qiao, N., Stefanini, F. & Indiveri, G. A scalable multicore architecture with heterogeneous memory structuresfor dynamic neuromorphic asynchronous processors (dynaps).

IEEE transactions on biomedical circuits systems ,106–122 (2017). Frenkel, C., Lefebvre, M., Legat, J.-D. & Bol, D. A 0.086-mm IEEE Transactions on Biomed. Circuits Syst. , 145–158 (2019). Cheng, H. Y. et al.

A thermally robust phase change memory by engineering the ge/n concentration in (ge, n) x sb y te z phase change material. In , 31.1.1–31.1.4 (2012). Udayakumar, K. R. et al.

Low-power ferroelectric random access memory embedded in 180nm analog friendly cmostechnology. In , 128–131 (2013).

Goux, L. et al.

Role of the Ta scavenger electrode in the excellent switching control and reliability of a scalable low-currentoperated TiN / Ta O / Ta RRAM device. In , 1–2 (2014). Golonzka, O. et al.

Mram as embedded non-volatile memory solution for 22fﬂ ﬁnfet technology. In , 18.1.1–18.1.4 (2018).

Jo, S. H., Kumar, T., Narayanan, S. & Nazarian, H. Cross-Point Resistive RAM Based on Field-Assisted SuperlinearThreshold Selector.

IEEE Transactions on Electron Devices , 3477–3481, DOI: 10.1109/TED.2015.2426717 (2015). Yang, H. et al.

Threshold switching selector and 1S1R integration development for 3D cross-point STT-MRAM. In , 38.1.1–38.1.4, DOI: 10.1109/IEDM.2017.8268513 (2017).

Wang, Z. et al.

Memristors with diffusive dynamics as synaptic emulators for neuromorphic computing.

Nat. Mater. ,101–108, DOI: 10.1038/nmat4756 (2017). Wang, W. et al.

Surface diffusion-limited lifetime of silver and copper nanoﬁlaments in resistive switching devices.

Nat.Commun. , 81, DOI: 10.1038/s41467-018-07979-0 (2019). Wang, W., Covi, E., Lin, Y., Ambrosi, E. & Ielmini, D. Modeling of switching speed and retention time in volatileresistive switching memory by ionic drift and diffusion. In ,32.3.1–32.3.4, DOI: 10.1109/IEDM19573.2019.8993625 (2019).

Covi, E. et al.

A volatile rram synapse for neuromorphic computing. In , 903–906, DOI: 10.1109/ICECS46596.2019.8965044 (2019).

Linares-Barranco, B. & Serrano-Gotarredona, T. Memristance can explain spike-time-dependent-plasticity in neuralsynapses.

Nat. Preced. (2009). Ielmini, D. & Wong, H.-S. P. In-memory computing with resistive switching devices.

Nat. Electron. , 333–343, DOI:10.1038/s41928-018-0092-2 (2018). 1801.06601. Chicca, E. & Indiveri, G. A recipe for creating ideal hybrid memristive-cmos neuromorphic processing systems.

Appl.Phys. Lett. , 120501, DOI: 10.1063/1.5142089 (2020). https://doi.org/10.1063/1.5142089.

Gao, W. et al.

Fully integrated wearable sensor arrays for multiplexed in situ perspiration analysis.

Nature , 509–514,DOI: 10.1038/nature16521 (2016).

Kanoun, O. & Tränkler, H. R. Sensor technology advances and future trends.

IEEE Transactions on InstrumentationMeas. , 1497–1501, DOI: 10.1109/TIM.2004.834613 (2004). López, A., Fernández, M., Rodríguez, H., Ferrero, F. & Postolache, O. Development of an eog-based system to control aserious game.

Meas. J. Int. Meas. Confed. , 481–488, DOI: 10.1016/j.measurement.2018.06.017 (2018).

Nweke, H. F., Teh, Y. W., Al-garadi, M. A. & Alo, U. R. Deep learning algorithms for human activity recognition usingmobile and wearable sensor networks: State of the art and research challenges.

Expert. Syst. with Appl. , 233–261,DOI: 10.1016/j.eswa.2018.03.056 (2018).

Witkowski, M. et al.

Enhancing brain-machine interface (bmi) control of a hand exoskeleton using electrooculography(eog).

J. NeuroEngineering Rehabil. , 1–6, DOI: 10.1186/1743-0003-11-165 (2014). Herry, C. L., Frasch, M., Seely, A. J. E. & Wu, H. T. Heart beat classiﬁcation from single-lead ecg using the syn-chrosqueezing transform.

Physiol. Meas. , 171–187, DOI: 10.1088/1361-6579/aa5070 (2017). Pantelopoulos, A. & Bourbakis, N. G. A survey on wearable sensor-based systems for health monitoring and prognosis.

IEEE Transactions on Syst. Man, Cybern. Part C (Applications Rev. , 1–12, DOI: 10.1109/TSMCC.2009.2032660(2010). Li, H., Shrestha, A., Heidari, H., Le Kernec, J. & Fioranelli, F. A multisensory approach for remote health monitoring ofolder people.

IEEE J. Electromagn. RF Microwaves Med. Biol. , 102–108 (2018). Liang, X. et al.

Fusion of wearable and contactless sensors for intelligent gesture recognition.

Adv. Intell. Syst. , 1900088,DOI: 10.1002/aisy.201900088 (2019). He, S., Yang, C., Wang, M., Cheng, L. & Hu, Z. Hand gesture recognition using myo armband.

Proc. - 2017 Chin. Autom.Congr. CAC 2017 , 4850–4855, DOI: 10.1109/CAC.2017.8243637 (2017).

Khezri, M. & Jahed, M. Real-time intelligent pattern recognition algorithm for surface emg signals.

BioMedical Eng.Online , 1–12, DOI: 10.1186/1475-925X-6-45 (2007). Liang, X., Ghannam, R. & Heidari, H. Wrist-worn gesture sensing with wearable intelligence.

IEEE Sensors J.

DOI:10.1109/JSEN.2018.2880194 (2018).

Wu, W., Nagarajan, S. & Chen, Z. Bayesian machine learning: Eegmeg signal processing measurements.

IEEE SignalProcess. Mag. , 14–36, DOI: 10.1109/MSP.2015.2481559 (2016). Yazicioglu, R. F., Van Hoof, C. & Puers, R.

Biopotential readout circuits for portable acquisition systems (SpringerScience & Business Media, 2008).

Luz, E. J. d. S., Schwartz, W. R., Cámara-Chávez, G. & Menotti, D. Ecg-based heartbeat classiﬁcation for arrhythmiadetection: A survey.

Comput. Methods Programs Biomed. , 144–164, DOI: https://doi.org/10.1016/j.cmpb.2015.12.008(2016).

Kiranyaz, S., Ince, T. & Gabbouj, M. Real-time patient-speciﬁc ecg classiﬁcation by 1-d convolutional neural networks.

IEEE Transactions on Biomed. Eng. , 664–675, DOI: 10.1109/TBME.2015.2468589 (2016). Rahhal, M. M. A. et al.

Deep learning approach for active classiﬁcation of electrocardiogram signals.

Inf. Sci. ,340–354, DOI: 10.1016/j.ins.2016.01.082 (2016).

Raj, S., Ray, K. C. & Shankar, O. Cardiac arrhythmia beat classiﬁcation using dost and pso tuned svm.

Comput. MethodsPrograms Biomed. , 163–177, DOI: 10.1016/j.cmpb.2016.08.016 (2016).

Zhang, Z., Dong, J., Luo, X., Choi, K.-S. & Wu, X. Heartbeat classiﬁcation using disease-speciﬁc feature selection.

Comput. Biol. Medicine , 79–89, DOI: https://doi.org/10.1016/j.compbiomed.2013.11.019 (2014). Alfaras, M., Soriano, M. C. & Ortín, S. A fast machine learning model for ecg-based heartbeat classiﬁcation andarrhythmia detection.

Front. Phys. , DOI: 10.3389/fphy.2019.00103 (2019). Ortín, S., Soriano, M. C., Alfaras, M. & Mirasso, C. R. Automated real-time method for ventricular heartbeat classiﬁcation.

Comput. Methods Programs Biomed. , 1–8, DOI: 10.1016/J.CMPB.2018.11.005 (2019). Hossain, M. S. & Muhammad, G. Cloud-assisted industrial internet of things (iiot) – enabled framework for healthmonitoring.

Comput. Networks , 192–202, DOI: https://doi.org/10.1016/j.comnet.2016.01.009 (2016).

Yang, Z., Zhou, Q., Lei, L., Zheng, K. & Xiang, W. An iot-cloud based wearable ecg monitoring system for smarthealthcare.

J. Med. Syst. , 286, DOI: 10.1007/s10916-016-0644-9 (2016). Jebelli, H., Hwang, S. & Lee, S. Eeg signal-processing framework to obtain high-quality brain waves from an off-the-shelfwearable eeg device.

J. Comput. Civ. Eng. , 04017070, DOI: 10.1061/(ASCE)CP.1943-5487.0000719 (2018). Lin, C. et al.

Wireless and wearable eeg system for evaluating driver vigilance.

IEEE Transactions on Biomed. CircuitsSyst. , 165–176, DOI: 10.1109/TBCAS.2014.2316224 (2014). Gargiulo, G. et al.

A new eeg recording system for passive dry electrodes.

Clin. Neurophysiol. , 686–693, DOI:https://doi.org/10.1016/j.clinph.2009.12.025 (2010).

Thakor, N. V. Biopotentials and electrophysiology measurements. In

Telehealth and Mobile Health , 595–614 (CRC press,2015).

Li, G., Lee, B. & Chung, W. Smartwatch-based wearable eeg system for driver drowsiness detection.

IEEE Sensors J. ,7169–7180, DOI: 10.1109/JSEN.2015.2473679 (2015). Shen, K.-Q., Li, X.-P., Ong, C.-J., Shao, S.-Y. & Wilder-Smith, E. P. V. Eeg-based mental fatigue measurementusing multi-class support vector machines with conﬁdence estimate.

Clin. Neurophysiol. , 1524–1533, DOI: https://doi.org/10.1016/j.clinph.2008.03.012 (2008).

Wang, X.-W., Nie, D. & Lu, B.-L. Emotional state classiﬁcation from eeg data using machine learning approach.

Neurocomputing , 94–106, DOI: https://doi.org/10.1016/j.neucom.2013.06.046 (2014).

Hosseinifard, B., Moradi, M. H. & Rostami, R. Classifying depression patients and normal subjects using machinelearning techniques and nonlinear features from eeg signal.

Comput. Methods Programs Biomed. , 339–345, DOI:10.1016/j.cmpb.2012.10.008 (2013).

Hwang, S., Jebelli, H., Choi, B., Choi, M. & Lee, S. Measuring workers’ emotional state during construction tasks usingwearable eeg.

J. Constr. Eng. Manag. , 04018050, DOI: 10.1061/(ASCE)CO.1943-7862.0001506 (2018).

Xu, J., Mitra, S., Hoof, C. V., Yazicioglu, R. F. & Makinwa, K. A. A. Active electrodes for wearable eeg acquisition:Review and electronics design methodology.

IEEE Rev. Biomed. Eng. , 187–198, DOI: 10.1109/RBME.2017.2656388(2017). Duchowski, A.

Eye Tracking Methodology - Theory and Practice (Springer, Cham, 2007).

Eid, M. A., Giakoumidis, N. & Saddik, A. E. A novel eye-gaze-controlled wheelchair system for navigating unknownenvironments: Case study with a person with als.

IEEE Access , 558–573, DOI: 10.1109/ACCESS.2016.2520093(2016). Duvinage, M., Castermans, T. & Dutoit, T. Control of a lower limb active prosthesis with eye movement sequences.In , 1–7, DOI:10.1109/CCMB.2011.5952116 (2011).

Barua, S., Ahmed, M. U., Ahlström, C. & Begum, S. Automatic driver sleepiness detection using eeg, eog and contextualinformation.

Expert. Syst. with Appl. , 121–135, DOI: https://doi.org/10.1016/j.eswa.2018.07.054 (2019).

Piñero, P. et al.

Sleep stage classiﬁcation using fuzzy sets and machine learning techniques.

Neurocomputing ,1137–1143, DOI: https://doi.org/10.1016/j.neucom.2004.01.178 (2004).

Zhu, X. et al.

Eog-based drowsiness detection using convolutional neural networks. In , 128–134, DOI: 10.1109/IJCNN.2014.6889642 (2014).

Martin, W. B. et al.

Pattern recognition of eeg-eog as a technique for all-night sleep stage scoring.

Electroencephalogr.Clin. Neurophysiol. , 417–427, DOI: https://doi.org/10.1016/0013-4694(72)90009-0 (1972). Stevens, J. R. et al.

Telemetered eeg-eog during psychotic behaviors of schizophrenia.

Arch. Gen. Psychiatry , 251–262,DOI: 10.1001/archpsyc.1979.01780030017001 (1979). Punsawad, Y., Wongsawat, Y. & Parnichkun, M. Hybrid eeg-eog brain-computer interface system for practical machinecontrol. In , 1360–1363, DOI:10.1109/IEMBS.2010.5626745 (2010). Wang, H., Li, Y., Long, J., Yu, T. & Gu, Z. An asynchronous wheelchair control by hybrid eeg–eog brain–computerinterface.

Cogn. Neurodynamics , 399–409, DOI: 10.1007/s11571-014-9296-y (2014). Mendez, I. et al.

Evaluation of the myo armband for the classiﬁcation of hand motions. In , 1211–1214, DOI: 10.1109/ICORR.2017.8009414 (2017).

Rissanen, S. M. et al.

Surface emg and acceleration signals in parkinson’s disease: feature extraction and cluster analysis.

Med. & Biol. Eng. & Comput. , 849–858, DOI: 10.1007/s11517-008-0369-0 (2008). Wang, Q. et al.

A novel pedestrian dead reckoning algorithm using wearable emg sensors to measure walking strides. In , 1–8, DOI: 10.1109/UPINLBS.2010.5653821(2010).

Rawat, S., Vats, S. & Kumar, P. Evaluating and exploring the myo armband. In , 115–120, DOI: 10.1109/SYSMART.2016.7894501 (2016).

Inhyuk, M., Myungjoon, L., Junuk, C. & Museong, M. Wearable emg-based hci for electric-powered wheelchairusers with motor disabilities. In

Proceedings of the 2005 IEEE International Conference on Robotics and Automation ,2649–2654, DOI: 10.1109/ROBOT.2005.1570513 (2005).

Artemiadis, P. K. & Kyriakopoulos, K. J. A switching regime model for the emg-based control of a robot arm.

IEEETransactions on Syst. Man, Cybern. Part B (Cybernetics) , 53–63, DOI: 10.1109/TSMCB.2010.2045120 (2011). Cipriani, C., Zaccone, F., Micera, S. & Carrozza, M. C. On the shared control of an emg-controlled prosthetic hand:Analysis of user–prosthesis interaction.

IEEE Transactions on Robotics , 170–184, DOI: 10.1109/TRO.2007.910708(2008). Subasi, A. Classiﬁcation of emg signals using pso optimized svm for diagnosis of neuromuscular disorders.

Comput.Biol. Medicine , 576–586, DOI: https://doi.org/10.1016/j.compbiomed.2013.01.020 (2013). Rincon, A. L., Yamasaki, H. & Shimoda, S. Design of a video game for rehabilitation using motion capture, emg analysisand virtual reality. In ,198–204, DOI: 10.1109/CONIELECOMP.2016.7438575 (2016).

Biswas, D., Simões-Capela, N., Hoof, C. V. & Helleputte, N. V. Heart rate estimation from wrist-worn photoplethysmog-raphy: A review.

IEEE Sensors J. , 6560–6570, DOI: 10.1109/JSEN.2019.2914166 (2019). Biswas, D. et al.

Cornet: Deep learning framework for ppg-based heart rate estimation and biometric identiﬁcation inambulant environment.

IEEE Transactions on Biomed. Circuits Syst. , 282–291, DOI: 10.1109/TBCAS.2019.2892297(2019). Re¸sit Kavsao˘glu, A., Polat, K. & Recep Bozkurt, M. A novel feature ranking algorithm for biometric recognition withppg signals.

Comput. Biol. Medicine , 1–14, DOI: https://doi.org/10.1016/j.compbiomed.2014.03.005 (2014). Caytak, H., Boyle, A., Adler, A. & Bolic, M. Bioimpedance spectroscopy processing and applications. In Narayan, R.(ed.)

Encyclopedia of Biomedical Engineering , 265 – 279, DOI: https://doi.org/10.1016/B978-0-12-801238-3.10884-0(Elsevier, Oxford, 2019).

Matthie, J. R. Bioimpedance measurements of human body composition: critical analysis and outlook.

Expert. Rev. Med.Devices , 239–261, DOI: 10.1586/17434440.5.2.239 (2008). Sun, T.-P. et al.

The use of bioimpedance in the detection/screening of tongue cancer.

Cancer Epidemiol. , 207–211,DOI: https://doi.org/10.1016/j.canep.2009.12.017 (2010). Zhang, Y., Xiao, R. & Harrison, C. Advancing hand gesture recognition with high resolution electrical impedancetomography.

UIST 2016 - Proc. 29th Annu. Symp. on User Interface Softw. Technol.

Alsheikh, M. A., Lin, S., Niyato, D. & Tan, H. P. Machine learning in wireless sensor networks: Algorithms, strategies,and applications.

IEEE Commun. Surv. Tutorials , 1996–2018, DOI: 10.1109/COMST.2014.2320099 (2014). Gravina, R., Alinia, P., Ghasemzadeh, H. & Fortino, G. Multi-sensor fusion in body sensor networks: State-of-the-art andresearch challenges.

Inf. Fusion , 1339–1351, DOI: 10.1016/j.inffus.2016.09.005 (2017). Khaleghi, B., Khamis, A., Karray, F. O. & Razavi, S. N. Multisensor data fusion: A review of the state-of-the-art.

Inf.Fusion , 28–44 (2013). Rundo, F., Conoci, S., Ortis, A. & Battiato, S. An advanced bio-inspired photoplethysmography (ppg) and ecg patternrecognition system for medical assessment.

Sensors (Basel) , DOI: 10.3390/s18020405 (2018). He, X., Goubran, R. A. & Liu, X. P. Secondary peak detection of ppg signal for continuous cufﬂess arterial blood pressuremeasurement.

IEEE Transactions on Instrumentation Meas. , 1431–1439, DOI: 10.1109/TIM.2014.2299524 (2014). Chiu, H.-Y., Shuai, H.-H. & Chao, P. C.-P. Reconstructing qrs complex from ppg by transformed attentional neuralnetworks.

IEEE SENSORS JOURNAL (2020).

Patel, S. et al.

A wearable computing platform for developing cloud-based machine learning models for health monitoringapplications. In , 5997–6001, DOI: 10.1109/EMBC.2016.7592095 (2016).

Mead, C. How we created neuromorphic engineering.

Nat. Electron. , 434–435, DOI: 10.1038/s41928-020-0448-2(2020). Bellec, G., Scherr, F., Hajek, E. et al.

Biologically inspired alternatives to backpropagation through time for learning inrecurrent neural nets. arXiv preprint arXiv:1901.09049 (2019).

Bellec, G. et al.

A solution to the learning dilemma for recurrent networks of spiking neurons. bioRxiv

Kappel, D., Habenschuss, S., Legenstein, R. & Maass, W. Network plasticity as bayesian inference.

PLoS computationalbiology (2015). Kappel, D., Legenstein, R., Habenschuss, S., Hsieh, M. & Maass, W. A dynamic connectome supports the emergence ofstable computational function of neural circuits through reward-based learning.

Eneuro (2018). Kirkpatrick, J., Pascanu, R., Rabinowitz, N., Veness, J. et al.

Overcoming catastrophic forgetting in neural networks.

Proc. Natl. Acad. Sci.

Rumelhart, D., Hinton, G. & Williams, R. Learning internal representations by error propagation. In

In: ParallelDistributed Processing , vol. 1, 318–362 (MIT Press, Cambridge, MA, 1986).

Czarnecki, W. M. et al.

Understanding synthetic gradients and decoupled neural interfaces. arXiv preprintarXiv:1703.00522 (2017).

Richards, B. A. et al.

A deep learning framework for neuroscience.

Nat. neuroscience , 1761–1770 (2019). Lillicrap, T. P., Cownden, D., Tweed, D. B. & Akerman, C. J. Random synaptic feedback weights support errorbackpropagation for deep learning.

Nat. communications , 1–10 (2016). Samadi, A., Lillicrap, T. P. & Tweed, D. B. Deep learning with dynamic spiking neurons and ﬁxed feedback weights.

Neural computation , 578–602 (2017). Neftci, E. O., Augustine, C., Paul, S. & Detorakis, G. Event-driven random back-propagation: Enabling neuromorphicdeep learning machines.

Front. neuroscience , 324 (2017). Payvand, M., Fouda, M. E., Kurdahi, F., Eltawil, A. & Neftci, E. O. Error-triggered three-factor learning dynamics forcrossbar arrays. In ,218–222 (IEEE, 2020).

Zenke, F. & Ganguli, S. Superspike: Supervised learning in multilayer spiking neural networks.

Neural computation ,1514–1541 (2018). Hinton, G., Srivastava, N. & Swersky, K. Neural networks for machine learning lecture 6a overview of mini-batchgradient descent.

Cited on (2012). Bengio, Y., Léonard, N. & Courville, A. Estimating or propagating gradients through stochastic neurons for conditionalcomputation. arXiv preprint arXiv:1308.3432 (2013).

Sacramento, J., Costa, R. P., Bengio, Y. & Senn, W. Dendritic error backpropagation in deep cortical microcircuits. arXivpreprint arXiv:1801.00062 (2017).

Göltz, J. et al.

Fast and deep neuromorphic learning with time-to-ﬁrst-spike coding. arXiv preprint arXiv:1912.11443 (2019).

Bellec, G., Salaj, D., Subramoney, A., Legenstein, R. & Maass, W. Long short-term memory and learning-to-learn innetworks of spiking neurons. In

Advances in Neural Information Processing Systems , 787–797 (2018).

Williams, R. J. Simple statistical gradient-following algorithms for connectionist reinforcement learning.

Mach. learning , 229–256 (1992). Izhikevich, E. M. Solving the distal reward problem through linkage of stdp and dopamine signaling.

Cereb. cortex ,2443–2452 (2007). Yagishita, S. et al.

A critical time window for dopamine actions on the structural plasticity of dendritic spines.

Science , 1616–1620 (2014).

He, K., Huertas, M., Hong, S. Z. et al.

Distinct eligibility traces for ltp and ltd in cortical synapses.

Neuron , 528–538(2015). Brzosko, Z., Schultz, W. & Paulsen, O. Retroactive modulation of spike timing-dependent plasticity by dopamine.

Elife ,e09685 (2015). Bittner, K. C., Milstein, A. D., Grienberger, C., Romani, S. & Magee, J. C. Behavioral time scale synaptic plasticityunderlies ca1 place ﬁelds.

Science , 1033–1036 (2017).

Gale, T., Elsen, E. & Hooker, S. The state of sparsity in deep neural networks. arXiv preprint arXiv:1902.09574 (2019).

Ström, N. Sparse connection and pruning in large dynamic artiﬁcial neural networks. In , 2807–2810 (1997).

Collins, M. D. & Kohli, P. Memory bounded deep convolutional networks. arXiv preprint arXiv:1412.1442 (2014).

Han, S., Pool, J., Tran, J. & Dally, W. Learning both weights and connections for efﬁcient neural network. In

Advances inneural information processing systems , 1135–1143 (2015).

Guo, Y., Yao, A. & Chen, Y. Dynamic network surgery for efﬁcient dnns. In

Advances in neural information processingsystems , 1379–1387 (2016).

Zhu, M. & Gupta, S. To prune, or not to prune: exploring the efﬁcacy of pruning for model compression. arXiv preprintarXiv:1710.01878 (2017).

Molchanov, D., Ashukha, A. & Vetrov, D. Variational dropout sparsiﬁes deep neural networks. In

Proceedings of the 34thInternational Conference on Machine Learning-Volume 70 , 2498–2507 (JMLR. org, 2017).

Louizos, C., Welling, M. & Kingma, D. P. Learning sparse neural networks through l _0 regularization. arXiv preprintarXiv:1712.01312 (2017). Ullrich, K., Meeds, E. & Welling, M. Soft weight-sharing for neural network compression. arXiv preprintarXiv:1702.04008 (2017).

Dai, X., Yin, H. & Jha, N. K. Nest: A neural network synthesis tool based on a grow-and-prune paradigm.

IEEETransactions on Comput. , 1487–1497 (2019). Bellec, G., Kappel, D., Maass, W. & Legenstein, R. Deep rewiring: Training very sparse deep networks. arXiv preprintarXiv:1711.05136 (2017).

Mocanu, D. C. et al.

Scalable training of artiﬁcial neural networks with adaptive sparse connectivity inspired by networkscience.

Nat. communications , 1–12 (2018). Mostafa, H. & Wang, X. Parameter efﬁcient training of deep convolutional neural networks by dynamic sparse reparame-terization. arXiv preprint arXiv:1902.05967 (2019).

Lee, N., Ajanthan, T. & Torr, P. H. Snip: Single-shot network pruning based on connection sensitivity. arXiv preprintarXiv:1810.02340 (2018).

Dettmers, T. & Zettlemoyer, L. Sparse networks from scratch: Faster training without losing performance. arXiv preprintarXiv:1907.04840 (2019).

Liu, C. et al.

Memory-efﬁcient deep learning on a spinnaker 2 prototype.

Front. neuroscience , 840 (2018). Maass, W. Noise as a resource for computation and learning in networks of spiking neurons.

Proc. IEEE , 860–880(2014).

Pecevski, D. & Maass, W. Learning probabilistic inference through spike-timing-dependent plasticity. eneuro (2016). Neftci, E. O., Pedroni, B. U., Joshi, S., Al-Shedivat, M. & Cauwenberghs, G. Unsupervised learning in synaptic samplingmachines. arXiv preprint arXiv:1511.04484 (2015).

Kaiser, J. et al.

Embodied synaptic plasticity with online reinforcement learning.

Front. Neurorobotics , 81 (2019). Yan, Y. et al.

Efﬁcient reward-based structural plasticity on a spinnaker 2 prototype.

IEEE transactions on biomedicalcircuits systems , 579–591 (2019). Cichon, J. & Gan, W.-B. Branch-speciﬁc dendritic Ca2+ spikes cause persistent synaptic plasticity.

Nature , 180(2015).

Pan, S. J. & Yang, Q. A survey on transfer learning.

IEEE Transactions on Knowl. Data Eng. , 1345–1359 (2009). Hayashi-Takagi, A., Yagishita, S., Nakamura, M., Shirai, F. et al.

Labelling and optical erasure of synaptic memory tracesin the motor cortex.

Nature , 333 (2015).

Yang, G., Pan, F. & Gan, W.-B. Stably maintained dendritic spines are associated with lifelong memories.

Nature ,920–924 (2009).

Yang, G. et al.

Sleep promotes branch-speciﬁc formation of dendritic spines after learning.

Science , 1173–1178(2014).

Fusi, S., Drew, P. J. & Abbott, L. F. Cascade models of synaptically stored memories.

Neuron , 599–611 (2005). Benna, M. K. & Fusi, S. Computational principles of synaptic memory consolidation.

Nat. neuroscience , 1697–1706(2016). Huszár, F. Note on the quadratic penalties in elastic weight consolidation.

Proc. Natl. Acad. Sci.

Caruana, R. Multitask learning.

Mach. learning , 41–75 (1997). Torrey, L. & Shavlik, J. Transfer learning. In

Handbook of research on machine learning applications and trends:algorithms, methods, and techniques , 242–264 (IGI Global, 2010).

Weiss, K., Khoshgoftaar, T. M. & Wang, D. A survey of transfer learning.

J. Big Data , 9 (2016). Lu, J. et al.

Transfer learning using computational intelligence: a survey.

Knowledge-Based Syst. , 14–23 (2015). Long, M., Zhu, H., Wang, J. & Jordan, M. I. Deep transfer learning with joint adaptation networks. In , 2208–2217 (JMLR. org, 2017).

Duan, L., Xu, D. & Tsang, I. Learning with augmented features for heterogeneous domain adaptation. arXiv preprintarXiv:1206.4660 (2012).

Kulis, B., Saenko, K. & Darrell, T. What you saw is not what you get: Domain adaptation using asymmetric kerneltransforms. In

CVPR 2011 , 1785–1792 (IEEE, 2011).

Zhu, Y. et al.

Heterogeneous transfer learning for image classiﬁcation. In

Twenty-Fifth AAAI Conference on ArtiﬁcialIntelligence , 1304–1309 (2011).

Wang, C. & Mahadevan, S. Heterogeneous domain adaptation using manifold alignment. In

Twenty-Second InternationalJoint Conference on Artiﬁcial Intelligence , 1541–1546 (2011).

Zhou, J. T., Tsang, I. W., Pan, S. J. & Tan, M. Heterogeneous domain adaptation for multiple classes. In

ArtiﬁcialIntelligence and Statistics , 1095–1103 (2014).

Prettenhofer, P. & Stein, B. Cross-language text classiﬁcation using structural correspondence learning. In , 1118–1127 (2010).

Zhou, J. T., Pan, S. J., Tsang, I. W. & Yan, Y. Hybrid heterogeneous transfer learning through deep learning. In

Twenty-eighth AAAI conference on artiﬁcial intelligence , 2213–2219 (2014).

Harel, M. & Mannor, S. Learning from multiple outlooks. arXiv preprint arXiv:1005.0027 (2010).

Schmidhuber, J. Learning to control fast-weight memories: An alternative to dynamic recurrent networks.

Neural Comput. , 131–139 (1992). Schmidhuber, J. A neural network that embeds its own meta-levels. In

IEEE International Conference on NeuralNetworks , 407–412 (IEEE, 1993).

Andrychowicz, M. et al.

Learning to learn by gradient descent by gradient descent. In

Advances in neural informationprocessing systems , 3981–3989 (2016).

Bohnstingl, T., Scherr, F., Pehle, C., Meier, K. & Maass, W. Neuromorphic hardware learns to learn.

Front. neuroscience , 483 (2019). Indiveri, G. & Liu, S.-C. Memory and information processing in neuromorphic systems.

Proc. IEEE , 1379–1397(2015).

Chicca, E., Stefanini, F., Bartolozzi, C. & Indiveri, G. Neuromorphic electronic circuits for building autonomous cognitivesystems.

Proc. IEEE , 1367–1388 (2014).

Schemmel, J., Billaudelle, S., Dauer, P. & Weis, J. Accelerated analog neuromorphic computing. arXiv preprintarXiv:2003.11996 (2020).

Qiao, N. et al.

A reconﬁgurable on-line learning spiking neuromorphic processor comprising 256 neurons and 128ksynapses.

Front. neuroscience , 141 (2015). Neckar, A. et al.

Braindrop: A mixed-signal neuromorphic architecture with a dynamical systems-based programmingmodel.

Proc. IEEE , 144–164 (2018).

Mayr, C., Hoeppner, S. & Furber, S. Spinnaker 2: A 10 million core processor system for brain simulation and machinelearning. arXiv preprint arXiv:1911.02385 (2019).

Bartolozzi, C. & Indiveri, G. Synaptic dynamics in analog vlsi.

Neural computation , 2581–2603 (2007). Izhikevich, E. M. Which model to use for cortical spiking neurons?

IEEE Transactions on Neural Networks ,1063–1070 (2004). Brader, J. M., Senn, W. & Fusi, S. Learning real-world stimuli in a neural network with spike-driven synaptic dynamics.

Neural computation , 2881–2912 (2007). Frenkel, C., Legat, J.-D. & Bol, D. Morphic: A 65-nm 738k-synapse/mm quad-core binary-weight digital neuromorphicprocessor with stochastic spike-driven online learning. IEEE Transactions on Biomed. Circuits Syst. , 999–1010 (2019). Donati, E. et al.

Processing emg signals using reservoir computing on an event-based neuromorphic system. In , 1–4 (IEEE, 2018).

Donati, E., Payvand, M., Risi, N., Krause, R. B. & Indiveri, G. Discrimination of emg signals using a neuromorphicimplementation of a spiking neural network.

IEEE transactions on biomedical circuits systems (2019).

Bauer, F. C., Muir, D. R. & Indiveri, G. Real-time ultra-low power ecg anomaly detection using an event-drivenneuromorphic processor.

IEEE transactions on biomedical circuits systems (2019).

Corradi, F. et al.

Ecg-based heartbeat classiﬁcation in neuromorphic hardware. In , 1–8 (IEEE, 2019).

Sharifshazileh, M., Burelo, K., Fedele, T., Sarnthein, J. & Indiveri, G. A neuromorphic device for detecting high-frequencyoscillations in human ieeg. In ,69–72 (IEEE, 2019).

Benatti, S. et al.

A Versatile Embedded Platform for EMG Acquisition and Gesture Recognition.

IEEE Transactions onBiomed. Circuits Syst. , 620–630 (2015). Montagna, F., Rahimi, A., Benatti, S., Rossi, D. & Benini, L. PULP-HD: Accelerating Brain-inspired High-dimensionalComputing on a Parallel Ultra-low Power Platform. In

Proceedings of the ACM/ESDA/IEEE Design AutomationConference (DAC) , 1–6 (San Francisco, CA., 2018).

Ceolini, E. et al.

Hand-gesture recognition based on emg and event-based camera sensor fusion: a benchmark inneuromorphic computing.

Front. Neurosci.

36 (2020).

Corradi, F. & Indiveri, G. A neuromorphic event-based neural recording system for smart brain-machine-interfaces.

IEEEtransactions on biomedical circuits systems , 699–709 (2015). Lichtsteiner, P., Posch, C. & Delbruck, T. A 128x128 120 db 15 us latency asynchronous temporal contrast vision sensor.

IEEE journal solid-state circuits , 566–576 (2008). Behrenbeck, J. et al.

Classiﬁcation and regression of spatio-temporal signals using neucube and its realization on spinnakerneuromorphic hardware.

J. neural engineering , 026014 (2019). Qiao, N. & Indiveri, G. Scaling mixed-signal neuromorphic processors to 28 nm FD-SOI technologies. In

IEEEBiomedical Circuits and Systems Conference (BioCAS) , 552–555 (IEEE, Shanghai, China., 2016).

Furber, S. B. et al.

Overview of the SpiNNaker System Architecture.

IEEE Transactions on Comput. , 2454–2467,DOI: http://doi.ieeecomputersociety.org/10.1109/TC.2012.142 (2013). Payvand, M. & Indiveri, G. Spike-based Plasticity Circuits for Always-on On-line Learning in Neuromorphic Systems.In

Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS) , 1–5 (Sapporo, Japan., 2019).

Payvand, M., Fouda, M., Kurdahi, F., Eltawil, A. & Neftci, E. On-chip error-triggered learning of multi-layer memristivespiking neural networks.

J. Emerg. Technol. Circuits Syst. (JETCAS) (2020).

Zhang, W., Mazzarello, R., Wuttig, M. & Ma, E. Designing crystallization in phase-change materials for universalmemory and neuro-inspired computing.

Nat. Rev. Mater. , 150–168, DOI: 10.1038/s41578-018-0076-x (2019). Miron, I. M. et al.

Perpendicular switching of a single ferromagnetic layer induced by in-plane current injection.

Nature , 189–193, DOI: 10.1038/nature10309 (2011).

Wen, Z., Li, C., Wu, D., Li, A. & Ming, N. Ferroelectric-ﬁeld-effect-enhanced electroresistance inmetal/ferroelectric/semiconductor tunnel junctions.

Nat. Mater. , 617–621, DOI: 10.1038/nmat3649 (2013). Jo, S. H. et al.

Nanoscale Memristor Device as Synapse in Neuromorphic Systems.

Nano Lett. , 1297–1301, DOI:10.1021/nl904092h (2010). Wang, W. et al.

A hardware neural network for handwritten digits recognition using binary RRAM as synaptic weightelement. In , 50–51 (IEEE, 2016).

Ohno, T. et al.

Short-term plasticity and long-term potentiation mimicked in single inorganic synapses.

Nat. Mater. ,591–595, DOI: 10.1038/nmat3054 (2011). Kuzum, D., Jeyasingh, R. G. D., Yu, S. & Wong, H.-S. P. Low-Energy Robust Neuromorphic Computation Using SynapticDevices.

IEEE Transactions on Electron Devices , 3489–3494, DOI: 10.1109/TED.2012.2217146 (2012). Yang, J. J., Strukov, D. B. & Stewart, D. R. Memristive devices for computing.

Nat. Nanotechnol. , 13–24, DOI:10.1038/nnano.2012.240 (2013). Alibart, F., Zamanidoost, E. & Strukov, D. B. Pattern classiﬁcation by memristive crossbar circuits using ex situ and insitu training.

Nat. Commun. , 2072, DOI: 10.1038/ncomms3072 (2013). Eryilmaz, S. B. et al.

Brain-like associative learning using a nanoscale non-volatile phase change synaptic device array.

Front. Neurosci. , 1–11, DOI: 10.3389/fnins.2014.00205 (2014). 1406.4951. Ambrogio, S. et al.

Equivalent-accuracy accelerated neural-network training using analogue memory.

Nature , 60–67,DOI: 10.1038/s41586-018-0180-5 (2018).

Ielmini, D. & Pedretti, G. Device and circuit architectures for in-memory computing.

Adv. Intell. Syst. n/a , 2000040(2020).

IRDS. International Roadmap for Devices and Systems™. https://irds.ieee.org/ (2020).

Rho, K. et al. , 396–397 (2017).

Kim, Y. et al.

Bi-layered rram with unlimited endurance and extremely uniform switching. In , 52–53 (2011).

Lee, M.-J. et al.

A fast, high-endurance and scalable non-volatile memory device made from asymmetric ta o − x /tao − x bilayer structures. Nat. Mater. , 625–630, DOI: 10.1038/nmat3070 (2011). Kim, I. S. et al.

High performance pram cell scalable to sub-20nm technology with below 4f cell size, extendable todram applications. In , 203–204 (2010). Saida, D. et al. × - to 2 × -nm perpendicular mtj switching at sub-3-ns pulses below 100 µ a for high-performanceembedded stt-mram for sub-20-nm cmos. IEEE Transactions on Electron Devices , 427–431 (2017). Torrezan, A. C., Strachan, J. P., Medeiros-Ribeiro, G. & Williams, R. S. Sub-nanosecond switching of a tantalum oxidememristor.

Nanotechnology , 485203, DOI: 10.1088/0957-4484/22/48/485203 (2011). Jan, G. et al.

Demonstration of ultra-low voltage and ultra low power stt-mram designed for compatibility with 0x nodeembedded llc applications. In , 65–66 (2018).

Choi, B. J. et al.

High-speed and low-energy nitride memristors.

Adv. Funct. Mater. , 5290–5296, DOI: 10.1002/adfm.201600680 (2016). https://onlinelibrary.wiley.com/doi/pdf/10.1002/adfm.201600680. Kitagawa, E. et al.

Impact of ultra low power and fast write operation of advanced perpendicular mtj on power reductionfor high-performance mobile cpu. In , 29.4.1–29.4.4 (2012).

Francois, T. et al.

Demonstration of beol-compatible ferroelectric hf0.5zr0.5o2 scaled feram co-integrated with 130nmcmos for embedded nvm applications. In , 15.7.1–15.7.4(2019).

Luo, Q. et al.

Super non-linear rram with ultra-low power for 3d vertical nano-crossbar arrays.

Nanoscale , 15629–15636,DOI: 10.1039/C6NR02029A (2016). De Sandre, G. et al.

A 90nm 4Mb embedded phase-change memory with 1.2V 12ns read access time and 1MB/s writethroughput. In , 268–269 (2010).

Bruno, F. Y. et al.

Millionfold resistance change in ferroelectric tunnel junctions based on nickelate electrodes.

Adv.Electron. Mater. , 1500245, DOI: 10.1002/aelm.201500245 (2016). https://onlinelibrary.wiley.com/doi/pdf/10.1002/aelm.201500245. Kang, C.-F. et al.

Self-formed conductive nanoﬁlaments in (bi, mn)ox for ultralow-power memory devices.

Nano Energy , 283 – 290, DOI: https://doi.org/10.1016/j.nanoen.2015.02.033 (2015). Xiong, F., Liao, A. D., Estrada, D. & Pop, E. Low-power switching of phase-change materials with carbon nanotubeelectrodes.

Science , 568–570, DOI: 10.1126/science.1201938 (2011). https://science.sciencemag.org/content/332/6029/568.full.pdf.

Wang, W. et al.

Volatile Resistive Switching Memory Based on Ag Ion Drift/Diffusion—Part II: Compact Modeling.

IEEE Transactions on Electron Devices , 3802–3808, DOI: 10.1109/TED.2019.2928888 (2019). Shi, Q., Wang, J., Aziz, I. & Lee, P. S. Stretchable and Wearable Resistive Switching Random-Access Memory.

Adv.Intell. Syst. , 2000007, DOI: 10.1002/aisy.202000007 (2020).

Shang, J. et al.

Highly ﬂexible resistive switching memory based on amorphous-nanocrystalline hafnium oxide ﬁlms.

Nanoscale , 7037–7046, DOI: 10.1039/C6NR08687J (2017). Dang, B. et al.

Physically Transient Memristor Synapse Based on Embedding Magnesium Nanolayer in Oxide forSecurity Neuromorphic Electronics.

IEEE Electron Device Lett. , 1265–1268, DOI: 10.1109/LED.2019.2921322(2019). Mehonic, A. & Kenyon, A. J. Emulating the Electrical Activity of the Neuron Using a Silicon Oxide RRAM Cell.

Front.Neurosci. , 57, DOI: 10.3389/fnins.2016.00057 (2016). Wang, W. et al.

Learning of spatiotemporal patterns in a spiking neural network with resistive switching synapses.

Sci.Adv. , eaat4752, DOI: 10.1126/sciadv.aat4752 (2018). Tuma, T., Pantazi, A., Le Gallo, M., Sebastian, A. & Eleftheriou, E. Stochastic phase-change neurons.

Nat. Nanotechnol. , 693–699, DOI: 10.1038/nnano.2016.70 (2016). Suresh, B. et al.

Simulation of integrate-and-ﬁre neuron circuits using hfo2-based ferroelectric ﬁeld effect transistors. In , 229–232 (2019).

Kwon, M.-W. et al.

Integrate-and-ﬁre neuron circuit using positive feedback ﬁeld effect transistor for low power operation.

J. Appl. Phys. , 152107 (2018).

Wang, Z. et al.

Fully memristive neural networks for pattern classiﬁcation with unsupervised learning.

Nat. Electron. ,137–145 (2018). Wang, Z., Ambrogio, S., Balatti, S. & Ielmini, D. A 2-transistor/1-resistor artiﬁcial synapse capable of communicationand stochastic learning in neuromorphic systems.

Front. Neurosci. , 1–11, DOI: 10.3389/fnins.2014.00438 (2015). Covi, E. et al.

Analog Memristive Synapse in Spiking Networks Implementing Unsupervised Learning.

Frontiers inNeuroscience , 482, DOI: 10.3389/fnins.2016.00482 (2016). Mulaosmanovic, H. et al.

Novel ferroelectric fet based synapse for neuromorphic systems. In , T176–T177 (IEEE, 2017).

Covi, E. et al.

Spike-driven threshold-based learning with memristive synapses and neuromorphic silicon neurons.

J.Phys. D: Appl. Phys. , 344003 (2018). Pedretti, G. et al.

Memristive neural network for on-line learning and tracking with brain-inspired spike timing dependentplasticity.

Sci. Reports , 5288, DOI: 10.1038/s41598-017-05480-0 (2017). Prezioso, M. et al.

Spike-timing-dependent plasticity learning of coincidence detection with passively integratedmemristive circuits.

Nat. Commun. , 5311, DOI: 10.1038/s41467-018-07757-y (2018). Sebastian, A. et al.

Temporal correlation detection using computational phase-change memory.

Nat. Commun. , 1115,DOI: 10.1038/s41467-017-01481-9 (2017). 1706.00511. Wang, W. et al.

Computing of temporal information in spiking neural networks with ReRAM synapses.

Faraday Discuss. , 453–469, DOI: 10.1039/C8FD00097B (2019).

Zhu, X., Du, C., Jeong, Y. & Lu, W. D. Emulation of synaptic metaplasticity in memristors.

Nanoscale , 45–51, DOI:10.1039/C6NR08024C (2017). Wu, Q. et al.

Full imitation of synaptic metaplasticity based on memristor devices.

Nanoscale , 5875–5881, DOI:10.1039/C8NR00222C (2018). Burr, G. W. et al.

Experimental Demonstration and Tolerancing of a Large-Scale Neural Network (165 000 Synapses)Using Phase-Change Memory as the Synaptic Weight Element.

IEEE Transactions on Electron Devices , 3498–3507,DOI: 10.1109/TED.2015.2439635 (2015). Yao, P. et al.

Fully hardware-implemented memristor convolutional neural network.

Nature , 641–646, DOI:10.1038/s41586-020-1942-4 (2020).

Hopﬁeld, J. J. Neural networks and physical systems with emergent collective computational abilities.

Natl. Acad. Sci.United States Am. , 2554–2558, DOI: 10.1073/pnas.79.8.2554 (1982). Duan, S., Hu, X., Dong, Z., Wang, L. & Mazumder, P. Memristor-Based Cellular Nonlinear/Neural Network: Design,Analysis, and Applications.

IEEE Transactions on Neural Networks Learn. Syst. , 1202–1213, DOI: 10.1109/TNNLS.2014.2334701 (2015). Romera, M. et al.

Vowel recognition with four coupled spin-torque nano-oscillators.

Nature , 230–234 (2018).

Milo, V., Ielmini, D. & Chicca, E. Attractor networks and associative memories with STDP learning in RRAM synapses.In , 11.2.1–11.2.4 (2017).

Wang, Y., Yu, L., Wu, S., Huang, R. & Yang, Y. Memristor-Based Biologically Plausible Memory Based on Discrete andContinuous Attractor Networks for Neuromorphic Systems.

Adv. Intell. Syst. , 2000001, DOI: 10.1002/aisy.202000001(2020). Ignatov, M., Ziegler, M., Hansen, M. & Kohlstedt, H. Memristive stochastic plasticity enables mimicking of neuralsynchrony: Memristive circuit emulates an optical illusion.

Sci. Adv. , DOI: 10.1126/sciadv.1700849 (2017). https://advances.sciencemag.org/content/3/10/e1700849.full.pdf. Park, S. et al.

Electronic system with memristive synapses for pattern recognition.

Sci. Reports , 1–9, DOI: 10.1038/srep10123 (2015). Sheridan, P. M., Du, C. & Lu, W. D. Feature extraction using memristor networks.

IEEE Transactions on Neural NetworksLearn. Syst. , 2327–2336 (2016). Krestinskaya, O. & James, A. P. Feature extraction without learning in an analog spatial pooler memristive-cmoscircuit design of hierarchical temporal memory.

Analog. Integr. Circuits Signal Process. , 457–465, DOI: 10.1007/s10470-018-1161-1 (2018). Serb, A. et al.

Memristive synapses connect brain and silicon spiking neurons.

Sci. Reports , 2590, DOI: 10.1038/s41598-020-58831-9 (2020). Saleh, Q., Merkel, C., Kudithipudi, D. & Wysocki, B. Memristive computational architecture of an echo state network forreal-time speech-emotion recognition. In , 1–5 (2015).

Kudithipudi, D., Saleh, Q., Merkel, C., Thesing, J. & Wysocki, B. Design and analysis of a neuromemristive reservoircomputing architecture for biosignal processing.

Front. Neurosci. , 502, DOI: 10.3389/fnins.2015.00502 (2016). Zhu, S., Wang, L. & Duan, S. Memristive pulse coupled neural network with applications in medical image processing.

Neurocomputing , 149 – 157, DOI: https://doi.org/10.1016/j.neucom.2016.07.068 (2017).

Tzouvadaki, I., Tuoheti, A., De Micheli, G., Demarchi, D. & Carrara, S. Portable memristive biosensing system aseffective point-of-care device for cancer diagnostics. In , 1–5 (2018).

Gokmen, T. & Vlasov, Y. Acceleration of Deep Neural Network Training with Resistive Cross-Point Devices: DesignConsiderations.

Front. Neurosci. , 333, DOI: 10.3389/fnins.2016.00333 (2016). Cai, F. et al.

A fully integrated reprogrammable memristor–CMOS system for efﬁcient multiply–accumulate operations.

Nat. Electron. , 290–299, DOI: 10.1038/s41928-019-0270-x (2019). Woo, J. et al.

Improved synaptic behavior under identical pulses using alox/hfo2 bilayer rram array for neuromorphicsystems.

IEEE Electron Device Lett. , 994–997 (2016). Wang, Z. et al.

Engineering incremental resistive switching in taox based memristors for brain-inspired computing.

Nanoscale , 14015–14022, DOI: 10.1039/C6NR00476H (2016). Mahmoodi, M. R., Prezioso, M. & Strukov, D. B. Versatile stochastic dot product circuits based on nonvolatilememories for high performance neurocomputing and neurooptimization.

Nat. Commun. , 5113, DOI: 10.1038/s41467-019-13103-7 (2019). Cai, F. et al.

Power-efﬁcient combinatorial optimization using intrinsic noise in memristor Hopﬁeld neural networks.

Nat.Electron. , 1–10, DOI: 10.1038/s41928-020-0436-6 (2020). Midya, R. et al.

Reservoir Computing Using Diffusive Memristors.

Adv. Intell. Syst. , 1900084, DOI: 10.1002/aisy.201900084 (2019). Yang-Scharlotta, J., Fazio, M., Amrbar, M., White, M. & Sheldon, D. Reliability characterization of a commercialtaox-based reram. In

IEEE IIRW , 131–134 (2014).

Hayakawa, Y., Himeno, A., Yasuhara, R., Boullart, W. et al.

Highly reliable taox reram with centralized ﬁlament for28-nm embedded application. In

VLSI Technology , T14–T15 (2015).

Hirtzlin, T. et al.

Digital biologically plausible implementation of binarized neural networks with differential hafniumoxide resistive memory arrays.

Front. Neurosci. , 1383, DOI: 10.3389/fnins.2019.01383 (2020). Chen, W.-H. et al.

Cmos-integrated memristive non-volatile computing-in-memory for ai edge processors.

Nat. Electron. , 420–428 (2019). Valentian, A. et al.

Fully integrated spiking neural network with analog neurons and rram synapses. In , 14.13.1–14.13.4 (2019).

Shulaker, M. M. et al.

Three-dimensional integration of nanotechnologies for computing and data storage on a singlechip.

Nature , 74–78 (2017).

Parisi, G. I., Kemker, R., Part, J. L., Kanan, C. & Wermter, S. Continual lifelong learning with neural networks: A review.

Neural Networks , 54–71 (2019).

Brivio, S. et al.

Extended memory lifetime in spiking neural networks employing memristive synapses with nonlinearconductance dynamics.

Nanotechnology , 015102 (2018). Frascaroli, J., Brivio, S., Covi, E. & Spiga, S. Evidence of soft bound behaviour in analogue memristive devices forneuromorphic computing.

Sci. Reports , 7178, DOI: 10.1038/s41598-018-25376-x (2018). Muñoz-Martín, I. et al.

Unsupervised learning to overcome catastrophic forgetting in neural networks.

IEEE J. on Explor.Solid-State Comput. Devices Circuits , 58–66 (2019)., 58–66 (2019).