[PDF] Gaussian Process Nowcasting: Application to COVID-19 Mortality Reporting

Abstract

Updating observations of a signal due to the delays in the measurement process is a common problem in signal processing, with prominent examples in a wide range of fields. An important example of this problem is the nowcasting of COVID-19 mortality: given a stream of reported counts of daily deaths, can we correct for the delays in reporting to paint an accurate picture of the present, with uncertainty? Without this correction, raw data will often mislead by suggesting an improving situation. We present a flexible approach using a latent Gaussian process that is capable of describing the changing auto-correlation structure present in the reporting time-delay surface. This approach also yields robust estimates of uncertainty for the estimated nowcasted numbers of deaths. We test assumptions in model specification such as the choice of kernel or hyper priors, and evaluate model performance on a challenging real dataset from Brazil. Our experiments show that Gaussian process nowcasting performs favourably against both comparable methods, and against a small sample of expert human predictions. Our approach has substantial practical utility in disease modelling -- by applying our approach to COVID-19 mortality data from Brazil, where reporting delays are large, we can make informative predictions on important epidemiological quantities such as the current effective reproduction number.

Full PDF

GGaussian Process Nowcasting: Application to COVID-19 Mortality Reporting

Iwona Hawryluk *1 Henrique Hoeltgebaum Swapnil Mishra Xenia Miscouridou Ricardo P Schnekenberg Charles Whittaker Michaela Vollmer Seth Flaxman Samir Bhatt †1,**

Thomas A. Mellan ‡1,**1

Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, UK Department of Mathematics, Imperial College London, UK Nufﬁeld Department of Clinical Neuroscience, University of Oxford, UK * Contributed equally

Abstract

Updating observations of a signal due to the delaysin the measurement process is a common problemin signal processing, with prominent examples in awide range of ﬁelds. An important example of thisproblem is the nowcasting of COVID-19 mortality:given a stream of reported counts of daily deaths,can we correct for the delays in reporting to paintan accurate picture of the present, with uncer-tainty? Without this correction, raw data will oftenmislead by suggesting an improving situation.We present a ﬂexible approach using a latentGaussian process that is capable of describingthe changing auto-correlation structure present inthe reporting time-delay surface. a This approachalso yields robust estimates of uncertainty forthe estimated nowcasted numbers of deaths. Wetest assumptions in model speciﬁcation such asthe choice of kernel or hyper priors, and evaluatemodel performance on a challenging real datasetfrom Brazil. Our experiments show that Gaussianprocess nowcasting performs favourably againstboth comparable methods, and a small sampleof expert human predictions. Our approach hassubstantial practical utility in disease modelling —by applying our approach to COVID-19 mortalitydata from Brazil, where reporting delays are large,we can make informative predictions on importantepidemiological quantities such as the currenteffective reproduction number. a Code is available at https://github.com/ihawryluk/GP_nowcasting * [email protected] † [email protected] ‡ [email protected] In many real-world settings, current observations from anoisy signal can be systematically biased, with these biasesonly being corrected after subsequent updates create morecomplete data. Often, these updates occur much later inthe future due to data processing or reporting delays. Notaccounting for these delays would result in biased predic-tions, while waiting for updates would result in a lack oftimely estimates. The need for timely estimates to predictthe present is colloquially known as nowcasting, and despiteits importance to a wide range of ﬁelds such as actuarialscience, economics, and epidemiology [Kaminsky, 1987,Lawless, 1994, Bastos et al., 2019, McGough et al., 2020],relatively little literature focuses exclusively on the problem.Nowcasting, as deﬁned by Banbura et al. [2010] at theEuropean Central Bank, is the process of predicting thepresent, the very recent past, and very near future using timeseries data known to be incomplete. An example from eco-nomics is using monthly data to nowcast the current stateof important indicators for an economy such as GDP orincome. More broadly, nowcasting is relevant for scenariosnot only where the data are incomplete, but when the dataare comprised of a biased subsample that will be updated inthe future retrospectively, following lengthy delays.In epidemiology, nowcasting is required due to delays inreporting arising from limitations in testing capacity, datacuration, and the requirement for pseudonymisation of pa-tient data [Bastos et al., 2019]. These delays are furthercompounded by the noise inherent in such data due to lim-ited sampling (typically only a subset of the population issampled). Throughout this paper, we speciﬁcally focus onthe delays in the reporting of deaths. An individual dies of adisease on a given day, but the delay between this event andthe death being reported (and appearing in the dataset) canbe substantial because of the reasons noted above. Thesereporting delays mask the true current state of the epidemic,and have material consequences for our understanding ofboth present and future evolution of the epidemic. For ex- a r X i v : . [ s t a t . A P ] F e b round truth data Raw data O c t O c t N o v N o v D a il y nu m be r o f dea t h s Mortality dataA O c t O c t N o v N o v R t R t using raw dataB O c t O c t N o v N o v R t using ground truth dataC O c t O c t N o v N o v R t using nowcasted dataD Figure 1: A) Reported daily hospital deaths are censored at recent times due to reporting delays. This can be seen bycomparing the raw data with a ground truth from two months in the future, when the records have been backdated. B-C) Theeffective reproduction number R t for SARS-CoV-2 infections in Brazil from 30-Jun-2020 to 23-Nov-2020, estimated usingdeaths from the raw reported data released on the 23-Nov-2020, and using a backdated ground truth based on data releasedon 08-Feb-2021. D) R t estimates based on nowcasted mortality data. Whereas the raw data results in misleading estimates of R t , with the estimated R t <

1, by applying nowcasting to the deaths counts we achieve a picture of the epidemic closer to thetruth.ample, estimation of key epidemiological quantities such asthe effective reproduction number ( R t ) would be systematic-ally biased. Contemporary, real-time and unbiased estimatesare necessary for effective public health planning and policy.In this paper we propose a nowcasting framework based onlatent Gaussian processes (GPs). This methodology is usedto address the speciﬁc problem of delayed reporting in thetrue incidence of deaths due to COVID-19 in Brazil. Previous methods for nowcasting exist in several differentcontexts. Ba´nbura and Modugno [2014] propose a maximumlikelihood approach with a dynamic factor model to predictGDP. Shi et al. [2015] use a deep learning approach basedon LSTM to nowcast rainfall intensity. Codeco et al. [2018]provide a framework to gather epidemiological informationand correct for delays in reporting in Brazilian data. Bas-tos et al. [2019] present a Bayesian hierarchical model fornowcasting applied on data relating to dengue fever andsevere acute respiratory infection cases. In McGough et al.[2020] a Bayesian nowcasting approach is proposed thatproduces accurate estimates that capture the time evolutionof the epidemic curve. Speciﬁcally for COVID-19, Bayesiannowcasting approaches have been used to correct for thereporting delays in Bavaria in Günther et al. [2020]. Furtherdiscussion around the challenges in estimating reportingdelays are also addressed in Seaman and De Angelis [2020].Finally the problem and background context in Brazil fordelays in reporting with corrected data are further explainedin Bastos et al. [2020], Villela [2020]. Our methods build upon and generalize the NobBS (Now-casting by Bayesian Smoothing) method originally proposedby McGough et al. [2020]. NobBS is a Bayesian methodthat produces smooth and accurate nowcasted estimates inthe presence of multiple diseases. NobBS allows for bothuncertainty in the delay distribution and the evolution of theepidemic curve. While an effective method, NobBS has sev-eral limitations, such as inability to pick up fast-occurringchanges in the delay distribution, which we overcome inthis paper. The extensions we show result in comparableperformance for COVID-19 mortality surveillance in Brazil,but present a better ﬁt to the dynamic delays distribution.

The problem tackled in this paper is conceptually illustratedin Figure 1. The black points are the data available to usat a given time, and the red the ground truth that is onlyavailable much further in the future. It can be seen that thediscrepancy between the presently available data and the un-derlying ground truth data grows markedly as we approachthe present – a distinguishing characteristic of reportingdelays. Alongside this, in Figure 1 we also show 3 estimatesof the effective reproduction number R t (deﬁned as the aver-age number of infections an infected individual will go on toinfect), obtained using a Bayesian hierarchical renewal-typemodel [Flaxman et al., 2020, Mellan et al., 2020, Mishraet al., 2020]. Understanding this epidemiological quantityis vital - R t > R t < R t derived from the raw data, while Figures 1C and 1D2how estimates of R t derived from the ground truth data andour nowcasting approach respectively. These plots show thatnot correcting for delays can lead to a fundamentally dif-ferent picture of the current epidemic state. Delays in deathreporting lead to an underestimation of the true number ofdeaths in the observed data - the result is a suggestion ofa declining epidemic, despite the fact that the epidemic isactually growing.In this paper we focus on the Brazilian death data from thepublicly available hospitalisation database with deaths fromboth conﬁrmed and suspected COVID-19 diagnostic status[Ministério da Saúde, 2020]. Our central premise is thatusing these daily death data alone results in policy decisionsbeing made based on false statistics and trends [Villela,2020]. To facilitate well informed policy making based onunreliable data streams we propose and implement a now-casting method using latent Gaussian processes. These GPsare capable of capturing the complex correlation structurein delayed data and present an effective means to correctthe reporting delays. We use this corrected death data tocalculate the effective reproduction number R t using rawretrospective observed data, nowcasted data and the groundtruth updated dataset (Figure 1).Our contributions are the following:• We provide a new ﬂexible and accurate way to cor-rect for delays in reporting. Our framework solves thenowcasting problem through using latent GPs, andprovides realistic estimates for the deaths today givenincomplete data. Our approach closely predicts the nonobserved/missing values and simultaneously learns theunderlying (latent) data generating mechanisms of thedelays.• We compare our approach to an established alternativemethod (NobBS), and in a novel contribution, alsoprovide a comparison to a small human expert panel ofinfectious disease epidemiologists. Domain knowledgeis of primary importance for such applications, andis frequently the primary approach taken to interpretdata. In generating estimates that are improved overboth existing computational methods as well as humanexperts, we demonstrate the utility of our approach.• An important contribution of this work are the resultsand estimates provided. Implementing our approachenables generating of more accurate estimates of the re-production number; and in turn, a better understandingof the evolution of the COVID-19 epidemic in Brazil.Our framework is implemented in the easy to use prob-abilistic program PyStan, and therefore facilitates usein low and middle income settings with limited tech-nical expertise.The structure of the paper is as follows: In section 3 webrieﬂy introduce Gaussian processes and describe the latentGP nowcasting models with several variants. In section 4 we describe the data and perform retrospective tests to evaluatethe accuracy of the new models and compare them with asample of human experts predictions. Finally, we discuss theadvantages and limitations of the GP nowcasting frameworkin section 5. Let n t denote the response variable of interest that needsto be nowcasted at time t . In this paper n t represents thereported COVID-19 mortality. The mortality observations,in general, consist of measurements from an online datasource, subject to distributed observation delays. The cent-ral task of nowcasting approaches is to identify a regulartime-delay structure, and use this to estimate n t , at a timewhen it has only been partially observed. The bias that now-casting identiﬁes and corrects in this scenario is the additivedecomposition of the observable over the reporting delay d .That is, the true signal at a given time t is the sum over allthe delayed partial observations for that time: n t = ∑ d n t , d . (1)The intuition behind this formulation is that the ’true’ deathsthat occurred at time t are distributed over various delays d due to the delays in reporting them.A visual example of partial observation at recent times isthe right-censored epidemiological data shown in Figure 2A.For all data releases, we observe precipitous declines in con-temporary data, which is then subsequently revised upwardsas the data becomes more complete. In the COVID-19 con-text this occurs due to time lags in registering and reportingdeath certiﬁcates [Villela, 2020]. Figure 2B shows that mostdeaths are reported to near completeness after around 5weeks, and 90% are reported within 10 weeks. The splittingof the data by delay index, to form the 2D array in time anddelay n t , d , called a reporting triangle , is shown in Figure 2C.The ﬁgure is also highlighting that both the temporal and thedelay dimensions have structured correlations. The lowertriangle part of this 2D array is missing, since at any time T only the number of deaths reported with delay d ≤ T − t are known for each epidemiological week t .The representation of the data by time and delay, ratherthan time and reporting date, induces a regular structure —one that is auto-correlated and approximately monotonicallydecreasing in delay. This relatively simple structure makesthis problem amenable to statistical modelling. The lowertriangle of the n t , d matrix can be predicted with the model,and therefore an estimate of the true signal is available forany time up to the current time by Eqn. 1, by summing overthe delays. This is a common theme from which variationsof nowcasting branch out.3 (SLGHPLRORJLFDO'D\ ' D LO \ QX P EH U R I GHD W K V $ 5HOHDVHGDWH6HS2FW)HE 5HSRUWLQJGHOD\ZHHNV 7 R W D O QX P EH U R I GHD W K V % 5HSRUWLQJGHOD\ZHHNV ( S L GH P L R O RJ L F D O Z HH N & Figure 2: A) Daily COVID-19 deaths in Brazil as reported in releases of data between July-2020 and Feb-2021. Each linerepresents a single release. B) Total number of deaths reported per reporting delay in weeks. Most deaths are reported withdelay ≤ n t , d , we can use aPoisson or a negative-binomial likelihood for overdisperseddata: n t , d ∼ NB ( λ t , d , r ) . (2)In the negative-binomial case, the dispersion parameter r isa hyperparameter that can be learnt or given an informativeprior based on the problem. The latter approach is commonamong the established Bayesian nowcasting methods [Bas-tos et al., 2019, Günther et al., 2020, McGough et al., 2020].The mean of the negative binomial, λ t d , is often modelled asa random walk [Bastos et al., 2019] or as an auto-regressiveprocess [McGough et al., 2020] along the time dimension,that is joint independent with a learnt vector of delays.This approach has been successful for dengue and inﬂuenzasurveillance [Codeco et al., 2018, Bastos et al., 2019], buthas limitations in terms of the generality of the time-delaycovariance structure that can become apparent in more dy-namic nowcasting scenarios, such as an evolving epidemicwith changing delay distributions. Such issues can be min-imised by tuning the window over which the static delayvector is estimated, or by manually adding cross-term co-variates. Here we employ Gaussian processes as a genericﬂexible alternative to model arbitrarily structured λ t d . Thedetails of this are set out in the following section. The introductory model we consider consists of a latentGP with a 1D kernel. In general terms, GPs are a class ofBayesian non-parametric models that deﬁne a prior overfunctions. They are a powerful tool in machine learning, forlearning complex functions with applications in both regres-sion and classiﬁcation problems [Rasmussen and Williams,2006, Wilson and Adams, 2013]. In recent years GPs havegained popularity in statistics and in machine learning, dueto their ﬂexibility and excellent performance for many spa-tial and spatiotemporal problems [Wilson and Adams, 2013, Flaxman et al., 2015]. The covariance function or kerneltogether with the mean function completely deﬁne a GP.The mean function is the base function around which all ofthe realizations of the GP are be distributed. The covariancekernel is a crucial component of the Gaussian process, asit describes the covariance of the Gaussian process randomvariables i.e. how similar two points are. Therefore, the ker-nel deﬁnes the shape of the distribution and which type offunctions are more probable.One of the most popular choices of covariance kernel, andthe one we chose to introduce the model with, is the squaredexponential kernel, k SE , with entries deﬁned by a covariancefunction k SE ( · , · ) such that k SE ( t i , t j ) = α exp (cid:18) − || t i − t j || ρ (cid:19) . (3)The parameter α deﬁnes the kernel’s variance scale, and ρ is a lengthscale parameter that speciﬁes how nearsighted thecorrelation between pairs of time points ( t i ) is. The kernelresults in a prior over a set of functions to describe, λ t , d , themean of the statistical model in Eqn. 2. This is modelled asa zero mean log-space latent Gaussian processlog ( λ t , d ) ∼ GP ( , k SE ) . (4)Due to weak identiﬁability [Rasmussen and Williams, 2006],a strategy to identify the hyperparameters ρ and α is to ﬁxthe lengthscale ρ to the maximum delay time consideredin the nowcasting problem, and learn the scale parameter α . Markov Chain Monte Carlo (MCMC) is used in orderto generate posterior summaries for arbitrary (non-normal)latent Gaussian processes.4 .3 GENERALISED MODEL3.3.1 Additive Kernel Model The basic model introduced above can be extended toprovide a more expressive description of the data. The pur-pose of this is to be able to describe the complex structurein n t , d . Using the compositional kernel approach [Duven-aud et al., 2013, Wilson and Adams, 2013, Wilson et al.,2016], we can create a new additive kernel over multiplelengthscales, indexed s , as k add = ∑ s k s , log (cid:0) λ t , d (cid:1) ∼ GP ( , k add ) . (5)The lengthscale hyperparameters are ﬁxed or given stronglyinformative priors, ρ s , while each α s is learnt. In the simplestcase we consider a kernel with two lengthscale contributions,short- and long-range correlation structure: k add ( t i , t j ) = k long ( t i , t j ) + k short ( t i , t j ) + σ δ i j , (6)plus a regularising term with a Kronecker delta functionensuring σ Gaussian noise is only added when i = j . Thechoice of kernel confers bias that can result in a better gen-eralisation. The logic of this kernel is to split the covarianceinto two components: (a) a smooth long-range component,used to extrapolate the trend into the unknown part of thereporting triangle where large distances from the observedpoints exist and (b) a part for describing variation in n t , d overshorter lengthscales. Additionally, the separation of kernelsprovides a generic method to describe more complex datagenerating processes – for example, the long-range kernelcan be squared-exponential, while the short-range can bea less smooth type with a different power spectrum suchMatérn (1/2). This can be used to create a general statisticalmodel for all of n t , d . Furthermore, in this regard the δ con-tribution provides a source of regularisation which may beuseful if there is reason to believe n t , d values are subject tovariation beyond the scope of the basic nowcasting frame-work. For example, if a death can switch category from aCOVID-19 suspected death to a cause other than COVID-19 in later data releases, this could result in a negative n t , d count, which can be modelled as an error to be regularised.A further modiﬁcation that can be applied if the time-delaysurface n t , d has a complex structure, is to split the datainto two components and model each with separate kernels.For example, if delays of 0 or 1 weeks account for a largefraction of total counts, they can be considered separately todelays >

1. This approach is considered later in section 4.2.But a more generic formulation is to consider a 2D kernel tofully account for the time-delay correlation structure, whichis introduced below.

As a further expansion of the approach described before,we introduce a separable two dimensional kernel over timeand delay, k (( t i , t j ) , ( d i , d j )) = k t ( t i , t j ) k d ( d i , d j ) . Separablekernels can be efﬁciently implemented using Kroneckerproduct algebra as described in Flaxman et al. [2015]. Spe-ciﬁcally, individual Gram matrices for time and delay arecombined using the Kronecker product such that K t , d = K t ⊗ K d . (7)As before, the kernel can be given an additive structure overmultiple lengthscales. For example, k long ( t , d ) = k t long k d long , k short ( t , d ) = k t short k d short , log (cid:0) λ t , d (cid:1) ∼ GP ( , k long ( t , d ) + k short ( t , d )) . (8)This approach captures the relationship between t and d . Inboth 1D and 2D kernel approaches it is possible to performpartial pooling of the models parameters by combining twoor more spatial locations with similar features, for exampleneighbouring states, if limited data is available. In practicehowever we found that there was a limited gain in doing soas our approach works well with relatively few observations. The number of deaths per date has been extracted from theBrazilian Ministry of Health’s Sistema de Informação de Vi-gilância Epidemiológica da Gripe (SIVEP-Gripe) database[Ministério da Saúde, 2020]. SIVEP-Gripe is a large pub-licly available database providing anonymised patient-levelrecords of all individuals who died or were hospitalisedwith suspected or conﬁrmed COVID-19 [Bastos et al., 2020,de Souza et al., 2020, Niquini et al., 2020]. New data havebeen released regularly online, on a weekly basis, in thesecond half of 2020 considered here. In this study, we ex-tracted all SIVEP-Gripe data releases from 7-July-2020 to8-Feb-2021. We consider all cases of suspected or conﬁrmedCOVID-19 (class 4 and 5).There are a number of potential sources of error in thereported SIVEP data. One is underascertainment — sys-tematic biases which are beyond the scope of correctionby this nowcasting methodology. Another source of erroris delayed classiﬁcation. After the initial input of patient’sdata into the database (usually at the time of hospitalisa-tion), the entry might be later updated with clinical andlaboratory data, including conﬁrmatory COVID-19 testing.5urther updates will include the outcome and its date (i.e.date of death or date of hospital discharge) and cases receivea ﬁnal classiﬁcation. Cases can be classiﬁed as COVID-19(classi_ﬁn=5), other causes (classi_ﬁn=1-3) or unknown(classi_ﬁn=4). Despite being described as a "ﬁnal classiﬁca-tion", reclassiﬁcation does occur, and is especially commonfor unknown cases to be reclassiﬁed as COVID-19 onceresults from conﬁrmatory tests are informed to the healthauthorities. On the other hand, some deaths attributed tosuspected SARS-CoV-2 infection are later ’removed’ fromthe SIVEP database, due to duplicate ﬁltering or becausethey are eventually attributed to other diseases. That cancause the number of deaths on certain days to decreasein consecutive data releases, as shown in Figure S1 in theSupplement.The number of deaths per day as reported by each releaseis presented in Figure 2, together with a reporting triangle,showing the distribution of the reporting delays across time.According to the SIVEP-Gripe dataset, over 90% of alldeaths have been reported with delay less than 10 weeks(Figure 2B). We therefore choose the maximum reportingdelay D for our data to be D =

10, and sum up all deathswhich were reported with the delay longer than 10 weeks.Finally, to create the reporting triangle appropriate for ourmodel, we aggregate the data into weeks.

We ﬁt and present 9 models. For 1D kernel GPs, we considera single SE kernel (1D SE), and additive long- and short-range component kernels (1D SE+SE and 1D SE+Mat).The additive long- and short-range component kernels arealso considering splitting the data into across delay greaterand less than one (1D SE+SE data-split and 1D SE+Matdata-split). Finally we consider a 2D kernel GP model withadditive long and short range components (2D). The NobBSmodel of McGough et al. [2020] is ﬁtted and presented forreference of the current state-of-the-art. All models are ﬁttedto the SIVEP-Gripe weekly COVID-19 deaths reported inBrazil, currently available until 8-Feb-2021.Posterior samples of the parameters in the models were gen-erated using Hamiltonian Monte Carlo with Stan [Hoffmanand Gelman, 2014, Carpenter et al., 2017], using the PyStaninterface (version 2.19.0.0). For each ﬁt we used 4 chainsand 1000 iterations, with 400 iterations dedicated to warm-up. The convergence of each model ﬁt was evaluated byensuring that ˆ R < .

01 for each parameter. Traceplots andother MCMC diagnostic measures were also investigated(Tables S3 - S4 and Figures S13 - S14).Each of the models, characterised by the likelihood givenin Eq (2), and a latent GP part for modelling the λ t , d (sec-tion 3.2) is trained by supplying the reporting triangle n t , d ﬁlled with data available up to the point of the nowcast. Each of the parameters governing the model, such as overdisper-sion r or lengthscales and variances of the GPs are learntduring the model ﬁt. The best performing hyperparametersof the prior distribution were selected conditioned on theobserved results. All of those parameters and their priordensities are given in the Supplementary Material, Table S1.The training of the model and nowcasting through samplingeach element of the n t , d matrix is done simultaneously. Spe-ciﬁcally, at each iteration parameter values are sampled andimmediately used to sample from the negative binomialdistribution to obtain all elements of the n t , d matrix.Other nowcasting methods, including NobBS, focus primar-ily on estimating only the "missing" part of the n t , d array andcomparing the total numbers n t , that is sums of each row ofthe array. Here, we aim to obtain a statistical model explain-ing all elements of the n t , d matrix. The reason for that istwofold: ﬁrstly, having a model that describes the whole n t , d surface well increases the reliability of the model, which isvital in any healthcare setting. Secondly, the SIVEP-Gripedatabase contains hard to identify errors discussed in sec-tion 4.1, therefore it is preferable to treat the reported datawith additional statistical uncertainty. The ﬁt of the 2D GPand the NobBS models to the n t , d matrix is presented inFigure 3 and shows that the GP-based nowcasting methodﬁts the time-delay structure much closer than NobBS. '*3PRGHO (SLGHPLRORJLFDOZHHN 1RE%6 1 X P EH U R I GHD W K V SH U Z HH N 5HSRUWLQJGHOD\ZHHNV Figure 3: Reported and nowcasted numbers of deaths withreporting delay 0 to 5 weeks generated by the 2D GP andNobBS models. These plots show the columns of the re-porting triangle n t , d . The reported data are shown with solidlines, and the 50% CrI for the nowcasts with the ribbons.6 .3 RETROSPECTIVE TESTING 2FW 2FW *3PHDQ*3&U, 1RY 1RY 5DZGDWD7UXHGDWD (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N Figure 4: Retrospective testing for the whole of Brazil usingthe 2D GP model.To evaluate the accuracy of the competing nowcasting mod-els, we ﬁt all of the models to the retrospective data setsavailable at each week between 05-Oct and 30-Nov-2020,using the numbers of deaths recorded for the whole of Brazil.This way we obtain 81 different sets of nowcasts, which wecompare to the numbers of deaths reported by the most re-cent SIVEP-Gripe data release from 8-Feb-2021. The startdate gives us at least 15 weeks of training data for each ofthe nowcasts. The end date of 30-Nov is 10 weeks before themost recent release, so the number of deaths reported in themost recent release can be conﬁdently taken as a true value.The comparison is done by calculating the weighted andunweighted rooted mean squared error (RMSE) between the’true’ values from the most recent release and the nowcastedvalues. The differences between the ground truth and rawdata, as shown in Figure 1A, are used as weights. For eachRMSE evaluation, we use the mortality data from the 10weeks leading up to the date of the nowcast.Out of all tested models, GP models with 1D kernels with 2components (SE+SE and SE+Mat) performed worse thanthe benchmark. The predictive accuracy of the other mod-els was comparable to that of the benchmark, as shown inFigure 5, while also simultaneously giving an appropriatestatistical description of the data (Figure 3). These resultsprovide empirical evidence that our proposed method, un-der correct speciﬁcation, gives a complete and accurateapproach for describing and nowcasting COVID-19 deathdata. Model ﬁts for the 2D GP model, including the 95%credible intervals (CrI) are shown in Figure 4 and the ﬁtsfor the remaining models are shown in Figures S17 - S25.In addition to the forecasting metrics, as a novelty, we alsoevaluate how the GP nowcasting model performs when com-pared to human experts’ predictions. A group of infectiousdisease epidemiology experts was asked to provide a seriesof nowcasts when presented releases up to to 12-Oct and 23-Nov-2020. They were asked for their estimates of the truenumbers of deaths due to COVID-19 in Brazil on the 08-Oct ' 6 ( ' V S OL W 6 ( 0 D W ' V S OL W 6 ( 0 D W ' V S OL W 6 ( 6 ( ' 1 RE % 6 5 0 6( :HLJKWHG8QZHLJKWHG 2 F W 2 F W 1 R Y 1 R Y 1 R Y 5 0 6( 'VSOLW6(6(1RE%6' Figure 5: RMSE evaluated for n t for tested nowcasting meth-ods over the weeks 5-Oct to 30-Nov-2020. In the bottomﬁgure, weighted RMSE is shown with the solid lines, andunweighted RMSE with dashed lines.and 19-Nov-2020 (Figure S4). The dates were speciﬁcallychosen. In the ﬁrst one, 08-Oct, both the raw data and theupdated numbers of daily deaths were declining. Whereasin the second date, 19-Nov, the raw data were decliningwhile the updated release revealed that the true numbers ofdaily deaths were actually increasing. For this experiment,36 anonymous experts provided their point estimates andconﬁdence intervals, which are presented in Figure 6. Toextract daily deaths from the model’s weekly estimates, weperformed a simple interpolation, through setting the now-casted number of deaths per week divided by 7 to the middleday of the given week, and interpolating the remaining val-ues using splines (example shown in Figure S2). 1 X P EH U R I GHD W K V 2FW 7UXH5HSRUWHG([SHUWV&U,*3&U, 1RY Figure 6: Human experts estimates of the true number ofdeaths are shown with the black points and errorbars.7otably in both cases the median value guessed by thehuman experts was not far from the ’true’ value (differenceof 55 and 73 deaths/day respectively for the human medianestimate and 13 and 72 for the nowcasting model mean),however only 36% and 50% of all answers included the truevalue within the provided credible intervals, respectively.The conﬁdence intervals given by the human experts werecomparable with the 95% CrI of the model, with the modelconﬁdence narrower by 19 deaths/day for the ﬁrst date and27 deaths/day for the second date.

We performed basic sensitivity analysis for the 1D SE+SEdata-split GP model. We ﬁrst varied the prior for the over-dispersion parameter r . This parameter is often unidentiﬁ-able by the models and has to be chosen based on the data[McGough et al., 2020]. This is conﬁrmed by our sensit-ivity analysis, where changing the prior for r changed thewidth of the conﬁdence interval, but did not impact the meanpredictions, as shown in Figures S6 and S7. Changing thepriors for the scale parameter α to less informative, that isincreasing the variance of the priors does not signiﬁcantlyaffect the mean predictions (Figures S8, S9). Changing themean of the priors does however have an impact on thepredictions and leads to bi-modality of the posterior distri-bution if the model is misspeciﬁed, e.g. when uninformativeN ( , ) priors are used (Figures S10, S11). Applying nowcasting to surveillance data suffering fromthe reporting delays is crucial to accurate tracking of real-time epidemic dynamics. The limitations associated withusing non-corrected data in epidemiological analyses is high-lighted with our results of the R t estimates shown in Figure 1.Use of this raw data leads to continued underestimation of R t and predicts a declining epidemic. Speciﬁcally, in themonth preceding the nowcast, the relative entropy value forthe ground truth and raw data R t was on average 13.14 (max43.8) and for ground truth and nowcasted data R t only 0.26(max 0.35) (see Figure S3). By contrast, the ground truthresults show that the epidemic remains uncontrolled, with R t remaining above 1 - an important conclusion also capturedby our nowcasting approach.Recent evidence of emergence of the new, potentially moreinfectious strains emphasises the need for the accurate real-time epidemic tracking [Claro et al., 2021, Sabino et al.,2021], in order to ensure timely, appropriate and propor-tional responses to the increasing number of infections. Toshow the current situation of the COVID-19 epidemic inBrazil, we ﬁt the GP models to the most recent data avail-able, that is 8-Feb-2021 SIVEP release. The results of thenowcast for the whole of Brazil are shown in Figure 7 and for a number of states of Brazil in Figure S5. The reporteddata show a decline in the weekly number of deaths sinceepidemiological week 54 for Brazil, however the nowcas-ted results show much higher numbers of estimated weeklydeaths. (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N *3PRGHOQRZFDVW5HSRUWHG Figure 7: Nowcasted and reported deaths due to COVID-19death for Brazil generated with the 2D GP model. 50% and95% CrI for the GP model nowcasts are shown with theribbon.One of the limitations of the approach described here is thedependence on the historical data and the regularity of thedata releases, a limitation shared by many other nowcastingapproaches. Additional challenges include variability in thedistribution of reporting delays over time. For example, dur-ing the initial phase of the Brazilian COVID-19 epidemic,reporting delays were particularly severe. Delays in report-ing are typically most extensive during outbreaks of a novelpathogen (such as SARS-CoV-2), due to the limitationsin diagnostic availability and testing capacity. Relatedly,during epidemic peaks, strain on healthcare systems andadministrative staff due to increasing admissions can alsoresult in lengthening of reporting delays.The GP nowcasting models introduced in this paper can bereadily used for real-time monitoring of the new outbreaksof diseases, as relatively small amount of data (ca. 3 monthshere) is required to train the model. Although this paper fo-cuses on application of the proposed GP-based nowcastingframework to the death counts for the whole country, theGP models can also be applied to reported data not only forthe whole country, but also for individual states or muni-cipalities of Brazil (Figure S5). This ﬂexibility is importantdue to the large heterogeneity of the healthcare system inthe country [Baqui et al., 2020].

We have presented a new approach to modelling time-delaydata, which can be used to nowcast online data streams thathave delays. Our approach uses latent Gaussian processeswith additive kernels, and gives a fully ﬂexible and gen-8ric method to describe and predict the data for unknowndelays. The method has been demonstrated for assessingmortality and estimating the effective reproduction numberfor COVID-19 reporting in Brazil, but can be used for othercontexts in which delays in a measurement process exist.

Python, R and Stan code used to analyse the dataand ﬁt the nowcasting models is available at https://github.com/ihawryluk/GP_nowcasting .The SIVEP-Gripe database [Ministério da Saúde, 2020]is available to download from Brazil Ministry of Healthwebsite https://opendatasus.saude.gov.br/dataset/bd-srag-2020 . The authors thank the CADDE group for insight intoCOVID-19 reporting in Brazil, and are grateful to Bruce Nel-son for consistent public reporting of COVID-19 in Amazo-nas. T.A.M is grateful to Daniel A. M. Villela for helpfuldiscussion of nowcasting. The authors would also like tothank the group of anonymous epidemiology experts fromImperial College London Department of Infectious DiseaseEpidemiology, who participated in testing the nowcastingmodel against the human predictions. 9 eferences

Marta Ba´nbura and Michele Modugno. Maximumlikelihood estimation of factor models on datasetswith arbitrary pattern of missing data.

Journal ofApplied Econometrics , 29(1):133–160, 2014. URL .Marta Banbura, Domenico Giannone, and Lucrezia Reichlin.Nowcasting.

ECB Working Paper No. 1275 , 2010. URL https://ssrn.com/abstract=1717887 .Pedro Baqui, Ioana Bica, Valerio Marra, Ari Ercole, andMihaela van der Schaar. Ethnic and regional variationsin hospital mortality from COVID-19 in Brazil: a cross-sectional observational study.

The Lancet Global Health ,8:e1018 – e1026, 2020. doi: https://doi.org/10.1016/S2214-109X(20)30285-0.Leonardo S Bastos, Theodoros Economou, Marcelo F CGomes, Daniel A M Villela, Flavio C Coelho, Oswaldo GCruz, Oliver Stoner, Trevor Bailey, and Claudia T Codeço.A modelling approach for correcting reporting delays indisease surveillance data.

Statistics in Medicine , 38(22):4363–4377, 2019. doi: https://doi.org/10.1002/sim.8303.Leonardo S Bastos, Roberta P Niquini, Raquel MLana, Daniel AM Villela, Oswaldo G Cruz, Flávio CCoelho, Claudia T Codeço, and Marcelo FC Gomes.COVID-19 and hospitalizations for SARI in Brazil:a comparison up to the 12th epidemiological weekof 2020.

Cadernos de Saúde Pública , 36, 2020.ISSN 0102-311X. URL .Bob Carpenter, Andrew Gelman, Matthew D Hoffman,Daniel Lee, Ben Goodrich, Michael Betancourt, MarcusBrubaker, Jiqiang Guo, Peter Li, and Allen Riddell. Stan:A probabilistic programming language.

Journal of Stat-istical Software , 76(1), 2017. doi: 10.18637/jss.v076.i01.Ingra Morales Claro, Flavia Cristina da Silva Sales,Mariana Severo Ramundo, Darlan Candido, Cam-ila A.M. Silva, Jaqueline Goes de Jesus, Erika Manuli,Cristina Mendes de Oliveira, Luciano Scarpelli, Gust-avo Campana, Oliver Pybus, Ester Cerdeira Sabino,Nuno Rodrigues Faria, and José Eduardo Levi. LocalTransmission of SARS-CoV-2 Lineage B.1.1.7, Brazil,December 2020.

Emerging Infectious Disease journal ,27(3), 2021. ISSN 1080-6059. doi: 10.3201/eid2703.210038. URL .C. Codeco, F. Coelho, O. Cruz, S. Oliveira, T. Castro, andL. Bastos. Infodengue: A nowcasting system for the sur-veillance of arboviruses in Brazil.

Revue d’Épidémiologie et de Santé Publique , 66:S386, 2018. ISSN 0398-7620.doi: https://doi.org/10.1016/j.respe.2018.05.408. URL . EuropeanCongress of Epidemiology “Crises, epidemiologicaltransitions and the role of epidemiologists”.W.M. de Souza, L.F. Buss, D.d.S. Candido, et al. Epidemi-ological and clinical characteristics of the COVID-19epidemic in Brazil.

Nature Human Behavior , 4:856–865,2020. doi: https://doi.org/10.1038/s41562-020-0928-4.David Duvenaud, James Lloyd, Roger Grosse, JoshuaTenenbaum, and Ghahramani Zoubin. Structure Dis-covery in Nonparametric Regression through Com-positional Kernel Search. In

Proceedings of the30th International Conference on Machine Learning ,volume 28 of

Proceedings of Machine Learning Re-search , pages 1166–1174, Atlanta, Georgia, USA,2013. PMLR. URL http://proceedings.mlr.press/v28/duvenaud13.html .Seth Flaxman, Andrew Gordon Wilson, Daniel Neill,Hannes Nickisch, and Alex Smola. Fast Kronecker In-ference in Gaussian Processes with non-Gaussian Like-lihoods. In

Proceedings of the 32nd International Con-ference on Machine Learning , volume 37 of

Proceedingsof Machine Learning Research , pages 607–616. PMLR,07–09 Jul 2015. URL http://proceedings.mlr.press/v37/flaxman15.html .Seth Flaxman, Swapnil Mishra, Axel Gandy, H Juli-ette T Unwin, Helen Coupland, Thomas A Mellan, Har-rison Zhu, Tresnia Berah, Jeffrey W Eaton, ImperialCOVID19 Reponse Team, Azra Ghani, Christl A. Don-nelly, Steven Riley, Lucy C Okell, Michaela A C Vollmer,Neil M. Ferguson, and Samir Bhatt. Estimating the ef-fects of non-pharmaceutical interventions on COVID-19 in Europe.

Nature , pages 1–8, 2020. doi: https://doi.org/10.1038/s41586-020-2405-7.Felix Günther, Andreas Bender, Katharina Katz, HelmutKüchenhoff, and Michael Höhle. Nowcasting the COVID-19 pandemic in Bavaria.

Biometrical Journal , n/a(n/a),2020. doi: https://doi.org/10.1002/bimj.202000112.Matthew D Hoffman and Andrew Gelman. The No-U-Turnsampler: adaptively setting path lengths in HamiltonianMonte Carlo.

J. Mach. Learn. Res. , 15(1):1593–1623,2014. URL .Kenneth S. Kaminsky. Prediction of IBNR claim countsby modelling the distribution of report lags.

Insurance:Mathematics and Economics , 6(2):151 – 159, 1987. ISSN0167-6687. doi: https://doi.org/10.1016/0167-6687(87)90024-2.10. F. Lawless. Adjustments for Reporting Delays and thePrediction of Occurred but Not Reported Events.

TheCanadian Journal of Statistics / La Revue Canadienne deStatistique , 22(1):15–31, 1994. ISSN 03195724. URL .Sarah F McGough, Michael A Johansson, Marc Lipsitch,and Nicolas A Menzies. Nowcasting by BayesianSmoothing: A ﬂexible, generalizable model for real-timeepidemic tracking.

PLoS computational biology , 16(4):e1007735, 2020. doi: https://doi.org/10.1371/journal.pcbi.1007735.Thomas A Mellan, Henrique H Hoeltgebaum, SwapnilMishra, Charlie Whittaker, Ricardo P Schnekenberg,Axel Gandy, H Juliette T Unwin, Michaela AC Vollmer,Helen Coupland, Iwona Hawryluk, et al. Report 21: Es-timating COVID-19 cases and reproduction number inBrazil. medRxiv , 2020. URL https://doi.org/10.25561/78872 .Ministério da Saúde. SRAG 2020 - Banco de Da-dos de Síndrome Respiratória Aguda Grave, 2020.URL https://opendatasus.saude.gov.br/dataset/bd-srag-2020 .Swapnil Mishra, Tresnia Berah, Thomas A Mellan, H Juli-ette T Unwin, Michaela A Vollmer, Kris V Parag, AxelGandy, Seth Flaxman, and Samir Bhatt. On the derivationof the renewal equation from an age-dependent branchingprocess: an epidemic modelling perspective. arXiv , 2020.URL https://arxiv.org/abs/2006.16487 .Roberta Pereira Niquini, Raquel Martins Lana, Anto-nio Guilherme Pacheco, Oswaldo G Cruz, Flávio CodeçoCoelho, Luiz Max Carvalho, Daniel Antunes MacielVillela, Marcelo Ferreira da Costa Gomes, and Le-onardo Soares Bastos. Description and comparison ofdemographic characteristics and comorbidities in SARIfrom COVID-19, SARI from inﬂuenza, and the Braziliangeneral population.

Cadernos de Saúde Pública , 36, 002020. ISSN 0102-311X. URL .C. E. Rasmussen and C. K .I. Williams.

Gaussian Processesfor Machine Learning . MIT Press, 2006. URL .Ester C Sabino, Lewis F Buss, Maria P Carvalho, Carlos APrete Jr, Myuki A E Crispim, Nelson A Fraiji, Rafael H MPereira, Kris V Parag, Pedro da Silva Peixoto, Moritz U GKraemer, and et al. Resurgence of COVID-19 in Manaus,Brazil, despite high seroprevalence.

The Lancet , 397:452–455, 2021. doi: https://doi.org/10.1016/S0140-6736(21)00183-5.Shaun Seaman and Daniela De Angelis. Challenges inestimating the distribution of delay from COVID-19 death to report of death. 2020. URL .Xingjian Shi, Zhourong Chen, Hao Wang, Dit-Yan Yeung,Wai-kin Wong, and Wang-chun Woo. ConvolutionalLSTM Network: A Machine Learning Approach forPrecipitation Nowcasting. NIPS’15, page 802–810. MITPress, 2015. URL https://proceedings.neurips.cc/paper/2015/file/07563a3fe3bbe7e3ba84431ad9d055af-Paper.pdf

Proceedings of the 30th International Con-ference on Machine Learning , volume 28 of

Pro-ceedings of Machine Learning Research , pages 1067–1075, Atlanta, Georgia, USA, 17–19 Jun 2013. PMLR.URL http://proceedings.mlr.press/v28/wilson13.html .Andrew Gordon Wilson, Zhiting Hu, Ruslan Salakhutdinov,and Eric P. Xing. Deep Kernel Learning. In

Proceedings of the 19th International Conference onArtiﬁcial Intelligence and Statistics , volume 51 of

Proceedings of Machine Learning Research , pages370–378, Cadiz, Spain, 09–11 May 2016. PMLR.URL http://proceedings.mlr.press/v51/wilson16.htm .11

SUPPLEMENTARY MATERIAL 'HOD\LQGH[ 1 X P EH U R I GHD W K V U HSR U W HG Figure S1: Example of weekly reporting delay with negativesignal for Amazonas, epidemiological weeks 27 to 42. Eachweek’s mortality data are plotted as a single line. Some linesfall under the y = (SLGHPLRORJLFDOGD\ 1 X P EH U R I GHD W K V SH U GD \ 7UXWK*3PRGHOQRZFDVW Figure S2: Example of the interpolation of numbers ofdeaths per day based on the weekly nowcasted values. The50% and 95% CrI for the nowcast are shown with the ribbon.

Raw dataNowcasted data S ep O c t O c t N o v N o v K L ( . || g r ound t r u t h ) Figure S3: Kullback-Leibler divergence (relative entropy)between the R t value estimated using raw data and nowcas-ted data.12able S1: Priors for the analysed models. Here T is the maximum number of weeks for which the data is available, that isrows in the reporting triangle, and D is the maximum reporting delay, that is number of columns in the reporting triangle. Γ denotes a Gamma distribution. ∗ the same hyperparameters were used for the models with Matérn(1/2) and Matérn(3/2)kernels. 1DSE 1DSE+SE 1DSE+Mat* 1D SE+SEdata-split 1D SE+Mat*data-split 2D NobBS r Γ ( , ) Γ ( , ) Γ ( , ) Γ ( , ) Γ ( , ) Γ ( , ) Γ ( , ) α , long N ( , ) N ( , ) N ( , ) N ( , ) N ( , ) - - α , long - - - N ( , ) N ( , ) - - α , short - N ( , ) N ( , ) N ( , ) N ( , ) N ( D , ) - α , short - - - N ( . , ) N ( , ) - α , t - - - - - N ( T , ) - α , t - - - - - N ( , ) - α , d - - - - - N ( D , ) - ρ , long N ( , ) N ( T , . ) N ( T , . ) N ( T , . ) N ( T , . ) - - ρ , long - - - N ( D , . ) N ( D , . ) - - ρ , short - N ( , . ) N ( , . ) N ( , . ) N ( , . ) - - ρ , short - - - N ( , . ) N ( , . ) - - ρ , t - - - - - T - ρ , t - - - - - 1 - ρ , d - - - - - D - ρ , d - - - - - 1 - δ N ( , e − ) N ( , e − ) N ( , e − ) N ( , e − ) N ( , e − ) N ( , e − ) - δ - - - N ( , e − ) N ( , e − ) N ( , e − ) - z N ( , . ) N ( , . ) N ( , . ) N ( , . ) N ( , . ) N ( , . ) - τ - - - - - - Γ ( . , . ) a [ ] - - - - - - N ( , (cid:113) . ) a [ t ] - - - - - - N ( α [ t − ] , (cid:113) τ ) β - - - - - - Dirichlet ( . ) (SLGHPLRORJLFDOGD\ ' D LO \ QX P EH U R I GHD W K V 0RVWUHFHQWUHOHDVH+XPDQH[SHUWV*3QRZFDVW (SLGHPLRORJLFDOGD\ ' D LO \ QX P EH U R I GHD W K V 0RVWUHFHQWUHOHDVH+XPDQH[SHUWV*3QRZFDVW Figure S4: Human experts and 1D SE+SE data-split GP model estimates of a true number of deaths on 8-Oct-2020 (left) and19-Nov-2020 (right) plotted together with the reported data.13 1 X P EH U R I GHD W K V SH U Z HH N $& $0 &( 1 X P EH U R I GHD W K V SH U Z HH N ') 3$ 3( 1 X P EH U R I GHD W K V SH U Z HH N 5- 52 55 (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N 6& (SLGHPLRORJLFDOZHHN 63 (SLGHPLRORJLFDOZHHN %UD]LO &U,&U,UHSRUWHG Figure S5: Nowcasts made by the 1D SE+SE data-split GP model, using data up to 8-Feb-2021 for Acre (AC), Amazonas(AM), Ceará (CE), Distrito Federal (DF), Pará (PA), Pernambuco (PE), Rio de Janeiro (RJ), Rondônia (RO), Roraima (RR),Santa Catarina (SC), São Paulo (SP) and whole Brazil. 14

ENSITIVITY ANALYSIS

For all sensitivity analyses, 1D SE+SE data-split GP modelhas been used. Each time we run the models with 4 chainsfor 1000 iterations, with 400 iterations used for burn-in.All ﬁts presented are done for nowcasting using all dataavailable up to 11-Jan-2021. (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N &U,3ULRUVIRUSDUDPHWHUV *DPPD *DPPD *DPPD 1RUPDO 1RUPDO (SLGHPLRORJLFDOZHHN &U, Figure S6: Model ﬁts with different r prior density. 1RUPDO U1RUPDO U*DPPD U*DPPD U*DPPD U Figure S7: Model ﬁts with different r prior density. 15 (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N &U,3ULRUVIRUSDUDPHWHUV GHIDXOW[GHIDXOW[GHIDXOW[GHIDXOW[ (SLGHPLRORJLFDOZHHN &U, Figure S8: Model ﬁts with different α long , , α long , , α short , and α short , prior density variance. GHIDXOW[DOSKDBJSEGHIDXOW[DOSKDBJSEGHIDXOW[DOSKDBJSEGHIDXOW[DOSKDBJSEGHIDXOW[DOSKDBJSEGHIDXOW[DOSKDBJSEGHIDXOW[DOSKDBJSEGHIDXOW[DOSKDBJSEGHIDXOW[DOSKDBJSGHIDXOW[DOSKDBJSGHIDXOW[DOSKDBJSGHIDXOW[DOSKDBJSGHIDXOW[DOSKDBJSGHIDXOW[DOSKDBJSGHIDXOW[DOSKDBJSGHIDXOW[DOSKDBJS Figure S9: Model ﬁts with different α long , , α long , , α short , and α short , prior density variance.16 (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N &U,3ULRUVIRUSDUDPHWHUV GHIDXOWGHIDXOW[1RUPDO VKRUWGHIDXOW[ORQJGHIDXOW[ (SLGHPLRORJLFDOZHHN &U, Figure S10: Model ﬁts with different α long , , α long , , α short , and α short , prior density. Default means using the default priorsdescribed in Table S1, for default x 3 prior we increased the mean in the default priors 3-fold, Normal(0,1) means a standardnon-informative prior was set to all α -s, and long- and short- default x 2 means increased mean in the default prior long-and short-part respectively. ORQJGHIDXOW[DOSKDBJSEVKRUWGHIDXOW[DOSKDBJSE1RUPDO DOSKDBJSEGHIDXOW[DOSKDBJSEGHIDXOWDOSKDBJSEORQJGHIDXOW[DOSKDBJSEVKRUWGHIDXOW[DOSKDBJSE1RUPDO DOSKDBJSEGHIDXOW[DOSKDBJSEGHIDXOWDOSKDBJSEORQJGHIDXOW[DOSKDBJSVKRUWGHIDXOW[DOSKDBJS1RUPDO DOSKDBJSGHIDXOW[DOSKDBJSGHIDXOWDOSKDBJSORQJGHIDXOW[DOSKDBJSVKRUWGHIDXOW[DOSKDBJS1RUPDO DOSKDBJSGHIDXOW[DOSKDBJSGHIDXOWDOSKDBJS Figure S11: Model ﬁts with different α long , , α long , , α short , and α short , prior density. Default means using the default priorsdescribed in Table S1, for default x 3 prior we increased the mean in the default priors 3-fold, Normal(0,1) means a standardnon-informative prior was set to all α -s, and long- and short- default x 2 means increased mean in the default prior long-and short-part respectively. 17 .1 MODEL DIAGNOSTICS For each of the model runs shown in this section, 4 chainswere run for 1000 iterations, with 500 iterations used forburn-in. All ﬁts presented are done for nowcasting using alldata available up to 11-Jan-2021.Table S2: Diagnostics for the 1D SE+SE data-split GPmodel. mean sd hdi_3% hdi_97% mcse_mean mcse_sd ess_mean ess_sd ess_bulk ess_tail r_hat ρ ρ ρ b < < ρ b < < α α α b α b δ < < < < < δ < < < < < < r ρ ρ ρ b < < ρ b < < α α α b α b δ < < < < < < δ < < < < < < r ρ t ρ t < < ρ d α t α t α d δ < < < < < < δ < < < < < < r UKRBJS UKRBJS UKRBJS UKRBJS UKRBJSE UKRBJSE UKRBJSE UKRBJSE DOSKDBJS DOSKDBJS DOSKDBJS DOSKDBJS DOSKDBJSE DOSKDBJSE DOSKDBJSE DOSKDBJSE H GHOWD H GHOWD H GHOWD H GHOWD U U FRPSRQHQWVPL[WXUHZLWKIXOO\6(NHUQHOPRGHOILW Figure S12: Traceplots for the 1D SE+SE data-split GP model.19 UKRBJS UKRBJS UKRBJS UKRBJS UKRBJSE UKRBJSE UKRBJSE UKRBJSE DOSKDBJS DOSKDBJS DOSKDBJS DOSKDBJS DOSKDBJSE DOSKDBJSE DOSKDBJSE DOSKDBJSE H GHOWD H GHOWD H GHOWD H GHOWD U U FRPSRQHQWVPL[WXUHZLWK6(DQG0DWHUQ NHUQHOPRGHOILW Figure S13: Traceplots for the 1D SE+Mat(3/2) data-split kernel GP model.20 UKRBJSBWBGLVW UKRBJSBWBGLVW UKRBJSBWBGLVW UKRBJSBWBGLVW UKRBJSBGBGLVW UKRBJSBGBGLVW DOSKDBJSBW DOSKDBJSBW DOSKDBJSBW DOSKDBJSBW DOSKDBJSBG DOSKDBJSBG H GHOWD H GHOWD H GHOWD H GHOWD U U 'PL[WXUHNHUQHOZLWK.URQHFNHUPHWKRGPRGHOILW Figure S14: Traceplots for the 2D GP model.21 .2 RETROSPECTIVE TESTING (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N 1RZFDVWLQJRI&29,'UHODWHGGHDWKVIRU5LRGH-DQHLURVWDWH XSWR &U,&U,UHSRUWHGXVHGIRUQRZFDVWLQJ Figure S15: Nowcasted and reported deaths due to COVID-19 death for Rio de Janeiro (state), generated with a 1DSE+SE data-split GP model. Reported deaths are shown inblue, nowcasted CrI in orange. Here the nowcasting wasperformed with all data available up till the SIVEP datarelease on the 30-Nov-2020. At that point, looking only atthe reported data might indicate that the number of deathskeep decreasing, however using nowcasting would haverevealed the uptick in the number of deaths, which was notyet observed in the data at the time of the 30-Nov-2020release. 22 1 X P EH U R I GHD W K V SH U Z HH N (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N (SLGHPLRORJLFDOZHHN (SLGHPLRORJLFDOZHHN 5HSRUWLQJGHOD\ZHHNV Figure S16: Distribution of reporting delays for each of the retrospective tests. The moment of the ’nowcast’ is shown withthe red dotted line in each plot. Solid line present the data extracted from the release of SIVEP data from 8-Feb-2021, andthe ribbons show 50% CrI of the model ﬁt obtained obtain using the 1D SE+SE data-split GP model. (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N '6(*3PRGHO Figure S17: Retrospective tests for the 1D SE GP nowcasting mode. Deaths reported in the 8-Feb-2021 release are shownwith blue dots, data available at the time of nowcasting with red dashed line, nowcasted mean values with black solid lineand 95% CrI with orange ribbon. 23 (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N '6(6(*3PRGHO Figure S18: Retrospective tests for the 1D SE+SE GP model. (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N '6(0DW *3PRGHO Figure S19: Retrospective tests for the 1D SE+Mat(1/2) GP model.24 (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N '6(0DW *3PRGHO Figure S20: Retrospective tests for the 1D SE+Mat(3/2) GP model. (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N '6(0DW GDWDVSOLW*3PRGHO Figure S21: Retrospective tests for the 1D SE+Mat(1/2) data-split GP model.25 (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N '6(0DW GDWDVSOLW*3PRGHO Figure S22: Retrospective tests for the 1D SE+Mat(3/2) data-split GP model. (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N '6(6(GDWDVSOLW*3PRGHO Figure S23: Retrospective tests for the 1D SE+SE data-split GP model.26 (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N '*3PRGHO Figure S24: Retrospective tests for the 2D GP model. (SLGHPLRORJLFDOZHHN 1 X P EH U R I GHD W K V SH U Z HH N 1RE%60RGHO1RE%60RGHO