[PDF] A generalized linear threshold model for an improved description of the spreading dynamics

Abstract

Many spreading processes in our real-life can be considered as a complex contagion, and the linear threshold (LT) model is often applied as a very representative model for this mechanism. Despite its intensive usage, the LT model suffers several limitations in describing the time evolution of the spreading. First, the discrete-time step that captures the speed of the spreading is vaguely defined. Second, the synchronous updating rule makes the nodes infected in batches, which can not take individual differences into account. Finally, the LT model is incompatible with existing models for the simple contagion. Here we consider a generalized linear threshold (GLT) model for the continuous-time stochastic complex contagion process that can be efficiently implemented by the Gillespie algorithm. The time in this model has a clear mathematical definition and the updating order is rigidly defined. We find that the traditional LT model systematically underestimates the spreading speed and the randomness in the spreading sequence order. We also show that the GLT model works seamlessly with the susceptible-infected (SI) or susceptible-infected-recovered (SIR) model. One can easily combine them to model a hybrid spreading process in which simple contagion accumulates the critical mass for the complex contagion that leads to the global cascades. Overall, the GLT model we proposed can be a useful tool to study complex contagion, especially when studying the time evolution of the spreading.

Full PDF

AA generalized linear threshold model

A generalized linear threshold model for an improved description of thespreading dynamics

Yijun Ran, Xiaomin Deng, Xiaomeng Wang, and Tao Jia a) College of Computer and Information Science, Southwest University, Beibei, Chongqing,400715 P. R. China (Dated: 18 August 2020)

Many spreading processes in our real-life can be considered as a complex contagion, and the linear threshold (LT)model is often applied as a very representative model for this mechanism. Despite its intensive usage, the LT modelsuffers several limitations in describing the time evolution of the spreading. First, the discrete-time step that capturesthe speed of the spreading is vaguely deﬁned. Second, the synchronous updating rule makes the nodes infected inbatches, which can not take individual differences into account. Finally, the LT model is incompatible with existingmodels for the simple contagion. Here we consider a generalized linear threshold (GLT) model for the continuous-timestochastic complex contagion process that can be efﬁciently implemented by the Gillespie algorithm. The time in thismodel has a clear mathematical deﬁnition and the updating order is rigidly deﬁned. We ﬁnd that the traditional LTmodel systematically underestimates the spreading speed and the randomness in the spreading sequence order. We alsoshow that the GLT model works seamlessly with the susceptible-infected (SI) or susceptible-infected-recovered (SIR)model. One can easily combine them to model a hybrid spreading process in which simple contagion accumulates thecritical mass for the complex contagion that leads to the global cascades. Overall, the GLT model we proposed can bea useful tool to study complex contagion, especially when studying the time evolution of the spreading.

The linear threshold (LT) model is a typical model for thecomplex contagion process. However, it systematically un-derestimates the spreading speed and the randomness inthe spreading sequence order. To cope with this issue,we propose a generalized linear threshold (GLT) model,where the time evolution is controlled by the continuous-time stochastic process. The GLT model can be efﬁ-ciently implemented by the Gillespie algorithm, providinga useful tool to investigate and simulate more complicatedspreading processes, especially when the time evolution isthe focus.

I. INTRODUCTION

The process of adoption such as the adoption ofinnovations , commercial products and socialbehavior , and the process of diffusion such as thespread of rumors , opinions and knowledge canall be described as a kind of contagion process . In theseprocesses, things like information or ideas pass from oneperson to another through the association between the twoindividuals, analogous to the infection of diseases. This kindof contagion process is of particular interest when it occurs insparsely connected networks, where the topology of the net-work has a big impact on the outcome of the spreading ,giving rise to a set of interesting phenomena .The underlying mechanisms generally fall into two cate-gories: simple contagion and complex contagion . The sim-ple contagion is based on disease spreading. An individual, orequivalently a node of a network, has a non-zero probability a) Electronic mail: [email protected]. to be infected if one of the connected neighbors is infected.The infection probability also increases monotonically withthe number of infected neighbors. The complex contagion isinspired by collective behaviors in social systems. It assumesthat the infection will occur only when some critical mass hasreached , which can be either the number of contacts orthe number of infected neighbors . Correspondingly, the in-fection probability is non-monotonic, typically captured by astep function that goes directly from 0 to 1 when the criticalmass has been reached.The linear threshold (LT) model is widely used to studycomplex contagion . In the model, a node will deﬁ-nitely become infected if the fraction of its infected neigh-boring nodes goes beyond a threshold value. Previous worksusing the LT model usually focus on the ﬁnal consequenceof the spreading, such as when the global cascading couldoccur or how to select effective seed nodes to maximizethe spreading . When it comes to the spreading dynam-ics, however, the LT model suffers three limitations. First, theevolution in LT model is controlled by discrete-time steps thatlack a proper deﬁnition, which gives rise to issues when thespeed of the spreading needs to be investigated. Second, thestatus of a node is updated in a synchronous manner. At eachtime step, all nodes currently satisfying the spreading thresh-old will turn into the infected state. This can be an issue inapplication such as machine learning where the order of in-fection can be important information . Finally, the LT modelis not very ﬂexible. It is both theoretically and practicallychallenging if one plans to combine the LT model and othersimple contagion model to model some complicated hybridspreading processes.To overcome these limitations, we consider a generalizedlinear threshold (GLT) model for the continuous-time stochas-tic spreading process that can be efﬁciently implemented bythe Gillespie algorithm . The evolutionary time in the newmodel has a physical meaning, which is associated with the a r X i v : . [ phy s i c s . s o c - ph ] A ug generalized linear threshold model 2rate of the underlying stochastic process. We ﬁnd that com-pared with the GLT model, the traditional LT model tends tounderestimate the spreading speed. The order of nodes beinginfected is properly deﬁned in the GLT model, allowing us tobetter generate synthetic spreading node sequence to modelthe spreading in real systems. Finally, the GLT model iscompatible with the susceptible-infected (SI) or susceptible-infected-recovered (SIR) model , because they are deﬁnedunder the same mathematical framework. One can easily builda hybrid spreading by combing both simple and complex con-tagion, or adding the recovery process into the complex con-tagion. The remainder of the paper is structured as follows.We ﬁrst give a brief description of the classical LT model withboth the synchronous and asynchronous updating rules. Wethen propose the GLT model and show how to model it efﬁ-ciently with the Gillespie algorithm. To further shed light onthis model, we compare the spreading results from the GLTand LT model. Finally, we show how the GLT model can becombined with other spreading models. II. RESULTSA. The linear threshold (LT) model

The LT model was ﬁrst introduced in the ﬁeld of socialscience to analyze the effects of social reinforcement by as-suming that each adoption requires a certain fraction of expo-sures. The community of network science may be more fa-miliar with the work by Duncan Watts where the LT modelis used to study the condition for global spreading. The samemodel was also applied in the community of computer scienceto ﬁnd the optimal initiator set that maximizes the spreadingoutcome . In the LT model, each node is in one of the twopossible states: 0 (inactive, susceptible, etc. ) or 1 (active, in-fected, etc. ). A node i in the network can switch only fromstate 0 to state 1. The transition probability depends on thefraction of its neighbors that are on state 1, denoted by φ i , as p ( φ i ) = (cid:40) φ i < φ ∗ i φ i ≥ φ ∗ i , (1)where φ ∗ i is the threshold value of node i , which can be chosenfrom a probability distribution or stay ﬁxed for all nodes.Note that there are other variations for the choice of threshold,such as the number of contacts or the number of infectedneighbors . In this paper, we adopt the model by Watts thatuses the fraction of infected neighbors.The evolution of the system is characterized by discrete-time steps in the LT model. At each time step, we go throughthe network and calculate the transition probability of eachnode according to Eq.(1). All nodes that can change the stateare updated synchronously in that time step. The process is re-peated until no more nodes can change the state. The time stepthat characterizes the system evolution, however, is never ex-plicitly deﬁned. This may be because that initial studies that proposed the model mainly focused on the outcome of thespreading, which does not depend on the choice of time step or how the system actually evolves with time. Nevertheless,the time step needs a proper deﬁnition when the spreading dy-namics are concerned. Indeed, while the discrete-time step isused in both the LT and SI model, they are inherently differ-ent with distinct physical meanings. In the SI model, the timestep is associated with the probability that the disease is trans-ferred from one node to another, or the chance that a nodegets infected from an infected neighbor. If the time step isequivalent to a longer period of real time, the infection proba-bility would be tuned larger, which eventually gives the samespreading dynamics. As an example, the infection probabilityin disease spreading would be different if the time step refersto an hour or a day. In the LT model, however, the time step isassociated with a node’s status updating, which is independentof the transmission probability. Its physical meaning, relatedto why every node updates its status within one time step, isnot clearly interpreted.Another issue is the order of the infection. Under the syn-chronous updating rule, all nodes satisfying the threshold con-dition change the state together in one time step. Consideringthe case that node i changes the state from 0 to 1, which makesits neighboring node j reach the threshold. While the transi-tion condition is satisﬁed, node j can not change the state inthat time step. In other words, node j ’s state is frozen till allnodes in the same batch of node i complete the transition. Interms of the infection order, node j always ranks behind them.Note that the infection order is important in tasks such astracking the spreading source or learning the embedding ofthe underlying network . The simpliﬁcation of LT modelmay limit its application in generating the synthetic spreadingnode sequence in real systems. A simple ﬁx of this issue isto use the asynchronous updating rule. One option is that ateach time step, we randomly pick only one node from thosewhose threshold is reached and update the node’s state. Inthis way, the spreading order would be more realistic. How-ever, the spreading dynamics would become unrealistic as thenumber of infected nodes increases linearly with time steps.An alternative option is to randomly pick an arbitrary node ateach time step regardless of its threshold condition and up-date its states according to its p ( φ i ) . This actually becomesa Monte Carlo simulation . But the computational com-plexity raised to O ( N ) where N is the number of nodes ina network. More importantly, even though the asynchronousupdating rule can ﬁx the order, it is very difﬁcult to modela system with individual differences. For example, if we as-sume that some nodes are more active and would change thestates faster than others, it would be very difﬁcult to imple-ment this feature in the model.Finally, the LT model is not very ﬂexible. This is partiallyrelated with the vague deﬁnition of the discrete-time step. Ifwe want to model a system with both simple and complexcontagion, we need to deﬁne two types of time steps. Onetype of time step is for the deterministic infection in the LTmodel and the other for the probabilistic infection in the SImodel. The conversion between the two types of time stepcan be an interesting interplay, which, however, lacks a properdeﬁnition and brings challenges for theoretical interpretation.Because the node status is updated synchronous at the end generalized linear threshold model 3of LT time step, the spreading curve will not be smooth butcontaining multiple bursts separated by ﬁxed time intervals.We also need to propose a rule to decide which action shouldoccur ﬁrst when the two types of time step coincides. All thesedifﬁculties increase when more dynamics are involved, suchas adding a recovery process to have the susceptible?infected-susceptible (SIS) or SIR model in the system. Therefore, itis challenging to apply the traditional LT model for complexspreading process. B. A generalized linear threshold model and its stochasticsimulation

To cope with the issues mentioned, we consider a sim-ple variation of the original LT model and generalize it tocontinuous-time stochastic process. In the generalized linearthreshold (GLT) model, a node i has a certain rate to transferfrom state 0 to state 1, which is given by β i ( φ i ) = (cid:40) φ i < φ ∗ i k i if φ i ≥ φ ∗ i . (2)Eq.(2) is similar to Eq.(1), both capturing a threshold dy-namic. When φ i is below the threshold, the transition (or in-fection) can not occur. When φ i is above the threshold, thetransition will occur in certain. The extra information givenby Eq.(2) is the rate k i , which controls the speed of the transi-tion and to what extent node i would be infected ahead of othernodes. By assigning different k i value to different nodes, theindividual differences on the transition are well characterized.To efﬁciently simulate the GLT model, we apply the Gille-spie algorithm . It is an efﬁcient simulation method forthe stochastic process and was heavily used to investigate theinteractions of molecules in chemical systems or the cel-lular growth and division in biological systems . It canalso be used to simulate the epidemic spreading such as SIand SIR model . Indeed, though it is not explicitly spec-iﬁed, when we use a rate k to quantify a dynamic process, weimply that it is a Poisson process with a rate k . The inter-eventtime or waiting time τ is random and follows an exponentialdistribution with a rate k . This property can be generalized tocases when multiple Poisson processes coexist. Assume thatthere are N nodes in the network, each has a transition rate β i . The inter-event time τ for the occurrence of next transitionfollows an exponential distribution with rate ˜ β = ∑ Ni = β i . Inpractice, τ can be efﬁciently calculated from a random num-ber r uniformed picked from the interval (0,1) as τ = − ln r ˜ β . (3)The probability that the transition takes place on node j lin-early depends on its transition rate as p = β j ˜ β . (4) Algorithm 1

The generalized linear threshold model based onthe Gillespie algorithm

Input:

Network G , infection rate β , threshold φ ∗ , initial seeds ρ Output:

Time series list T, susceptible number list S, infected num-ber list I function GLT ( G , β , φ ∗ , ρ ) T , S , I ← [ ] , [ | G | − ρ ] , [ ρ ] where the | G | is the number ofnodes nodes ← nodes in the G in f ected _ nodes ← random.sample( nodes , ρ ) risk _ nodes ← the susceptible neighbors of the in f ected _ nodes susceptible _ nodes ← [] for u in nodes do in f ected _ rate [ u ] ← β end for τ ← for n in risk _ nodes do num [ n ] ← the number of infected neighbors of n degree [ n ] ← the number of neighbors of n if num [ n ] degree [ n ] ≥ φ ∗ then add n into susceptible _ nodes remove n from risk _ nodes end if end for total _ rate ← ∑ n ∈ susceptible _ nodes in f ected _ rate [ n ] while total _ rate > do n = random . choice ( susceptible _ nodes ) (cid:46) If each node has different rate β i in a network, please see belowfor an optimization. remove n from susceptible _ nodes add n into in f ected _ nodes τ ← τ − ln ( random . uni f orm ( . , . )) total _ rate Update T , S , I susceptible _ neighbors ← the susceptible neighbors ofthe n for u in susceptible _ neighbors do if u not in susceptible _ nodes then risk _ nodes ← u end if end for for n in risk _ nodes do num [ n ] ← the number of infected neighbors of n degree [ n ] ← the number of neighbors of n if num [ n ] degree [ n ] ≥ φ ∗ then add n into susceptible _ nodes remove n from risk _ nodes end if end for total _ rate ← ∑ n ∈ susceptible _ nodes in f ected _ rate [ n ] end while return T , S , I end function The Gillespie algorithm takes this property of the stochasticprocess. At each simulation step, it decides, in a randommanner, which event would occur and when it would occur.The procedure can be summarized as follows:1. At the time t , ﬁnd all events that may occur (with a positiverate) and get the sum of the rate ˜ β .2. Generate a random variable τ from an exponential distri- generalized linear threshold model 4bution with rate ˜ β .3. Randomly draw an event according to the probability of p in Eq.(4)4. Update the system according to the event drawn. Updatethe time from t to t + τ .5. Repeat from step 1.To illustrate the simulation of the GLT model, we providethe pseudocode in Algorithm 1. At each simulation step, weneed to determine which action would occur from the rates ofall actions. When the k i is the same for all nodes, or thereare only a few choice of k i values, we can do a random selec-tion of actions to simplify this process, which takes only O ( ) complexity. In comparison with the Monte Carlo version ofthe LT model with complexity O ( N ) , the Gillespie al-gorithm signiﬁcantly reduces the computation cost. When allnodes have different k i values, the Monte Carlo version of theLT model would fail because it assumes that all nodes arepicked to update the states with equal probability . TheGillespie algorithm can handle this situation by deciding theprocess that happens according the rate k i . The selection is atypical ﬁtness proportionate selection, also known as roulette-wheel selection . The complexity is usually O ( N ) becausewe need to calculate β j / ˜ β for every node at each simulationstep. However, using a recently proposed optimization, thecomplexity can be reduced to O ( ) type . Taken togetherwith the N nodes in the system, the complexity to simulatethe whole evolution is roughly O ( N ) . C. The application of the GLT model

To show features of the GLT model, we compare its timeevolution with that of the LT model. Because the transitionrate k i can be any value, we have to ﬁrst adjust the continuousrate and the discrete-time step to make the continuous-timeand discrete-time model comparable. Unlike SI model wherethe relationship among the rate, the infection probability andthe discrete-time step is known , there is no method yet tohandle the parameter conversion in the threshold model. Tocope with this issue, we consider re-scaling the spreading timewindow. We choose average cascade size S = .

98 as our ref-erence point and record the time (either discrete-time steps orcontinuous time) takes from the beginning of the spreadingto S = .

98 as the time window. The discrete-time steps andthe continuous time are then re-scaled such that the spreadingtime window is the same. We consider S = .

98 instead of S = as the baseline, where an arbitrary node is pickedat random at each time step regardless of its threshold condi-tion. The state of the node is then updated according to Eq.(1).This baseline is compared with the spreading generated by the GLT model and the traditional LT model with synchronousupdating rule. The dynamics by the GLT model matches withthe baseline, but the dynamics of the LT model is different,where the infected size grows slower than both the GLT modeland the baseline (Fig. 1(b)). This indicates that the LT modelunderestimates the speed of spreading, supporting our initialstatement of the LT model’s limitations. (a) (b) S t

SI(continuous-time)

SI(discrete-time)

S t LT GLT

LT(Monte Carlo)

FIG. 1: Time evolution of the average cascade size S (usually known as infectedfraction in epidemic researches) on Erdös-Rényi network with the size N = (cid:104) k (cid:105) =

4. (a) The continuous-time and discrete-time SI model. Thecontinuous-time SI model is implemented by the Gillespie algorithm in which eachnode has the same infection rate β =

1. In the discrete-time SI model, the infectionprobability is p = .

01 and nodes’ states are updated synchronously. (b) The GLT andLT model in which each node has the same threshold value φ ∗ = .

16. In the GLTmodel, each node has the same rate β =

1. All curves in (a) and (b) are based on theaverage over 10 runs of simulation. In each run, we choose 1 same node as initiator. The LT model underestimates the spreading dynamics dueto its coarse-grained description of the time evolution. Thespreading speed is not instantaneously updated according tothe number of nodes satisfying the threshold condition. Asan example, let us assume there are 10 nodes satisfying thethreshold in the LT model. Naturally, these 10 nodes will beinfected in the next time step. If we assume one time stepcorresponds to a continuous time T , the infection of each ofthe 10 nodes takes T /

10 time on average. In the GLT model,if there are 10 nodes satisfying the threshold, the infection rateof the ﬁrst node will be 10 × k (assuming k i = k for all nodes).The average waiting time to infect the ﬁrst node is T /

10 if k is set as k = / T , which is the same as that in the LT model.However, the infection of one node will activate more nodesduring the spreading. Therefore, after the infection of thisnode, there will be more than 10 nodes in the system satisfyingthe threshold. The rate for the next infection to occur is greaterthan 10 × k and the average waiting time takes less than T / k i of the node selected, giving it amuch higher priority to change the state, its rank is still very generalized linear threshold model 5 P rank (a) (b)

P rank (c)

P rank

FIG. 2: The distribution of the rank of an infected node in spreading sequences on an Erdös-Rényi network with N = (cid:104) k (cid:105) =

4. The threshold valueis φ ∗ = .

16. We ﬁx the node in all models and check when it will be infected. (a) The LT model, (b) the GLT model in which every node has the same infection rate β =

1, (c) theGLT model in which the node selected has the infection rate β =

10 and other nodes have infection rate β =

1. The results are based on 10 runs of simulation. We select 1 node asthe initiator and ﬁx it in each run. S t

Hybrid

GLT

S t

SI(continuous-time) Hybrid GLT (a) (b)

FIG. 3: Time evolution of the average cascade size S for among the continuous-time SImodel (the blue diamond), the hybrid model (the black square) and the GLT model (thered circle) on the Erdös-Rényi network with N = (cid:104) k (cid:105) =

4. (a) The threshold value φ ∗ = .

25 and every node has the same infection rate β = φ ∗ = .

16 and every nodehas the same infection rate β =

4. The fraction of nodes that follow the GLT model andthe SI model is 1:1 in the hybrid model. We select 1 node as the initiator and ﬁx it ineach run. The curve is based on 10 runs of simulation. randomly distributed (Fig. 2(c)). Therefore, when using theLT model to generate synthetic spreading data, we may under-estimate the complexity of spreading brought by the inherentrandomness.Finally, because the GLT model is based on thecontinuous-time stochastic process, it is compatible with othercontinuous-time stochastic processes. We only need to addmore reactions in the queue when multiple processes coexist.As an example, we apply the GLT model as a tool to simulatethe hybrid spreading process. There are works in epidemicsassuming that the disease infection rate can be different underdifferent conditions . Hence there will be two infectionrates in the system. This feature is hard to implement in theLT model but can be easily added by the GLT model. Here weconsider a more interesting situation. In the threshold spread-ing, to have a global cascade triggered by a single initiator, thethreshold value φ ∗ needs to be small ( φ ∗ ≤ / (cid:104) k (cid:105) ) . Thisbrings questions on how a social spreading, which is usuallybelieved to be the complex contagion, could occur since thethreshold of a real social system may not be that small. Oneexplanation is that there can be multiple initiators . Al-ternatively, we may also assume that both simple and complexcontagion are active . Here we analyze co-evolutionary con-tagion which is a hybrid model combining the SI and GLTmodel. In the model, we assume that there are two types of nodes, one evolves according to the SI model and the otherto the GLT model. The result shows that complex contagionalone can not take place, but the global cascade could occurwhen simple contagion co-exist (Fig. 3(a)). The simulationresult demonstrates the model’s capability in combining otherspreading mechanisms.We further ﬁnd that the dynamic of the hybrid model alwayslocates between the SI model and the GLT model whatever theinfected rate β is when the threshold φ ∗ is smaller than the1 / (cid:104) k (cid:105) (Fig. 3(b)). This shows that the simple contagion accu-mulates the critical mass for the complex contagion, which al-lows other nodes to be infected earlier than when the complexcontagion alone takes place. It also implies that the simpleand complex contagion may demonstrate identical spreadingdynamics under certain parameters. III. CONCLUSION AND DISCUSSION

To summarize, we propose a GLT model for thecontinuous-time complex contagion process. It overcomesthe limitations of the LT model in studying the system evo-lution. The GLT model can be efﬁciently implemented by theGillespie Algorithm. We ﬁnd that the traditional LT modeltends to underestimate the speed of spreading and the random-ness of the spreading sequence, compared with cases whenthe dynamics are more properly deﬁned. We show that theGLT model can be very efﬁcient to simulate more compli-cated spreading. Taken together, the GLT model we proposedcan be a useful tool to study complex contagion, especiallywhen the time evolution of the spreading is the focus. Ourresult not only sheds light on a series of important questionsthat were not emphasized previously, but also brings insightinto the modeling process of real spreading data. Previous re-search shows that real spreading process is usually more com-plex. There are examples of combining multiple spreadingmechanisms . More importantly, the recovery process isincluded in real spreading . This urges us to combine the lin-ear threshold model with SIR model, which is readily doablewith the GLT model proposed in this paper. These more so-phisticated models together with real spreading data would generalized linear threshold model 6deﬁnitely help us understand the underlying patterns in infor-mation spreading.Our model also has some shortcomings, the events basedon the spreading dynamics are described as a Poisson ran-dom process for the GLT model, which may not deal with thereal information spreading well. In the GLT model, whethera node becomes active depends only on the number of cur-rent exposures from its neighbors, without memory effects.The previous records, however, could impact the informationspreading in current time in the real data . Miller stud-ied the equivalence between the generalized epidemic processand the LT model through the percolation theory. They ﬁndthat the generalized epidemic process is completely equiva-lent to the LT model. Using the GLT model, we can extendthe analyses to the continuous-time dynamics. In the future,we can study the equivalence based on the temporal dynamicsbetween the GLT model and the simple contagion. In addi-tion, we can study under what circumstances the two modelscan be distinguished, and factors that make the two modelsequivalent. ACKNOWLEDGMENTS

This research is supported by the Chongqing Graduate Re-search and Innovation Project (Grant No. CYB18080), andthe S-Tech Internet Communication Academic Support Plan.

DATA AVAILABILITY STATEMENT

Data sharing is not applicable to this article as data weregenerated by the theoretical model.

REFERENCES E. M. Rogers,

Diffusion of innovations (Simon and Schuster, 2010). C. H. Weiss, J. Poncela-Casasnovas, J. I. Glaser, A. R. Pah, S. D. Persell,D. W. Baker, R. G. Wunderink, and L. A. N. Amaral, “Adoption of ahigh-impact innovation in a homogeneous population,” Physical review x , 041008 (2014). Z.-K. Zhang, C. Liu, X.-X. Zhan, X. Lu, C.-X. Zhang, and Y.-C. Zhang,“Dynamics of information diffusion and its applications on complex net-works,” Physics Reports , 1–34 (2016). F. M. Bass, “A new product growth for model consumer durables,” Man-agement science , 215–227 (1969). S. Aral, L. Muchnik, and A. Sundararajan, “Distinguishing inﬂuence-basedcontagion from homophily-driven diffusion in dynamic networks,” Pro-ceedings of the National Academy of Sciences , 21544–21549 (2009). C. Jin, C. Song, J. Bjelland, G. Canright, and D. Wang, “Emergence ofscaling in complex substitutive systems,” Nature human behaviour , 837–846 (2019). J. H. Fowler and N. A. Christakis, “Cooperative behavior cascades in hu-man social networks,” Proceedings of the National Academy of Sciences , 5334–5338 (2010). M. Zheng, L. Lü, M. Zhao, et al. , “Spreading in online social networks:The role of social reinforcement,” Physical Review E , 012818 (2013). T. Jia, D. Wang, and B. K. Szymanski, “Quantifying patterns of research-interest evolution,” Nature Human Behaviour , 0078 (2017). Y. Moreno, M. Nekovee, and A. F. Pacheco, “Dynamics of rumor spreadingin complex networks,” Physical Review E , 066130 (2004). D. M. Lazer, M. A. Baum, Y. Benkler, A. J. Berinsky, K. M. Greenhill,F. Menczer, M. J. Metzger, B. Nyhan, G. Pennycook, D. Rothschild, et al. ,“The science of fake news,” Science , 1094–1096 (2018). S. Vosoughi, D. Roy, and S. Aral, “The spread of true and false newsonline,” Science , 1146–1151 (2018). G. Travieso and L. da Fontoura Costa, “Spread of opinions and proportionalvoting,” Physical Review E , 036112 (2006). F. Battiston, G. Cencetti, I. Iacopini, V. Latora, M. Lucas, A. Patania, J.-G.Young, and G. Petri, “Networks beyond pairwise interactions: structureand dynamics,” Physics Reports (2020). J. A. Evans and J. G. Foster, “Metaknowledge,” Science , 721–725(2011). I. Iacopini, S. Milojevi´c, and V. Latora, “Network dynamics of innovationprocesses,” Physical review letters , 048301 (2018). S. Liu, N. Perra, M. Karsai, and A. Vespignani, “Controlling contagionprocesses in activity driven networks,” Physical review letters , 118702(2014). D. Guilbeault, J. Becker, and D. Centola, “Complex contagions: A decadein review,” in

Complex spreading phenomena in social systems (Springer,2018) pp. 3–25. D. Centola,

How behavior spreads: The science of complex contagions ,Vol. 3 (Princeton University Press, 2018). D. Centola, “The spread of behavior in an online social network experi-ment,” science , 1194–1197 (2010). W. Wang, Q.-H. Liu, J. Liang, Y. Hu, and T. Zhou, “Coevolution spreadingin complex networks,” Physics Reports (2019). M. Karsai, M. Kivelä, R. K. Pan, K. Kaski, J. Kertész, A.-L. Barabási, andJ. Saramäki, “Small but slow world: How network topology and burstinessslow down spreading,” Physical Review E , 025102 (2011). L. Lü, D.-B. Chen, and T. Zhou, “The small world yields the most effectiveinformation spreading,” New Journal of Physics , 123005 (2011). J. Xian, D. Yang, L. Pan, W. Wang, and Z. Wang, “Misinformation spread-ing on correlated multiplex networks,” Chaos: An Interdisciplinary Journalof Nonlinear Science , 113123 (2019). C. Castellano and R. Pastor-Satorras, “Thresholds for epidemic spreadingin networks,” Physical review letters , 218701 (2010). J. Borge-Holthoefer, R. A. Baños, S. González-Bailón, and Y. Moreno,“Cascading behaviour in complex socio-technical networks,” Journal ofComplex Networks , 3–24 (2013). D. J. Watts, “A simple model of global cascades on random networks,”Proceedings of the National Academy of Sciences , 5766–5771 (2002). F. Karimi and P. Holme, “Threshold model of cascades in empirical tempo-ral networks,” Physica A: Statistical Mechanics and its Applications ,3476–3483 (2013). W. Wang, M. Tang, P. Shu, and Z. Wang, “Dynamics of social contagionswith heterogeneous adoption thresholds: crossover phenomena in phasetransition,” New Journal of Physics , 013029 (2016). M. Granovetter, “Threshold models of collective behavior,” American jour-nal of sociology , 1420–1443 (1978). D. Kempe, J. Kleinberg, and É. Tardos, “Maximizing the spread of inﬂu-ence through a social network,” in

Proceedings of the ninth ACM SIGKDDinternational conference on Knowledge discovery and data mining (ACM,2003) pp. 137–146. P. Singh, S. Sreenivasan, B. K. Szymanski, and G. Korniss, “Threshold-limited spreading in social networks with multiple initiators,” Scientiﬁc re-ports , 2330 (2013). Q.-H. Liu, F.-M. Lü, Q. Zhang, M. Tang, and T. Zhou, “Impacts of opin-ion leaders on social contagions,” Chaos: An Interdisciplinary Journal ofNonlinear Science , 053103 (2018). D.-B. Chen, H.-L. Sun, Q. Tang, S.-Z. Tian, and M. Xie, “Identifying in-ﬂuential spreaders in complex networks by propagation probability dynam-ics,” Chaos: An Interdisciplinary Journal of Nonlinear Science , 033120(2019). Q. Cao, H. Shen, K. Cen, W. Ouyang, and X. Cheng, “Deephawkes: Bridg-ing the gap between prediction and understanding of information cascades,”in

Proceedings of the 2017 ACM on Conference on Information and Knowl-edge Management (2017) pp. 1149–1158. D. T. Gillespie, “A general method for numerically simulating the stochas-tic time evolution of coupled chemical reactions,” Journal of computationalphysics , 403–434 (1976). generalized linear threshold model 7 D. T. Gillespie, “Exact stochastic simulation of coupled chemical reac-tions,” The journal of physical chemistry , 2340–2361 (1977). J. P. Gleeson, “High-accuracy approximation of binary-state dynamics onnetworks,” Physical Review Letters , 068701 (2011). J. P. Gleeson, “Binary-state dynamics on complex networks: Pair approxi-mation and beyond,” Physical Review X , 021004 (2013). Z. Shen, S. Cao, W.-X. Wang, Z. Di, and H. E. Stanley, “Locating thesource of diffusion in complex networks by time-reversal backward spread-ing,” Physical Review E , 032301 (2016). S. Bourigault, S. Lamprier, and P. Gallinari, “Representation learningfor information diffusion through social networks: an embedded cascademodel,” in

Proceedings of the Ninth ACM international conference on WebSearch and Data Mining (2016) pp. 573–582. C. Gou, H. Shen, P. Du, D. Wu, Y. Liu, and X. Cheng, “Learning sequen-tial features for cascade outbreak prediction,” Knowledge and InformationSystems , 721–739 (2018). M. A. Porter and J. P. Gleeson, “Dynamical systems on networks,” Frontiersin Applied Dynamical Systems: Reviews and Tutorials (2016). N. Sinitsyn, N. Hengartner, and I. Nemenman, “Adiabatic coarse-grainingand simulations of stochastic biochemical networks,” Proceedings of theNational Academy of Sciences , 10546–10551 (2009). R. Ramaswamy and I. F. Sbalzarini, “A partial-propensity formulation ofthe stochastic simulation algorithm for chemical reaction networks with de-lays,” The Journal of chemical physics , 014106 (2011). T. Jia and R. V. Kulkarni, “Intrinsic noise in stochastic models of gene ex-pression with molecular memory and bursting,” Physical review letters ,058102 (2011). S. Qiu, T. Jia, et al. , “Quantifying the noise in bursty gene expression un-der regulation by small rnas,” International Journal of Modern Physics C(IJMPC) , 1–14 (2019). N. Kumar, T. Jia, K. Zarringhalam, and R. V. Kulkarni, “Frequency mod-ulation of stochastic gene expression bursts by strongly interacting smallrnas,” Physical Review E , 042419 (2016). P. G. Fennell, S. Melnik, and J. P. Gleeson, “Limitations of discrete-timeapproaches to continuous-time contagion dynamics,” Physical Review E , 052125 (2016). A. Lipowski and D. Lipowska, “Roulette-wheel selection via stochasticacceptance,” Physica A: Statistical Mechanics and its Applications ,2193–2196 (2012). S. Altizer, A. Dobson, P. Hosseini, P. Hudson, M. Pascual, and P. Rohani,“Seasonality and the dynamics of infectious diseases,” Ecology letters ,467–484 (2006). E. E. Freeman, H. A. Weiss, J. R. Glynn, P. L. Cross, J. A. Whitworth, andR. J. Hayes, “Herpes simplex virus 2 infection increases hiv acquisitionin men and women: systematic review and meta-analysis of longitudinalstudies,” Aids , 73–83 (2006). J. Jankowski, B. K. Szymanski, P. Kazienko, R. Michalski, and P. Bródka,“Probing limits of information spread with sequential seeding,” Scientiﬁcreports , 1–9 (2018). J. Jankowski, P. Bródka, P. Kazienko, B. K. Szymanski, R. Michalski, andT. Kajdanowicz, “Balancing speed and coverage by sequential seeding incomplex networks,” Scientiﬁc reports , 1–11 (2017). Q.-H. Liu, L.-F. Zhong, W. Wang, T. Zhou, and H. Eugene Stanley, “In-teractive social contagions and co-infections on complex networks,” Chaos:An Interdisciplinary Journal of Nonlinear Science , 013120 (2018). X. Wang, Y. Lan, and J. Xiao, “Anomalous structure and dynamics in newsdiffusion among heterogeneous individuals,” Nature human behaviour ,709–718 (2019). J. Wu, M. Zheng, Z.-K. Zhang, W. Wang, C. Gu, and Z. Liu, “A modelof spreading of sudden events on social networks,” Chaos: An Interdisci-plinary Journal of Nonlinear Science , 033113 (2018). Y. Hu, S. Ji, Y. Jin, L. Feng, H. E. Stanley, and S. Havlin, “Local structurecan identify and quantify inﬂuential global spreaders in large scale socialnetworks,” Proceedings of the National Academy of Sciences , 7468–7472 (2018). J. C. Miller, “Equivalence of several generalized percolation models on net-works,” Physical Review E94