IEEE Transactions on Knowledge and Data Engineering | 2019

Viral Cascade Probability Estimation and Maximization in Diffusion Networks

 
 

Abstract


People use social networks to share millions of stories every day, but these stories rarely become viral. Can we estimate the probability that a story becomes a <italic>viral cascade</italic>? If so, can we find a set of users that are more likely to trigger viral cascades? These estimation and maximization problems are very challenging since both rare-event nature of viral cascades and efficiency requirement should be considered. Unfortunately, this problem still remains largely unexplored to date. In this paper, given temporal dynamics of a network, we first develop an efficient <bold>vi</bold>ral <bold>c</bold>ascade probability <bold>e</bold>stimation method, <sc>ViCE</sc>, that leverages an special <italic>importance sampling</italic> approximation to achieve high accuracy, even in the cases of very <italic>small probability</italic> of influence. We then show that the most influential nodes in this model is NP-hard, and develop an efficient <bold>vi</bold>ral <bold>c</bold>ascade probability <bold>m</bold>aximization method, <sc>ViCEM</sc>, that maximizes a surrogate submodular function using a greedy algorithm. Experiments on both synthetic and real-world data show that <sc>ViCE</sc> can accurately estimate viral cascade probabilities using fewer samples than naive sampling by at least two orders of magnitude, and also <sc>ViCEM</sc> finds a set of users with higher viral cascade probability than alternatives. Additionally, experiments show that these algorithms are robust across different network topologies.

Volume 31
Pages 589-600
DOI 10.1109/TKDE.2018.2840998
Language English
Journal IEEE Transactions on Knowledge and Data Engineering

Full Text