Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Nicholas Generous is active.

Publication


Featured researches published by Nicholas Generous.


PLOS Computational Biology | 2014

Global Disease Monitoring and Forecasting with Wikipedia

Nicholas Generous; Geoffrey Fairchild; Alina Deshpande; Sara Y. Del Valle; Reid Priedhorsky

Infectious disease is a leading threat to public health, economic stability, and other key social structures. Efforts to mitigate these impacts depend on accurate and timely monitoring to measure the risk and progress of disease. Traditional, biologically-focused monitoring techniques are accurate but costly and slow; in response, new techniques based on social internet data, such as social media and search queries, are emerging. These efforts are promising, but important challenges in the areas of scientific peer review, breadth of diseases and countries, and forecasting hamper their operational usefulness. We examine a freely available, open data source for this use: access logs from the online encyclopedia Wikipedia. Using linear models, language as a proxy for location, and a systematic yet simple article selection procedure, we tested 14 location-disease combinations and demonstrate that these data feasibly support an approach that overcomes these challenges. Specifically, our proof-of-concept yields models with up to 0.92, forecasting value up to the 28 days tested, and several pairs of models similar enough to suggest that transferring models from one location to another without re-training is feasible. Based on these preliminary results, we close with a research agenda designed to overcome these challenges and produce a disease monitoring and forecasting system that is significantly more effective, robust, and globally comprehensive than the current state of the art.


PLOS Computational Biology | 2015

Forecasting the 2013-2014 influenza season using Wikipedia.

Kyle S. Hickmann; Geoffrey Fairchild; Reid Priedhorsky; Nicholas Generous; James M. Hyman; Alina Deshpande; Sara Y. Del Valle

Infectious diseases are one of the leading causes of morbidity and mortality around the world; thus, forecasting their impact is crucial for planning an effective response strategy. According to the Centers for Disease Control and Prevention (CDC), seasonal influenza affects 5% to 20% of the U.S. population and causes major economic impacts resulting from hospitalization and absenteeism. Understanding influenza dynamics and forecasting its impact is fundamental for developing prevention and mitigation strategies. We combine modern data assimilation methods with Wikipedia access logs and CDC influenza-like illness (ILI) reports to create a weekly forecast for seasonal influenza. The methods are applied to the 2013-2014 influenza season but are sufficiently general to forecast any disease outbreak, given incidence or case count data. We adjust the initialization and parametrization of a disease model and show that this allows us to determine systematic model bias. In addition, we provide a way to determine where the model diverges from observation and evaluate forecast accuracy. Wikipedia article access logs are shown to be highly correlated with historical ILI records and allow for accurate prediction of ILI data several weeks before it becomes available. The results show that prior to the peak of the flu season, our forecasting method produced 50% and 95% credible intervals for the 2013-2014 ILI observations that contained the actual observations for most weeks in the forecast. However, since our model does not account for re-infection or multiple strains of influenza, the tail of the epidemic is not predicted well after the peak of flu season has passed.


EPJ Data Science | 2015

Enhancing disease surveillance with novel data streams: challenges and opportunities

Benjamin M. Althouse; Samuel V. Scarpino; Lauren Ancel Meyers; John W. Ayers; Marisa Bargsten; Joan Baumbach; John S. Brownstein; Lauren Castro; Hannah E. Clapham; Derek A. T. Cummings; Sara Y. Del Valle; Stephen Eubank; Geoffrey Fairchild; Lyn Finelli; Nicholas Generous; Dylan B. George; David Harper; Laurent Hébert-Dufresne; Michael A. Johansson; Kevin Konty; Marc Lipsitch; Gabriel J. Milinovich; Joseph D. Miller; Elaine O. Nsoesie; Donald R. Olson; Michael J. Paul; Philip M. Polgreen; Reid Priedhorsky; Jonathan M. Read; Isabel Rodriguez-Barraquer

Novel data streams (NDS), such as web search data or social media updates, hold promise for enhancing the capabilities of public health surveillance. In this paper, we outline a conceptual framework for integrating NDS into current public health surveillance. Our approach focuses on two key questions: What are the opportunities for using NDS and what are the minimal tests of validity and utility that must be applied when using NDS? Identifying these opportunities will necessitate the involvement of public health authorities and an appreciation of the diversity of objectives and scales across agencies at different levels (local, state, national, international). We present the case that clearly articulating surveillance objectives and systematically evaluating NDS and comparing the performance of NDS to existing surveillance data and alternative NDS data is critical and has not sufficiently been addressed in many applications of NDS currently in the literature.


PLOS ONE | 2014

Advancing a Framework to Enable Characterization and Evaluation of Data Streams Useful for Biosurveillance

Kristen Margevicius; Nicholas Generous; Kirsten Taylor-McCabe; Mac G. Brown; W. Brent Daniel; Lauren Castro; Andrea Hengartner; Alina Deshpande

In recent years, biosurveillance has become the buzzword under which a diverse set of ideas and activities regarding detecting and mitigating biological threats are incorporated depending on context and perspective. Increasingly, biosurveillance practice has become global and interdisciplinary, requiring information and resources across public health, One Health, and biothreat domains. Even within the scope of infectious disease surveillance, multiple systems, data sources, and tools are used with varying and often unknown effectiveness. Evaluating the impact and utility of state-of-the-art biosurveillance is, in part, confounded by the complexity of the systems and the information derived from them. We present a novel approach conceptualizing biosurveillance from the perspective of the fundamental data streams that have been or could be used for biosurveillance and to systematically structure a framework that can be universally applicable for use in evaluating and understanding a wide range of biosurveillance activities. Moreover, the Biosurveillance Data Stream Framework and associated definitions are proposed as a starting point to facilitate the development of a standardized lexicon for biosurveillance and characterization of currently used and newly emerging data streams. Criteria for building the data stream framework were developed from an examination of the literature, analysis of information on operational infectious disease biosurveillance systems, and consultation with experts in the area of biosurveillance. To demonstrate utility, the framework and definitions were used as the basis for a schema of a relational database for biosurveillance resources and in the development and use of a decision support tool for data stream evaluation.


The Journal of Infectious Diseases | 2016

Epidemic forecasting is messier than weather forecasting: The role of human behavior and internet data streams in epidemic forecast

Kelly Renee Moran; Geoffrey Fairchild; Nicholas Generous; Kyle S. Hickmann; Dave Osthus; Reid Priedhorsky; James M. Hyman; Sara Y. Del Valle

Mathematical models, such as those that forecast the spread of epidemics or predict the weather, must overcome the challenges of integrating incomplete and inaccurate data in computer simulations, estimating the probability of multiple possible scenarios, incorporating changes in human behavior and/or the pathogen, and environmental factors. In the past 3 decades, the weather forecasting community has made significant advances in data collection, assimilating heterogeneous data steams into models and communicating the uncertainty of their predictions to the general public. Epidemic modelers are struggling with these same issues in forecasting the spread of emerging diseases, such as Zika virus infection and Ebola virus disease. While weather models rely on physical systems, data from satellites, and weather stations, epidemic models rely on human interactions, multiple data sources such as clinical surveillance and Internet data, and environmental or biological factors that can change the pathogen dynamics. We describe some of similarities and differences between these 2 fields and how the epidemic modeling community is rising to the challenges posed by forecasting to help anticipate and guide the mitigation of epidemics. We conclude that some of the fundamental differences between these 2 fields, such as human behavior, make disease forecasting more challenging than weather forecasting.


conference on computer supported cooperative work | 2017

Measuring Global Disease with Wikipedia: Success, Failure, and a Research Agenda

Reid Priedhorsky; Dave Osthus; Ashlynn R. Daughton; Kelly Renee Moran; Nicholas Generous; Geoffrey Fairchild; Alina Deshpande; Sara Y. Del Valle

Effective disease monitoring provides a foundation for effective public health systems. This has historically been accomplished with patient contact and bureaucratic aggregation, which tends to be slow and expensive. Recent internet-based approaches promise to be real-time and cheap, with few parameters. However, the question of when and how these approaches work remains open. We addressed this question using Wikipedia access logs and category links. Our experiments, replicable and extensible using our open source code and data, test the effect of semantic article filtering, amount of training data, forecast horizon, and model staleness by comparing across 6 diseases and 4 countries using thousands of individual models. We found that our minimal-configuration, language-agnostic article selection process based on semantic relatedness is effective for improving predictions, and that our approach is relatively insensitive to the amount and age of training data. We also found, in contrast to prior work, very little forecasting value, and we argue that this is consistent with theoretical considerations about the nature of forecasting. These mixed results lead us to propose that the currently observational field of internet-based disease surveillance must pivot to include theoretical models of information flow as well as controlled experiments based on simulations of disease.


PLOS ONE | 2016

The Biosurveillance Analytics Resource Directory (BARD): Facilitating the Use of Epidemiological Models for Infectious Disease Surveillance

Kristen Margevicius; Nicholas Generous; Esteban Abeyta; Ben Althouse; Howard Burkom; Lauren Castro; Ashlynn R. Daughton; Sara Y. Del Valle; Geoffrey Fairchild; James M. Hyman; Richard K. Kiang; Andrew P. Morse; Carmen M. Pancerella; Laura L. Pullum; Arvind Ramanathan; Jeffrey Schlegelmilch; Aaron E. Scott; Kirsten Taylor-McCabe; Alessandro Vespignani; Alina Deshpande

Epidemiological modeling for infectious disease is important for disease management and its routine implementation needs to be facilitated through better description of models in an operational context. A standardized model characterization process that allows selection or making manual comparisons of available models and their results is currently lacking. A key need is a universal framework to facilitate model description and understanding of its features. Los Alamos National Laboratory (LANL) has developed a comprehensive framework that can be used to characterize an infectious disease model in an operational context. The framework was developed through a consensus among a panel of subject matter experts. In this paper, we describe the framework, its application to model characterization, and the development of the Biosurveillance Analytics Resource Directory (BARD; http://brd.bsvgateway.org/brd/), to facilitate the rapid selection of operational models for specific infectious/communicable diseases. We offer this framework and associated database to stakeholders of the infectious disease modeling field as a tool for standardizing model description and facilitating the use of epidemiological models.


PLOS ONE | 2014

Selecting essential information for biosurveillance--a multi-criteria decision analysis.

Nicholas Generous; Kristen Margevicius; Kirsten Taylor-McCabe; Mac G. Brown; W. Brent Daniel; Lauren Castro; Andrea Hengartner; Alina Deshpande

The National Strategy for Biosurveillancedefines biosurveillance as “the process of gathering, integrating, interpreting, and communicating essential information related to all-hazards threats or disease activity affecting human, animal, or plant health to achieve early detection and warning, contribute to overall situational awareness of the health aspects of an incident, and to enable better decision-making at all levels.” However, the strategy does not specify how “essential information” is to be identified and integrated into the current biosurveillance enterprise, or what the metrics qualify information as being “essential”. Thequestion of data stream identification and selection requires a structured methodology that can systematically evaluate the tradeoffs between the many criteria that need to be taken in account. Multi-Attribute Utility Theory, a type of multi-criteria decision analysis, can provide a well-defined, structured approach that can offer solutions to this problem. While the use of Multi-Attribute Utility Theoryas a practical method to apply formal scientific decision theoretical approaches to complex, multi-criteria problems has been demonstrated in a variety of fields, this method has never been applied to decision support in biosurveillance.We have developed a formalized decision support analytic framework that can facilitate identification of “essential information” for use in biosurveillance systems or processes and we offer this framework to the global BSV community as a tool for optimizing the BSV enterprise. To demonstrate utility, we applied the framework to the problem of evaluating data streams for use in an integrated global infectious disease surveillance system.


Online Journal of Public Health Informatics | 2018

Epi Archive: Automated Synthesis of Global Notifiable Disease Data

Hari Khalsa; Sergio Cordova; Nicholas Generous; Prabhu Singh Khalsa; Byron Tasseff; James Arnold

Introduction Most countries do not report national notifiable disease data in a machine-readable format. Data are often in the form of a file that contains text, tables and graphs summarizing weekly or monthly disease counts. This presents a problem when information is needed for more data intensive approaches to epidemiology, biosurveillance and public health. While most nations likely store incident data in a machinereadable format, governments are often hesitant to share data openly for a variety of reasons that include technical, political, economic, and motivational issues1. A survey conducted by LANL of notifiable disease data reporting in over fifty countries identified only a few websites that report data in a machine-readable format. The majority (>70%) produce reports as PDF files on a regular basis. The bulk of the PDF reports present data in a structured tabular format, while some report in natural language. The structure and format of PDF reports change often; this adds to the complexity of identifying and parsing the desired data. Not all websites publish in English, and it is common to find typos and clerical errors. LANL has developed a tool, Epi Archive, to collect global notifiable disease data automatically and continuously and make it uniform and readily accessible.


Scientific Reports | 2017

Corrigendum: An approach to and web-based tool for infectious disease outbreak intervention analysis

Ashlynn R. Daughton; Nicholas Generous; Reid Priedhorsky; Alina Deshpande

This corrects the article DOI: 10.1038/srep46076.

Collaboration


Dive into the Nicholas Generous's collaboration.

Top Co-Authors

Avatar

Alina Deshpande

Los Alamos National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Geoffrey Fairchild

Los Alamos National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Reid Priedhorsky

Los Alamos National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Ashlynn R. Daughton

Los Alamos National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Lauren Castro

Los Alamos National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Sara Y. Del Valle

Los Alamos National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Kirsten Taylor-McCabe

Los Alamos National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Andrea Hengartner

Los Alamos National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Kristen Margevicius

Los Alamos National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Nileena Velappan

Los Alamos National Laboratory

View shared research outputs
Researchain Logo
Decentralizing Knowledge