Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Reid A. Johnson is active.

Publication


Featured researches published by Reid A. Johnson.


international symposium on neural networks | 2014

Using HDDT to avoid instances propagation in unbalanced and evolving data streams

Andrea Dal Pozzolo; Reid A. Johnson; Olivier Caelen; Serge Waterschoot; Nitesh V. Chawla; Gianluca Bontempi

Hellinger Distance Decision Trees [10] (HDDT) has been previously used for static datasets with skewed distributions. In unbalanced data streams, state-of-the-art techniques use instance propagation and standard decision trees (e.g. C4.5 [27]) to cope with the unbalanced problem. However it is not always possible to revisit/store old instances of a stream. In this paper we show how HDDT can be successfully applied in unbalanced and evolving stream data. Using HDDT allows us to remove instance propagations between batches with several benefits: i) improved predictive accuracy ii) speed iii) single-pass through the data. We use a Hellinger weighted ensemble of HDDTs to combat concept drift and increase accuracy of single classifiers. We test our framework on several streaming datasets with unbalanced classes and concept drift.


intelligent data analysis | 2013

Classifier Evaluation with Missing Negative Class Labels

Andrew K. Rider; Reid A. Johnson; Darcy A. Davis; T. Ryan Hoens; Nitesh V. Chawla

The concept of a negative class does not apply to many problems for which classification is increasingly utilized. In this study we investigate the reliability of evaluation metrics when the negative class contains an unknown proportion of mislabeled positive class instances. We examine how evaluation metrics can inform us about potential systematic biases in the data. We provide a motivating case study and a general framework for approaching evaluation when the negative class contains mislabeled positive class instances. We show that the behavior of evaluation metrics is unstable in the presence of uncertainty in class labels and that the stability of evaluation metrics depends on the kind of bias in the data. Finally, we show that the type and amount of bias present in data can have a significant effect on the ranking of evaluation metrics and the degree to which they over- or underestimate the true performance of classifiers.


knowledge discovery and data mining | 2017

Structural Diversity and Homophily: A Study Across More Than One Hundred Big Networks

Yuxiao Dong; Reid A. Johnson; Jian Xu; Nitesh V. Chawla

A widely recognized organizing principle of networks is structural homophily, which suggests that people with more common neighbors are more likely to connect with each other. However, what influence the diverse structures embedded in common neighbors have on link formation is much less well-understood. To explore this problem, we begin by characterizing the structural diversity of common neighborhoods. Using a collection of 120 large-scale networks, we demonstrate that the impact of the common neighborhood diversity on link existence can vary substantially across networks. We find that its positive effect on Facebook and negative effect on LinkedIn suggest different underlying networking needs in these networks. We also discover striking cases where diversity violates the principle of homophily---that is, where fewer mutual connections may lead to a higher tendency to link with each other. We then leverage structural diversity to develop a common neighborhood signature (CNS), which we apply to a large set of networks to uncover unique network superfamilies not discoverable by conventional methods. Our findings shed light on the pursuit to understand the ways in which network structures are organized and formed, pointing to potential advancement in designing graph generation models and recommender systems.


knowledge discovery and data mining | 2012

ALIVE: a multi-relational link prediction environment for the healthcare domain

Reid A. Johnson; Yang Yang; Everaldo Aguiar; Andrew K. Rider; Nitesh V. Chawla

An underlying assumption of biomedical informatics is that decisions can be more informed when professionals are assisted by analytical systems. For this purpose, we propose ALIVE, a multi-relational link prediction and visualization environment for the healthcare domain. ALIVE combines novel link prediction methods with a simple user interface and intuitive visualization of data to enhance the decision-making process for healthcare professionals. It also includes a novel link prediction algorithm, MRPF, which outperforms many comparable algorithms on multiple networks in the biomedical domain. ALIVE is one of the first attempts to provide an analytical and visual framework for healthcare analytics, promoting collaboration and sharing of data through ease of use and potential extensibility. We encourage the development of similar tools, which can assist in facilitating successful sharing, collaboration, and a vibrant online community.


Information Fusion | 2019

Characterizing online health and wellness information consumption: A study

Aastha Nigam; Reid A. Johnson; Dong Wang; Nitesh V. Chawla

Abstract To seek answers to health queries, we often find ourselves on a quest to assimilate information from varied online sources. This information search and fusion from different sources elicits user preferences, which can be driven by demographics, context, and socio-economic factors. To that end, we study these factors as part of health-information seeking behavior of users on a large health and wellness-based knowledge sharing online platform. We begin by identifying the topical interests of users from different content consumption sources. Using these topical preferences, we explore information consumption and health-seeking behavior across three contextual dimensions: user-based demographic attributes, time-related features, and community-based socioeconomic factors. We then study how these context signals can be used to explain specific user health topic preferences. Our findings suggest that linking demographic features to user profiles is more effective in explaining health preferences than other features. Our work demonstrates the value of using contextual factors to characterize and understand the content consumption of users seeking health and wellness information online.


pacific-asia conference on knowledge discovery and data mining | 2015

Optimizing Classifiers for Hypothetical Scenarios

Reid A. Johnson; Troy Raeder; Nitesh V. Chawla

The deployment of classification models is an integral component of many modern data mining and machine learning applications. A typical classification model is built with the tacit assumption that the deployment scenario by which it is evaluated is fixed and fully characterized. Yet, in the practical deployment of classification methods, important aspects of the application environment, such as the misclassification costs, may be uncertain during model building. Moreover, a single classification model may be applied in several different deployment scenarios. In this work, we propose a method to optimize a model for uncertain deployment scenarios. We begin by deriving a relationship between two evaluation measures, H measure and cost curves, that may be used to address uncertainty in classifier performance. We show that when uncertainty in classifier performance is modeled as a probabilistic belief that is a function of this underlying relationship, a natural definition of risk emerges for both classifiers and instances. We then leverage this notion of risk to develop a boosting-based algorithm—which we call RiskBoost—that directly mitigates classifier risk, and we demonstrate that it outperforms AdaBoost on a diverse selection of datasets.


web search and data mining | 2015

Will This Paper Increase Your h -index?: Scientific Impact Prediction

Yuxiao Dong; Reid A. Johnson; Nitesh V. Chawla


IEEE Transactions on Big Data | 2016

Can Scientific Impact Be Predicted

Yuxiao Dong; Reid A. Johnson; Nitesh V. Chawla


european conference on machine learning | 2015

Will this paper increase your h -index?

Yuxiao Dong; Reid A. Johnson; Nitesh V. Chawla


conference on intelligent data understanding | 2012

Species distribution modeling and prediction: A class imbalance problem

Reid A. Johnson; Nitesh V. Chawla; Jessica J. Hellmann

Collaboration


Dive into the Reid A. Johnson's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jian Xu

University of Notre Dame

View shared research outputs
Top Co-Authors

Avatar

Andrea Dal Pozzolo

Université libre de Bruxelles

View shared research outputs
Top Co-Authors

Avatar

Gianluca Bontempi

Université libre de Bruxelles

View shared research outputs
Top Co-Authors

Avatar

Olivier Caelen

Université libre de Bruxelles

View shared research outputs
Top Co-Authors

Avatar

Aastha Nigam

University of Notre Dame

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge