Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Bailey K. Fosdick is active.

Publication


Featured researches published by Bailey K. Fosdick.


Science | 2014

World population stabilization unlikely this century.

Patrick Gerland; Adrian E. Raftery; Hana Ševčíková; Nan Li; Danan Gu; Thomas Spoorenberg; Leontine Alkema; Bailey K. Fosdick; Jennifer Chunn; Nevena Lalic; Guiomar Bay; Thomas Buettner; Gerhard K. Heilig; John Wilmoth

The United Nations (UN) recently released population projections based on data until 2012 and a Bayesian probabilistic methodology. Analysis of these data reveals that, contrary to previous literature, the world population is unlikely to stop growing this century. There is an 80% probability that world population, now 7.2 billion people, will increase to between 9.6 billion and 12.3 billion in 2100. This uncertainty is much smaller than the range from the traditional UN high and low variants. Much of the increase is expected to happen in Africa, in part due to higher fertility rates and a recent slowdown in the pace of fertility decline. Also, the ratio of working-age people to older people is likely to decline substantially in all countries, even those that currently have young populations. The 21st century is unlikely to see the end of global population growth. [Also see Perspective by Smeeding] Global population growth continuing The United Nations released new population projections for all countries in July 2014. Gerland et al. analyzed the data and describe the probabilistic population projections for the entire world as well as individual regions and countries (see the Perspective by Smeeding). World population is likely to continue growing for the rest of the century, with at least a 3.5-fold increase in the population of Africa. Furthermore, the ratio of working-age people to older people is almost certain to decline substantially in all countries, not just currently developed ones. Science, this issue p. 234; see also p. 163


PLOS ONE | 2016

Women 1.5 Times More Likely to Leave STEM Pipeline after Calculus Compared to Men: Lack of Mathematical Confidence a Potential Culprit

Jessica Ellis; Bailey K. Fosdick; Chris Rasmussen

The substantial gender gap in the science, technology, engineering, and mathematics (STEM) workforce can be traced back to the underrepresentation of women at various milestones in the career pathway. Calculus is a necessary step in this pathway and has been shown to often dissuade people from pursuing STEM fields. We examine the characteristics of students who begin college interested in STEM and either persist or switch out of the calculus sequence after taking Calculus I, and hence either continue to pursue a STEM major or are dissuaded from STEM disciplines. The data come from a unique, national survey focused on mainstream college calculus. Our analyses show that, while controlling for academic preparedness, career intentions, and instruction, the odds of a woman being dissuaded from continuing in calculus is 1.5 times greater than that for a man. Furthermore, women report they do not understand the course material well enough to continue significantly more often than men. When comparing women and men with above-average mathematical abilities and preparedness, we find women start and end the term with significantly lower mathematical confidence than men. This suggests a lack of mathematical confidence, rather than a lack of mathematically ability, may be responsible for the high departure rate of women. While it would be ideal to increase interest and participation of women in STEM at all stages of their careers, our findings indicate that if women persisted in STEM at the same rate as men starting in Calculus I, the number of women entering the STEM workforce would increase by 75%.


The Annals of Applied Statistics | 2016

Dynamic social networks based on movement

Henry R. Scharf; Mevin B. Hooten; Bailey K. Fosdick; Devin S. Johnson; Josh M. London; John W. Durban

Network modeling techniques provide a means for quantifying social structure in populations of individuals. Data used to define social connectivity are often expensive to collect and based on case-specific, ad hoc criteria. Moreover, in applications involving animal social networks, collection of these data is often opportunistic and can be invasive. Frequently, the social network of interest for a given population is closely related to the way individuals move. Thus telemetry data, which are minimally-invasive and relatively inexpensive to collect, present an alternative source of information. We develop a framework for using telemetry data to infer social relationships among animals. To achieve this, we propose a Bayesian hierarchical model with an underlying dynamic social network controlling movement of individuals via two mechanisms: an attractive effect, and an aligning effect. We demonstrate the model and its ability to accurately identify complex social behavior in simulation, and apply our model to telemetry data arising from killer whales. Using auxiliary information about the study population, we investigate model validity and find the inferred dynamic social network is consistent with killer whale ecology and expert knowledge.


Journal of the American Statistical Association | 2015

Testing and Modeling Dependencies Between a Network and Nodal Attributes

Bailey K. Fosdick; Peter D. Hoff

Network analysis is often focused on characterizing the dependencies between network relations and node-level attributes. Potential relationships are typically explored by modeling the network as a function of the nodal attributes or by modeling the attributes as a function of the network. These methods require specification of the exact nature of the association between the network and attributes, reduce the network data to a small number of summary statistics, and are unable to provide predictions simultaneously for missing attribute and network information. Existing methods that model the attributes and network jointly also assume the data are fully observed. In this article, we introduce a unified approach to analysis that addresses these shortcomings. We use a previously developed latent variable model to obtain a low-dimensional representation of the network in terms of node-specific network factors. We introduce a novel testing procedure to determine if dependencies exist between the network factors and attributes as a surrogate for a test of dependence between the network and attributes. We also present a joint model for the network relations and attributes, for use if the hypothesis of independence is rejected, which can capture a variety of dependence patterns and be used to make inference and predictions for missing observations.


The Annals of Applied Statistics | 2014

SEPARABLE FACTOR ANALYSIS WITH APPLICATIONS TO MORTALITY DATA

Bailey K. Fosdick; Peter D. Hoff

Human mortality data sets can be expressed as multiway data arrays, the dimensions of which correspond to categories by which mortality rates are reported, such as age, sex, country and year. Regression models for such data typically assume an independent error distribution or an error model that allows for dependence along at most one or two dimensions of the data array. However, failing to account for other dependencies can lead to inefficient estimates of regression parameters, inaccurate standard errors and poor predictions. An alternative to assuming independent errors is to allow for dependence along each dimension of the array using a separable covariance model. However, the number of parameters in this model increases rapidly with the dimensions of the array and, for many arrays, maximum likelihood estimates of the covariance parameters do not exist. In this paper, we propose a submodel of the separable covariance model that estimates the covariance matrix for each dimension as having factor analytic structure. This model can be viewed as an extension of factor analysis to array-valued data, as it uses a factor model to estimate the covariance along each dimension of the array. We discuss properties of this model as they relate to ordinary factor analysis, describe maximum likelihood and Bayesian estimation methods, and provide a likelihood ratio testing procedure for selecting the factor model ranks. We apply this methodology to the analysis of data from the Human Mortality Database, and show in a cross-validation experiment how it outperforms simpler methods. Additionally, we use this model to impute mortality rates for countries that have no mortality data for several years. Unlike other approaches, our methodology is able to estimate similarities between the mortality rates of countries, time periods and sexes, and use this information to assist with the imputations.


Siam Review | 2018

Configuring Random Graph Models with Fixed Degree Sequences

Bailey K. Fosdick; Daniel B. Larremore; Joel Nishimura; Johan Ugander

Random graph null models have found widespread application in diverse research communities analyzing network datasets, including social, information, and economic networks, as well as food webs, protein-protein interactions, and neuronal networks. The most popular family of random graph null models, called configuration models, are defined as uniform distributions over a space of graphs with a fixed degree sequence. Commonly, properties of an empirical network are compared to properties of an ensemble of graphs from a configuration model in order to quantify whether empirical network properties are meaningful or whether they are instead a common consequence of the particular degree sequence. In this work we study the subtle but important decisions underlying the specification of a configuration model, and investigate the role these choices play in graph sampling procedures and a suite of applications. We place particular emphasis on the importance of specifying the appropriate graph labeling (stub-labeled or vertex-labeled) under which to consider a null model, a choice that closely connects the study of random graphs to the study of random contingency tables. We show that the choice of graph labeling is inconsequential for studies of simple graphs, but can have a significant impact on analyses of multigraphs or graphs with self-loops. The importance of these choices is demonstrated through a series of three vignettes, analyzing network datasets under many different configuration models and observing substantial differences in study conclusions under different models. We argue that in each case, only one of the possible configuration models is appropriate. While our work focuses on undirected static networks, it aims to guide the study of directed networks, dynamic networks, and all other network contexts that are suitably studied through the lens of random graph null models.


Biology Letters | 2016

Stress response, gut microbial diversity and sexual signals correlate with social interactions

Iris I. Levin; David M. Zonana; Bailey K. Fosdick; Se Jin Song; Rob Knight; Rebecca J. Safran

Theory predicts that social interactions are dynamically linked to phenotype. Yet because social interactions are difficult to quantify, little is known about the precise details on how interactivity is linked to phenotype. Here, we deployed proximity loggers on North American barn swallows (Hirundo rustica erythrogaster) to examine intercorrelations among social interactions, morphology and features of the phenotype that are sensitive to the social context: stress-induced corticosterone (CORT) and gut microbial diversity. We analysed relationships at two spatial scales of interaction: (i) body contact and (ii) social interactions occurring between 0.1 and 5 m. Network analysis revealed that relationships between social interactions, morphology, CORT and gut microbial diversity varied depending on the sexes of the individuals interacting and the spatial scale of interaction proximity. We found evidence that body contact interactions were related to diversity of socially transmitted microbes and that looser social interactions were related to signalling traits and CORT.


The American Statistician | 2012

Estimating the Correlation in Bivariate Normal Data With Known Variances and Small Sample Sizes

Bailey K. Fosdick; Adrian E. Raftery

We consider the problem of estimating the correlation in bivariate normal data when the means and variances are assumed known, with emphasis on the small sample case. We consider eight different estimators, several of them considered here for the first time in the literature. In a simulation study, we found that Bayesian estimators using the uniform and arc-sine priors outperformed several empirical and exact or approximate maximum likelihood estimators in small samples. The arc-sine prior did better for large values of the correlation. For testing whether the correlation is zero, we found that Bayesian hypothesis tests outperformed significance tests based on the empirical and exact or approximate maximum likelihood estimators considered in small samples, but that all tests performed similarly for sample size 50. These results lead us to suggest using the posterior mean with the arc-sine prior to estimate the correlation in small samples when the variances are assumed known.


Network Science | 2013

Likelihoods for fixed rank nomination networks

Peter D. Hoff; Bailey K. Fosdick; Alexander Volfovsky; Katherine Stovel

Many studies that gather social network data use survey methods that lead to censored, missing, or otherwise incomplete information. For example, the popular fixed rank nomination (FRN) scheme, often used in studies of schools and businesses, asks study participants to nominate and rank at most a small number of contacts or friends, leaving the existence of other relations uncertain. However, most statistical models are formulated in terms of completely observed binary networks. Statistical analyses of FRN data with such models ignore the censored and ranked nature of the data and could potentially result in misleading statistical inference. To investigate this possibility, we compare Bayesian parameter estimates obtained from a likelihood for complete binary networks with those obtained from likelihoods that are derived from the FRN scheme, and therefore accommodate the ranked and censored nature of the data. We show analytically and via simulation that the binary likelihood can provide misleading inference, particularly for certain model parameters that relate network ties to characteristics of individuals and pairs of individuals. We also compare these different likelihoods in a data analysis of several adolescent social networks. For some of these networks, the parameter estimates from the binary and FRN likelihoods lead to different conclusions, indicating the importance of analyzing FRN data with a method that accounts for the FRN survey design.


The Annals of Applied Statistics | 2016

Categorical data fusion using auxiliary information

Bailey K. Fosdick; Maria DeYoreo

In data fusion analysts seek to combine information from two databases comprised of disjoint sets of individuals, in which some variables appear in both databases and other variables appear in only one database. Most data fusion techniques rely on variants of conditional independence assumptions. When inappropriate, these assumptions can result in unreliable inferences. We propose a data fusion technique that allows analysts to easily incorporate auxiliary information on the dependence structure of variables not observed jointly; we refer to this auxiliary information as glue. With this technique, we fuse two marketing surveys from the book publisher HarperCollins using glue from the online, rapid-response polling company CivicScience. The fused data enable estimation of associations between peoples preferences for authors and for learning about new books. The analysis also serves as a case study on the potential for using online surveys to aid data fusion.

Collaboration


Dive into the Bailey K. Fosdick's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Peter D. Hoff

University of Washington

View shared research outputs
Top Co-Authors

Avatar

Iris I. Levin

University of Colorado Boulder

View shared research outputs
Top Co-Authors

Avatar

Leontine Alkema

University of Massachusetts Amherst

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Nevena Lalic

University of Washington

View shared research outputs
Top Co-Authors

Avatar

Rebecca J. Safran

University of Colorado Boulder

View shared research outputs
Top Co-Authors

Avatar

Ted Westling

University of Washington

View shared research outputs
Top Co-Authors

Avatar

Wesley Lee

University of Washington

View shared research outputs
Researchain Logo
Decentralizing Knowledge