Jake M. Hofman | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Jake M. Hofman is active.

Explore More

Publication

Featured researches published by Jake M. Hofman.

web search and data mining | 2011

Everyone's an influencer: quantifying influence on twitter

Eytan Bakshy; Jake M. Hofman; Winter A. Mason; Duncan J. Watts

In this paper we investigate the attributes and relative influence of 1.6M Twitter users by tracking 74 million diffusion events that took place on the Twitter follower graph over a two month interval in 2009. Unsurprisingly, we find that the largest cascades tend to be generated by users who have been influential in the past and who have a large number of followers. We also find that URLs that were rated more interesting and/or elicited more positive feelings by workers on Mechanical Turk were more likely to spread. In spite of these intuitive results, however, we find that predictions of which particular user or URL will generate large cascades are relatively unreliable. We conclude, therefore, that word-of-mouth diffusion can only be harnessed reliably by targeting large numbers of potential influencers, thereby capturing average effects. Finally, we consider a family of hypothetical marketing strategies, defined by the relative cost of identifying versus compensating potential influencers. We find that although under some circumstances, the most influential users are also the most cost-effective, under a wide range of plausible assumptions the most cost-effective performance can be realized using ordinary influencers---individuals who exert average or even less-than-average influence.

international world wide web conferences | 2011

Who says what to whom on twitter

Shaomei Wu; Jake M. Hofman; Winter A. Mason; Duncan J. Watts

We study several longstanding questions in media communications research, in the context of the microblogging service Twitter, regarding the production, flow, and consumption of information. To do so, we exploit a recently introduced feature of Twitter known as lists to distinguish between elite users - by which we mean celebrities, bloggers, and representatives of media outlets and other formal organizations - and ordinary users. Based on this classification, we find a striking concentration of attention on Twitter, in that roughly 50% of URLs consumed are generated by just 20K elite users, where the media produces the most information, but celebrities are the most followed. We also find significant homophily within categories: celebrities listen to celebrities, while bloggers listen to bloggers etc; however, bloggers in general rebroadcast more information than the other categories. Next we re-examine the classical two-step flow theory of communications, finding considerable support for it on Twitter. Third, we find that URLs broadcast by different categories of users or containing different types of content exhibit systematically different lifespans. And finally, we examine the attention paid by the different user categories to different news topics.

Cell | 2007

Opposing Effects of PKCθ and WASp on Symmetry Breaking and Relocation of the Immunological Synapse

Tasha N. Sims; Timothy J. Soos; Harry S. Xenias; Benjamin J. Dubin-Thaler; Jake M. Hofman; Janelle Waite; Thomas O. Cameron; V. Kaye Thomas; Rajat Varma; Chris H. Wiggins; Michael P. Sheetz; Dan R. Littman; Michael L. Dustin

The immunological synapse (IS) is a junction between the T cell and antigen-presenting cell and is composed of supramolecular activation clusters (SMACs). No studies have been published on naive T cell IS dynamics. Here, we find that IS formation during antigen recognition comprises cycles of stable IS formation and autonomous naive T cell migration. The migration phase is driven by PKCtheta, which is localized to the F-actin-dependent peripheral (p)SMAC. PKCtheta(-/-) T cells formed hyperstable IS in vitro and in vivo and, like WT cells, displayed fast oscillations in the distal SMAC, but they showed reduced slow oscillations in pSMAC integrity. IS reformation is driven by the Wiscott Aldrich Syndrome protein (WASp). WASp(-/-) T cells displayed normal IS formation but were unable to reform IS after migration unless PKCtheta was inhibited. Thus, opposing effects of PKCtheta and WASp control IS stability through pSMAC symmetry breaking and reformation.

Physical Review Letters | 2008

Bayesian Approach to Network Modularity

Jake M. Hofman; Chris H. Wiggins

We present an efficient, principled, and interpretable technique for inferring module assignments and for identifying the optimal number of modules in a given network. We show how several existing methods for finding modules can be described as variant, special, or limiting cases of our work, and how the method overcomes the resolution limit problem, accurately recovering the true number of modules. Our approach is based on Bayesian methods for model selection which have been used with success for almost a century, implemented using a variational technique developed only in the past decade. We apply the technique to synthetic and real networks and outline how the method naturally allows selection among competing models.

Biophysical Journal | 2009

Learning Rates and States from Biophysical Time Series: A Bayesian Approach to Model Selection and Single-Molecule FRET Data

Jonathan E. Bronson; Jingyi Fei; Jake M. Hofman; Ruben L. Gonzalez; Chris H. Wiggins

Time series data provided by single-molecule Förster resonance energy transfer (smFRET) experiments offer the opportunity to infer not only model parameters describing molecular complexes, e.g., rate constants, but also information about the model itself, e.g., the number of conformational states. Resolving whether such states exist or how many of them exist requires a careful approach to the problem of model selection, here meaning discrimination among models with differing numbers of states. The most straightforward approach to model selection generalizes the common idea of maximum likelihood--selecting the most likely parameter values--to maximum evidence: selecting the most likely model. In either case, such an inference presents a tremendous computational challenge, which we here address by exploiting an approximation technique termed variational Bayesian expectation maximization. We demonstrate how this technique can be applied to temporal data such as smFRET time series; show superior statistical consistency relative to the maximum likelihood approach; compare its performance on smFRET data generated from experiments on the ribosome; and illustrate how model selection in such probabilistic or generative modeling can facilitate analysis of closely related temporal data currently prevalent in biophysics. Source code used in this analysis, including a graphical user interface, is available open source via http://vbFRET.sourceforge.net.

international world wide web conferences | 2010

Inferring relevant social networks from interpersonal communication

Munmun De Choudhury; Winter A. Mason; Jake M. Hofman; Duncan J. Watts

Researchers increasingly use electronic communication data to construct and study large social networks, effectively inferring unobserved ties (e.g. i is connected to j) from observed communication events (e.g. i emails j). Often overlooked, however, is the impact of tie definition on the corresponding network, and in turn the relevance of the inferred network to the research question of interest. Here we study the problem of network inference and relevance for two email data sets of different size and origin. In each case, we generate a family of networks parameterized by a threshold condition on the frequency of emails exchanged between pairs of individuals. After demonstrating that different choices of the threshold correspond to dramatically different network structures, we then formulate the relevance of these networks in terms of a series of prediction tasks that depend on various network features. In general, we find: a) that prediction accuracy is maximized over a non-trivial range of thresholds corresponding to 5-10 reciprocated emails per year; b) that for any prediction task, choosing the optimal value of the threshold yields a sizable (~30%) boost in accuracy over naive choices; and c) that the optimal threshold value appears to be (somewhat surprisingly) consistent across data sets and prediction tasks. We emphasize the practical utility in defining ties via their relevance to the prediction task(s) at hand and discuss implications of our empirical results.

Proceedings of the National Academy of Sciences of the United States of America | 2009

Allosteric collaboration between elongation factor G and the ribosomal L1 stalk directs tRNA movements during translation

Jingyi Fei; Jonathan E. Bronson; Jake M. Hofman; Rathi L. Srinivas; Chris H. Wiggins; Ruben L. Gonzalez

Determining the mechanism by which tRNAs rapidly and precisely transit through the ribosomal A, P, and E sites during translation remains a major goal in the study of protein synthesis. Here, we report the real-time dynamics of the L1 stalk, a structural element of the large ribosomal subunit that is implicated in directing tRNA movements during translation. Within pretranslocation ribosomal complexes, the L1 stalk exists in a dynamic equilibrium between open and closed conformations. Binding of elongation factor G (EF-G) shifts this equilibrium toward the closed conformation through one of at least two distinct kinetic mechanisms, where the identity of the P-site tRNA dictates the kinetic route that is taken. Within posttranslocation complexes, L1 stalk dynamics are dependent on the presence and identity of the E-site tRNA. Collectively, our data demonstrate that EF-G and the L1 stalk allosterically collaborate to direct tRNA translocation from the P to the E sites, and suggest a model for the release of E-site tRNA.

PLOS ONE | 2008

Quantification of Cell Edge Velocities and Traction Forces Reveals Distinct Motility Modules during Cell Spreading

Benjamin J. Dubin-Thaler; Jake M. Hofman; Yunfei Cai; Harry S. Xenias; Ingrid Spielman; Anna V. Shneidman; Lawrence A. David; Hans-Günther Döbereiner; Chris H. Wiggins; Michael P. Sheetz

Actin-based cell motility and force generation are central to immune response, tissue development, and cancer metastasis, and understanding actin cytoskeleton regulation is a major goal of cell biologists. Cell spreading is a commonly used model system for motility experiments – spreading fibroblasts exhibit stereotypic, spatially-isotropic edge dynamics during a reproducible sequence of functional phases: 1) During early spreading, cells form initial contacts with the surface. 2) The middle spreading phase exhibits rapidly increasing attachment area. 3) Late spreading is characterized by periodic contractions and stable adhesions formation. While differences in cytoskeletal regulation between phases are known, a global analysis of the spatial and temporal coordination of motility and force generation is missing. Implementing improved algorithms for analyzing edge dynamics over the entire cell periphery, we observed that a single domain of homogeneous cytoskeletal dynamics dominated each of the three phases of spreading. These domains exhibited a unique combination of biophysical and biochemical parameters – a motility module. Biophysical characterization of the motility modules revealed that the early phase was dominated by periodic, rapid membrane blebbing; the middle phase exhibited continuous protrusion with very low traction force generation; and the late phase was characterized by global periodic contractions and high force generation. Biochemically, each motility module exhibited a different distribution of the actin-related protein VASP, while inhibition of actin polymerization revealed different dependencies on barbed-end polymerization. In addition, our whole-cell analysis revealed that many cells exhibited heterogeneous combinations of motility modules in neighboring regions of the cell edge. Together, these observations support a model of motility in which regions of the cell edge exhibit one of a limited number of motility modules that, together, determine the overall motility function. Our data and algorithms are publicly available to encourage further exploration.

knowledge discovery and data mining | 2009

Characterizing individual communication patterns

R. Dean Malmgren; Jake M. Hofman; Luís A. Nunes Amaral; Duncan J. Watts

The increasing availability of electronic communication data, such as that arising from e-mail exchange, presents social and information scientists with new possibilities for characterizing individual behavior and, by extension, identifying latent structure in human populations. Here, we propose a model of individual e-mail communication that is sufficiently rich to capture meaningful variability across individuals, while remaining simple enough to be interpretable. We show that the model, a cascading non-homogeneous Poisson process, can be formulated as a double-chain hidden Markov model, allowing us to use an efficient inference algorithm to estimate the model parameters from observed data. We then apply this model to two e-mail data sets consisting of 404 and 6,164 users, respectively, that were collected from two universities in different countries and years. We find that the resulting best-estimate parameter distributions for both data sets are surprisingly similar, indicating that at least some features of communication dynamics generalize beyond specific contexts. We also find that variability of individual behavior over time is significantly less than variability across the population, suggesting that individuals can be classified into persistent types. We conclude that communication patterns may prove useful as an additional class of attribute data, complementing demographic and network data, for user classification and outlier detection-a point that we illustrate with an interpretable clustering of users based on their inferred model parameters.

BMC Bioinformatics | 2010

Graphical models for inferring single molecule dynamics

Jonathan E. Bronson; Jake M. Hofman; Jingyi Fei; Ruben L. Gonzalez; Chris H. Wiggins

BackgroundThe recent explosion of experimental techniques in single molecule biophysics has generated a variety of novel time series data requiring equally novel computational tools for analysis and inference. This article describes in general terms how graphical modeling may be used to learn from biophysical time series data using the variational Bayesian expectation maximization algorithm (VBEM). The discussion is illustrated by the example of single-molecule fluorescence resonance energy transfer (smFRET) versus time data, where the smFRET time series is modeled as a hidden Markov model (HMM) with Gaussian observables. A detailed description of smFRET is provided as well.ResultsThe VBEM algorithm returns the model’s evidence and an approximating posterior parameter distribution given the data. The former provides a metric for model selection via maximum evidence (ME), and the latter a description of the model’s parameters learned from the data. ME/VBEM provide several advantages over the more commonly used approach of maximum likelihood (ML) optimized by the expectation maximization (EM) algorithm, the most important being a natural form of model selection and a well-posed (non-divergent) optimization problem.ConclusionsThe results demonstrate the utility of graphical modeling for inference of dynamic processes in single molecule biophysics.

Explore More