Benjamin A. Logsdon
Sage Bionetworks
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Benjamin A. Logsdon.
Nature Neuroscience | 2016
Menachem Fromer; Panos Roussos; Solveig K. Sieberts; Jessica S. Johnson; David H. Kavanagh; Thanneer M. Perumal; Douglas M. Ruderfer; Edwin C. Oh; Aaron Topol; Hardik Shah; Lambertus Klei; Robin Kramer; Dalila Pinto; Zeynep H. Gümüş; A. Ercument Cicek; Kristen Dang; Andrew Browne; Cong Lu; Lu Xie; Ben Readhead; Eli A. Stahl; Jianqiu Xiao; Mahsa Parvizi; Tymor Hamamsy; John F. Fullard; Ying-Chih Wang; Milind Mahajan; Jonathan Derry; Joel T. Dudley; Scott E. Hemby
Over 100 genetic loci harbor schizophrenia-associated variants, yet how these variants confer liability is uncertain. The CommonMind Consortium sequenced RNA from dorsolateral prefrontal cortex of people with schizophrenia (N = 258) and control subjects (N = 279), creating a resource of gene expression and its genetic regulation. Using this resource, ∼20% of schizophrenia loci have variants that could contribute to altered gene expression and liability. In five loci, only a single gene was involved: FURIN, TSNARE1, CNTN4, CLCN3 or SNAP91. Altering expression of FURIN, TSNARE1 or CNTN4 changed neurodevelopment in zebrafish; knockdown of FURIN in human neural progenitor cells yielded abnormal migration. Of 693 genes showing significant case-versus-control differential expression, their fold changes were ≤ 1.33, and an independent cohort yielded similar results. Gene co-expression implicates a network relevant for schizophrenia. Our findings show that schizophrenia is polygenic and highlight the utility of this resource for mechanistic interpretations of genetic liability for brain diseases.
PLOS Computational Biology | 2013
Erhan Bilal; Janusz Dutkowski; Justin Guinney; In Sock Jang; Benjamin A. Logsdon; Gaurav Pandey; Benjamin A. Sauerwine; Yishai Shimoni; Hans Kristian Moen Vollan; Brigham Mecham; Oscar M. Rueda; Jörg Tost; Christina Curtis; Mariano J. Alvarez; Vessela N. Kristensen; Samuel Aparicio; Anne Lise Børresen-Dale; Carlos Caldas; Stephen H. Friend; Trey Ideker; Eric E. Schadt; Gustavo Stolovitzky; Adam A. Margolin
Breast cancer is the most common malignancy in women and is responsible for hundreds of thousands of deaths annually. As with most cancers, it is a heterogeneous disease and different breast cancer subtypes are treated differently. Understanding the difference in prognosis for breast cancer based on its molecular and phenotypic features is one avenue for improving treatment by matching the proper treatment with molecular subtypes of the disease. In this work, we employed a competition-based approach to modeling breast cancer prognosis using large datasets containing genomic and clinical information and an online real-time leaderboard program used to speed feedback to the modeling team and to encourage each modeler to work towards achieving a higher ranked submission. We find that machine learning methods combined with molecular features selected based on expert prior knowledge can improve survival predictions compared to current best-in-class methodologies and that ensemble models trained across multiple user submissions systematically outperform individual models within the ensemble. We also find that model scores are highly consistent across multiple independent evaluations. This study serves as the pilot phase of a much larger competition open to the whole research community, with the goal of understanding general strategies for model optimization using clinical and molecular profiling data and providing an objective, transparent system for assessing prognostic models.
American Journal of Human Genetics | 2012
Paul L. Auer; Jill M. Johnsen; Andrew D. Johnson; Benjamin A. Logsdon; Leslie A. Lange; Michael A. Nalls; Guosheng Zhang; Nora Franceschini; Keolu Fox; Ethan M. Lange; Stephen S. Rich; Christopher J. O'Donnell; Rebecca D. Jackson; Robert B. Wallace; Zhao Chen; Timothy A. Graubert; James G. Wilson; Hua Tang; Guillaume Lettre; Alex P. Reiner; Santhi K. Ganesh; Yun Li
Researchers have successfully applied exome sequencing to discover causal variants in selected individuals with familial, highly penetrant disorders. We demonstrate the utility of exome sequencing followed by imputation for discovering low-frequency variants associated with complex quantitative traits. We performed exome sequencing in a reference panel of 761 African Americans and then imputed newly discovered variants into a larger sample of more than 13,000 African Americans for association testing with the blood cell traits hemoglobin, hematocrit, white blood count, and platelet count. First, we illustrate the feasibility of our approach by demonstrating genome-wide-significant associations for variants that are not covered by conventional genotyping arrays; for example, one such association is that between higher platelet count and an MPL c.117G>T (p.Lys39Asn) variant encoding a p.Lys39Asn amino acid substitution of the thrombopoietin receptor gene (p = 1.5 × 10(-11)). Second, we identified an association between missense variants of LCT and higher white blood count (p = 4 × 10(-13)). Third, we identified low-frequency coding variants that might account for allelic heterogeneity at several known blood cell-associated loci: MPL c.754T>C (p.Tyr252His) was associated with higher platelet count; CD36 c.975T>G (p.Tyr325(∗)) was associated with lower platelet count; and several missense variants at the α-globin gene locus were associated with lower hemoglobin. By identifying low-frequency missense variants associated with blood cell traits not previously reported by genome-wide association studies, we establish that exome sequencing followed by imputation is a powerful approach to dissecting complex, genetically heterogeneous traits in large population-based studies.
PLOS Genetics | 2012
Rehan Qayyum; Beverly M. Snively; Elad Ziv; Michael A. Nalls; Yongmei Liu; Weihong Tang; Lisa R. Yanek; Leslie A. Lange; Michele K. Evans; Santhi K. Ganesh; Melissa A. Austin; Guillaume Lettre; Diane M. Becker; Alan B. Zonderman; Andrew Singleton; Tamara B. Harris; Emile R. Mohler; Benjamin A. Logsdon; Charles Kooperberg; Aaron R. Folsom; James G. Wilson; Lewis C. Becker; Alex P. Reiner
Several genetic variants associated with platelet count and mean platelet volume (MPV) were recently reported in people of European ancestry. In this meta-analysis of 7 genome-wide association studies (GWAS) enrolling African Americans, our aim was to identify novel genetic variants associated with platelet count and MPV. For all cohorts, GWAS analysis was performed using additive models after adjusting for age, sex, and population stratification. For both platelet phenotypes, meta-analyses were conducted using inverse-variance weighted fixed-effect models. Platelet aggregation assays in whole blood were performed in the participants of the GeneSTAR cohort. Genetic variants in ten independent regions were associated with platelet count (N = 16,388) with p<5×10−8 of which 5 have not been associated with platelet count in previous GWAS. The novel genetic variants associated with platelet count were in the following regions (the most significant SNP, closest gene, and p-value): 6p22 (rs12526480, LRRC16A, p = 9.1×10−9), 7q11 (rs13236689, CD36, p = 2.8×10−9), 10q21 (rs7896518, JMJD1C, p = 2.3×10−12), 11q13 (rs477895, BAD, p = 4.9×10−8), and 20q13 (rs151361, SLMO2, p = 9.4×10−9). Three of these loci (10q21, 11q13, and 20q13) were replicated in European Americans (N = 14,909) and one (11q13) in Hispanic Americans (N = 3,462). For MPV (N = 4,531), genetic variants in 3 regions were significant at p<5×10−8, two of which were also associated with platelet count. Previously reported regions that were also significant in this study were 6p21, 6q23, 7q22, 12q24, and 19p13 for platelet count and 7q22, 17q11, and 19p13 for MPV. The most significant SNP in 1 region was also associated with ADP-induced maximal platelet aggregation in whole blood (12q24). Thus through a meta-analysis of GWAS enrolling African Americans, we have identified 5 novel regions associated with platelet count of which 3 were replicated in other ethnic groups. In addition, we also found one region associated with platelet aggregation that may play a potential role in atherothrombosis.
Alzheimers & Dementia | 2016
Genevera I. Allen; Nicola Amoroso; Catalina V Anghel; Venkat K. Balagurusamy; Christopher Bare; Derek Beaton; Roberto Bellotti; David A. Bennett; Kevin L. Boehme; Paul C. Boutros; Laura Caberlotto; Cristian Caloian; Frederick Campbell; Elias Chaibub Neto; Yu Chuan Chang; Beibei Chen; Chien Yu Chen; Ting Ying Chien; Timothy W.I. Clark; Sudeshna Das; Christos Davatzikos; Jieyao Deng; Donna N. Dillenberger; Richard Dobson; Qilin Dong; Jimit Doshi; Denise Duma; Rosangela Errico; Guray Erus; Evan Everett
Identifying accurate biomarkers of cognitive decline is essential for advancing early diagnosis and prevention therapies in Alzheimers disease. The Alzheimers disease DREAM Challenge was designed as a computational crowdsourced project to benchmark the current state‐of‐the‐art in predicting cognitive outcomes in Alzheimers disease based on high dimensional, publicly available genetic and structural imaging data. This meta‐analysis failed to identify a meaningful predictor developed from either data modality, suggesting that alternate approaches should be considered for prediction of cognitive performance.
American Journal of Epidemiology | 2012
James Y. Dai; Benjamin A. Logsdon; Ying Huang; Li Hsu; Alex P. Reiner; Ross L. Prentice; Charles Kooperberg
In this article, the authors propose to simultaneously test for marginal genetic association and gene-environment interaction to discover single nucleotide polymorphisms that may be involved in gene-environment or gene-treatment interaction. The asymptotic independence of the marginal association estimator and various interaction estimators leads to a simple and flexible way of combining the 2 tests, allowing for exploitation of gene-environment independence in estimating gene-environment interaction. The proposed test differs from the 2-df test proposed by Kraft et al. (Hum Hered. 2007;63(2):111-119) in two respects. First, for the genetic association component, it tests for marginal association, which is often the primary objective in inference, rather than the main effect in a model with gene-environment interaction. Second, the gene-environment testing component can easily exploit putative gene-environment independence using either the case-only estimator or the empirical Bayes estimator, depending on whether the goal is gene-treatment interaction in a randomized trial or gene-environment interaction in an observational study. The use of the proposed joint test is illustrated through simulations and a genetic study (1993-2005) from the Womens Health Initiative.
Bioinformatics | 2012
Benjamin A. Logsdon; Cara L. Carty; Alex P. Reiner; James Y. Dai; Charles Kooperberg
MOTIVATION For many complex traits, including height, the majority of variants identified by genome-wide association studies (GWAS) have small effects, leaving a significant proportion of the heritable variation unexplained. Although many penalized multiple regression methodologies have been proposed to increase the power to detect associations for complex genetic architectures, they generally lack mechanisms for false-positive control and diagnostics for model over-fitting. Our methodology is the first penalized multiple regression approach that explicitly controls Type I error rates and provide model over-fitting diagnostics through a novel normally distributed statistic defined for every marker within the GWAS, based on results from a variational Bayes spike regression algorithm. RESULTS We compare the performance of our method to the lasso and single marker analysis on simulated data and demonstrate that our approach has superior performance in terms of power and Type I error control. In addition, using the Womens Health Initiative (WHI) SNP Health Association Resource (SHARe) GWAS of African-Americans, we show that our method has power to detect additional novel associations with body height. These findings replicate by reaching a stringent cutoff of marginal association in a larger cohort. AVAILABILITY An R-package, including an implementation of our variational Bayes spike regression (vBsr) algorithm, is available at http://kooperberg.fhcrc.org/soft.html.
American Journal of Human Genetics | 2017
Mads E. Hauberg; Wen Zhang; Claudia Giambartolomei; Oscar Franzén; David L. Morris; Timothy J. Vyse; Arno Ruusalepp; Menachem Fromer; Solveig K. Sieberts; Jessica S. Johnson; Douglas M. Ruderfer; Hardik Shah; Lambertus Klei; Kristen Dang; Thanneer M. Perumal; Benjamin A. Logsdon; Milind Mahajan; Lara M. Mangravite; Laurent Essioux; Hiroyoshi Toyoshiba; Raquel E. Gur; Chang-Gyu Hahn; David A. Lewis; Vahram Haroutunian; Mette A. Peters; Barbara K. Lipska; Joseph D. Buxbaum; Keisuke Hirai; Enrico Domenici; Bernie Devlin
Genome-wide association studies (GWASs) have identified a multitude of genetic loci involved with traits and diseases. However, it is often unclear which genes are affected in such loci and whether the associated genetic variants lead to increased or decreased gene function. To mitigate this, we integrated associations of common genetic variants in 57 GWASs with 24 studies of expression quantitative trait loci (eQTLs) from a broad range of tissues by using a Mendelian randomization approach. We discovered a total of 3,484 instances of gene-trait-associated changes in expression at a false-discovery rate < 0.05. These genes were often not closest to the genetic variant and were primarily identified in eQTLs derived from pathophysiologically relevant tissues. For instance, genes with expression changes associated with lipid traits were mostly identified in the liver, and those associated with cardiovascular disease were identified in arterial tissue. The affected genes additionally point to biological processes implicated in the interrogated traits, such as the interleukin-27 pathway in rheumatoid arthritis. Further, comparing trait-associated gene expression changes across traits suggests that pleiotropy is a widespread phenomenon and points to specific instances of both agonistic and antagonistic pleiotropy. For instance, expression of SNX19 and ABCB9 is positively correlated with both the risk of schizophrenia and educational attainment. To facilitate interpretation, we provide this lexicon of how common trait-associated genetic variants alter gene expression in various tissues as the online database GWAS2Genes.
Genetic Epidemiology | 2014
Benjamin A. Logsdon; James Y. Dai; Paul L. Auer; Jill M. Johnsen; Santhi K. Ganesh; Nicholas L. Smith; James G. Wilson; Russell P. Tracy; Leslie A. Lange; Stephen S. Rich; Guillaume Lettre; Christopher S. Carlson; Rebecca D. Jackson; Christopher J. O'Donnell; Mark M. Wurfel; Deborah A. Nickerson; Hua Tang; Alex P. Reiner; Charles Kooperberg
Recently, many statistical methods have been proposed to test for associations between rare genetic variants and complex traits. Most of these methods test for association by aggregating genetic variations within a predefined region, such as a gene. Although there is evidence that “aggregate” tests are more powerful than the single marker test, these tests generally ignore neutral variants and therefore are unable to identify specific variants driving the association with phenotype. We propose a novel aggregate rare‐variant test that explicitly models a fraction of variants as neutral, tests associations at the gene‐level, and infers the rare‐variants driving the association. Simulations show that in the practical scenario where there are many variants within a given region of the genome with only a fraction causal our approach has greater power compared to other popular tests such as the Sequence Kernel Association Test (SKAT), the Weighted Sum Statistic (WSS), and the collapsing method of Morris and Zeggini (MZ). Our algorithm leverages a fast variational Bayes approximate inference methodology to scale to exome‐wide analyses, a significant computational advantage over exact inference model selection methodologies. To demonstrate the efficacy of our methodology we test for associations between von Willebrand Factor (VWF) levels and VWF missense rare‐variants imputed from the National Heart, Lung, and Blood Institutes Exome Sequencing project into 2,487 African Americans within the VWF gene. Our method suggests that a relatively small fraction (∼10%) of the imputed rare missense variants within VWF are strongly associated with lower VWF levels in African Americans.
PLOS Computational Biology | 2016
Maxim Grechkin; Benjamin A. Logsdon; Andrew J. Gentles; Su-In Lee
We present a computational framework, called DISCERN (DIfferential SparsE Regulatory Network), to identify informative topological changes in gene-regulator dependence networks inferred on the basis of mRNA expression datasets within distinct biological states. DISCERN takes two expression datasets as input: an expression dataset of diseased tissues from patients with a disease of interest and another expression dataset from matching normal tissues. DISCERN estimates the extent to which each gene is perturbed—having distinct regulator connectivity in the inferred gene-regulator dependencies between the disease and normal conditions. This approach has distinct advantages over existing methods. First, DISCERN infers conditional dependencies between candidate regulators and genes, where conditional dependence relationships discriminate the evidence for direct interactions from indirect interactions more precisely than pairwise correlation. Second, DISCERN uses a new likelihood-based scoring function to alleviate concerns about accuracy of the specific edges inferred in a particular network. DISCERN identifies perturbed genes more accurately in synthetic data than existing methods to identify perturbed genes between distinct states. In expression datasets from patients with acute myeloid leukemia (AML), breast cancer and lung cancer, genes with high DISCERN scores in each cancer are enriched for known tumor drivers, genes associated with the biological processes known to be important in the disease, and genes associated with patient prognosis, in the respective cancer. Finally, we show that DISCERN can uncover potential mechanisms underlying network perturbation by explaining observed epigenomic activity patterns in cancer and normal tissue types more accurately than alternative methods, based on the available epigenomic data from the ENCODE project.