Gwenaël G.R. Leday
VU University Amsterdam
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Gwenaël G.R. Leday.
Proceedings of the National Academy of Sciences of the United States of America | 2011
Núria Roura-Pascual; Cang Hui; Takayoshi Ikeda; Gwenaël G.R. Leday; Soledad Carpintero; Xavier Espadaler; Crisanto Gómez; Benoît S. Guénard; Stephen Hartley; Paul D. Krushelnycky; Philip J. Lester; Melodie A. McGeoch; Sean B. Menke; Jes S. Pedersen; Joel Pitt; Joaquin Reyes; Nathan J. Sanders; Andrew V. Suarez; Yoshifumi Touyama; Darren F. Ward; Philip S. Ward; Sue Worner
Because invasive species threaten the integrity of natural ecosystems, a major goal in ecology is to develop predictive models to determine which species may become widespread and where they may invade. Indeed, considerable progress has been made in understanding the factors that influence the local pattern of spread for specific invaders and the factors that are correlated with the number of introduced species that have become established in a given region. However, few studies have examined the relative importance of multiple drivers of invasion success for widespread species at global scales. Here, we use a dataset of >5,000 presence/absence records to examine the interplay between climatic suitability, biotic resistance by native taxa, human-aided dispersal, and human modification of habitats, in shaping the distribution of one of the worlds most notorious invasive species, the Argentine ant (Linepithema humile). Climatic suitability and the extent of human modification of habitats are primarily responsible for the distribution of this global invader. However, we also found some evidence for biotic resistance by native communities. Somewhat surprisingly, and despite the often cited importance of propagule pressure as a crucial driver of invasions, metrics of the magnitude of international traded commodities among countries were not related to global distribution patterns. Together, our analyses on the global-scale distribution of this invasive species provide strong evidence for the interplay of biotic and abiotic determinants of spread and also highlight the challenges of limiting the spread and subsequent impact of highly invasive species.
Biostatistics | 2013
Mark A. van de Wiel; Gwenaël G.R. Leday; Luba M. Pardo; Håvard Rue; Aad W. van der Vaart; Wessel N. van Wieringen
Next generation sequencing is quickly replacing microarrays as a technique to probe different molecular levels of the cell, such as DNA or RNA. The technology provides higher resolution, while reducing bias. RNA sequencing results in counts of RNA strands. This type of data imposes new statistical challenges. We present a novel, generic approach to model and analyze such data. Our approach aims at large flexibility of the likelihood (count) model and the regression model alike. Hence, a variety of count models is supported, such as the popular NB model, which accounts for overdispersion. In addition, complex, non-balanced designs and random effects are accommodated. Like some other methods, our method provides shrinkage of dispersion-related parameters. However, we extend it by enabling joint shrinkage of parameters, including those for which inference is desired. We argue that this is essential for Bayesian multiplicity correction. Shrinkage is effectuated by empirically estimating priors. We discuss several parametric (mixture) and non-parametric priors and develop procedures to estimate (parameters of) those. Inference is provided by means of local and Bayesian false discovery rates. We illustrate our method on several simulations and two data sets, also to compare it with other methods. Model- and data-based simulations show substantial improvements in the sensitivity at the given specificity. The data motivate the use of the ZI-NB as a powerful alternative to the NB, which results in higher detection rates for low-count data. Finally, compared with other methods, the results on small sample subsets are more reproducible when validated on their large sample complements, illustrating the importance of the type of shrinkage.
Neurobiology of Aging | 2013
Luba M. Pardo; Patrizia Rizzu; Margherita Francescatto; Morana Vitezic; Gwenaël G.R. Leday; Javier Simon Sanchez; Abdullah M. Khamis; Hazuki Takahashi; Wilma D.J. van de Berg; Yulia A. Medvedeva; Mark A. van de Wiel; Carsten O. Daub; Piero Carninci; Peter Heutink
To characterize the promoterome of caudate and putamen regions (striatum), frontal and temporal cortices, and hippocampi from aged human brains, we used high-throughput cap analysis of gene expression to profile the transcription start sites and to quantify the differences in gene expression across the 5 brain regions. We also analyzed the extent to which methylation influenced the observed expression profiles. We sequenced more than 71 million cap analysis of gene expression tags corresponding to 70,202 promoter regions and 16,888 genes. More than 7000 transcripts were differentially expressed, mainly because of differential alternative promoter usage. Unexpectedly, 7% of differentially expressed genes were neurodevelopmental transcription factors. Functional pathway analysis on the differentially expressed genes revealed an overrepresentation of several signaling pathways (e.g., fibroblast growth factor and wnt signaling) in hippocampus and striatum. We also found that although 73% of methylation signals mapped within genes, the influence of methylation on the expression profile was small. Our study underscores alternative promoter usage as an important mechanism for determining the regional differences in gene expression at old age.
BMC Bioinformatics | 2012
Wessel N. van Wieringen; Kristian Unger; Gwenaël G.R. Leday; Oscar Krijgsman; Renee X. de Menezes; Bauke Ylstra; Mark A. van de Wiel
BackgroundAn increasing number of genomic studies interrogating more than one molecular level is published. Bioinformatics follows biological practice, and recent years have seen a surge in methodology for the integrative analysis of genomic data. Often such analyses require knowledge of which elements of one platform link to those of another. Although important, many integrative analyses do not or insufficiently detail the matching of the platforms.ResultsWe describe, illustrate and discuss six matching procedures. They are implemented in the R-package sigaR (available from Bioconductor). The principles underlying the presented matching procedures are generic, and can be combined to form new matching approaches or be applied to the matching of other platforms. Illustration of the matching procedures on a variety of data sets reveals how the procedures differ in the use of the available data, and may even lead to different results for individual genes.ConclusionsMatching of data from multiple genomics platforms is an important preprocessing step for many integrative bioinformatic analysis, for which we present six generic procedures, both old and new. They have been implemented in the R-package sigaR, available from Bioconductor.
Biological Psychiatry | 2018
Gwenaël G.R. Leday; Petra E. Vértes; Sylvia Richardson; Jonathan R. Greene; Tim Regan; Shahid Khan; Robbie Henderson; Tom C. Freeman; Carmine M. Pariante; Neil A. Harrison; Edward T. Bullmore; Petra Eszter Vertes; Rudolf N. Cardinal; Tom Freeman; David A. Hume; Zhaozong Wu; C. Pariante; Annamaria Cattaneo; Patricia A. Zunszain; Alessandra Borsini; Robert Stewart; David Chandran; Livia A. Carvalho; Joshua A. Bell; Luis Souza-Teodoro; Hugh Perry; Neil Harrison; Wayne C. Drevets; Gayle M. Wittenberg; Declan Jones
Background Peripheral inflammation is often associated with major depressive disorder (MDD), and immunological biomarkers of depression remain a focus of investigation. Methods We used microarray data on whole blood from two independent case-control studies of MDD: the GlaxoSmithKline–High-Throughput Disease-specific target Identification Program [GSK-HiTDiP] study (113 patients and 57 healthy control subjects) and the Janssen–Brain Resource Company study (94 patients and 100 control subjects). Genome-wide differential gene expression analysis (18,863 probes) resulted in a p value for each gene in each study. A Bayesian method identified the largest p-value threshold (q = .025) associated with twice the number of genes differentially expressed in both studies compared with the number of coincidental case-control differences expected by chance. Results A total of 165 genes were differentially expressed in both studies with concordant direction of fold change. The 90 genes overexpressed (or UP genes) in MDD were significantly enriched for immune response to infection, were concentrated in a module of the gene coexpression network associated with innate immunity, and included clusters of genes with correlated expression in monocytes, monocyte-derived dendritic cells, and neutrophils. In contrast, the 75 genes underexpressed (or DOWN genes) in MDD were associated with the adaptive immune response and included clusters of genes with correlated expression in T cells, natural killer cells, and erythroblasts. Consistently, the MDD patients with overexpression of UP genes also had underexpression of DOWN genes (correlation > .70 in both studies). Conclusions MDD was replicably associated with proinflammatory activation of the peripheral innate immune system, coupled with relative inactivation of the adaptive immune system, indicating the potential of transcriptional biomarkers for immunological stratification of patients with depression.
The Annals of Applied Statistics | 2013
Gwenaël G.R. Leday; A.W. van der Vaart; W.N. van Wieringen; M.A. van de Wiel
DNA copy number and mRNA expression are widely used data types in cancer studies, which combined provide more insight than separately. Whereas in existing literature the form of the relationship between these two types of markers is fixed a priori, in this paper we model their association. We employ piecewise linear regression splines (PLRS), which combine good interpretation with sufficient flexibility to identify any plausible type of relationship. The specification of the model leads to estimation and model selection in a constrained, nonstandard setting. We provide methodology for testing the effect of DNA on mRNA and choosing the appropriate model. Furthermore, we present a novel approach to obtain reliable confidence bands for constrained PLRS, which incorporates model uncertainty. The procedures are applied to colorectal and breast cancer data. Common assumptions are found to be potentially misleading for biologically relevant genes. More flexible models may bring more insight in the interaction between the two markers.
Translational Psychiatry | 2017
J A Bell; Mika Kivimäki; E T Bullmore; Andrew Steptoe; Edward T. Bullmore; Petra E. Vértes; Rudolf N. Cardinal; Sylvia Richardson; Gwenaël G.R. Leday; Tom C. Freeman; David A. Hume; Tim Regan; Zhaozong Wu; Carmine M. Pariante; Annamaria Cattaneo; Patricia Zuszain; Alessandra Borsini; Robert Stewart; David Chandran; Livia A. Carvalho; Joshua A. Bell; Luis Souza-Teodoro; Hugh Perry; Neil A. Harrison; Wayne C. Drevets; Gayle Wittenberg; Yu Sun; Declan Jones; Shahid Khan; Annie Stylianou
Evidence on systemic inflammation as a risk factor for future depression is inconsistent, possibly due to a lack of regard for persistency of exposure. We examined whether being inflamed on multiple occasions increases risk of new depressive symptoms using prospective data from a population-based sample of adults aged 50 years or older (the English Longitudinal Study of Ageing). Participants with less than four of eight depressive symptoms in 2004/05 and 2008/09 based on the Eight-item Centre for Epidemiologic Studies Depression scale were analysed. The number of occasions with C-reactive protein ⩾3 mg l−1 over the same initial assessments (1 vs 0 occasion, and 2 vs 0 occasions) was examined in relation to change in depressive symptoms between 2008/09 and 2012/13 and odds of developing depressive symptomology (having more than or equal to four of eight symptoms) in 2012/13. In multivariable-adjusted regression models (n=2068), participants who were inflamed on 1 vs 0 occasion showed no increase in depressive symptoms nor raised odds of developing depressive symptomology; those inflamed on 2 vs 0 occasions showed a 0.10 (95% confidence intervals (CIs)=−0.07, 0.28) symptom increase and 1.60 (95% CI=1.00, 2.55) times higher odds. In further analyses, 2 vs 0 occasions of inflammation were associated with increased odds of developing depressive symptoms among women (odds ratio (OR)=2.75, 95% CI=1.53, 4.95), but not among men (OR=0.70, 95% CI=0.29, 1.68); P-for-sex interaction=0.035. In this cohort study of older adults, repeated but not transient exposure to systemic inflammation was associated with increased risk of future depressive symptoms among women; this subgroup finding requires confirmation of validity.
The Annals of Applied Statistics | 2017
Gwenaël G.R. Leday; Mathisca C. M. de Gunst; Gino B. Kpogbezan; Aad van der Vaart; Wessel N. van Wieringen; Mark A. van de Wiel
Reconstructing a gene network from high-throughput molecular data is an important but challenging task, as the number of parameters to estimate easily is much larger than the sample size. A conventional remedy is to regularize or penalize the model likelihood. In network models, this is often done locally in the neighbourhood of each node or gene. However, estimation of the many regularization parameters is often difficult and can result in large statistical uncertainties. In this paper we propose to combine local regularization with global shrinkage of the regularization parameters to borrow strength between genes and improve inference. We employ a simple Bayesian model with non-sparse, conjugate priors to facilitate the use of fast variational approximations to posteriors. We discuss empirical Bayes estimation of hyper-parameters of the priors, and propose a novel approach to rank-based posterior thresholding. Using extensive model- and data-based simulations, we demonstrate that the proposed inference strategy outperforms popular (sparse) methods, yields more stable edges, and is more reproducible. The proposed method, termed ShrinkNet, is then applied to Glioblastoma to investigate the interactions between genes associated with patient survival.
Archive | 2014
Susan P. Worner; Muriel Gevrey; Takayoshi Ikeda; Gwenaël G.R. Leday; Joel Pitt; Stefan Schliebs; Snjezana Soltic
Ecologists face rapidly accumulating environmental data form spatial studies and from large-scale field experiments such that many now specialize in information technology. Those scientists carry out interdisciplinary research in what is known as ecological informatics. Ecological informatics is defined as a discipline that brings together ecology and computer science to solve problems using biologically-inspired computation, information processing, and other computer science disciplines such as data management and visualization. Scientists working in the discipline have research interests that include ecological knowledge discovery, clustering, and forecasting, and simulation of ecological dynamics by individual-based or agent-based models, as well as hybrid models and artificial life. In this chapter, ecological informatics techniques are applied to answer questions about alien invasive species, in particular, species that pose a biosecurity threat in a terrestrial ecological setting. Biosecurity is defined as the protection of a regionʼs environment, flora and fauna, marine life, indigenous resources, and human and animal health. Because biological organisms can cause billions of dollars of impact in any country, good science, systems, and protocols that underpin a regulatory biosecurity system are required in order to facilitate international trade. The tools and techniques discussed in this chapter are designed to be used in a risk analysis procedure so that agencies in charge of biosecurity can prioritize scarce resources and effort and be better prepared to prevent unexpected incursions of dangerous invasive species. The methods are used to predict, (1) which species out of the many thousands might establish in a new area, (2) where those species might establish, and, (3) where they might spread over a realistic landscape so that their impact can be determined.
Biometrical Journal | 2017
Gino B. Kpogbezan; Aad van der Vaart; Wessel N. van Wieringen; Gwenaël G.R. Leday; Mark A. van de Wiel
Reconstruction of a high-dimensional network may benefit substantially from the inclusion of prior knowledge on the network topology. In the case of gene interaction networks such knowledge may come for instance from pathway repositories like KEGG, or be inferred from data of a pilot study. The Bayesian framework provides a natural means of including such prior knowledge. Based on a Bayesian Simultaneous Equation Model, we develop an appealing Empirical Bayes (EB) procedure that automatically assesses the agreement of the used prior knowledge with the data at hand. We use variational Bayes method for posterior densities approximation and compare its accuracy with that of Gibbs sampling strategy. Our method is computationally fast, and can outperform known competitors. In a simulation study, we show that accurate prior data can greatly improve the reconstruction of the network, but need not harm the reconstruction if wrong. We demonstrate the benefits of the method in an analysis of gene expression data from GEO. In particular, the edges of the recovered network have superior reproducibility (compared to that of competitors) over resampled versions of the data.