Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Leopold Parts is active.

Publication


Featured researches published by Leopold Parts.


Nature | 2007

Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures

Alexander Stark; Michael F. Lin; Pouya Kheradpour; Jakob Skou Pedersen; Leopold Parts; Joseph W. Carlson; Madeline A. Crosby; Matthew D. Rasmussen; Sushmita Roy; Ameya N. Deoras; J. Graham Ruby; Julius Brennecke; Harvard FlyBase curators; Berkeley Drosophila Genome; Emily Hodges; Angie S. Hinrichs; Anat Caspi; Benedict Paten; Seung-Won Park; Mira V. Han; Morgan L. Maeder; Benjamin J. Polansky; Bryanne E. Robson; Stein Aerts; Jacques van Helden; Bassem A. Hassan; Donald G. Gilbert; Deborah A. Eastman; Michael D. Rice; Michael Weir

Sequencing of multiple related species followed by comparative genomics analysis constitutes a powerful approach for the systematic understanding of any genome. Here, we use the genomes of 12 Drosophila species for the de novo discovery of functional elements in the fly. Each type of functional element shows characteristic patterns of change, or ‘evolutionary signatures’, dictated by its precise selective constraints. Such signatures enable recognition of new protein-coding genes and exons, spurious and incorrect gene annotations, and numerous unusual gene structures, including abundant stop-codon readthrough. Similarly, we predict non-protein-coding RNA genes and structures, and new microRNA (miRNA) genes. We provide evidence of miRNA processing and functionality from both hairpin arms and both DNA strands. We identify several classes of pre- and post-transcriptional regulatory motifs, and predict individual motif instances with high confidence. We also study how discovery power scales with the divergence and number of species compared, and we provide general guidelines for comparative studies.


Nature Genetics | 2012

Mapping cis- and trans-regulatory effects across multiple tissues in twins

Elin Grundberg; Kerrin S. Small; Åsa K. Hedman; Alexandra C. Nica; Alfonso Buil; Sarah Keildson; Jordana T. Bell; Yang T-P.; Eshwar Meduri; Amy Barrett; James Nisbett; Magdalena Sekowska; Alicja Wilk; Shin S-Y.; Daniel Glass; Mary E. Travers; Josine Min; S. M. Ring; Karen M Ho; Gudmar Thorleifsson; A. P. S. Kong; Unnur Thorsteindottir; Chrysanthi Ainali; Antigone S. Dimas; Neelam Hassanali; Catherine E. Ingle; David Knowles; Maria Krestyaninova; Christopher E. Lowe; P. Di Meglio

Sequence-based variation in gene expression is a key driver of disease risk. Common variants regulating expression in cis have been mapped in many expression quantitative trait locus (eQTL) studies, typically in single tissues from unrelated individuals. Here, we present a comprehensive analysis of gene expression across multiple tissues conducted in a large set of mono- and dizygotic twins that allows systematic dissection of genetic (cis and trans) and non-genetic effects on gene expression. Using identity-by-descent estimates, we show that at least 40% of the total heritable cis effect on expression cannot be accounted for by common cis variants, a finding that reveals the contribution of low-frequency and rare regulatory variants with respect to both transcriptional regulation and complex trait susceptibility. We show that a substantial proportion of gene expression heritability is trans to the structural gene, and we identify several replicating trans variants that act predominantly in a tissue-restricted manner and may regulate the transcription of many genes.


PLOS Genetics | 2012

Patterns of cis regulatory variation in diverse human populations

Barbara E. Stranger; Stephen B. Montgomery; Antigone S. Dimas; Leopold Parts; Oliver Stegle; Catherine E. Ingle; Magda Sekowska; George Davey Smith; David E. Evans; Maria Gutierrez-Arcelus; Alkes L. Price; Towfique Raj; James Nisbett; Alexandra C. Nica; Claude Beazley; Richard Durbin; Panos Deloukas; Emmanouil T. Dermitzakis

The genetic basis of gene expression variation has long been studied with the aim to understand the landscape of regulatory variants, but also more recently to assist in the interpretation and elucidation of disease signals. To date, many studies have looked in specific tissues and population-based samples, but there has been limited assessment of the degree of inter-population variability in regulatory variation. We analyzed genome-wide gene expression in lymphoblastoid cell lines from a total of 726 individuals from 8 global populations from the HapMap3 project and correlated gene expression levels with HapMap3 SNPs located in cis to the genes. We describe the influence of ancestry on gene expression levels within and between these diverse human populations and uncover a non-negligible impact on global patterns of gene expression. We further dissect the specific functional pathways differentiated between populations. We also identify 5,691 expression quantitative trait loci (eQTLs) after controlling for both non-genetic factors and population admixture and observe that half of the cis-eQTLs are replicated in one or more of the populations. We highlight patterns of eQTL-sharing between populations, which are partially determined by population genetic relatedness, and discover significant sharing of eQTL effects between Asians, European-admixed, and African subpopulations. Specifically, we observe that both the effect size and the direction of effect for eQTLs are highly conserved across populations. We observe an increasing proximity of eQTLs toward the transcription start site as sharing of eQTLs among populations increases, highlighting that variants close to TSS have stronger effects and therefore are more likely to be detected across a wider panel of populations. Together these results offer a unique picture and resource of the degree of differentiation among human populations in functional regulatory variation and provide an estimate for the transferability of complex trait variants across populations.


PLOS Genetics | 2011

The architecture of gene regulatory variation across multiple human tissues: the MuTHER study.

Alexandra C. Nica; Leopold Parts; Daniel Glass; James Nisbet; Amy Barrett; Magdalena Sekowska; Mary E. Travers; Simon Potter; Elin Grundberg; Kerrin S. Small; Åsa K. Hedman; Veronique Bataille; Jordana T. Bell; Gabriela Surdulescu; Antigone S. Dimas; Catherine E. Ingle; Frank O. Nestle; Paola Di Meglio; Josine L. Min; Alicja Wilk; Christopher J. Hammond; Neelam Hassanali; Tsun-Po Yang; Stephen B. Montgomery; Steve O'Rahilly; Cecilia M. Lindgren; Krina T. Zondervan; Nicole Soranzo; Inês Barroso; Richard Durbin

While there have been studies exploring regulatory variation in one or more tissues, the complexity of tissue-specificity in multiple primary tissues is not yet well understood. We explore in depth the role of cis-regulatory variation in three human tissues: lymphoblastoid cell lines (LCL), skin, and fat. The samples (156 LCL, 160 skin, 166 fat) were derived simultaneously from a subset of well-phenotyped healthy female twins of the MuTHER resource. We discover an abundance of cis-eQTLs in each tissue similar to previous estimates (858 or 4.7% of genes). In addition, we apply factor analysis (FA) to remove effects of latent variables, thus more than doubling the number of our discoveries (1,822 eQTL genes). The unique study design (Matched Co-Twin Analysis—MCTA) permits immediate replication of eQTLs using co-twins (93%–98%) and validation of the considerable gain in eQTL discovery after FA correction. We highlight the challenges of comparing eQTLs between tissues. After verifying previous significance threshold-based estimates of tissue-specificity, we show their limitations given their dependency on statistical power. We propose that continuous estimates of the proportion of tissue-shared signals and direct comparison of the magnitude of effect on the fold change in expression are essential properties that jointly provide a biologically realistic view of tissue-specificity. Under this framework we demonstrate that 30% of eQTLs are shared among the three tissues studied, while another 29% appear exclusively tissue-specific. However, even among the shared eQTLs, a substantial proportion (10%–20%) have significant differences in the magnitude of fold change between genotypic classes across tissues. Our results underline the need to account for the complexity of eQTL tissue-specificity in an effort to assess consequences of such variants for complex traits.


Genome Research | 2009

Simultaneous assay of every Salmonella Typhi gene using one million transposon mutants

Gemma C. Langridge; Minh-Duy Phan; Daniel J. Turner; Timothy T. Perkins; Leopold Parts; Jana K. Haase; Ian G. Charles; Duncan J. Maskell; Sarah E. Peters; Gordon Dougan; John Wain; Julian Parkhill; A. Keith Turner

Very high-throughput sequencing technologies need to be matched by high-throughput functional studies if we are to make full use of the current explosion in genome sequences. We have generated a very large bacterial mutant pool, consisting of an estimated 1.1 million transposon mutants and we have used genomic DNA from this mutant pool, and Illumina nucleotide sequencing to prime from the transposon and sequence into the adjacent target DNA. With this method, which we have called TraDIS (transposon directed insertion-site sequencing), we have been able to map 370,000 unique transposon insertion sites to the Salmonella enterica serovar Typhi chromosome. The unprecedented density and resolution of mapped insertion sites, an average of one every 13 base pairs, has allowed us to assay simultaneously every gene in the genome for essentiality and generate a genome-wide list of candidate essential genes. In addition, the semiquantitative nature of the assay allowed us to identify genes that are advantageous and those that are disadvantageous for growth under standard laboratory conditions. Comparison of the mutant pool following growth in the presence or absence of ox bile enabled every gene to be assayed for its contribution toward bile tolerance, a trait required of any enteric bacterium and for carriage of S. Typhi in the gall bladder. This screen validated our hypothesis that we can simultaneously assay every gene in the genome to identify niche-specific essential genes.


PLOS Computational Biology | 2010

A Bayesian framework to account for complex non-genetic factors in gene expression levels greatly increases power in eQTL studies.

Oliver Stegle; Leopold Parts; Richard Durbin; John Winn

Gene expression measurements are influenced by a wide range of factors, such as the state of the cell, experimental conditions and variants in the sequence of regulatory regions. To understand the effect of a variable of interest, such as the genotype of a locus, it is important to account for variation that is due to confounding causes. Here, we present VBQTL, a probabilistic approach for mapping expression quantitative trait loci (eQTLs) that jointly models contributions from genotype as well as known and hidden confounding factors. VBQTL is implemented within an efficient and flexible inference framework, making it fast and tractable on large-scale problems. We compare the performance of VBQTL with alternative methods for dealing with confounding variability on eQTL mapping datasets from simulations, yeast, mouse, and human. Employing Bayesian complexity control and joint modelling is shown to result in more precise estimates of the contribution of different confounding factors resulting in additional associations to measured transcript levels compared to alternative approaches. We present a threefold larger collection of cis eQTLs than previously found in a whole-genome eQTL scan of an outbred human population. Altogether, 27% of the tested probes show a significant genetic association in cis, and we validate that the additional eQTLs are likely to be real by replicating them in different sets of individuals. Our method is the next step in the analysis of high-dimensional phenotype data, and its application has revealed insights into genetic regulation of gene expression by demonstrating more abundant cis-acting eQTLs in human than previously shown. Our software is freely available online at http://www.sanger.ac.uk/resources/software/peer/.


Nature Protocols | 2012

Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses

Oliver Stegle; Leopold Parts; Matias Piipari; John Winn; Richard Durbin

We present PEER (probabilistic estimation of expression residuals), a software package implementing statistical models that improve the sensitivity and interpretability of genetic associations in population-scale expression data. This approach builds on factor analysis methods that infer broad variance components in the measurements. PEER takes as input transcript profiles and covariates from a set of individuals, and then outputs hidden factors that explain much of the expression variability. Optionally, these factors can be interpreted as pathway or transcription factor activations by providing prior information about which genes are involved in the pathway or targeted by the factor. The inferred factors are used in genetic association analyses. First, they are treated as additional covariates, and are included in the model to increase detection power for mapping expression traits. Second, they are analyzed as phenotypes themselves to understand the causes of global expression variability. PEER extends previous related surrogate variable models and can be implemented within hours on a desktop computer.


Genome Research | 2011

Revealing the genetic structure of a trait by sequencing a population under selection

Leopold Parts; Francisco A. Cubillos; Jonas Warringer; Kanika Jain; Francisco Salinas; Suzannah Bumpstead; Mikael Molin; Amin Zia; Jared T. Simpson; Michael A. Quail; Alan M. Moses; Edward J. Louis; Richard Durbin; Gianni Liti

One approach to understanding the genetic basis of traits is to study their pattern of inheritance among offspring of phenotypically different parents. Previously, such analysis has been limited by low mapping resolution, high labor costs, and large sample size requirements for detecting modest effects. Here, we present a novel approach to map trait loci using artificial selection. First, we generated populations of 10-100 million haploid and diploid segregants by crossing two budding yeast strains of different heat tolerance for up to 12 generations. We then subjected these large segregant pools to heat stress for up to 12 d, enriching for beneficial alleles. Finally, we sequenced total DNA from the pools before and during selection to measure the changes in parental allele frequency. We mapped 21 intervals with significant changes in genetic background in response to selection, which is several times more than found with traditional linkage methods. Nine of these regions contained two or fewer genes, yielding much higher resolution than previous genomic linkage studies. Multiple members of the RAS/cAMP signaling pathway were implicated, along with genes previously not annotated with heat stress response function. Surprisingly, at most selected loci, allele frequencies stopped changing before the end of the selection experiment, but alleles did not become fixed. Furthermore, we were able to detect the same set of trait loci in a population of diploid individuals with similar power and resolution, and observed primarily additive effects, similar to what is seen for complex trait genetics in other diploid organisms such as humans.


Molecular Systems Biology | 2016

Deep learning for computational biology.

Christof Angermueller; Tanel Pärnamaa; Leopold Parts; Oliver Stegle

Technological advances in genomics and imaging have led to an explosion of molecular and cellular profiling data from large numbers of samples. This rapid increase in biological data dimension and acquisition rate is challenging conventional analysis strategies. Modern machine learning methods, such as deep learning, promise to leverage very large data sets for finding hidden structure within them, and for making accurate predictions. In this review, we discuss applications of this new breed of analysis approaches in regulatory genomics and cellular imaging. We provide background of what deep learning is, and the settings in which it can be successfully applied to derive biological insights. In addition to presenting specific applications and providing tips for practical use, we also highlight possible pitfalls and limitations to guide computational biologists when and how to make the most use of this new technology.


Molecular Biology and Evolution | 2014

A high-definition view of functional genetic variation from natural yeast genomes

Anders Bergström; Jared T. Simpson; Francisco Salinas; Benjamin Barré; Leopold Parts; Amin Zia; Alex N. Nguyen Ba; Alan M. Moses; Edward J. Louis; Ville Mustonen; Jonas Warringer; Richard Durbin; Gianni Liti

The question of how genetic variation in a population influences phenotypic variation and evolution is of major importance in modern biology. Yet much is still unknown about the relative functional importance of different forms of genome variation and how they are shaped by evolutionary processes. Here we address these questions by population level sequencing of 42 strains from the budding yeast Saccharomyces cerevisiae and its closest relative S. paradoxus. We find that genome content variation, in the form of presence or absence as well as copy number of genetic material, is higher within S. cerevisiae than within S. paradoxus, despite genetic distances as measured in single-nucleotide polymorphisms being vastly smaller within the former species. This genome content variation, as well as loss-of-function variation in the form of premature stop codons and frameshifting indels, is heavily enriched in the subtelomeres, strongly reinforcing the relevance of these regions to functional evolution. Genes affected by these likely functional forms of variation are enriched for functions mediating interaction with the external environment (sugar transport and metabolism, flocculation, metal transport, and metabolism). Our results and analyses provide a comprehensive view of genomic diversity in budding yeast and expose surprising and pronounced differences between the variation within S. cerevisiae and that within S. paradoxus. We also believe that the sequence data and de novo assemblies will constitute a useful resource for further evolutionary and population genomics studies.

Collaboration


Dive into the Leopold Parts's collaboration.

Top Co-Authors

Avatar

Richard Durbin

Wellcome Trust Sanger Institute

View shared research outputs
Top Co-Authors

Avatar

Gianni Liti

University of Nice Sophia Antipolis

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Oliver Stegle

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Ville Mustonen

Wellcome Trust Sanger Institute

View shared research outputs
Top Co-Authors

Avatar

Francisco Salinas

University of Nice Sophia Antipolis

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge