Leszek P. Pryszcz
Pompeu Fabra University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Leszek P. Pryszcz.
PLOS ONE | 2013
Ferry Hagen; Paulo Cezar Ceresini; Itzhack Polacheck; Hansong Ma; Filip Van Nieuwerburgh; Toni Gabaldón; Sarah Kagan; E. Rhiannon Pursall; Hans L. Hoogveld; Leo van Iersel; Gunnar W. Klau; Steven Kelk; Leen Stougie; Karen H. Bartlett; Kerstin Voelz; Leszek P. Pryszcz; Elizabeth Castañeda; Márcia dos Santos Lazéra; Wieland Meyer; Dieter Deforce; Jacques F. Meis; Robin C. May; Corné H. W. Klaassen; Teun Boekhout
Over the past two decades, several fungal outbreaks have occurred, including the high-profile ‘Vancouver Island’ and ‘Pacific Northwest’ outbreaks, caused by Cryptococcus gattii, which has affected hundreds of otherwise healthy humans and animals. Over the same time period, C. gattii was the cause of several additional case clusters at localities outside of the tropical and subtropical climate zones where the species normally occurs. In every case, the causative agent belongs to a previously rare genotype of C. gattii called AFLP6/VGII, but the origin of the outbreak clades remains enigmatic. Here we used phylogenetic and recombination analyses, based on AFLP and multiple MLST datasets, and coalescence gene genealogy to demonstrate that these outbreaks have arisen from a highly-recombining C. gattii population in the native rainforest of Northern Brazil. Thus the modern virulent C. gattii AFLP6/VGII outbreak lineages derived from mating events in South America and then dispersed to temperate regions where they cause serious infections in humans and animals.
Nucleic Acids Research | 2011
Jaime Huerta-Cepas; Salvador Capella-Gutiérrez; Leszek P. Pryszcz; Ivan Denisov; Diego Kormes; Marina Marcet-Houben; Toni Gabaldón
The growing availability of complete genomic sequences from diverse species has brought about the need to scale up phylogenomic analyses, including the reconstruction of large collections of phylogenetic trees. Here, we present the third version of PhylomeDB (http://phylomeDB.org), a public database for genome-wide collections of gene phylogenies (phylomes). Currently, PhylomeDB is the largest phylogenetic repository and hosts 17 phylomes, comprising 416 093 trees and 165 840 alignments. It is also a major source for phylogeny-based orthology and paralogy predictions, covering about 5 million proteins in 717 fully-sequenced genomes. For each protein-coding gene in a seed genome, the database provides original and processed alignments, phylogenetic trees derived from various methods and phylogeny-based predictions of orthology and paralogy relationships. The new version of phylomeDB has been extended with novel data access and visualization features, including the possibility of programmatic access. Available seed species include model organisms such as human, yeast, Escherichia coli or Arabidopsis thaliana, but also alternative model species such as the human pathogen Candida albicans, or the pea aphid Acyrtosiphon pisum. Finally, PhylomeDB is currently being used by several genome sequencing projects that couple the genome annotation process with the reconstruction of the corresponding phylome, a strategy that provides relevant evolutionary insights.
Nucleic Acids Research | 2014
Jaime Huerta-Cepas; Salvador Capella-Gutiérrez; Leszek P. Pryszcz; Marina Marcet-Houben; Toni Gabaldón
Phylogenetic trees representing the evolutionary relationships of homologous genes are the entry point for many evolutionary analyses. For instance, the use of a phylogenetic tree can aid in the inference of orthology and paralogy relationships, and in the detection of relevant evolutionary events such as gene family expansions and contractions, horizontal gene transfer, recombination or incomplete lineage sorting. Similarly, given the plurality of evolutionary histories among genes encoded in a given genome, there is a need for the combined analysis of genome-wide collections of phylogenetic trees (phylomes). Here, we introduce a new release of PhylomeDB (http://phylomedb.org), a public repository of phylomes. Currently, PhylomeDB hosts 120 public phylomes, comprising >1.5 million maximum likelihood trees and multiple sequence alignments. In the current release, phylogenetic trees are annotated with taxonomic, protein-domain arrangement, functional and evolutionary information. PhylomeDB is also a major source for phylogeny-based predictions of orthology and paralogy, covering >10 million proteins across 1059 sequenced species. Here we describe newly implemented PhylomeDB features, and discuss a benchmark of the orthology predictions provided by the database, the impact of proteome updates and the use of the phylome approach in the analysis of newly sequenced genomes and transcriptomes.
Nucleic Acids Research | 2011
Leszek P. Pryszcz; Jaime Huerta-Cepas; Toni Gabaldón
Reliable prediction of orthology is central to comparative genomics. Approaches based on phylogenetic analyses closely resemble the original definition of orthology and paralogy and are known to be highly accurate. However, the large computational cost associated to these analyses is a limiting factor that often prevents its use at genomic scales. Recently, several projects have addressed the reconstruction of large collections of high-quality phylogenetic trees from which orthology and paralogy relationships can be inferred. This provides us with the opportunity to infer the evolutionary relationships of genes from multiple, independent, phylogenetic trees. Using such strategy, we combine phylogenetic information derived from different databases, to predict orthology and paralogy relationships for 4.1 million proteins in 829 fully sequenced genomes. We show that the number of independent sources from which a prediction is made, as well as the level of consistency across predictions, can be used as reliable confidence scores. A webserver has been developed to easily access these data (http://orthology.phylomedb.org), which provides users with a global repository of phylogeny-based orthology and paralogy predictions.
Nature Methods | 2016
Adrian M. Altenhoff; Brigitte Boeckmann; Salvador Capella-Gutiérrez; Daniel A. Dalquen; Todd DeLuca; Kristoffer Forslund; Jaime Huerta-Cepas; Benjamin Linard; Cecile Pereira; Leszek P. Pryszcz; Fabian Schreiber; Alan Wilter Sousa da Silva; Damian Szklarczyk; Clément-Marie Train; Peer Bork; Odile Lecompte; Christian von Mering; Ioannis Xenarios; Kimmen Sjölander; Lars Juhl Jensen; María Martín; Matthieu Muffato; Toni Gabaldón; Suzanna E. Lewis; Paul D. Thomas; Erik L. L. Sonnhammer; Christophe Dessimoz
Achieving high accuracy in orthology inference is essential for many comparative, evolutionary and functional genomic analyses, yet the true evolutionary history of genes is generally unknown and orthologs are used for very different applications across phyla, requiring different precision–recall trade-offs. As a result, it is difficult to assess the performance of orthology inference methods. Here, we present a community effort to establish standards and an automated web-based service to facilitate orthology benchmarking. Using this service, we characterize 15 well-established inference methods and resources on a battery of 20 different benchmarks. Standardized benchmarking provides a way for users to identify the most effective methods for the problem at hand, sets a minimum requirement for new tools and resources, and guides the development of more accurate orthology inference methods.
Genome Biology and Evolution | 2014
Leszek P. Pryszcz; Tibor Németh; Attila Gácser; Toni Gabaldón
The Candida parapsilosis species complex comprises a group of emerging human pathogens of varying virulence. This complex was recently subdivided into three different species: C. parapsilosis sensu stricto, C. metapsilosis, and C. orthopsilosis. Within the latter, at least two clearly distinct subspecies seem to be present among clinical isolates (Type 1 and Type 2). To gain insight into the genomic differences between these subspecies, we undertook the sequencing of a clinical isolate classified as Type 1 and compared it with the available sequence of a Type 2 clinical strain. Unexpectedly, the analysis of the newly sequenced strain revealed a highly heterozygous genome, which we show to be the consequence of a hybridization event between both identified subspecies. This implicitly suggests that C. orthopsilosis is able to mate, a so-far unanswered question. The resulting hybrid shows a chimeric genome that maintains a similar gene dosage from both parental lineages and displays ongoing loss of heterozygosity. Several of the differences found between the gene content in both strains relate to virulent-related families, with the hybrid strain presenting a higher copy number of genes coding for efflux pumps or secreted lipases. Remarkably, two clinical strains isolated from distant geographical locations (Texas and Singapore) are descendants of the same hybrid line, raising the intriguing possibility of a relationship between the hybridization event and the global spread of a virulent clone.
Nucleic Acids Research | 2016
Leszek P. Pryszcz; Toni Gabaldón
Many genomes display high levels of heterozygosity (i.e. presence of different alleles at the same loci in homologous chromosomes), being those of hybrid organisms an extreme such case. The assembly of highly heterozygous genomes from short sequencing reads is a challenging task because it is difficult to accurately recover the different haplotypes. When confronted with highly heterozygous genomes, the standard assembly process tends to collapse homozygous regions and reports heterozygous regions in alternative contigs. The boundaries between homozygous and heterozygous regions result in multiple assembly paths that are hard to resolve, which leads to highly fragmented assemblies with a total size larger than expected. This, in turn, causes numerous problems in downstream analyses such as fragmented gene models, wrong gene copy number, or broken synteny. To circumvent these caveats we have developed a pipeline that specifically deals with the assembly of heterozygous genomes by introducing a step to recognise and selectively remove alternative heterozygous contigs. We tested our pipeline on simulated and naturally-occurring heterozygous genomes and compared its accuracy to other existing tools. Our method is freely available at https://github.com/Gabaldonlab/redundans.
PLOS Genetics | 2015
Leszek P. Pryszcz; Tibor Németh; Ester Saus; Ewa Ksiezopolska; Eva Hegedűsová; Jozef Nosek; Kenneth H. Wolfe; Attila Gácser; Toni Gabaldón
Candida metapsilosis is a rarely-isolated, opportunistic pathogen that belongs to a clade of pathogenic yeasts known as the C. parapsilosis sensu lato species complex. To gain insight into the recent evolution of C. metapsilosis and the genetic basis of its virulence, we sequenced the genome of 11 clinical isolates from various locations, which we compared to each other and to the available genomes of the two remaining members of the complex: C. orthopsilosis and C. parapsilosis. Unexpectedly, we found compelling genomic evidence that C. metapsilosis is a highly heterozygous hybrid species, with all sequenced clinical strains resulting from the same past hybridization event involving two parental lineages that were approximately 4.5% divergent in sequence. This result indicates that the parental species are non-pathogenic, but that hybridization between them formed a new opportunistic pathogen, C. metapsilosis, that has achieved a worldwide distribution. We show that these hybrids are diploid and we identified strains carrying loci for both alternative mating types, which supports mating as the initial mechanism for hybrid formation. We trace the aftermath of this hybridization at the genomic level, and reconstruct the evolutionary relationships among the different strains. Recombination and introgression -resulting in loss of heterozygosis- between the two subgenomes have been rampant, and includes the partial overwriting of the MTLa mating locus in all strains. Collectively, our results shed light on the recent genomic evolution within the C. parapsilosis sensu lato complex, and argue for a re-definition of species within this clade, with at least five distinct homozygous lineages, some of which having the ability to form hybrids.
Genome Biology and Evolution | 2013
Leszek P. Pryszcz; Tibor Németh; Attila Gácser; Toni Gabaldón
Invasive candidiasis is the most commonly reported invasive fungal infection worldwide. Although Candida albicans remains the main cause, the incidence of emerging Candida species, such as C. parapsilosis is increasing. It has been postulated that C. parapsilosis clinical isolates result from a recent global expansion of a virulent clone. However, the availability of a single genome for this species has so far prevented testing this hypothesis at genomic scales. We present here the sequence of three additional strains from clinical and environmental samples. Our analyses reveal unexpected patterns of genomic variation, shared among distant strains, that argue against the clonal expansion hypothesis. All strains carry independent expansions involving an arsenite transporter homolog, pointing to the existence of directional selection in the environment, and independent origins of the two clinical isolates. Furthermore, we report the first evidence for the existence of recombination in this species. Altogether, our results shed new light onto the dynamics of genome evolution in C. parapsilosis.
Mitochondrion | 2012
Matus Valach; Leszek P. Pryszcz; Lubomir Tomaska; Attila Gácser; Toni Gabaldón; Jozef Nosek
Candida parapsilosis species complex includes three closely related species, namely C. parapsilosis (sensu stricto), C. orthopsilosis, and C. metapsilosis. Unlike most other yeast lineages, members of this species complex possess a linear mitochondrial genome. Yet, its circularized mutant form was identified in strains of C. orthopsilosis and C. metapsilosis. To investigate the underlying variability, we performed comparative analyses of the complete mitochondrial DNA sequences in a collection of strains. Our results demonstrate that in contrast to C. parapsilosis and C. metapsilosis, C. orthopsilosis exhibits remarkably high nucleotide diversity whose pattern is consistent with intraspecific genetic exchange.