Alejandro Sifrim
Wellcome Trust Sanger Institute
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Alejandro Sifrim.
The Lancet | 2015
Caroline F. Wright; Tomas Fitzgerald; Wendy D Jones; Stephen Clayton; Jeremy McRae; Margriet van Kogelenberg; Daniel A. King; Kirsty Ambridge; Daniel M Barrett; Tanya Bayzetinova; A. Paul Bevan; Eugene Bragin; Eleni A. Chatzimichali; Susan M. Gribble; Philip Jones; Netravathi Krishnappa; Laura E Mason; Ray Miller; Katherine I. Morley; Vijaya Parthiban; Elena Prigmore; Diana Rajan; Alejandro Sifrim; G. Jawahar Swaminathan; Adrian Tivey; Anna Middleton; Michael W. Parker; Nigel P. Carter; Jeffrey C. Barrett; David Fitzpatrick
Summary Background Human genome sequencing has transformed our understanding of genomic variation and its relevance to health and disease, and is now starting to enter clinical practice for the diagnosis of rare diseases. The question of whether and how some categories of genomic findings should be shared with individual research participants is currently a topic of international debate, and development of robust analytical workflows to identify and communicate clinically relevant variants is paramount. Methods The Deciphering Developmental Disorders (DDD) study has developed a UK-wide patient recruitment network involving over 180 clinicians across all 24 regional genetics services, and has performed genome-wide microarray and whole exome sequencing on children with undiagnosed developmental disorders and their parents. After data analysis, pertinent genomic variants were returned to individual research participants via their local clinical genetics team. Findings Around 80 000 genomic variants were identified from exome sequencing and microarray analysis in each individual, of which on average 400 were rare and predicted to be protein altering. By focusing only on de novo and segregating variants in known developmental disorder genes, we achieved a diagnostic yield of 27% among 1133 previously investigated yet undiagnosed children with developmental disorders, whilst minimising incidental findings. In families with developmentally normal parents, whole exome sequencing of the child and both parents resulted in a 10-fold reduction in the number of potential causal variants that needed clinical evaluation compared to sequencing only the child. Most diagnostic variants identified in known genes were novel and not present in current databases of known disease variation. Interpretation Implementation of a robust translational genomics workflow is achievable within a large-scale rare disease research study to allow feedback of potentially diagnostic findings to clinicians and research participants. Systematic recording of relevant clinical data, curation of a gene–phenotype knowledge base, and development of clinical decision support software are needed in addition to automated exclusion of almost all variants, which is crucial for scalable prioritisation and review of possible diagnostic variants. However, the resource requirements of development and maintenance of a clinical reporting system within a research setting are substantial. Funding Health Innovation Challenge Fund, a parallel funding partnership between the Wellcome Trust and the UK Department of Health.
Nature Genetics | 2012
Jeroen Van Houdt; Beata Nowakowska; Sérgio B. de Sousa; Barbera D. C. van Schaik; Eve Seuntjens; Nelson Avonce; Alejandro Sifrim; Omar A. Abdul-Rahman; Marie Jose H. van den Boogaard; Armand Bottani; Marco Castori; Valérie Cormier-Daire; Matthew A. Deardorff; Isabel Filges; Alan Fryer; Jean Pierre Fryns; Simone Gana; Livia Garavelli; Gabriele Gillessen-Kaesbach; Bryan D. Hall; Denise Horn; Danny Huylebroeck; Jakub Klapecki; Małgorzata Krajewska-Walasek; Alma Kuechler; Saskia M. Maas; Kay D. MacDermot; Shane McKee; Alex Magee; Stella A. de Man
Nicolaides-Baraitser syndrome (NBS) is characterized by sparse hair, distinctive facial morphology, distal-limb anomalies and intellectual disability. We sequenced the exomes of ten individuals with NBS and identified heterozygous variants in SMARCA2 in eight of them. Extended molecular screening identified nonsynonymous SMARCA2 mutations in 36 of 44 individuals with NBS; these mutations were confirmed to be de novo when parental samples were available. SMARCA2 encodes the core catalytic unit of the SWI/SNF ATP-dependent chromatin remodeling complex that is involved in the regulation of gene transcription. The mutations cluster within sequences that encode ultra-conserved motifs in the catalytic ATPase region of the protein. These alterations likely do not impair SWI/SNF complex assembly but may be associated with disrupted ATPase activity. The identification of SMARCA2 mutations in humans provides insight into the function of the Snf2 helicase family.
Nature Methods | 2013
Alejandro Sifrim; Dusan Popovic; Léon-Charles Tranchevent; Amin Ardeshirdavani; Ryo Sakai; Peter Konings; Joris Vermeesch; Jan Aerts; Bart De Moor; Yves Moreau
Massively parallel sequencing greatly facilitates the discovery of novel disease genes causing Mendelian and oligogenic disorders. However, many mutations are present in any individual genome, and identifying which ones are disease causing remains a largely open problem. We introduce eXtasy, an approach to prioritize nonsynonymous single-nucleotide variants (nSNVs) that substantially improves prediction of disease-causing variants in exome sequencing data by integrating variant impact prediction, haploinsufficiency prediction and phenotype-specific gene prioritization.
Nature Genetics | 2015
Nadia A. Akawi; Jeremy McRae; Morad Ansari; Meena Balasubramanian; Moira Blyth; Angela F. Brady; Stephen Clayton; Trevor Cole; Charu Deshpande; Tomas Fitzgerald; Nicola Foulds; Richard Francis; George C. Gabriel; Sebastian S. Gerety; Judith A. Goodship; Emma Hobson; Wendy D Jones; Shelagh Joss; Daniel A. King; Nikolai T. Klena; Ajith Kumar; Melissa Lees; Chris Lelliott; Jenny Lord; Dominic McMullan; Mary O'Regan; Deborah Osio; Virginia Piombo; Elena Prigmore; Diana Rajan
Discovery of most autosomal recessive disease-associated genes has involved analysis of large, often consanguineous multiplex families or small cohorts of unrelated individuals with a well-defined clinical condition. Discovery of new dominant causes of rare, genetically heterogeneous developmental disorders has been revolutionized by exome analysis of large cohorts of phenotypically diverse parent-offspring trios. Here we analyzed 4,125 families with diverse, rare and genetically heterogeneous developmental disorders and identified four new autosomal recessive disorders. These four disorders were identified by integrating Mendelian filtering (selecting probands with rare, biallelic and putatively damaging variants in the same gene) with statistical assessments of (i) the likelihood of sampling the observed genotypes from the general population and (ii) the phenotypic similarity of patients with recessive variants in the same candidate gene. This new paradigm promises to catalyze the discovery of novel recessive disorders, especially those with less consistent or nonspecific clinical presentations and those caused predominantly by compound heterozygous genotypes.
BMC Research Notes | 2011
Georgios A. Pavlopoulos; Sean D. Hooper; Alejandro Sifrim; Reinhard Schneider; Jan Aerts
BackgroundBiological processes such as metabolic pathways, gene regulation or protein-protein interactions are often represented as graphs in systems biology. The understanding of such networks, their analysis, and their visualization are today important challenges in life sciences. While a great variety of visualization tools that try to address most of these challenges already exists, only few of them succeed to bridge the gap between visualization and network analysis.FindingsMedusa is a powerful tool for visualization and clustering analysis of large-scale biological networks. It is highly interactive and it supports weighted and unweighted multi-edged directed and undirected graphs. It combines a variety of layouts and clustering methods for comprehensive views and advanced data analysis. Its main purpose is to integrate visualization and analysis of heterogeneous data from different sources into a single network.ConclusionsMedusa provides a concise visual tool, which is helpful for network analysis and interpretation. Medusa is offered both as a standalone application and as an applet written in Java. It can be found at: https://sites.google.com/site/medusa3visualization.
Genome Medicine | 2012
Alejandro Sifrim; Jeroen Van Houdt; Léon-Charles Tranchevent; Beata Nowakowska; Ryo Sakai; Georgios A. Pavlopoulos; Koen Devriendt; Joris Vermeesch; Yves Moreau; Jan Aerts
The increasing size and complexity of exome/genome sequencing data requires new tools for clinical geneticists to discover disease-causing variants. Bottlenecks in identifying the causative variation include poor cross-sample querying, constantly changing functional annotation and not considering existing knowledge concerning the phenotype. We describe a methodology that facilitates exploration of patient sequencing data towards identification of causal variants under different genetic hypotheses. Annotate-it facilitates handling, analysis and interpretation of high-throughput single nucleotide variant data. We demonstrate our strategy using three case studies. Annotate-it is freely available and test data are accessible to all users at http://www.annotate-it.org.
Biodata Mining | 2013
Georgios A. Pavlopoulos; Anastasis Oulas; Ernesto Iacucci; Alejandro Sifrim; Yves Moreau; Reinhard Schneider; Jan Aerts; Ioannis Iliopoulos
Elucidating the content of a DNA sequence is critical to deeper understand and decode the genetic information for any biological system. As next generation sequencing (NGS) techniques have become cheaper and more advanced in throughput over time, great innovations and breakthrough conclusions have been generated in various biological areas. Few of these areas, which get shaped by the new technological advances, involve evolution of species, microbial mapping, population genetics, genome-wide association studies (GWAs), comparative genomics, variant analysis, gene expression, gene regulation, epigenetics and personalized medicine. While NGS techniques stand as key players in modern biological research, the analysis and the interpretation of the vast amount of data that gets produced is a not an easy or a trivial task and still remains a great challenge in the field of bioinformatics. Therefore, efficient tools to cope with information overload, tackle the high complexity and provide meaningful visualizations to make the knowledge extraction easier are essential. In this article, we briefly refer to the sequencing methodologies and the available equipment to serve these analyses and we describe the data formats of the files which get produced by them. We conclude with a thorough review of tools developed to efficiently store, analyze and visualize such data with emphasis in structural variation analysis and comparative genomics. We finally comment on their functionality, strengths and weaknesses and we discuss how future applications could further develop in this field.
Nucleic Acids Research | 2016
Sarah Elshal; Léon-Charles Tranchevent; Alejandro Sifrim; Amin Ardeshirdavani; Jesse Davis; Yves Moreau
Disease-gene identification is a challenging process that has multiple applications within functional genomics and personalized medicine. Typically, this process involves both finding genes known to be associated with the disease (through literature search) and carrying out preliminary experiments or screens (e.g. linkage or association studies, copy number analyses, expression profiling) to determine a set of promising candidates for experimental validation. This requires extensive time and monetary resources. We describe Beegle, an online search and discovery engine that attempts to simplify this process by automating the typical approaches. It starts by mining the literature to quickly extract a set of genes known to be linked with a given query, then it integrates the learning methodology of Endeavour (a gene prioritization tool) to train a genomic model and rank a set of candidate genes to generate novel hypotheses. In a realistic evaluation setup, Beegle has an average recall of 84% in the top 100 returned genes as a search engine, which improves the discovery engine by 12.6% in the top 5% prioritized genes. Beegle is publicly available at http://beegle.esat.kuleuven.be/.
Nature | 2018
Ievgenia Pastushenko; Audrey Brisebarre; Alejandro Sifrim; Marco Fioramonti; Tatiana Revenco; Soufiane Boumahdi; Alexandra Van Keymeulen; Daniel Brown; Virginie Moers; Sophie Lemaire; Sarah De Clercq; Esmeralda Minguijón; Cédric Balsat; Youri Sokolow; Christine Dubois; Florian De Cock; Samuel Scozzaro; Federico Sopena; Angel Lanas; Nicky D’Haene; Isabelle Salmon; Jean-Christophe Marine; Thierry Voet; Panagiota A. Sotiropoulou; Cédric Blanpain
In cancer, the epithelial-to-mesenchymal transition (EMT) is associated with tumour stemness, metastasis and resistance to therapy. It has recently been proposed that, rather than being a binary process, EMT occurs through distinct intermediate states. However, there is no direct in vivo evidence for this idea. Here we screen a large panel of cell surface markers in skin and mammary primary tumours, and identify the existence of multiple tumour subpopulations associated with different EMT stages: from epithelial to completely mesenchymal states, passing through intermediate hybrid states. Although all EMT subpopulations presented similar tumour-propagating cell capacity, they displayed differences in cellular plasticity, invasiveness and metastatic potential. Their transcriptional and epigenetic landscapes identify the underlying gene regulatory networks, transcription factors and signalling pathways that control these different EMT transition states. Finally, these tumour subpopulations are localized in different niches that differentially regulate EMT transition states.Epithelial-to-mesenchymal transition in tumour cells occurs through distinct intermediate states, associated with different metastatic potential, cellular properties, gene expression, and chromatin landscape
Genetics in Medicine | 2018
Caroline F. Wright; Jeremy McRae; Stephen Clayton; Giuseppe Gallone; Stuart A. Aitken; Tomas Fitzgerald; Peter M. Jones; Elena Prigmore; Diana Rajan; Jenny Lord; Alejandro Sifrim; R Kelsell; Michael J. Parker; Jeffrey C. Barrett; FitzPatrick; Helen V. Firth
PurposeGiven the rapid pace of discovery in rare disease genomics, it is likely that improvements in diagnostic yield can be made by systematically reanalyzing previously generated genomic sequence data in light of new knowledge.MethodsWe tested this hypothesis in the United Kingdom–wide Deciphering Developmental Disorders study, where in 2014 we reported a diagnostic yield of 27% through whole-exome sequencing of 1,133 children with severe developmental disorders and their parents. We reanalyzed existing data using improved variant calling methodologies, novel variant detection algorithms, updated variant annotation, evidence-based filtering strategies, and newly discovered disease-associated genes.ResultsWe are now able to diagnose an additional 182 individuals, taking our overall diagnostic yield to 454/1,133 (40%), and another 43 (4%) have a finding of uncertain clinical significance. The majority of these new diagnoses are due to novel developmental disorder–associated genes discovered since our original publication.ConclusionThis study highlights the importance of coupling large-scale research with clinical practice, and of discussing the possibility of iterative reanalysis and recontact with patients and health professionals at an early stage. We estimate that implementing parent–offspring whole-exome sequencing as a first-line diagnostic test for developmental disorders would diagnose >50% of patients.