Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Hashem A. Shihab is active.

Publication


Featured researches published by Hashem A. Shihab.


Human Mutation | 2013

Predicting the Functional, Molecular, and Phenotypic Consequences of Amino Acid Substitutions using Hidden Markov Models

Hashem A. Shihab; Julian Gough; David Neil Cooper; Peter D. Stenson; Gary L. A. Barker; Keith J. Edwards; Ian N.M. Day; Tom R. Gaunt

The rate at which nonsynonymous single nucleotide polymorphisms (nsSNPs) are being identified in the human genome is increasing dramatically owing to advances in whole‐genome/whole‐exome sequencing technologies. Automated methods capable of accurately and reliably distinguishing between pathogenic and functionally neutral nsSNPs are therefore assuming ever‐increasing importance. Here, we describe the Functional Analysis Through Hidden Markov Models (FATHMM) software and server: a species‐independent method with optional species‐specific weightings for the prediction of the functional effects of protein missense variants. Using a model weighted for human mutations, we obtained performance accuracies that outperformed traditional prediction methods (i.e., SIFT, PolyPhen, and PANTHER) on two separate benchmarks. Furthermore, in one benchmark, we achieve performance accuracies that outperform current state‐of‐the‐art prediction methods (i.e., SNPs&GO and MutPred). We demonstrate that FATHMM can be efficiently applied to high‐throughput/large‐scale human and nonhuman genome sequencing projects with the added benefit of phenotypic outcome associations. To illustrate this, we evaluated nsSNPs in wheat (Triticum spp.) to identify some of the important genetic variants responsible for the phenotypic differences introduced by intense selection during domestication. A Web‐based implementation of FATHMM, including a high‐throughput batch facility and a downloadable standalone package, is available at http://fathmm.biocompute.org.uk.


Bioinformatics | 2015

An integrative approach to predicting the functional effects of non-coding and coding sequence variation.

Hashem A. Shihab; Mark F. Rogers; Julian Gough; Matthew Mort; David Neil Cooper; Ian N.M. Day; Tom R. Gaunt; Colin Campbell

Motivation: Technological advances have enabled the identification of an increasingly large spectrum of single nucleotide variants within the human genome, many of which may be associated with monogenic disease or complex traits. Here, we propose an integrative approach, named FATHMM-MKL, to predict the functional consequences of both coding and non-coding sequence variants. Our method utilizes various genomic annotations, which have recently become available, and learns to weight the significance of each component annotation source. Results: We show that our method outperforms current state-of-the-art algorithms, CADD and GWAVA, when predicting the functional consequences of non-coding variants. In addition, FATHMM-MKL is comparable to the best of these algorithms when predicting the impact of coding variants. The method includes a confidence measure to rank order predictions. Availability and implementation: The FATHMM-MKL webserver is available at: http://fathmm.biocompute.org.uk Contact: [email protected] or [email protected] or [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.


International Journal of Epidemiology | 2015

Maternal pre-pregnancy BMI and gestational weight gain, offspring DNA methylation and later offspring adiposity: findings from the Avon Longitudinal Study of Parents and Children

Gemma C. Sharp; Debbie A. Lawlor; Rebecca C Richmond; Abigail Fraser; Andrew J Simpkin; Matthew Suderman; Hashem A. Shihab; Oliver Lyttleton; Wendy L. McArdle; Susan M. Ring; Tom R. Gaunt; George Davey Smith; Caroline L Relton

Background: Evidence suggests that in utero exposure to undernutrition and overnutrition might affect adiposity in later life. Epigenetic modification is suggested as a plausible mediating mechanism. Methods: We used multivariable linear regression and a negative control design to examine offspring epigenome-wide DNA methylation in relation to maternal and offspring adiposity in 1018 participants. Results: Compared with neonatal offspring of normal weight mothers, 28 and 1621 CpG sites were differentially methylated in offspring of obese and underweight mothers, respectively [false discovert rate (FDR)-corrected P-value < 0.05), with no overlap in the sites that maternal obesity and underweight relate to. A positive association, where higher methylation is associated with a body mass index (BMI) outside the normal range, was seen at 78.6% of the sites associated with obesity and 87.9% of the sites associated with underweight. Associations of maternal obesity with offspring methylation were stronger than associations of paternal obesity, supporting an intrauterine mechanism. There were no consistent associations of gestational weight gain with offspring DNA methylation. In general, sites that were hypermethylated in association with maternal obesity or hypomethylated in association with maternal underweight tended to be positively associated with offspring adiposity, and sites hypomethylated in association with maternal obesity or hypermethylated in association with maternal underweight tended to be inversely associated with offspring adiposity. Conclusions: Our data suggest that both maternal obesity and, to a larger degree, underweight affect the neonatal epigenome via an intrauterine mechanism, but weight gain during pregnancy has little effect. We found some evidence that associations of maternal underweight with lower offspring adiposity and maternal obesity with greater offspring adiposity may be mediated via increased DNA methylation.


Bioinformatics | 2013

Predicting the functional consequences of cancer-associated amino acid substitutions

Hashem A. Shihab; Julian Gough; David Neil Cooper; Ian N.M. Day; Tom R. Gaunt

Motivation: The number of missense mutations being identified in cancer genomes has greatly increased as a consequence of technological advances and the reduced cost of whole-genome/whole-exome sequencing methods. However, a high proportion of the amino acid substitutions detected in cancer genomes have little or no effect on tumour progression (passenger mutations). Therefore, accurate automated methods capable of discriminating between driver (cancer-promoting) and passenger mutations are becoming increasingly important. In our previous work, we developed the Functional Analysis through Hidden Markov Models (FATHMM) software and, using a model weighted for inherited disease mutations, observed improved performances over alternative computational prediction algorithms. Here, we describe an adaptation of our original algorithm that incorporates a cancer-specific model to potentiate the functional analysis of driver mutations. Results: The performance of our algorithm was evaluated using two separate benchmarks. In our analysis, we observed improved performances when distinguishing between driver mutations and other germ line variants (both disease-causing and putatively neutral mutations). In addition, when discriminating between somatic driver and passenger mutations, we observed performances comparable with the leading computational prediction algorithms: SPF-Cancer and TransFIC. Availability and implementation: A web-based implementation of our cancer-specific model, including a downloadable stand-alone package, is available at http://fathmm.biocompute.org.uk. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.


Nature Communications | 2015

Whole-genome sequence-based analysis of thyroid function

Peter N. Taylor; Eleonora Porcu; Shelby Chew; Purdey J. Campbell; Michela Traglia; Suzanne J. Brown; Benjamin H. Mullin; Hashem A. Shihab; Josine Min; Klaudia Walter; Yasin Memari; Jie Huang; Michael R. Barnes; John Beilby; Pimphen Charoen; Petr Danecek; Frank Dudbridge; Vincenzo Forgetta; Celia M. T. Greenwood; Elin Grundberg; Andrew D. Johnson; Jennie Hui; Ee Mun Lim; Shane McCarthy; Dawn Muddyman; Vijay Panicker; John Perry; Jordana T. Bell; Wei Yuan; Caroline L Relton

Normal thyroid function is essential for health, but its genetic architecture remains poorly understood. Here, for the heritable thyroid traits thyrotropin (TSH) and free thyroxine (FT4), we analyse whole-genome sequence data from the UK10K project (N=2,287). Using additional whole-genome sequence and deeply imputed data sets, we report meta-analysis results for common variants (MAF≥1%) associated with TSH and FT4 (N=16,335). For TSH, we identify a novel variant in SYN2 (MAF=23.5%, P=6.15 × 10−9) and a new independent variant in PDE8B (MAF=10.4%, P=5.94 × 10−14). For FT4, we report a low-frequency variant near B4GALT6/SLC25A52 (MAF=3.2%, P=1.27 × 10−9) tagging a rare TTR variant (MAF=0.4%, P=2.14 × 10−11). All common variants explain ≥20% of the variance in TSH and FT4. Analysis of rare variants (MAF<1%) using sequence kernel association testing reveals a novel association with FT4 in NRG1. Our results demonstrate that increased coverage in whole-genome sequence association studies identifies novel variants associated with thyroid function.


Human Genomics | 2014

Ranking non-synonymous single nucleotide polymorphisms based on disease concepts

Hashem A. Shihab; Julian Gough; Matthew Mort; David Neil Cooper; Ian N.M. Day; Tom R. Gaunt

As the number of non-synonymous single nucleotide polymorphisms (nsSNPs) identified through whole-exome/whole-genome sequencing programs increases, researchers and clinicians are becoming increasingly reliant upon computational prediction algorithms designed to prioritize potential functional variants for further study. A large proportion of existing prediction algorithms are ‘disease agnostic’ but are nevertheless quite capable of predicting when a mutation is likely to be deleterious. However, most clinical and research applications of these algorithms relate to specific diseases and would therefore benefit from an approach that discriminates between functional variants specifically related to that disease from those which are not. In a whole-exome/whole-genome sequencing context, such an approach could substantially reduce the number of false positive candidate mutations. Here, we test this postulate by incorporating a disease-specific weighting scheme into the Functional Analysis through Hidden Markov Models (FATHMM) algorithm. When compared to traditional prediction algorithms, we observed an overall reduction in the number of false positives identified using a disease-specific approach to functional prediction across 17 distinct disease concepts/categories. Our results illustrate the potential benefits of making disease-specific predictions when prioritizing candidate variants in relation to specific diseases. A web-based implementation of our algorithm is available at http://fathmm.biocompute.org.uk.


Human Molecular Genetics | 2015

Mosaic structural variation in children with developmental disorders

Daniel A. King; Wendy D Jones; Yanick J. Crow; Anna F. Dominiczak; Nicola A. Foster; Tom R. Gaunt; Jade Harris; Stephen W. Hellens; Tessa Homfray; J A Innes; Elizabeth A. Jones; Shelagh Joss; Abhijit Kulkarni; Sahar Mansour; Andrew D. Morris; Michael J. Parker; David J. Porteous; Hashem A. Shihab; Blair H. Smith; Katrina Tatton-Brown; John Tolmie; Maciej Trzaskowski; Pradeep Vasudevan; Emma Wakeling; Michael Wright; Robert Plomin; Nicholas J. Timpson

Delineating the genetic causes of developmental disorders is an area of active investigation. Mosaic structural abnormalities, defined as copy number or loss of heterozygosity events that are large and present in only a subset of cells, have been detected in 0.2–1.0% of children ascertained for clinical genetic testing. However, the frequency among healthy children in the community is not well characterized, which, if known, could inform better interpretation of the pathogenic burden of this mutational category in children with developmental disorders. In a case–control analysis, we compared the rate of large-scale mosaicism between 1303 children with developmental disorders and 5094 children lacking developmental disorders, using an analytical pipeline we developed, and identified a substantial enrichment in cases (odds ratio = 39.4, P-value 1.073e − 6). A meta-analysis that included frequency estimates among an additional 7000 children with congenital diseases yielded an even stronger statistical enrichment (P-value 1.784e − 11). In addition, to maximize the detection of low-clonality events in probands, we applied a trio-based mosaic detection algorithm, which detected two additional events in probands, including an individual with genome-wide suspected chimerism. In total, we detected 12 structural mosaic abnormalities among 1303 children (0.9%). Given the burden of mosaicism detected in cases, we suspected that many of the events detected in probands were pathogenic. Scrutiny of the genotypic–phenotypic relationship of each detected variant assessed that the majority of events are very likely pathogenic. This work quantifies the burden of structural mosaicism as a cause of developmental disorders.


Diabetes | 2017

The role of DNA methylation in type 2 diabetes aetiology – using genotype as a causal anchor

Hannah R Elliott; Hashem A. Shihab; Gabrielle A. Lockett; John W. Holloway; Allan F. McRae; George Davey Smith; Susan M. Ring; Tom R. Gaunt; Caroline L Relton

Several studies have investigated the relationship between genetic variation and DNA methylation with respect to type 2 diabetes, but it is unknown if DNA methylation is a mediator in the disease pathway or if it is altered in response to disease state. This study uses genotypic information as a causal anchor to help decipher the likely role of DNA methylation measured in peripheral blood in the etiology of type 2 diabetes. Illumina HumanMethylation450 BeadChip data were generated on 1,018 young individuals from the Avon Longitudinal Study of Parents and Children (ALSPAC) cohort. In stage 1, 118 unique associations between published type 2 diabetes single nucleotide polymorphisms (SNPs) and genome-wide methylation (methylation quantitative trait loci [mQTLs]) were identified. In stage 2, a further 226 mQTLs were identified between 202 additional independent non–type 2 diabetes SNPs and CpGs identified in stage 1. Where possible, associations were replicated in independent cohorts of similar age. We discovered that around half of known type 2 diabetes SNPs are associated with variation in DNA methylation and postulated that methylation could either be on a causal pathway to future disease or could be a noncausal biomarker. For one locus (KCNQ1), we were able to provide further evidence that methylation is likely to be on the causal pathway to disease in later life.


Diabetic Medicine | 2017

Role of DNA Methylation in Type 2 Diabetes Etiology

Hannah R Elliott; Hashem A. Shihab; Gabrielle A. Lockett; John W. Holloway; Allan F. McRae; George Davey Smith; Susan M. Ring; Tom R. Gaunt; Caroline L Relton

Several studies have investigated the relationship between genetic variation and DNA methylation with respect to type 2 diabetes, but it is unknown if DNA methylation is a mediator in the disease pathway or if it is altered in response to disease state. This study uses genotypic information as a causal anchor to help decipher the likely role of DNA methylation measured in peripheral blood in the etiology of type 2 diabetes. Illumina HumanMethylation450 BeadChip data were generated on 1,018 young individuals from the Avon Longitudinal Study of Parents and Children (ALSPAC) cohort. In stage 1, 118 unique associations between published type 2 diabetes single nucleotide polymorphisms (SNPs) and genome-wide methylation (methylation quantitative trait loci [mQTLs]) were identified. In stage 2, a further 226 mQTLs were identified between 202 additional independent non–type 2 diabetes SNPs and CpGs identified in stage 1. Where possible, associations were replicated in independent cohorts of similar age. We discovered that around half of known type 2 diabetes SNPs are associated with variation in DNA methylation and postulated that methylation could either be on a causal pathway to future disease or could be a noncausal biomarker. For one locus (KCNQ1), we were able to provide further evidence that methylation is likely to be on the causal pathway to disease in later life.


Bioinformatics | 2018

FATHMM-XF: accurate prediction of pathogenic point mutations via extended features

Mark F. Rogers; Hashem A. Shihab; Matthew Mort; David Neil Cooper; Tom R. Gaunt; Colin Campbell

Summary We present FATHMM‐XF, a method for predicting pathogenic point mutations in the human genome. Drawing on an extensive feature set, FATHMM‐XF outperforms competitors on benchmark tests, particularly in non‐coding regions where the majority of pathogenic mutations are likely to be found. Availability and implementation The FATHMM‐XF web server is available at http://fathmm.biocompute.org.uk/fathmm‐xf/, and as tracks on the Genome Tolerance Browser: http://gtb.biocompute.org.uk. Predictions are provided for human genome version GRCh37/hg19. The data used for this project can be downloaded from: http://fathmm.biocompute.org.uk/fathmm‐xf/

Collaboration


Dive into the Hashem A. Shihab's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge