Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Eliseos J. Mucaki is active.

Publication


Featured researches published by Eliseos J. Mucaki.


Human Molecular Genetics | 2015

FANCM c.5791C>T nonsense mutation (rs144567652) induces exon skipping, affects DNA repair activity and is a familial breast cancer risk factor

Paolo Peterlongo; Irene Catucci; Mara Colombo; Laura Caleca; Eliseos J. Mucaki; Massimo Bogliolo; Maria Marín; Francesca Damiola; Loris Bernard; Valeria Pensotti; Sara Volorio; Valentina Dall'Olio; Alfons Meindl; Claus R. Bartram; Christian Sutter; Harald Surowy; Valérie Sornin; Marie Gabrielle Dondon; Séverine Eon-Marchais; Dominique Stoppa-Lyonnet; Nadine Andrieu; Olga M. Sinilnikova; Gillian Mitchell; Paul A. James; Ella R. Thompson; Marina Marchetti; Cristina Verzeroli; Carmen Tartari; Gabriele Lorenzo Capone; Anna Laura Putignano

Numerous genetic factors that influence breast cancer risk are known. However, approximately two-thirds of the overall familial risk remain unexplained. To determine whether some of the missing heritability is due to rare variants conferring high to moderate risk, we tested for an association between the c.5791C>T nonsense mutation (p.Arg1931*; rs144567652) in exon 22 of FANCM gene and breast cancer. An analysis of genotyping data from 8635 familial breast cancer cases and 6625 controls from different countries yielded an association between the c.5791C>T mutation and breast cancer risk [odds ratio (OR) = 3.93 (95% confidence interval (CI) = 1.28-12.11; P = 0.017)]. Moreover, we performed two meta-analyses of studies from countries with carriers in both cases and controls and of all available data. These analyses showed breast cancer associations with OR = 3.67 (95% CI = 1.04-12.87; P = 0.043) and OR = 3.33 (95% CI = 1.09-13.62; P = 0.032), respectively. Based on information theory-based prediction, we established that the mutation caused an out-of-frame deletion of exon 22, due to the creation of a binding site for the pre-mRNA processing protein hnRNP A1. Furthermore, genetic complementation analyses showed that the mutation influenced the DNA repair activity of the FANCM protein. In summary, we provide evidence for the first time showing that the common p.Arg1931* loss-of-function variant in FANCM is a risk factor for familial breast cancer.


Human Mutation | 2013

Prediction of Mutant mRNA Splice Isoforms by Information Theory‐Based Exon Definition

Eliseos J. Mucaki; Ben C. Shirley; Peter K. Rogan

Mutations that affect mRNA splicing often produce multiple mRNA isoforms, resulting in complex molecular phenotypes. Definition of an exon and its inclusion in mature mRNA relies on joint recognition of both acceptor and donor splice sites. This study predicts cryptic and exon‐skipping isoforms in mRNA produced by splicing mutations from the combined information contents (Ri, which measures binding‐site strength, in bits) and distribution of the splice sites defining these exons. The total information content of an exon (Ri,total) is the sum of the Ri values of its acceptor and donor splice sites, adjusted for the self‐information of the distance separating these sites, that is, the gap surprisal. Differences between total information contents of an exon (ΔRi,total) are predictive of the relative abundance of these exons in distinct processed mRNAs. Constraints on splice site and exon selection are used to eliminate nonconforming and poorly expressed isoforms. Molecular phenotypes are computed by the Automated Splice Site and Exon Definition Analysis (http://splice.uwo.ca) server. Predictions of splicing mutations were highly concordant (85.2%; n = 61) with published expression data. In silico exon definition analysis will contribute to streamlining assessment of abnormal and normal splice isoforms resulting from mutations.


F1000Research | 2014

Interpretation of mRNA splicing mutations in genetic disease: review of the literature and guidelines for information-theoretical analysis

Natasha G. Caminsky; Eliseos J. Mucaki; Peter K. Rogan

The interpretation of genomic variants has become one of the paramount challenges in the post-genome sequencing era. In this review we summarize nearly 20 years of research on the applications of information theory (IT) to interpret coding and non-coding mutations that alter mRNA splicing in rare and common diseases. We compile and summarize the spectrum of published variants analyzed by IT, to provide a broad perspective of the distribution of deleterious natural and cryptic splice site variants detected, as well as those affecting splicing regulatory sequences. Results for natural splice site mutations can be interrogated dynamically with Splicing Mutation Calculator, a companion software program that computes changes in information content for any splice site substitution, linked to corresponding publications containing these mutations. The accuracy of IT-based analysis was assessed in the context of experimentally validated mutations. Because splice site information quantifies binding affinity, IT-based analyses can discern the differences between variants that account for the observed reduced (leaky) versus abolished mRNA splicing. We extend this principle by comparing predicted mutations in natural, cryptic, and regulatory splice sites with observed deleterious phenotypic and benign effects. Our analysis of 1727 variants revealed a number of general principles useful for ensuring portability of these analyses and accurate input and interpretation of mutations. We offer guidelines for optimal use of IT software for interpretation of mRNA splicing mutations.


Human Mutation | 2011

Comprehensive prediction of mRNA splicing effects of BRCA1 and BRCA2 variants

Eliseos J. Mucaki; Peter Ainsworth; Peter K. Rogan

Variants of uncertain significance (VUS) in the BRCA1 and BRCA2 genes potentially affecting coding sequence as well as normal splicing activity have confounded predisposition testing in breast cancer. Here, we apply information theory to analyze BRCA1/2 mRNA splicing mutations categorized as VUS. The method was validated for 31 of 36 mutations known to cause missplicing in BRCA1/2 and all 26 that do not alter splicing. All single‐nucleotide variants in the Breast Cancer Information Resource (BIC; Breast Cancer Information Core Database; http://research.nhgri.nih.gov/bic; last access June 1, 2010) were then analyzed. Information analysis is similar in sensitivity to other predictive methods; however, the thermodynamic basis of the theory also enables splice‐site affinity to be determined accurately, which is important for assessing mutations that render natural splice sites partially functional and competition between cryptic and natural splice sites. We report 299 of 2,071 single‐nucleotide BIC mutations that are predicted to significantly weaken natural sites and/or strengthen cryptic splice sites, 171 of which are not designated as splicing mutations in the database. Splicing alterations are predicted for 68 of 690 BRCA1 and 60 of 958 BRCA2 mutations designated as VUS. These analyses should be useful in prioritizing suspected mutations for downstream expression studies and for predicting aberrantly spliced isoforms generated by these mutations. Hum Mutat 32:1–8, 2011.


Genomics, Proteomics & Bioinformatics | 2013

Interpretation, stratification and evidence for sequence variants affecting mRNA splicing in complete human genome sequences.

Ben C. Shirley; Eliseos J. Mucaki; Tyson Whitehead; Paul Igor Costea; Pelin Akan; Peter K. Rogan

Information theory-based methods have been shown to be sensitive and specific for predicting and quantifying the effects of non-coding mutations in Mendelian diseases. We present the Shannon pipeline software for genome-scale mutation analysis and provide evidence that the software predicts variants affecting mRNA splicing. Individual information contents (in bits) of reference and variant splice sites are compared and significant differences are annotated and prioritized. The software has been implemented for CLC-Bio Genomics platform. Annotation indicates the context of novel mutations as well as common and rare SNPs with splicing effects. Potential natural and cryptic mRNA splicing variants are identified, and null mutations are distinguished from leaky mutations. Mutations and rare SNPs were predicted in genomes of three cancer cell lines (U2OS, U251 and A431), which were supported by expression analyses. After filtering, tractable numbers of potentially deleterious variants are predicted by the software, suitable for further laboratory investigation. In these cell lines, novel functional variants comprised 6–17 inactivating mutations, 1–5 leaky mutations and 6–13 cryptic splicing mutations. Predicted effects were validated by RNA-seq analysis of the three aforementioned cancer cell lines, and expression microarray analysis of SNPs in HapMap cell lines.


Human Mutation | 2016

Prioritizing Variants in Complete Hereditary Breast and Ovarian Cancer Genes in Patients Lacking Known BRCA Mutations

Natasha G. Caminsky; Eliseos J. Mucaki; Ami M. Perri; Ruipeng Lu; Joan H. M. Knoll; Peter K. Rogan

BRCA1 and BRCA2 testing for hereditary breast and ovarian cancer (HBOC) does not identify all pathogenic variants. Sequencing of 20 complete genes in HBOC patients with uninformative test results (N = 287), including noncoding and flanking sequences of ATM, BARD1, BRCA1, BRCA2, CDH1, CHEK2, EPCAM, MLH1, MRE11A, MSH2, MSH6, MUTYH, NBN, PALB2, PMS2, PTEN, RAD51B, STK11, TP53, and XRCC2, identified 38,372 unique variants. We apply information theory (IT) to predict and prioritize noncoding variants of uncertain significance in regulatory, coding, and intronic regions based on changes in binding sites in these genes. Besides mRNA splicing, IT provides a common framework to evaluate potential affinity changes in transcription factor (TFBSs), splicing regulatory (SRBSs), and RNA‐binding protein (RBBSs) binding sites following mutation. We prioritized variants affecting the strengths of 10 splice sites (four natural, six cryptic), 148 SRBS, 36 TFBS, and 31 RBBS. Three variants were also prioritized based on their predicted effects on mRNA secondary (2°) structure and 17 for pseudoexon activation. Additionally, four frameshift, two in‐frame deletions, and five stop‐gain mutations were identified. When combined with pedigree information, complete gene sequence analysis can focus attention on a limited set of variants in a wide spectrum of functional mutation types for downstream functional and co‐segregation analysis.


Nucleic Acids Research | 2017

Discovery and validation of information theory-based transcription factor and cofactor binding site motifs

Ruipeng Lu; Eliseos J. Mucaki; Peter K. Rogan

Abstract Data from ChIP-seq experiments can derive the genome-wide binding specificities of transcription factors (TFs) and other regulatory proteins. We analyzed 765 ENCODE ChIP-seq peak datasets of 207 human TFs with a novel motif discovery pipeline based on recursive, thresholded entropy minimization. This approach, while obviating the need to compensate for skewed nucleotide composition, distinguishes true binding motifs from noise, quantifies the strengths of individual binding sites based on computed affinity and detects adjacent cofactor binding sites that coordinate with the targets of primary, immunoprecipitated TFs. We obtained contiguous and bipartite information theory-based position weight matrices (iPWMs) for 93 sequence-specific TFs, discovered 23 cofactor motifs for 127 TFs and revealed six high-confidence novel motifs. The reliability and accuracy of these iPWMs were determined via four independent validation methods, including the detection of experimentally proven binding sites, explanation of effects of characterized SNPs, comparison with previously published motifs and statistical analyses. We also predict previously unreported TF coregulatory interactions (e.g. TF complexes). These iPWMs constitute a powerful tool for predicting the effects of sequence variants in known binding sites, performing mutation analysis on regulatory SNPs and predicting previously unrecognized binding sites and target genes.


Breast Cancer Research and Treatment | 2017

Prevalence and spectrum of germline rare variants in BRCA1/2 and PALB2 among breast cancer cases in Sarawak, Malaysia

Xiaohong R. Yang; Beena C.R. Devi; Hyuna Sung; Jennifer Guida; Eliseos J. Mucaki; Yanzi Xiao; Ana F. Best; Lisa Garland; Yi Xie; Nan Hu; Maria Rodriguez-Herrera; Chaoyu Wang; Wen Luo; Belynda Hicks; Tieng Swee Tang; Karobi Moitra; Peter K. Rogan; Michael Dean

PurposeTo characterize the spectrum of germline mutations in BRCA1, BRCA2, and PALB2 in population-based unselected breast cancer cases in an Asian population.MethodsGermline DNA from 467 breast cancer patients in Sarawak General Hospital, Malaysia, where 93% of the breast cancer patients in Sarawak are treated, was sequenced for the entire coding region of BRCA1; BRCA2; PALB2; Exons 6, 7, and 8 of TP53; and Exons 7 and 8 of PTEN. Pathogenic variants included known pathogenic variants in ClinVar, loss of function variants, and variants that disrupt splice site.ResultsWe found 27 pathogenic variants (11 BRCA1, 10 BRCA2, 4 PALB2, and 2 TP53) in 34 patients, which gave a prevalence of germline mutations of 2.8, 3.23, and 0.86% for BRCA1, BRCA2, and PALB2, respectively. Compared to mutation non-carriers, BRCA1 mutation carriers were more likely to have an earlier age at onset, triple-negative subtype, and lower body mass index, whereas BRCA2 mutation carriers were more likely to have a positive family history. Mutation carrier cases had worse survival compared to non-carriers; however, the association was mostly driven by stage and tumor subtype. We also identified 19 variants of unknown significance, and some of them were predicted to alter splicing or transcription factor binding sites.ConclusionOur data provide insight into the genetics of breast cancer in this understudied group and suggest the need for modifying genetic testing guidelines for this population with a much younger age at diagnosis and more limited resources compared with Caucasian populations.


bioRxiv | 2017

Predicting Response to Platin Chemotherapy Agents with Biochemically-inspired Machine Learning

Eliseos J. Mucaki; Jonathan Z.L. Zhao; Daniel J. Lizotte; Peter K. Rogan

Selection of effective genes that accurately predict chemotherapy response could improve cancer outcomes. We compare optimized gene signatures for cisplatin, carboplatin, and oxaliplatin response, and respectively validate each with cancer patient data. Supervised support vector machine learning was used to derive gene sets whose expression was related to cell line GI50 values by backwards feature selection with cross-validation. Signatures at different GI50 thresholds distinguishing sensitivity from resistance contrast the contributions of genes at extreme vs. median thresholds. Ensembles of gene signatures at different thresholds are combined to reduce dependence on specific GI50 values for predicting drug response. The most accurate models for each platin are: cisplatin: BARD1, BCL2, BCL2L1, CDKN2C, FAAP24, FEN1, MAP3K1, MAPK13, MAPK3, NFKB1, NFKB2, SLC22A5, SLC31A2, TLR4, TWIST1; carboplatin: AKT1, EIF3K, ERCC1, GNGT1, GSR, MTHFR, NEDD4L, NLRP1, NRAS, RAF1, SGK1, TIGD1, TP53, VEGFB, VEGFC; oxaliplatin: BRAF, FCGR2A, IGF1, MSH2, NAGK, NFE2L2, NQO1, PANK3, SLC47A1, SLCO1B1, UGT1A1. TCGA bladder, ovarian and colorectal cancer patients were used to test cisplatin, carboplatin and oxaliplatin signatures (respectively), resulting in 71.0%, 60.2% and 54.5% accuracy in predicting disease recurrence and 59%, 61% and 72% accuracy in predicting remission. One cisplatin signature predicted 100% of recurrence in non-smoking bladder cancer patients (57% disease-free; N=19), and 79% recurrence in smokers (62% disease-free; N=35). This approach should be adaptable to other studies of chemotherapy response, independent of drug or cancer types.


bioRxiv | 2016

Discovery of Primary, Cofactor, and Novel Transcription Factor Binding Site Motifs by Recursive, Thresholded Entropy Minimization

Ruipeng Lu; Eliseos J. Mucaki; Peter K. Rogan

Data from ChIP-seq experiments can determine the genome-wide binding specificities of transcription factors (TFs) and other regulatory proteins. In the present study, we analyzed 745 ENCODE ChIP-seq peak datasets of 189 human TFs with a novel motif discovery method that is based on recursive, thresholded entropy minimization. This method is able to distinguish correct information models from noisy motifs, quantify the strengths of individual sites based on affinity, and detect adjacent cofactor binding sites that coordinate with primary TFs. We derived homogeneous and bipartite information models for 89 sequence-specific TFs, which enabled discovery of 24 cofactor motifs for 118 TFs, and revealed 6 high-confidence novel motifs. The reliability and accuracy of these models were determined via three independent quality control criteria, including the detection of experimentally proven binding sites, comparison with previously published motifs and statistical analyses. We also predict previously unreported TF cobinding interactions, and new components of known TF complexes. Because they are based on information theory, the derived models constitute a powerful tool for detecting and predicting the effects of variants in known binding sites, and predicting previously unrecognized binding sites and target genes.

Collaboration


Dive into the Eliseos J. Mucaki's collaboration.

Top Co-Authors

Avatar

Peter K. Rogan

University of Western Ontario

View shared research outputs
Top Co-Authors

Avatar

Ruipeng Lu

University of Western Ontario

View shared research outputs
Top Co-Authors

Avatar

Ben C. Shirley

University of Western Ontario

View shared research outputs
Top Co-Authors

Avatar

Natasha G. Caminsky

University of Western Ontario

View shared research outputs
Top Co-Authors

Avatar

Ami M. Perri

University of Western Ontario

View shared research outputs
Top Co-Authors

Avatar

Joan H. M. Knoll

University of Western Ontario

View shared research outputs
Top Co-Authors

Avatar

Jonathan Z.L. Zhao

University of Western Ontario

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge