Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Uday S. Evani is active.

Publication


Featured researches published by Uday S. Evani.


Science | 2013

Integrative annotation of variants from 1092 humans: application to cancer genomics.

Ekta Khurana; Yao Fu; Vincenza Colonna; Xinmeng Jasmine Mu; Hyun Min Kang; Tuuli Lappalainen; Andrea Sboner; Lucas Lochovsky; Jieming Chen; Arif Harmanci; Jishnu Das; Alexej Abyzov; Suganthi Balasubramanian; Kathryn Beal; Dimple Chakravarty; Daniel Challis; Yuan Chen; Declan Clarke; Laura Clarke; Fiona Cunningham; Uday S. Evani; Paul Flicek; Robert Fragoza; Erik Garrison; Richard A. Gibbs; Zeynep H. Gümüş; Javier Herrero; Naoki Kitabayashi; Yong Kong; Kasper Lage

Introduction Plummeting sequencing costs have led to a great increase in the number of personal genomes. Interpreting the large number of variants in them, particularly in noncoding regions, is a current challenge. This is especially the case for somatic variants in cancer genomes, a large proportion of which are noncoding. Prioritization of candidate noncoding cancer drivers based on patterns of selection. (Step 1) Filter somatic variants to exclude 1000 Genomes polymorphisms; (2) retain variants in noncoding annotations; (3) retain those in “sensitive” regions; (4) prioritize those disrupting a transcription-factor binding motif and (5) residing near the center of a biological network; (6) prioritize ones in annotation blocks mutated in multiple cancer samples. Methods We investigated patterns of selection in DNA elements from the ENCODE project using the full spectrum of variants from 1092 individuals in the 1000 Genomes Project (Phase 1), including single-nucleotide variants (SNVs), short insertions and deletions (indels), and structural variants (SVs). Although we analyzed broad functional annotations, such as all transcription-factor binding sites, we focused more on highly specific categories such as distal binding sites of factor ZNF274. The greater statistical power of the Phase 1 data set compared with earlier ones allowed us to differentiate the selective constraints on these categories. We also used connectivity information between elements from protein-protein-interaction and regulatory networks. We integrated all the information on selection to develop a workflow (FunSeq) to prioritize personal-genome variants on the basis of their deleterious impact. As a proof of principle, we experimentally validated and characterized a few candidate variants. Results We identified a specific subgroup of noncoding categories with almost as much selective constraint as coding genes: “ultrasensitive” regions. We also uncovered a number of clear patterns of selection. Elements more consistently active across tissues and both maternal and paternal alleles (in terms of allele-specific activity) are under stronger selection. Variants disruptive because of mechanistic effects on transcription-factor binding (i.e. “motif-breakers”) are selected against. Higher network connectivity (i.e. for hubs) is associated with higher constraint. Additionally, many hub promoters and regulatory elements show evidence of recent positive selection. Overall, indels and SVs follow the same pattern as SNVs; however, there are notable exceptions. For instance, enhancers are enriched for SVs formed by nonallelic homologous recombination. We integrated these patterns of selection into the FunSeq prioritization workflow and applied it to cancer variants, because they present a strong contrast to inherited polymorphisms. In particular, application to ~90 cancer genomes (breast, prostate and medulloblastoma) reveals nearly a hundred candidate noncoding drivers. Discussion Our approach can be readily used to prioritize variants in cancer and is immediately applicable in a precision-medicine context. It can be further improved by incorporation of larger-scale population sequencing, better annotations, and expression data from large cohorts. Identifying Important Identifiers Each of us has millions of sequence variations in our genomes. Signatures of purifying or negative selection should help identify which of those variations is functionally important. Khurana et al. (1235587) used sequence polymorphisms from 1092 humans across 14 populations to identify patterns of selection, especially in noncoding regulatory regions. Noncoding regions under very strong negative selection included binding sites of some chromatin and general transcription factors (TFs) and core motifs of some important TF families. Positive selection in TF binding sites tended to occur in network hub promoters. Many recurrent somatic cancer variants occurred in noncoding regulatory regions and thus might indicate mutations that drive cancer. Regions under strong selection in the human genome identify noncoding regulatory elements with possible roles in disease. Interpreting variants, especially noncoding ones, in the increasing number of personal genomes is challenging. We used patterns of polymorphisms in functionally annotated regions in 1092 humans to identify deleterious variants; then we experimentally validated candidates. We analyzed both coding and noncoding regions, with the former corroborating the latter. We found regions particularly sensitive to mutations (“ultrasensitive”) and variants that are disruptive because of mechanistic effects on transcription-factor binding (that is, “motif-breakers”). We also found variants in regions with higher network centrality tend to be deleterious. Insertions and deletions followed a similar pattern to single-nucleotide variants, with some notable exceptions (e.g., certain deletions and enhancers). On the basis of these patterns, we developed a computational tool (FunSeq), whose application to ~90 cancer genomes reveals nearly a hundred candidate noncoding drivers.


Aging Cell | 2012

Proteomic analysis of age-dependent changes in protein solubility identifies genes that modulate lifespan

Pedro Reis-Rodrigues; Gregg Czerwieniec; Theodore W. Peters; Uday S. Evani; Emily A. Gaman; Maithili C. Vantipalli; Sean D. Mooney; Bradford W. Gibson; Gordon J. Lithgow; Robert E. Hughes

While it is generally recognized that misfolding of specific proteins can cause late‐onset disease, the contribution of protein aggregation to the normal aging process is less well understood. To address this issue, a mass spectrometry‐based proteomic analysis was performed to identify proteins that adopt sodium dodecyl sulfate (SDS)‐insoluble conformations during aging in Caenorhabditis elegans. SDS‐insoluble proteins extracted from young and aged C. elegans were chemically labeled by isobaric tagging for relative and absolute quantification (iTRAQ) and identified by liquid chromatography and mass spectrometry. Two hundred and three proteins were identified as being significantly enriched in an SDS‐insoluble fraction in aged nematodes and were largely absent from a similar protein fraction in young nematodes. The SDS‐insoluble fraction in aged animals contains a diverse range of proteins including a large number of ribosomal proteins. Gene ontology analysis revealed highly significant enrichments for energy production and translation functions. Expression of genes encoding insoluble proteins observed in aged nematodes was knocked down using RNAi, and effects on lifespan were measured. 41% of genes tested were shown to extend lifespan after RNAi treatment, compared with 18% in a control group of genes. These data indicate that genes encoding proteins that become insoluble with age are enriched for modifiers of lifespan. This demonstrates that proteomic approaches can be used to identify genes that modify lifespan. Finally, these observations indicate that the accumulation of insoluble proteins with diverse functions may be a general feature of aging.


Human Mutation | 2010

In silico functional profiling of human disease‐associated and polymorphic amino acid substitutions

Matthew Mort; Uday S. Evani; Vidhya G. Krishnan; Kishore K. Kamati; Peter H. Baenziger; Angshuman Bagchi; Brandon Peters; Rakesh Sathyesh; Biao Li; Yanan Sun; Bin Xue; Nigam H. Shah; Maricel G. Kann; David Neil Cooper; Predrag Radivojac; Sean D. Mooney

An important challenge in translational bioinformatics is to understand how genetic variation gives rise to molecular changes at the protein level that can precipitate both monogenic and complex disease. To this end, we compiled datasets of human disease‐associated amino acid substitutions (AAS) in the contexts of inherited monogenic disease, complex disease, functional polymorphisms with no known disease association, and somatic mutations in cancer, and compared them with respect to predicted functional sites in proteins. Using the sequence homology‐based tool SIFT to estimate the proportion of deleterious AAS in each dataset, only complex disease AAS were found to be indistinguishable from neutral polymorphic AAS. Investigation of monogenic disease AAS predicted to be nondeleterious by SIFT were characterized by a significant enrichment for inherited AAS within solvent accessible residues, regions of intrinsic protein disorder, and an association with the loss or gain of various posttranslational modifications. Sites of structural and/or functional interest were therefore surmised to constitute useful additional features with which to identify the molecular disruptions caused by deleterious AAS. A range of bioinformatic tools, designed to predict structural and functional sites in protein sequences, were then employed to demonstrate that intrinsic biases exist in terms of the distribution of different types of human AAS with respect to specific structural, functional and pathological features. Our Web tool, designed to potentiate the functional profiling of novel AAS, has been made available at http://profile.mutdb.org/. Hum Mutat 31:1–12, 2010.


BMC Genomics | 2012

Atlas2 Cloud: a framework for personal genome analysis in the cloud

Uday S. Evani; Danny Challis; Jin Yu; Andrew R. Jackson; Sameer Paithankar; Matthew N. Bainbridge; Adinarayana Jakkamsetti; Peter Pham; Cristian Coarfa; Aleksandar Milosavljevic; Fuli Yu

BackgroundUntil recently, sequencing has primarily been carried out in large genome centers which have invested heavily in developing the computational infrastructure that enables genomic sequence analysis. The recent advancements in next generation sequencing (NGS) have led to a wide dissemination of sequencing technologies and data, to highly diverse research groups. It is expected that clinical sequencing will become part of diagnostic routines shortly. However, limited accessibility to computational infrastructure and high quality bioinformatic tools, and the demand for personnel skilled in data analysis and interpretation remains a serious bottleneck. To this end, the cloud computing and Software-as-a-Service (SaaS) technologies can help address these issues.ResultsWe successfully enabled the Atlas2 Cloud pipeline for personal genome analysis on two different cloud service platforms: a community cloud via the Genboree Workbench, and a commercial cloud via the Amazon Web Services using Software-as-a-Service model. We report a case study of personal genome analysis using our Atlas2 Genboree pipeline. We also outline a detailed cost structure for running Atlas2 Amazon on whole exome capture data, providing cost projections in terms of storage, compute and I/O when running Atlas2 Amazon on a large data set.ConclusionsWe find that providing a web interface and an optimized pipeline clearly facilitates usage of cloud computing for personal genome analysis, but for it to be routinely used for large scale projects there needs to be a paradigm shift in the way we develop tools, in standard operating procedures, and in funding mechanisms.


Methods of Molecular Biology | 2010

Bioinformatic tools for identifying disease gene and SNP candidates

Sean D. Mooney; Vidhya G. Krishnan; Uday S. Evani

As databases of genome data continue to grow, our understanding of the functional elements of the genome grows as well. Many genetic changes in the genome have now been discovered and characterized, including both disease-causing mutations and neutral polymorphisms. In addition to experimental approaches to characterize specific variants, over the past decade, there has been intense bioinformatic research to understand the molecular effects of these genetic changes. In addition to genomic experimental assays, the bioinformatic efforts have focused on two general areas. First, researchers have annotated genetic variation data with molecular features that are likely to affect function. Second, statistical methods have been developed to predict mutations that are likely to have a molecular effect. In this protocol manuscript, methods for understanding the molecular functions of single nucleotide polymorphisms (SNPs) and mutations are reviewed and described. The intent of this chapter is to provide an introduction to the online tools that are both easy to use and useful.


Molecular Biology of the Cell | 2012

Tor1 regulates protein solubility in Saccharomyces cerevisiae

Theodore W. Peters; Matthew J. Rardin; Gregg Czerwieniec; Uday S. Evani; Pedro Reis-Rodrigues; Gordon J. Lithgow; Sean D. Mooney; Bradford W. Gibson; Robert E. Hughes

The transition of proteins targeted for autophagic degradation from the soluble to the insoluble phase is regulated in an ATG1-independent mechanism by TORC1. This process is likely a critical mechanism for maintaining protein homeostasis when challenged with proteomic stress.


BMC Bioinformatics | 2013

STOP using just GO: a multi-ontology hypothesis generation tool for high throughput experimentation

Tobias Wittkop; Emily TerAvest; Uday S. Evani; K M Fleisch; Ari E. Berman; Corey Powell; Nigam H. Shah; Sean D. Mooney

BackgroundGene Ontology (GO) enrichment analysis remains one of the most common methods for hypothesis generation from high throughput datasets. However, we believe that researchers strive to test other hypotheses that fall outside of GO. Here, we developed and evaluated a tool for hypothesis generation from gene or protein lists using ontological concepts present in manually curated text that describes those genes and proteins.ResultsAs a consequence we have developed the method Statistical Tracking of Ontological Phrases (STOP) that expands the realm of testable hypotheses in gene set enrichment analyses by integrating automated annotations of genes to terms from over 200 biomedical ontologies. While not as precise as manually curated terms, we find that the additional enriched concepts have value when coupled with traditional enrichment analyses using curated terms.ConclusionMultiple ontologies have been developed for gene and protein annotation, by using a dataset of both manually curated GO terms and automatically recognized concepts from curated text we can expand the realm of hypotheses that can be discovered. The web application STOP is available at http://mooneygroup.org/stop/.


BMC Genomics | 2015

The distribution and mutagenesis of short coding INDELs from 1,128 whole exomes

Danny Challis; Lilian Antunes; Erik Garrison; Eric Banks; Uday S. Evani; Donna M. Muzny; Ryan Poplin; Richard A. Gibbs; Gabor T. Marth; Fuli Yu

BackgroundIdentifying insertion/deletion polymorphisms (INDELs) with high confidence has been intrinsically challenging in short-read sequencing data. Here we report our approach for improving INDEL calling accuracy by using a machine learning algorithm to combine call sets generated with three independent methods, and by leveraging the strengths of each individual pipeline. Utilizing this approach, we generated a consensus exome INDEL call set from a large dataset generated by the 1000 Genomes Project (1000G), maximizing both the sensitivity and the specificity of the calls.ResultsThis consensus exome INDEL call set features 7,210 INDELs, from 1,128 individuals across 13 populations included in the 1000 Genomes Phase 1 dataset, with a false discovery rate (FDR) of about 7.0%.ConclusionsIn our study we further characterize the patterns and distributions of these exonic INDELs with respect to density, allele length, and site frequency spectrum, as well as the potential mutagenic mechanisms of coding INDELs in humans.


international conference on bioinformatics | 2011

Enabling Atlas2 personal genome analysis on the cloud

Uday S. Evani; Danny Challis; Jin Yu; Andrew R. Jackson; Sameer Paithankar; Matthew N. Bainbridge; Cris tian Coarfa; Aleksandar Milosavljevic; Fuli Yu

Until recently, sequencing has primarily been carried out in large genome centers who also invested heavily in developing the computational infrastructure to enable post sequencing analysis. The recent advancements in sequencing technologies have lead to a wide dissemination of sequencing and we are now seeing many sequencing projects being undertaken in small laboratories. However, the limited accessibility to the computational infrastructure and high quality bioinformatic tools needed to enable analysis remains a serious road-block. The cloud computing and Software-as-a-Service (SaaS) technologies can help address this barrier. We deploy the Atlas2 Cloud Pipeline for personal genome analysis via the Genboree Workbench using software-as-a-service model. We report on a successful case study of personal genome analysis using this pipeline.


american medical informatics association annual symposium | 2010

An Ontology-Neutral Framework for Enrichment Analysis

Rob Tirrell; Uday S. Evani; Ari E. Berman; Sean D. Mooney; Mark A. Musen; Nigam H. Shah

Collaboration


Dive into the Uday S. Evani's collaboration.

Top Co-Authors

Avatar

Sean D. Mooney

University of Washington

View shared research outputs
Top Co-Authors

Avatar

Danny Challis

Baylor College of Medicine

View shared research outputs
Top Co-Authors

Avatar

Fuli Yu

Baylor College of Medicine

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Andrew R. Jackson

Baylor College of Medicine

View shared research outputs
Top Co-Authors

Avatar

Ari E. Berman

Buck Institute for Research on Aging

View shared research outputs
Top Co-Authors

Avatar

Bradford W. Gibson

Buck Institute for Research on Aging

View shared research outputs
Top Co-Authors

Avatar

Gordon J. Lithgow

Buck Institute for Research on Aging

View shared research outputs
Top Co-Authors

Avatar

Gregg Czerwieniec

Buck Institute for Research on Aging

View shared research outputs
Researchain Logo
Decentralizing Knowledge