Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Michal Twik is active.

Publication


Featured researches published by Michal Twik.


Current protocols in human genetics | 2016

The GeneCards Suite: From Gene Data Mining to Disease Genome Sequence Analyses.

Gil Stelzer; Naomi Rosen; Inbar Plaschkes; Shahar Zimmerman; Michal Twik; Simon Fishilevich; Tsippi Iny Stein; Ron Nudel; Iris Lieder; Yaron Mazor; Sergey Kaplan; Dvir Dahary; David Warshawsky; Yaron Guan-Golan; Asher Kohn; Noa Rappaport; Marilyn Safran; Doron Lancet

GeneCards, the human gene compendium, enables researchers to effectively navigate and inter‐relate the wide universe of human genes, diseases, variants, proteins, cells, and biological pathways. Our recently launched Version 4 has a revamped infrastructure facilitating faster data updates, better‐targeted data queries, and friendlier user experience. It also provides a stronger foundation for the GeneCards suite of companion databases and analysis tools. Improved data unification includes gene‐disease links via MalaCards and merged biological pathways via PathCards, as well as drug information and proteome expression. VarElect, another suite member, is a phenotype prioritizer for next‐generation sequencing, leveraging the GeneCards and MalaCards knowledgebase. It automatically infers direct and indirect scored associations between hundreds or even thousands of variant‐containing genes and disease phenotype terms. VarElects capabilities, either independently or within TGex, our comprehensive variant analysis pipeline, help prepare for the challenge of clinical projects that involve thousands of exome/genome NGS analyses.


Nucleic Acids Research | 2017

MalaCards: an amalgamated human disease compendium with diverse clinical and genetic annotation and structured search

Noa Rappaport; Michal Twik; Inbar Plaschkes; Ron Nudel; Tsippi Iny Stein; Jacob Levitt; Moran Gershoni; C. Paul Morrey; Marilyn Safran; Doron Lancet

The MalaCards human disease database (http://www.malacards.org/) is an integrated compendium of annotated diseases mined from 68 data sources. MalaCards has a web card for each of ∼20 000 disease entries, in six global categories. It portrays a broad array of annotation topics in 15 sections, including Summaries, Symptoms, Anatomical Context, Drugs, Genetic Tests, Variations and Publications. The Aliases and Classifications section reflects an algorithm for disease name integration across often-conflicting sources, providing effective annotation consolidation. A central feature is a balanced Genes section, with scores reflecting the strength of disease-gene associations. This is accompanied by other gene-related disease information such as pathways, mouse phenotypes and GO-terms, stemming from MalaCards’ affiliation with the GeneCards Suite of databases. MalaCards’ capacity to inter-link information from complementary sources, along with its elaborate search function, relational database infrastructure and convenient data dumps, allows it to tackle its rich disease annotation landscape, and facilitates systems analyses and genome sequence interpretation. MalaCards adopts a ‘flat’ disease-card approach, but each card is mapped to popular hierarchical ontologies (e.g. International Classification of Diseases, Human Phenotype Ontology and Unified Medical Language System) and also contains information about multi-level relations among diseases, thereby providing an optimal tool for disease representation and scrutiny.


Current protocols in human genetics | 2014

MalaCards: A Comprehensive Automatically‐Mined Database of Human Diseases

Noa Rappaport; Michal Twik; Noam Nativ; Gil Stelzer; Iris Bahir; Tsippi Iny Stein; Marilyn Safran; Doron Lancet

Systems medicine provides insights into mechanisms of human diseases, and expedites the development of better diagnostics and drugs. To facilitate such strategies, we initiated MalaCards, a compendium of human diseases and their annotations, integrating and often remodeling information from 64 data sources. MalaCards employs, among others, the proven automatic data‐mining strategies established in the construction of GeneCards, our widely used compendium of human genes. The development of MalaCards poses many algorithmic challenges, such as disease name unification, integrated classification, gene‐disease association, and disease‐targeted expression analysis. MalaCards displays a Web card for each of >19,000 human diseases, with 17 sections, including textual summaries, related diseases, related genes, genetic variations and tests, and relevant publications. Also included are a powerful search engine and a variety of categorized disease lists. This unit describes two basic protocols to search and browse MalaCards effectively. Curr. Protoc. Bioinform. 47:1.24.1‐1.24.19.


BMC Genomics | 2016

VarElect: the phenotype-based variation prioritizer of the GeneCards Suite

Gil Stelzer; Inbar Plaschkes; Danit Oz-Levi; Anna Alkelai; Tsviya Olender; Shahar Zimmerman; Michal Twik; Frida Belinky; Simon Fishilevich; Ron Nudel; Yaron Guan-Golan; David Warshawsky; Dvir Dahary; Asher Kohn; Yaron Mazor; Sergey Kaplan; Tsippi Iny Stein; Hagit N. Baris; Noa Rappaport; Marilyn Safran; Doron Lancet

BackgroundNext generation sequencing (NGS) provides a key technology for deciphering the genetic underpinnings of human diseases. Typical NGS analyses of a patient depict tens of thousands non-reference coding variants, but only one or very few are expected to be significant for the relevant disorder. In a filtering stage, one employs family segregation, rarity in the population, predicted protein impact and evolutionary conservation as a means for shortening the variation list. However, narrowing down further towards culprit disease genes usually entails laborious seeking of gene-phenotype relationships, consulting numerous separate databases. Thus, a major challenge is to transition from the few hundred shortlisted genes to the most viable disease-causing candidates.ResultsWe describe a novel tool, VarElect (http://ve.genecards.org), a comprehensive phenotype-dependent variant/gene prioritizer, based on the widely-used GeneCards, which helps rapidly identify causal mutations with extensive evidence. The GeneCards suite offers an effective and speedy alternative, whereby >120 gene-centric automatically-mined data sources are jointly available for the task. VarElect cashes on this wealth of information, as well as on GeneCards’ powerful free-text Boolean search and scoring capabilities, proficiently matching variant-containing genes to submitted disease/symptom keywords. The tool also leverages the rich disease and pathway information of MalaCards, the human disease database, and PathCards, the unified pathway (SuperPaths) database, both within the GeneCards Suite. The VarElect algorithm infers direct as well as indirect links between genes and phenotypes, the latter benefitting from GeneCards’ diverse gene-to-gene data links in GenesLikeMe. Finally, our tool offers an extensive gene-phenotype evidence portrayal (“MiniCards”) and hyperlinks to the parent databases.ConclusionsWe demonstrate that VarElect compares favorably with several often-used NGS phenotyping tools, thus providing a robust facility for ranking genes, pointing out their likelihood to be related to a patient’s disease. VarElect’s capacity to automatically process numerous NGS cases, either in stand-alone format or in VCF-analyzer mode (TGex and VarAnnot), is indispensable for emerging clinical projects that involve thousands of whole exome/genome NGS analyses.


Database | 2017

GeneHancer: genome-wide integration of enhancers and target genes in GeneCards

Simon Fishilevich; Ron Nudel; Noa Rappaport; Rotem Hadar; Inbar Plaschkes; Tsippi Iny Stein; Naomi Rosen; Asher Kohn; Michal Twik; Marilyn Safran; Doron Lancet; Dana Cohen

Abstract A major challenge in understanding gene regulation is the unequivocal identification of enhancer elements and uncovering their connections to genes. We present GeneHancer, a novel database of human enhancers and their inferred target genes, in the framework of GeneCards. First, we integrated a total of 434 000 reported enhancers from four different genome-wide databases: the Encyclopedia of DNA Elements (ENCODE), the Ensembl regulatory build, the functional annotation of the mammalian genome (FANTOM) project and the VISTA Enhancer Browser. Employing an integration algorithm that aims to remove redundancy, GeneHancer portrays 285 000 integrated candidate enhancers (covering 12.4% of the genome), 94 000 of which are derived from more than one source, and each assigned an annotation-derived confidence score. GeneHancer subsequently links enhancers to genes, using: tissue co-expression correlation between genes and enhancer RNAs, as well as enhancer-targeted transcription factor genes; expression quantitative trait loci for variants within enhancers; and capture Hi-C, a promoter-specific genome conformation assay. The individual scores based on each of these four methods, along with gene–enhancer genomic distances, form the basis for GeneHancer’s combinatorial likelihood-based scores for enhancer–gene pairing. Finally, we define ‘elite’ enhancer–gene relations reflecting both a high-likelihood enhancer definition and a strong enhancer–gene association. GeneHancer predictions are fully integrated in the widely used GeneCards Suite, whereby candidate enhancers and their annotations are displayed on every relevant GeneCard. This assists in the mapping of non-coding variants to enhancers, and via the linked genes, forms a basis for variant–phenotype interpretation of whole-genome sequences in health and disease. Database URL: http://www.genecards.org/


Biomedical Engineering Online | 2017

Rational confederation of genes and diseases: NGS interpretation via GeneCards, MalaCards and VarElect

Noa Rappaport; Simon Fishilevich; Ron Nudel; Michal Twik; Frida Belinky; Inbar Plaschkes; Tsippi Iny Stein; Dana Cohen; Danit Oz-Levi; Marilyn Safran; Doron Lancet

BackgroundA key challenge in the realm of human disease research is next generation sequencing (NGS) interpretation, whereby identified filtered variant-harboring genes are associated with a patient’s disease phenotypes. This necessitates bioinformatics tools linked to comprehensive knowledgebases. The GeneCards suite databases, which include GeneCards (human genes), MalaCards (human diseases) and PathCards (human pathways) together with additional tools, are presented with the focus on MalaCards utility for NGS interpretation as well as for large scale bioinformatic analyses.ResultsVarElect, our NGS interpretation tool, leverages the broad information in the GeneCards suite databases. MalaCards algorithms unify disease-related terms and annotations from 69 sources. Further, MalaCards defines hierarchical relatedness—aliases, disease families, a related diseases network, categories and ontological classifications. GeneCards and MalaCards delineate and share a multi-tiered, scored gene-disease network, with stringency levels, including the definition of elite status—high quality gene-disease pairs, coming from manually curated trustworthy sources, that includes 4500 genes for 8000 diseases. This unique resource is key to NGS interpretation by VarElect. VarElect, a comprehensive search tool that helps infer both direct and indirect links between genes and user-supplied disease/phenotype terms, is robustly strengthened by the information found in MalaCards. The indirect mode benefits from GeneCards’ diverse gene-to-gene relationships, including SuperPaths—integrated biological pathways from 12 information sources. We are currently adding an important information layer in the form of “disease SuperPaths”, generated from the gene-disease matrix by an algorithm similar to that previously employed for biological pathway unification. This allows the discovery of novel gene-disease and disease–disease relationships. The advent of whole genome sequencing necessitates capacities to go beyond protein coding genes. GeneCards is highly useful in this respect, as it also addresses 101,976 non-protein-coding RNA genes. In a more recent development, we are currently adding an inclusive map of regulatory elements and their inferred target genes, generated by integration from 4 resources.ConclusionsMalaCards provides a rich big-data scaffold for in silico biomedical discovery within the gene-disease universe. VarElect, which depends significantly on both GeneCards and MalaCards power, is a potent tool for supporting the interpretation of wet-lab experiments, notably NGS analyses of disease. The GeneCards suite has thus transcended its 2-decade role in biomedical research, maturing into a key player in clinical investigation.


international conference on bioinformatics | 2016

Integrated Identification of Disease-Gene Links and their Utility in Next-Generation Sequencing Interpretation

Noa Rappaport; Michal Twik; Ron Nudel; Inbar Plaschkes; Tsippi Iny Stein; Danit Oz-Levi; Simon Fishilevich; Marilyn Safran; Doron Lancet

The study of human diseases is at the core of present-day biological research. It is an interdisciplinary effort encompassing genomics, bioinformatics, systems biology, and systems medicine. Currently, many efforts are being made to elucidate the genetic underpinnings of human diseases. A consequence thereof is that many different sources use different nomenclatures, definitions, and classifications. Furthermore, the identification of gene-disease links, in addition to being challenging in its own right, is also affected by this lack of convention. We addressed both of these issues when creating MalaCards (www.malacards.org), an integrated and unified database of human diseases and their annotations, which capitalizes on information from the GeneCards database (www.genecards.org) [1-2]. GeneCards has annotations relevant to various characteristics of genes, which can be used as a discovery platform for identifying gene-disease links [3-4]. At the heart of MalaCards is a consolidated gene-disease matrix based on nine sources, some manually curated and others text-mined. A scoring algorithm prioritizes the list of disease-associated genes based on the strength of the evidence from each source. Figure 1 shows the frequencies of gene-disease links across the GeneCards gene categories. The gene-disease matrix can be used in the interpretation of Next Generation Sequencing (NGS) data, whereby identified filtered variant-harboring genes are associated with a patients disease keywords. VarElect (varelect.genecards.org) [5], the GeneCards suites NGS interpretation tool, leverages MalaCards and GeneCards to infer direct and/or indirect keyword-gene links. Our tools can thus facilitate biomedical research of both basic-scientific and clinical relevance.


Database | 2016

ORDB, HORDE, ODORactor and other on-line knowledge resources of olfactory receptor-odorant interactions

Luis N. Marenco; Rixin Wang; Robert A. McDougal; Tsviya Olender; Michal Twik; Elspeth A. Bruford; Xinyi Liu; Jian Zhang; Doron Lancet; Gordon M. Shepherd; Chiquito J. Crasto

We present here an exploration of the evolution of three well-established, web-based resources dedicated to the dissemination of information related to olfactory receptors (ORs) and their functional ligands, odorants. These resources are: the Olfactory Receptor Database (ORDB), the Human Olfactory Data Explorer (HORDE) and ODORactor. ORDB is a repository of genomic and proteomic information related to ORs and other chemosensory receptors, such as taste and pheromone receptors. Three companion databases closely integrated with ORDB are OdorDB, ORModelDB and OdorMapDB; these resources are part of the SenseLab suite of databases (http://senselab.med.yale.edu). HORDE (http://genome.weizmann.ac.il/horde/) is a semi-automatically populated database of the OR repertoires of human and several mammals. ODORactor (http://mdl.shsmu.edu.cn/ODORactor/) provides information related to OR-odorant interactions from the perspective of the odorant. All three resources are connected to each other via web-links. Database URL: http://senselab.med.yale.edu; http://genome.weizmann.ac.il/horde/; http://mdl.shsmu.edu.cn/ODORactor/


Current protocols in human genetics | 2014

MalaCards: A Comprehensive Automatically-Mined Database of Human Diseases: MalaCards: Comprehensive Database of Human Diseases

Noa Rappaport; Michal Twik; Noam Nativ; Gil Stelzer; Iris Bahir; Tsippi Iny Stein; Marilyn Safran; Doron Lancet

Systems medicine provides insights into mechanisms of human diseases, and expedites the development of better diagnostics and drugs. To facilitate such strategies, we initiated MalaCards, a compendium of human diseases and their annotations, integrating and often remodeling information from 64 data sources. MalaCards employs, among others, the proven automatic data‐mining strategies established in the construction of GeneCards, our widely used compendium of human genes. The development of MalaCards poses many algorithmic challenges, such as disease name unification, integrated classification, gene‐disease association, and disease‐targeted expression analysis. MalaCards displays a Web card for each of >19,000 human diseases, with 17 sections, including textual summaries, related diseases, related genes, genetic variations and tests, and relevant publications. Also included are a powerful search engine and a variety of categorized disease lists. This unit describes two basic protocols to search and browse MalaCards effectively. Curr. Protoc. Bioinform. 47:1.24.1‐1.24.19.


Current protocols in human genetics | 2014

UNIT 1.24 MalaCards: A Comprehensive Automatically-Mined Database of Human Diseases

Noa Rappaport; Michal Twik; Noam Nativ; Gil Stelzer; Iris Bahir; Tsippi Iny Stein; Marilyn Safran; Doron Lancet

Systems medicine provides insights into mechanisms of human diseases, and expedites the development of better diagnostics and drugs. To facilitate such strategies, we initiated MalaCards, a compendium of human diseases and their annotations, integrating and often remodeling information from 64 data sources. MalaCards employs, among others, the proven automatic data‐mining strategies established in the construction of GeneCards, our widely used compendium of human genes. The development of MalaCards poses many algorithmic challenges, such as disease name unification, integrated classification, gene‐disease association, and disease‐targeted expression analysis. MalaCards displays a Web card for each of >19,000 human diseases, with 17 sections, including textual summaries, related diseases, related genes, genetic variations and tests, and relevant publications. Also included are a powerful search engine and a variety of categorized disease lists. This unit describes two basic protocols to search and browse MalaCards effectively. Curr. Protoc. Bioinform. 47:1.24.1‐1.24.19.

Collaboration


Dive into the Michal Twik's collaboration.

Top Co-Authors

Avatar

Doron Lancet

Weizmann Institute of Science

View shared research outputs
Top Co-Authors

Avatar

Marilyn Safran

Weizmann Institute of Science

View shared research outputs
Top Co-Authors

Avatar

Noa Rappaport

Weizmann Institute of Science

View shared research outputs
Top Co-Authors

Avatar

Tsippi Iny Stein

Weizmann Institute of Science

View shared research outputs
Top Co-Authors

Avatar

Gil Stelzer

Weizmann Institute of Science

View shared research outputs
Top Co-Authors

Avatar

Inbar Plaschkes

Weizmann Institute of Science

View shared research outputs
Top Co-Authors

Avatar

Ron Nudel

Weizmann Institute of Science

View shared research outputs
Top Co-Authors

Avatar

Simon Fishilevich

Weizmann Institute of Science

View shared research outputs
Top Co-Authors

Avatar

Iris Bahir

Weizmann Institute of Science

View shared research outputs
Top Co-Authors

Avatar

Noam Nativ

Weizmann Institute of Science

View shared research outputs
Researchain Logo
Decentralizing Knowledge