Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Misha Kapushesky is active.

Publication


Featured researches published by Misha Kapushesky.


Nucleic Acids Research | 2003

ArrayExpress—a public repository for microarray gene expression data at the EBI

Helen Parkinson; Ugis Sarkans; Mohammadreza Shojatalab; Niran Abeygunawardena; Sergio Contrino; Richard M. R. Coulson; Anna Farne; Gonzalo Garcia Lara; Ele Holloway; Misha Kapushesky; P. Lilja; Gaurab Mukherjee; Ahmet Oezcimen; Tim F. Rayner; Philippe Rocca-Serra; Anjan Sharma; Susanna-Assunta Sansone; Alvis Brazma

ArrayExpress is a public repository for microarray data that supports the MIAME (Minimum Informa-tion About a Microarray Experiment) requirements and stores well-annotated raw and normalized data. As of November 2004, ArrayExpress contains data from ∼12 000 hybridizations covering 35 species. Data can be submitted online or directly from local databases or LIMS in a standard format, and password-protected access to prepublication data is provided for reviewers and authors. The data can be retrieved by accession number or queried by vari-ous parameters such as species, author and array platform. A facility to query experiments by gene and sample properties is provided for a growing subset of curated data that is loaded in to the ArrayExpress data warehouse. Data can be visualized and analysed using Expression Profiler, the integrated data analysis tool. ArrayExpress is available at http://www.ebi.ac.uk/arrayexpress.


Nucleic Acids Research | 2007

ArrayExpress—a public database of microarray experiments and gene expression profiles

Helen Parkinson; Misha Kapushesky; Mohammadreza Shojatalab; Niran Abeygunawardena; Richard M. R. Coulson; Anna Farne; Ele Holloway; Nikolay Kolesnykov; P. Lilja; Margus Lukk; Roby Mani; Tim F. Rayner; Anjan Sharma; E. William; Ugis Sarkans; Alvis Brazma

ArrayExpress is a public database for high throughput functional genomics data. ArrayExpress consists of two parts—the ArrayExpress Repository, which is a MIAME supportive public archive of microarray data, and the ArrayExpress Data Warehouse, which is a database of gene expression profiles selected from the repository and consistently re-annotated. Archived experiments can be queried by experiment attributes, such as keywords, species, array platform, authors, journals or accession numbers. Gene expression profiles can be queried by gene names and properties, such as Gene Ontology terms and gene expression profiles can be visualized. ArrayExpress is a rapidly growing database, currently it contains data from >50 000 hybridizations and >1 500 000 individual expression profiles. ArrayExpress supports community standards, including MIAME, MAGE-ML and more recently the proposal for a spreadsheet based data exchange format: MAGE-TAB. Availability: .


Nucleic Acids Research | 2009

ArrayExpress update—from an archive of functional genomics experiments to the atlas of gene expression

Helen E. Parkinson; Misha Kapushesky; Nikolay Kolesnikov; Gabriella Rustici; Mohammadreza Shojatalab; Niran Abeygunawardena; Hugo Bérubé; Miroslaw Dylag; Ibrahim Emam; Anna Farne; Ele Holloway; Margus Lukk; James P. Malone; Roby Mani; Ekaterina Pilicheva; Tim F. Rayner; Faisal Ibne Rezwan; Anjan Sharma; Eleanor Williams; Xiangqun Zheng Bradley; Tomasz Adamusiak; Marco Brandizi; Tony Burdett; Richard M. R. Coulson; Maria Krestyaninova; Pavel Kurnosov; Eamonn Maguire; Sudeshna Guha Neogi; Philippe Rocca-Serra; Susanna-Assunta Sansone

ArrayExpress http://www.ebi.ac.uk/arrayexpress consists of three components: the ArrayExpress Repository—a public archive of functional genomics experiments and supporting data, the ArrayExpress Warehouse—a database of gene expression profiles and other bio-measurements and the ArrayExpress Atlas—a new summary database and meta-analytical tool of ranked gene expression across multiple experiments and different biological conditions. The Repository contains data from over 6000 experiments comprising approximately 200 000 assays, and the database doubles in size every 15 months. The majority of the data are array based, but other data types are included, most recently—ultra high-throughput sequencing transcriptomics and epigenetic data. The Warehouse and Atlas allow users to query for differentially expressed genes by gene names and properties, experimental conditions and sample properties, or a combination of both. In this update, we describe the ArrayExpress developments over the last two years.


Nature Biotechnology | 2010

A global map of human gene expression

Margus Lukk; Misha Kapushesky; Janne Nikkilä; Helen Parkinson; Angela Goncalves; Wolfgang Huber; Esko Ukkonen; Alvis Brazma

To the Editor Although there is only one human genome sequence, different genes are expressed in many different cell types and tissues, as well as in different developmental stages or diseases. The structure of this ‘expression space’ is still largely unknown, as most transcriptomics experiments focus on sampling small regions. We have constructed a global gene expression map by integrating microarray data from 5,372 human samples representing 369 different cell and tissue types, disease states and cell lines. These have been compiled in an online resource (http://www.ebi.ac.uk/gxa/array/U133A) that allows the user to search for a gene of interest and find the conditions in which it is over- or underexpressed, or, conversely, to find which genes are over- or underexpressed in a particular condition. An analysis of the structure of the expression space reveals that it can be described by a small number of distinct expression profile classes and that the first three principal components of this space have biological interpretations. The hematopoietic system, solid tissues and incompletely differentiated cell types are arranged on the first principal axis; cell lines, neoplastic samples and nonneoplastic primary tissue–derived samples are on the second principal axis; and nervous system is separated from the rest of the samples on the third axis. We also show below that most cell lines cluster together rather than with their tissues of origin.


Bioinformatics | 2010

Modeling sample variables with an Experimental Factor Ontology

James Malone; Ele Holloway; Tomasz Adamusiak; Misha Kapushesky; Jie Zheng; Nikolay Kolesnikov; Anna Zhukova; Alvis Brazma; Helen Parkinson

Motivation: Describing biological sample variables with ontologies is complex due to the cross-domain nature of experiments. Ontologies provide annotation solutions; however, for cross-domain investigations, multiple ontologies are needed to represent the data. These are subject to rapid change, are often not interoperable and present complexities that are a barrier to biological resource users. Results: We present the Experimental Factor Ontology, designed to meet cross-domain, application focused use cases for gene expression data. We describe our methodology and open source tools used to create the ontology. These include tools for creating ontology mappings, ontology views, detecting ontology changes and using ontologies in interfaces to enhance querying. The application of reference ontologies to data is a key problem, and this work presents guidelines on how community ontologies can be presented in an application ontology in a data-driven way. Availability: http://www.ebi.ac.uk/efo Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.


Nucleic Acids Research | 2010

Gene Expression Atlas at the European Bioinformatics Institute

Misha Kapushesky; Ibrahim Emam; Ele Holloway; Pavel Kurnosov; Andrey Zorin; James Malone; Gabriella Rustici; Eleanor Williams; Helen Parkinson; Alvis Brazma

The Gene Expression Atlas (http://www.ebi.ac.uk/gxa) is an added-value database providing information about gene expression in different cell types, organism parts, developmental stages, disease states, sample treatments and other biological/experimental conditions. The content of this database derives from curation, re-annotation and statistical analysis of selected data from the ArrayExpress Archive of Functional Genomics Data. A simple interface allows the user to query for differential gene expression either (i) by gene names or attributes such as Gene Ontology terms, or (ii) by biological conditions, e.g. diseases, organism parts or cell types. The gene queries return the conditions where expression has been reported, while condition queries return which genes are reported to be expressed in these conditions. A combination of both query types is possible. The query results are ranked using various statistical measures and by how many independent studies in the database show the particular gene-condition association. Currently, the database contains information about more than 200 000 genes from nine species and almost 4500 biological conditions studied in over 30 000 assays from over 1000 independent studies.


Nucleic Acids Research | 2012

Gene Expression Atlas update—a value-added database of microarray and sequencing-based functional genomics experiments

Misha Kapushesky; Tomasz Adamusiak; Tony Burdett; Aedín C. Culhane; Anna Farne; Alexey Filippov; Ele Holloway; Andrey Klebanov; Nataliya Kryvych; Natalja Kurbatova; Pavel Kurnosov; James P. Malone; Olga Melnichuk; Robert Petryszak; Nikolay Pultsin; Gabriella Rustici; Andrew Tikhonov; Ravensara S. Travillian; Eleanor Williams; Andrey Zorin; Helen E. Parkinson; Alvis Brazma

Gene Expression Atlas (http://www.ebi.ac.uk/gxa) is an added-value database providing information about gene expression in different cell types, organism parts, developmental stages, disease states, sample treatments and other biological/experimental conditions. The content of this database derives from curation, re-annotation and statistical analysis of selected data from the ArrayExpress Archive and the European Nucleotide Archive. A simple interface allows the user to query for differential gene expression either by gene names or attributes or by biological conditions, e.g. diseases, organism parts or cell types. Since our previous report we made 20 monthly releases and, as of Release 11.08 (August 2011), the database supports 19 species, which contains expression data measured for 19 014 biological conditions in 136 551 assays from 5598 independent studies.


Nucleic Acids Research | 2004

Expression Profiler: next generation—an online platform for analysis of microarray data

Misha Kapushesky; Patrick Kemmeren; Aedín C. Culhane; Steffen Durinck; Jan Ihmels; Christine Körner; Meelis Kull; Aurora Torrente; Ugis Sarkans; Jaak Vilo; Alvis Brazma

Expression Profiler (EP, http://www.ebi.ac.uk/expressionprofiler) is a web-based platform for microarray gene expression and other functional genomics-related data analysis. The new architecture, Expression Profiler: next generation (EP:NG), modularizes the original design and allows individual analysis-task-related components to be developed by different groups and yet still seamlessly to work together and share the same user interface look and feel. Data analysis components for gene expression data preprocessing, missing value imputation, filtering, clustering methods, visualization, significant gene finding, between group analysis and other statistical components are available from the EBI (European Bioinformatics Institute) web site. The web-based design of Expression Profiler supports data sharing and collaborative analysis in a secure environment. Developed tools are integrated with the microarray gene expression database ArrayExpress and form the exploratory analytical front-end to those data. EP:NG is an open-source project, encouraging broad distribution and further extensions from the scientific community.


Nucleic Acids Research | 2012

GeneSigDB: a manually curated database and resource for analysis of gene expression signatures

Aedín C. Culhane; Markus S. Schröder; Razvan Sultana; Shaita C. Picard; Enzo N. Martinelli; Caroline Kelly; Benjamin Haibe-Kains; Misha Kapushesky; Anne-Alyssa St Pierre; William Flahive; Kermshlise C. Picard; Daniel Gusenleitner; Gerald Papenhausen; Niall O'Connor; Mick Correll; John Quackenbush

GeneSigDB (http://www.genesigdb.org or http://compbio.dfci.harvard.edu/genesigdb/) is a database of gene signatures that have been extracted and manually curated from the published literature. It provides a standardized resource of published prognostic, diagnostic and other gene signatures of cancer and related disease to the community so they can compare the predictive power of gene signatures or use these in gene set enrichment analysis. Since GeneSigDB release 1.0, we have expanded from 575 to 3515 gene signatures, which were collected and transcribed from 1604 published articles largely focused on gene expression in cancer, stem cells, immune cells, development and lung disease. We have made substantial upgrades to the GeneSigDB website to improve accessibility and usability, including adding a tag cloud browse function, facetted navigation and a ‘basket’ feature to store genes or gene signatures of interest. Users can analyze GeneSigDB gene signatures, or upload their own gene list, to identify gene signatures with significant gene overlap and results can be viewed on a dynamic editable heatmap that can be downloaded as a publication quality image. All data in GeneSigDB can be downloaded in numerous formats including .gmt file format for gene set enrichment analysis or as a R/Bioconductor data file. GeneSigDB is available from http://www.genesigdb.org.


Proceedings of the National Academy of Sciences of the United States of America | 2013

Reversion of a fungal genetic code alteration links proteome instability with genomic and phenotypic diversification

Ana R. Bezerra; João Simões; Wanseon Lee; Johan Rung; Tobias Weil; Ivo Gut; Marta Gut; Mònica Bayés; Lisa Rizzetto; Duccio Cavalieri; Gloria Giovannini; Silvia Bozza; Luigina Romani; Misha Kapushesky; Gabriela R. Moura; Manuel A. S. Santos

Many fungi restructured their proteomes through incorporation of serine (Ser) at thousands of protein sites coded by the leucine (Leu) CUG codon. How these fungi survived this potentially lethal genetic code alteration and its relevance for their biology are not understood. Interestingly, the human pathogen Candida albicans maintains variable Ser and Leu incorporation levels at CUG sites, suggesting that this atypical codon assignment flexibility provided an effective mechanism to alter the genetic code. To test this hypothesis, we have engineered C. albicans strains to misincorporate increasing levels of Leu at protein CUG sites. Tolerance to the misincorporations was very high, and one strain accommodated the complete reversion of CUG identity from Ser back to Leu. Increasing levels of Leu misincorporation decreased growth rate, but production of phenotypic diversity on a phenotypic array probing various metabolic networks, drug resistance, and host immune cell responses was impressive. Genome resequencing revealed an increasing number of genotype changes at polymorphic sites compared with the control strain, and 80% of Leu misincorporation resulted in complete loss of heterozygosity in a large region of chromosome V. The data unveil unanticipated links between gene translational fidelity, proteome instability and variability, genome diversification, and adaptive phenotypic diversity. They also explain the high heterozygosity of the C. albicans genome and open the door to produce microorganisms with genetic code alterations for basic and applied research.

Collaboration


Dive into the Misha Kapushesky's collaboration.

Top Co-Authors

Avatar

Alvis Brazma

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Ele Holloway

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Helen Parkinson

Swiss Institute of Bioinformatics

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Ugis Sarkans

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Niran Abeygunawardena

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Anna Farne

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Mohammadreza Shojatalab

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Anjan Sharma

European Bioinformatics Institute

View shared research outputs
Researchain Logo
Decentralizing Knowledge