Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Matthew E. Holford is active.

Publication


Featured researches published by Matthew E. Holford.


Nature Genetics | 2009

A transcriptome atlas of rice cell types uncovers cellular, functional and developmental hierarchies

Yuling Jiao; S. Lori Tausta; Neeru Gandotra; Ning Sun; Tie Liu; Nicole K. Clay; Teresa Ceserani; Meiqin Chen; Ligeng Ma; Matthew E. Holford; Hui-yong Zhang; Hongyu Zhao; Xing Wang Deng; Timothy Nelson

The functions of the plant body rely on interactions among distinct and nonequivalent cell types. The comparison of transcriptomes from different cell types should expose the transcriptional networks that underlie cellular attributes and contributions. Using laser microdissection and microarray profiling, we have produced a cell type transcriptome atlas that includes 40 cell types from rice (Oryza sativa) shoot, root and germinating seed at several developmental stages, providing patterns of cell specificity for individual genes and gene classes. Cell type comparisons uncovered previously unrecognized properties, including cell-specific promoter motifs and coexpressed cognate binding factor candidates, interaction partner candidates and hormone response centers. We inferred developmental regulatory hierarchies of gene expression in specific cell types by comparison of several stages within root, shoot and embryo.


Bioinformatics | 2006

Pathway analysis using random forests classification and regression

Herbert Pang; Aiping Lin; Matthew E. Holford; Bradley E. Enerson; Bin Lu; Michael P. Lawton; Eugenia Floyd; Hongyu Zhao

MOTIVATION Although numerous methods have been developed to better capture biological information from microarray data, commonly used single gene-based methods neglect interactions among genes and leave room for other novel approaches. For example, most classification and regression methods for microarray data are based on the whole set of genes and have not made use of pathway information. Pathway-based analysis in microarray studies may lead to more informative and relevant knowledge for biological researchers. RESULTS In this paper, we describe a pathway-based classification and regression method using Random Forests to analyze gene expression data. The proposed methods allow researchers to rank important pathways from externally available databases, discover important genes, find pathway-based outlying cases and make full use of a continuous outcome variable in the regression setting. We also compared Random Forests with other machine learning methods using several datasets and found that Random Forests classification error rates were either the lowest or the second-lowest. By combining pathway information and novel statistical methods, this procedure represents a promising computational strategy in dissecting pathways and can provide biological insight into the study of microarray data. AVAILABILITY Source code written in R is available from http://bioinformatics.med.yale.edu/pathway-analysis/rf.htm.


BMC Bioinformatics | 2003

PathMAPA: a tool for displaying gene expression and performing statistical tests on metabolic pathways at multiple levels for Arabidopsis

Deyun Pan; Ning Sun; Kei-Hoi Cheung; Zhong Guan; Ligeng Ma; Matthew E. Holford; Xing Wang Deng; Hongyu Zhao

BackgroundTo date, many genomic and pathway-related tools and databases have been developed to analyze microarray data. In published web-based applications to date, however, complex pathways have been displayed with static image files that may not be up-to-date or are time-consuming to rebuild. In addition, gene expression analyses focus on individual probes and genes with little or no consideration of pathways. These approaches reveal little information about pathways that are key to a full understanding of the building blocks of biological systems. Therefore, there is a need to provide useful tools that can generate pathways without manually building images and allow gene expression data to be integrated and analyzed at pathway levels for such experimental organisms as Arabidopsis.ResultsWe have developed PathMAPA, a web-based application written in Java that can be easily accessed over the Internet. An Oracle database is used to store, query, and manipulate the large amounts of data that are involved. PathMAPA allows its users to (i) upload and populate microarray data into a database; (ii) integrate gene expression with enzymes of the pathways; (iii) generate pathway diagrams without building image files manually; (iv) visualize gene expressions for each pathway at enzyme, locus, and probe levels; and (v) perform statistical tests at pathway, enzyme and gene levels. PathMAPA can be used to examine Arabidopsis thaliana gene expression patterns associated with metabolic pathways.ConclusionPathMAPA provides two unique features for the gene expression analysis of Arabidopsis thaliana: (i) automatic generation of pathways associated with gene expression and (ii) statistical tests at pathway level. The first feature allows for the periodical updating of genomic data for pathways, while the second feature can provide insight into how treatments affect relevant pathways for the selected experiment(s).


Bioinformatics | 2005

VitaPad: visualization tools for the analysis of pathway data

Matthew E. Holford; Naixin Li; Prakash M. Nadkarni; Hongyu Zhao

MOTIVATION Packages that support the creation of pathway diagrams are limited by their inability to be readily extended to new classes of pathway-related data. RESULTS VitaPad is a cross-platform application that enables users to create and modify biological pathway diagrams and incorporate microarray data with them. It improves on existing software in the following areas: (i) It can create diagrams dynamically through graph layout algorithms. (ii) It is open-source and uses an open XML format to store data, allowing for easy extension or integration with other tools. (iii) It features a cutting-edge user interface with intuitive controls, high-resolution graphics and fully customizable appearance. AVAILABILITY http://bioinformatics.med.yale.edu CONTACTS [email protected]; [email protected].


Bioinformatics | 2010

Using semantic web rules to reason on an ontology of pseudogenes

Matthew E. Holford; Ekta Khurana; Kei-Hoi Cheung; Mark Gerstein

Motivation: Recent years have seen the development of a wide range of biomedical ontologies. Notable among these is Sequence Ontology (SO) which offers a rich hierarchy of terms and relationships that can be used to annotate genomic data. Well-designed formal ontologies allow data to be reasoned upon in a consistent and logically sound way and can lead to the discovery of new relationships. The Semantic Web Rules Language (SWRL) augments the capabilities of a reasoner by allowing the creation of conditional rules. To date, however, formal reasoning, especially the use of SWRL rules, has not been widely used in biomedicine. Results: We have built a knowledge base of human pseudogenes, extending the existing SO framework to incorporate additional attributes. In particular, we have defined the relationships between pseudogenes and segmental duplications. We then created a series of logical rules using SWRL to answer research questions and to annotate our pseudogenes appropriately. Finally, we were left with a knowledge base which could be queried to discover information about human pseudogene evolution. Availability: The fully populated knowledge base described in this document is available for download from http://ontology.pseudogene.org. A SPARQL endpoint from which to query the dataset is also available at this location. Contact: [email protected]; [email protected]


BMC Bioinformatics | 2011

A semantic web framework to integrate cancer omics data with biological knowledge

Matthew E. Holford; James P. McCusker; Kei-Hoi Cheung; Michael Krauthammer

BackgroundThe RDF triple provides a simple linguistic means of describing limitless types of information. Triples can be flexibly combined into a unified data source we call a semantic model. Semantic models open new possibilities for the integration of variegated biological data. We use Semantic Web technology to explicate high throughput clinical data in the context of fundamental biological knowledge. We have extended Corvus, a data warehouse which provides a uniform interface to various forms of Omics data, by providing a SPARQL endpoint. With the querying and reasoning tools made possible by the Semantic Web, we were able to explore quantitative semantic models retrieved from Corvus in the light of systematic biological knowledge.ResultsFor this paper, we merged semantic models containing genomic, transcriptomic and epigenomic data from melanoma samples with two semantic models of functional data - one containing Gene Ontology (GO) data, the other, regulatory networks constructed from transcription factor binding information. These two semantic models were created in an ad hoc manner but support a common interface for integration with the quantitative semantic models. Such combined semantic models allow us to pose significant translational medicine questions. Here, we study the interplay between a cells molecular state and its response to anti-cancer therapy by exploring the resistance of cancer cells to Decitabine, a demethylating agent.ConclusionsWe were able to generate a testable hypothesis to explain how Decitabine fights cancer - namely, that it targets apoptosis-related gene promoters predominantly in Decitabine-sensitive cell lines, thus conveying its cytotoxic effect by activating the apoptosis pathway. Our research provides a framework whereby similar hypotheses can be developed easily.


Briefings in Bioinformatics | 2009

Approaches to neuroscience data integration

Kei-Hoi Cheung; Ernest Lim; Matthias Samwald; Huajun Chen; Luis N. Marenco; Matthew E. Holford; Thomas M. Morse; Pradeep G. Mutalik; Gordon M. Shepherd; Perry L. Miller

As the number of neuroscience databases increases, the need for neuroscience data integration grows. This paper reviews and compares several approaches, including the Neuroscience Database Gateway (NDG), Neuroscience Information Framework (NIF) and Entrez Neuron, which enable neuroscience database annotation and integration. These approaches cover a range of activities spanning from registry, discovery and integration of a wide variety of neuroscience data sources. They also provide different user interfaces for browsing, querying and displaying query results. In Entrez Neuron, for example, four different facets or tree views (neuron, neuronal property, gene and drug) are used to hierarchically organize concepts that can be used for querying a collection of ontologies. The facets are also used to define the structure of the query results.


Cancer Informatics | 2009

Semantic Web-Based Integration of Cancer Pathways and Allele Frequency Data

Matthew E. Holford; Haseena Rajeevan; Hongyu Zhao; Kenneth K. Kidd; Kei-Hoi Cheung

We demonstrate the use of Semantic Web technology to integrate the ALFRED allele frequency database and the Starpath pathway resource. The linking of population-specific genotype data with cancer-related pathway data is potentially useful given the growing interest in personalized medicine and the exploitation of pathway knowledge for cancer drug discovery. We model our data using the Web Ontology Language (OWL), drawing upon ideas from existing standard formats BioPAX for pathway data and PML for allele frequency data. We store our data within an Oracle database, using Oracle Semantic Technologies. We then query the data using Oracles rule-based inference engine and SPARQL-like RDF query language. The ability to perform queries across the domains of population genetics and pathways offers the potential to answer a number of cancer-related research questions. Among the possibilities is the ability to identify genetic variants which are associated with cancer pathways and whose frequency varies significantly between ethnic groups. This sort of information could be useful for designing clinical studies and for providing background data in personalized medicine. It could also assist with the interpretation of genetic analysis results such as those from genome-wide association studies.


Bioinformatics | 2015

Mutadelic: mutation analysis using description logic inferencing capabilities

Matthew E. Holford; Michael Krauthammer

MOTIVATION As next generation sequencing gains a foothold in clinical genetics, there is a need for annotation tools to characterize increasing amounts of patient variant data for identifying clinically relevant mutations. While existing informatics tools provide efficient bulk variant annotations, they often generate excess information that may limit their scalability. RESULTS We propose an alternative solution based on description logic inferencing to generate workflows that produce only those annotations that will contribute to the interpretation of each variant. Workflows are dynamically generated using a novel abductive reasoning framework called a basic framework for abductive workflow generation (AbFab). Criteria for identifying disease-causing variants in Mendelian blood disorders were identified and implemented as AbFab services. A web application was built allowing users to run workflows generated from the criteria to analyze genomic variants. Significant variants are flagged and explanations provided for why they match or fail to match the criteria. AVAILABILITY AND IMPLEMENTATION The Mutadelic web application is available for use at http://krauthammerlab.med.yale.edu/mutadelic. CONTACT [email protected]. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.


Nature Precedings | 2010

Analysis of Cancer Omics Data In A Semantic Web Framework

Matthew E. Holford; James P. McCusker; Kei-Hoi Cheung; Michael Krauthammer

Collaboration


Dive into the Matthew E. Holford's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

James P. McCusker

Rensselaer Polytechnic Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Matthias Samwald

Medical University of Vienna

View shared research outputs
Researchain Logo
Decentralizing Knowledge