John Watkinson
Columbia University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by John Watkinson.
Bioinformatics | 2010
Aris Floratos; Kenneth Smith; Zhou Ji; John Watkinson
SUMMARY geWorkbench (genomics Workbench) is an open source Java desktop application that provides access to an integrated suite of tools for the analysis and visualization of data from a wide range of genomics domains (gene expression, sequence, protein structure and systems biology). More than 70 distinct plug-in modules are currently available implementing both classical analyses (several variants of clustering, classification, homology detection, etc.) as well as state of the art algorithms for the reverse engineering of regulatory networks and for protein structure prediction, among many others. geWorkbench leverages standards-based middleware technologies to provide seamless access to remote data, annotation and computational servers, thus, enabling researchers with limited local resources to benefit from available public infrastructure. AVAILABILITY The project site (http://www.geworkbench.org) includes links to self-extracting installers for most operating system (OS) platforms as well as instructions for building the application from scratch using the source code [which is freely available from the projects SVN (subversion) repository]. geWorkbench support is available through the end-user and developer forums of the caBIG Molecular Analysis Tools Knowledge Center, https://cabig-kc.nci.nih.gov/Molecular/forums/
Annals of the New York Academy of Sciences | 2009
John Watkinson; Kuo-ching Liang; Xiadong Wang; Tian Zheng; Dimitris Anastassiou
This paper describes the technique designated best performer in the 2nd conference on Dialogue for Reverse Engineering Assessments and Methods (DREAM2) Challenge 5 (unsigned genome‐scale network prediction from blinded microarray data). Existing algorithms use the pairwise correlations of the expression levels of genes, which provide valuable but insufficient information for the inference of regulatory interactions. Here we present a computational approach based on the recently developed context likelihood of related (CLR) algorithm, extracting additional complementary information using the information theoretic measure of synergy and assigning a score to each ordered pair of genes measuring the degree of confidence that the first gene regulates the second. When tested on a set of publicly available Escherichia coli gene‐expression data with known assumed ground truth, the synergy augmented CLR (SA‐CLR) algorithm had significantly improved prediction performance when compared to CLR. There is also enhanced potential for biological discovery as a result of the identification of the most likely synergistic partner genes involved in the interactions.
Journal of Computational Biology | 2011
Hoon Kim; John Watkinson; Dimitris Anastassiou
Analysis of large gene expression data sets in the presence and absence of a phenotype can lead to the selection of a group of genes serving as biomarkers jointly predicting the phenotype. Among gene selection methods, filter methods derived from ranked individual genes have been widely used in existing products for diagnosis and prognosis. Univariate filter approaches selecting genes individually, although computationally efficient, often ignore gene interactions inherent in the biological data. On the other hand, multivariate approaches selecting gene subsets are known to have a higher risk of selecting spurious gene subsets due to the overfitting of the vast number of gene subsets evaluated. Here we propose a framework of statistical significance tests for multivariate feature selection that can reduce the risk of selecting spurious gene subsets. Using three existing data sets, we show that our proposed approach is an essential step to identify such a gene set that is generated by a significant interaction of its members, even improving classification performance when compared to established approaches. This technique can be applied for the discovery of robust biomarkers for medical diagnosis.
Bioinformatics | 2009
John Watkinson; Dimitris Anastassiou
Summary:We present a visualization tool applied on genome-wide association data, revealing disease-associated haplotypes, epistatically interacting loci, as well as providing visual signatures of multivariate correlations of genetic markers with respect to a phenotype. Availability:Freely available on the web at: http://www.ee.columbia.edu/~anastas/sdplots Contact:[email protected] Supplementary information: Supplementary data are available at Bioinformatics online.
BMC Medical Genomics | 2010
Hoon Kim; John Watkinson; Vinay Varadan; Dimitris Anastassiou
BMC Systems Biology | 2008
John Watkinson; Xiaodong Wang; Tian Zheng; Dimitris Anastassiou
Human Genetics | 2011
Yee Him Cheung; John Watkinson; Dimitris Anastassiou
Archive | 2011
Dimitris Anastassiou; John Watkinson; Hoon Kim
BMC Genetics | 2010
Alexandros Iliadis; John Watkinson; Dimitris Anastassiou; Xiaodong Wang
Archive | 2008
Dimitris Anastassiou; John Watkinson