Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Charles A. Phillips is active.

Publication


Featured researches published by Charles A. Phillips.


PLOS ONE | 2012

Genetic Dissection of Acute Ethanol Responsive Gene Networks in Prefrontal Cortex: Functional and Mechanistic Implications

Aaron R. Wolen; Charles A. Phillips; Michael A. Langston; Alex H. Putman; Paul J. Vorster; Nathan A. Bruce; Timothy P. York; Robert W. Williams; Michael F. Miles

Background Individual differences in initial sensitivity to ethanol are strongly related to the heritable risk of alcoholism in humans. To elucidate key molecular networks that modulate ethanol sensitivity we performed the first systems genetics analysis of ethanol-responsive gene expression in brain regions of the mesocorticolimbic reward circuit (prefrontal cortex, nucleus accumbens, and ventral midbrain) across a highly diverse family of 27 isogenic mouse strains (BXD panel) before and after treatment with ethanol. Results Acute ethanol altered the expression of ∼2,750 genes in one or more regions and 400 transcripts were jointly modulated in all three. Ethanol-responsive gene networks were extracted with a powerful graph theoretical method that efficiently summarized ethanols effects. These networks correlated with acute behavioral responses to ethanol and other drugs of abuse. As predicted, networks were heavily populated by genes controlling synaptic transmission and neuroplasticity. Several of the most densely interconnected network hubs, including Kcnma1 and Gsk3β, are known to influence behavioral or physiological responses to ethanol, validating our overall approach. Other major hub genes like Grm3, Pten and Nrg3 represent novel targets of ethanol effects. Networks were under strong genetic control by variants that we mapped to a small number of chromosomal loci. Using a novel combination of genetic, bioinformatic and network-based approaches, we identified high priority cis-regulatory candidate genes, including Scn1b, Gria1, Sncb and Nell2. Conclusions The ethanol-responsive gene networks identified here represent a previously uncharacterized intermediate phenotype between DNA variation and ethanol sensitivity in mice. Networks involved in synaptic transmission were strongly regulated by ethanol and could contribute to behavioral plasticity seen with chronic ethanol. Our novel finding that hub genes and a small number of loci exert major influence over the ethanol response of gene networks could have important implications for future studies regarding the mechanisms and treatment of alcohol use disorders.


hawaii international conference on system sciences | 2014

On finding bicliques in bipartite graphs: a novel algorithm and its application to the integration of diverse biological data types

Yun Zhang; Charles A. Phillips; Gary L. Rogers; Erich J. Baker; Elissa J. Chesler; Michael A. Langston

BackgroundIntegrating and analyzing heterogeneous genome-scale data is a huge algorithmic challenge for modern systems biology. Bipartite graphs can be useful for representing relationships across pairs of disparate data types, with the interpretation of these relationships accomplished through an enumeration of maximal bicliques. Most previously-known techniques are generally ill-suited to this foundational task, because they are relatively inefficient and without effective scaling. In this paper, a powerful new algorithm is described that produces all maximal bicliques in a bipartite graph. Unlike most previous approaches, the new method neither places undue restrictions on its input nor inflates the problem size. Efficiency is achieved through an innovative exploitation of bipartite graph structure, and through computational reductions that rapidly eliminate non-maximal candidates from the search space. An iterative selection of vertices for consideration based on non-decreasing common neighborhood sizes boosts efficiency and leads to more balanced recursion trees.ResultsThe new technique is implemented and compared to previously published approaches from graph theory and data mining. Formal time and space bounds are derived. Experiments are performed on both random graphs and graphs constructed from functional genomics data. It is shown that the new method substantially outperforms the best previous alternatives.ConclusionsThe new method is streamlined, efficient, and particularly well-suited to the study of huge and diverse biological data. A robust implementation has been incorporated into GeneWeaver, an online tool for integrating and analyzing functional genomics experiments, available at http://geneweaver.org. The enormous increase in scalability it provides empowers users to study complex and previously unassailable gene-set associations between genes and their biological functions in a hierarchical fashion and on a genome-wide scale. This practical computational resource is adaptable to almost any applications environment in which bipartite graphs can be used to model relationships between pairs of heterogeneous entities.


BMC Bioinformatics | 2012

The maximum clique enumeration problem: algorithms, applications, and implementations

John D. Eblen; Charles A. Phillips; Gary L. Rogers; Michael A. Langston

BackgroundThe maximum clique enumeration (MCE) problem asks that we identify all maximum cliques in a finite, simple graph. MCE is closely related to two other well-known and widely-studied problems: the maximum clique optimization problem, which asks us to determine the size of a largest clique, and the maximal clique enumeration problem, which asks that we compile a listing of all maximal cliques. Naturally, these three problems are NP-hard, given that they subsume the classic version of the NP-complete clique decision problem. MCE can be solved in principle with standard enumeration methods due to Bron, Kerbosch, Kose and others. Unfortunately, these techniques are ill-suited to graphs encountered in our applications. We must solve MCE on instances deeply seeded in data mining and computational biology, where high-throughput data capture often creates graphs of extreme size and density. MCE can also be solved in principle using more modern algorithms based in part on vertex cover and the theory of fixed-parameter tractability (FPT). While FPT is an improvement, these algorithms too can fail to scale sufficiently well as the sizes and densities of our datasets grow.ResultsAn extensive testbed of benchmark graphs are created using publicly available transcriptomic datasets from the Gene Expression Omnibus (GEO). Empirical testing reveals crucial but latent features of such high-throughput biological data. In turn, it is shown that these features distinguish real data from random data intended to reproduce salient topological features. In particular, with real data there tends to be an unusually high degree of maximum clique overlap. Armed with this knowledge, novel decomposition strategies are tuned to the data and coupled with the best FPT MCE implementations.ConclusionsSeveral algorithmic improvements to MCE are made which progressively decrease the run time on graphs in the testbed. Frequently the final runtime improvement is several orders of magnitude. As a result, instances which were once prohibitively time-consuming to solve are brought into the domain of realistic feasibility.


Frontiers in Microbiology | 2017

Robust inference of genetic exchange communities from microbial genomes using TF-IDF

Yingnan Cong; Yao-ban Chan; Charles A. Phillips; Michael A. Langston; Mark A. Ragan

Bacteria and archaea can exchange genetic material across lineages through processes of lateral genetic transfer (LGT). Collectively, these exchange relationships can be modeled as a network and analyzed using concepts from graph theory. In particular, densely connected regions within an LGT network have been defined as genetic exchange communities (GECs). However, it has been problematic to construct networks in which edges solely represent LGT. Here we apply term frequency-inverse document frequency (TF-IDF), an alignment-free method originating from document analysis, to infer regions of lateral origin in bacterial genomes. We examine four empirical datasets of different size (number of genomes) and phyletic breadth, varying a key parameter (word length k) within bounds established in previous work. We map the inferred lateral regions to genes in recipient genomes, and construct networks in which the nodes are groups of genomes, and the edges natively represent LGT. We then extract maximum and maximal cliques (i.e., GECs) from these graphs, and identify nodes that belong to GECs across a wide range of k. Most surviving lateral transfer has happened within these GECs. Using Gene Ontology enrichment tests we demonstrate that biological processes associated with metabolism, regulation and transport are often over-represented among the genes affected by LGT within these communities. These enrichments are largely robust to change of k.


Developmental Biology | 2015

CbGRiTS: cerebellar gene regulation in time and space.

Thomas Ha; Douglas J. Swanson; Matt Larouche; Randy Glenn; Dave Weeden; Peter Zhang; Kristin M. Hamre; Michael A. Langston; Charles A. Phillips; Mingzhou Song; Zhengyu Ouyang; Elissa J. Chesler; Suman Duvvurru; Roumyana Yordanova; Yan Cui; Kate Campbell; Greg Ricker; Carey Phillips; Ramin Homayouni; Dan Goldowitz

The mammalian CNS is one of the most complex biological systems to understand at the molecular level. The temporal information from time series transcriptome analysis can serve as a potent source of associative information between developmental processes and regulatory genes. Here, we introduce a new transcriptome database called, Cerebellar Gene Regulation in Time and Space (CbGRiTS). This dataset is populated with transcriptome data across embryonic and postnatal development from two standard mouse strains, C57BL/6J and DBA/2J, several recombinant inbred lines and cerebellar mutant strains. Users can evaluate expression profiles across cerebellar development in a deep time series with graphical interfaces for data exploration and link-out to anatomical expression databases. We present three analytical approaches that take advantage of specific aspects of the time series for transcriptome analysis. We demonstrate the use of CbGRiTS dataset as a community resource to explore patterns of gene expression and develop hypotheses concerning gene regulatory networks in brain development.


BMC Bioinformatics | 2010

Serendipitous discoveries in microarray analysis

Sally R. Ellingson; Charles A. Phillips; Randy Glenn; Douglas J. Swanson; Thomas Ha; Dan Goldowitz; Michael A. Langston

Background Scientists are capable of performing very large scale gene expression experiments with current microarray technologies. In order to find significance in the expression data, it is common to use clustering algorithms to group genes with similar expression patterns. Clusters will often contain related genes, such as co-regulated genes or genes in the same biological pathway. It is too expensive and time consuming to test all of the relationships found in large scale microarray experiments. There are many bioinformatics tools that can be used to infer the significance of microarray experiments and cluster analysis.


international symposium on bioinformatics research and applications | 2011

The maximum clique enumeration problem: algorithms, applications and implementations

John D. Eblen; Charles A. Phillips; Gary L. Rogers; Michael A. Langston

Algorithms are designed, analyzed and implemented for the maximum clique enumeration (MCE) problem, which asks that we identify all maximum cliques in a finite, simple graph. MCE is closely related to two other well-known and widely-studied problems: the maximum clique optimization problem, which asks us to determine the size of a largest clique, and the maximal clique enumeration problem, which asks that we compile a listing of all maximal cliques. Naturally, these three problems are NP-hard, given that they subsume the classic version of the NP-complete clique decision problem. MCE can be solved in principle with standard enumeration methods due to Bron, Kerbosch, Kose and others. Unfortunately, these techniques are ill-suited to graphs encountered in our applications. We must solve MCE on instances deeply seeded in data mining and computational biology, where high-throughput data capture often creates graphs of extreme size and density. MCE can also be solved in principle using more modern algorithms based in part on vertex cover and the theory of fixed-parameter tractability (FPT). While FPT is an improvement, these algorithms too can fail to scale sufficiently well as the sizes and densities of our datasets grow. An extensive testbed of benchmark MCE instances is devised, based on applications in transcriptomic data analysis. Empirical testing reveals crucial but latent features of such high-throughput biological data. In turn, it is shown that these features distinguish real data from random data intended to reproduce salient topological features. In particular, with real data there tends to be an unusually high degree of maximum clique overlap. Armed with this knowledge, novel decomposition strategies are tuned to the data and coupled with the best FPT MCE implementations. It is demonstrated that the resultant run times are frequently reduced by several orders of magnitude, and that instances once prohibitively time-consuming to solve are now often brought into the domain of realistic feasibility.


International Journal of Computational Biology and Drug Design | 2014

Differential Shannon entropy and differential coefficient of variation: alternatives and augmentations to differential expression in the search for disease-related genes.

Kai Wang; Charles A. Phillips; Gary L. Rogers; Fredrik Barrenäs; Mikael Benson; Michael A. Langston

Differential expression has been a standard tool for analysing case-control transcriptomic data since the advent of microarray technology. It has proved invaluable in characterising the molecular mechanisms of disease. Nevertheless, the expression profile of a gene across samples can be perturbed in ways that leave the expression level unaltered, while a biological effect is nonetheless present. This paper describes and analyses differential Shannon entropy and differential coefficient of variation, two alternate techniques for identifying genes of interest. Ontological analysis across 16 human disease datasets demonstrates that these alternatives are effective at identifying disease-related genes not found by mere differential expression alone. Because the two alternate techniques are based on somewhat different mathematical formulations, they tend to produce somewhat different gene lists. Moreover, each may pinpoint genes completely overlooked by the other. Thus, measures of entropy and variation can be used to replace or better yet augment standard differential expression computations.


acs/ieee international conference on computer systems and applications | 2009

Using out-of-core techniques to produce exact solutions to the maximum clique problem on extremely large graphs

Gary L. Rogers; Andy D. Perkins; Charles A. Phillips; John D. Eblen; Faisal N. Abu-Khzam; Michael A. Langston

Practical methods are presented for computing exact solutions to the maximum clique problem on graphs that are too large to fit within core memory. These methods use a combination of in-core and out-of-core techniques, recursively dissecting large graphs into manageable components. A global solution to the maximum clique problem is derived from local solutions generated for each of the individual components. Parallelizing the search within these components is instrumental in improving the running times of the algorithms.


BMC Research Notes | 2015

EntropyExplorer: an R package for computing and comparing differential Shannon entropy, differential coefficient of variation and differential expression

Kai Wang; Charles A. Phillips; Arnold M. Saxton; Michael A. Langston

AbstractBackgroundDifferential Shannon entropy (DSE) and differential coefficient of variation (DCV) are effective metrics for the study of gene expression data. They can serve to augment differential expression (DE), and be applied in numerous settings whenever one seeks to measure differences in variability rather than mere differences in magnitude. A general purpose, easily accessible tool for DSE and DCV would help make these two metrics available to data scientists. Automated p value computations would additionally be useful, and are often easier to interpret than raw test statistic values alone.ResultsEntropyExplorer is an R package for calculating DSE, DCV and DE. It also computes corresponding p values for each metric. All features are available through a single R function call. Based on extensive investigations in the literature, the Fligner-Killeen test was chosen to compute DCV p values. No standard method was found to be appropriate for DSE, and so permutation testing is used to calculate DSE p values.ConclusionsEntropyExplorer provides a convenient resource for calculating DSE, DCV, DE and associated p values. The package, along with its source code and reference manual, are freely available from the CRAN public repository at http://cran.r-project.org/web/packages/EntropyExplorer/index.html.

Collaboration


Dive into the Charles A. Phillips's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Elissa J. Chesler

University of Tennessee Health Science Center

View shared research outputs
Top Co-Authors

Avatar

Kai Wang

University of Tennessee

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge