Salim A. Chowdhury
Carnegie Mellon University
                                 Network
                            
                            Latest external collaboration on country level. Dive into details by clicking on the dots.
                                 Publication
                            
                            Featured researches published by Salim A. Chowdhury.
American Journal of Pathology | 2012
Kerstin Heselmeyer-Haddad; Lissa Y. Berroa Garcia; Amanda Bradley; Clarymar Ortiz-Melendez; Woei-Jyh Lee; Rebecca Christensen; Sheila A. Prindiville; Kathleen A. Calzone; Peter W. Soballe; Yue Hu; Salim A. Chowdhury; Russell Schwartz; Alejandro A. Schäffer; Thomas Ried
Ductal carcinoma in situ (DCIS) is a precursor lesion of invasive ductal carcinoma (IDC) of the breast. To understand the dynamics of genomic alterations in this progression, we used four multicolor fluorescence in situ hybridization probe panels consisting of the oncogenes COX2, MYC, HER2, CCND1, and ZNF217 and the tumor suppressor genes DBC2, CDH1, and TP53 to visualize copy number changes in 13 cases of synchronous DCIS and IDC based on single-cell analyses. The DCIS had a lower degree of chromosomal instability than the IDC. Despite enormous intercellular heterogeneity in DCIS and IDC, we observed signal patterns consistent with a nonrandom distribution of genomic imbalances. CDH1 was most commonly lost, and gain of MYC emerged during progression from DCIS to IDC. Four of 13 DCISs showed identical clonal imbalances in the IDCs. Six cases revealed a switch, and in four of those, the IDC had acquired a gain of MYC. In one case, the major clone in the IDC was one of several clones in the DCIS, and in another case, the major clone in the DCIS became one of the two major clones in the IDC. Despite considerable chromosomal instability, in most cases the evolution from DCIS to IDC is determined by recurrent patterns of genomic imbalances, consistent with a biological continuum.
research in computational molecular biology | 2010
Salim A. Chowdhury; Rod K. Nibbe; Mark R. Chance; Mehmet Koyutürk
Emerging research demonstrates the potential of protein-protein interaction (PPI) networks in uncovering the mechanistic bases of cancers, through identification of interacting proteins that are coordinately dysregulated in tumorigenic and metastatic samples When used as features for classification, such coordinately dysregulated subnetworks improve diagnosis and prognosis of cancer considerably over single-gene markers However, existing methods formulate coordination between multiple genes through additive representation of their expression profiles and utilize greedy heuristics to identify dysregulated subnetworks, which may not be well suited to the potentially combinatorial nature of coordinate dysregulation Here, we propose a combinatorial formulation of coordinate dysregulation and decompose the resulting objective function to cast the problem as one of identifying subnetwork state functions that are indicative of phenotype Based on this formulation, we show that coordinate dysregulation of larger subnetworks can be bounded using simple statistics on smaller subnetworks We then use these bounds to devise an efficient algorithm, Crane, that can search the subnetwork space more effectively than simple greedy algorithms Comprehensive cross-classification experiments show that subnetworks identified by Crane significantly outperform those identified by greedy algorithms in predicting metastasis of colorectal cancer (CRC).
Wiley Interdisciplinary Reviews: Systems Biology and Medicine | 2011
Rod K. Nibbe; Salim A. Chowdhury; Mehmet Koyutürk; Rob M. Ewing; Mark R. Chance
The main goal of systems medicine is to provide predictive models of the patho‐physiology of complex diseases as well as define healthy states. The reason is clear—we hope accurate models will ultimately lead to more specific and sensitive markers of disease that will help clinicians better stratify their patient populations and optimize treatment plans. In addition, we expect that these models will define novel targets for combating disease. However, for many complex diseases, particularly at the clinical level, it is becoming increasingly clear that one or a few genomic variations alone (e.g., simple models) cannot adequately explain the multiple phenotypes related to disease states, or the variable risks that attend disease progression. We suggest that models that account for the activities of many interacting proteins will explain a wider range of variability inherent in these phenotypes. These models, which encompass protein interaction networks dysregulated for specific diseases and specific patient sub‐populations, will be constructed by integrating protein interaction data with multiple types of other relevant cellular information. Protein interaction databases are thus playing an increasingly important role in systems biology approaches to the study of disease. They present us with a static, but highly functional view of the cellular state, and thus give us a better understanding of not only the normal phenotype, but also the overall disease phenotype at the level of the whole organism when certain interactions become dysregulated. WIREs Syst Biol Med 2011 3 357–367 DOI: 10.1002/wsbm.121
Journal of Computational Biology | 2011
Salim A. Chowdhury; Rod K. Nibbe; Mark R. Chance; Mehmet Koyutürk
Emerging research demonstrates the potential of protein-protein interaction (PPI) networks in uncovering the mechanistic bases of cancers, through identification of interacting proteins that are coordinately dysregulated in tumorigenic and metastatic samples. When used as features for classification, such coordinately dysregulated subnetworks improve diagnosis and prognosis of cancer considerably over single-gene markers. However, existing methods formulate coordination between multiple genes through additive representation of their expression profiles and utilize fast heuristics to identify dysregulated subnetworks, which may not be well suited to the potentially combinatorial nature of coordinate dysregulation. Here, we propose a combinatorial formulation of coordinate dysregulation and decompose the resulting objective function to cast the problem as one of identifying subnetwork state functions that are indicative of phenotype. Based on this formulation, we show that coordinate dysregulation of larger subnetworks can be bounded using simple statistics on smaller subnetworks. We then use these bounds to devise an efficient algorithm, Crane, that can search the subnetwork space more effectively than existing algorithms. Comprehensive cross-classification experiments show that subnetworks identified by Crane outperform those identified by additive algorithms in predicting metastasis of colorectal cancer (CRC).
PLOS Computational Biology | 2013
Vishal N. Patel; Giridharan Gokulrangan; Salim A. Chowdhury; Yanwen Chen; Andrew E. Sloan; Mehmet Koyutürk; Jill S. Barnholtz-Sloan; Mark R. Chance
To determine a molecular basis for prognostic differences in glioblastoma multiforme (GBM), we employed a combinatorial network analysis framework to exhaustively search for molecular patterns in protein-protein interaction (PPI) networks. We identified a dysregulated molecular signature distinguishing short-term (survival<225 days) from long-term (survival>635 days) survivors of GBM using whole genome expression data from The Cancer Genome Atlas (TCGA). A 50-gene subnetwork signature achieved 80% prediction accuracy when tested against an independent gene expression dataset. Functional annotations for the subnetwork signature included “protein kinase cascade,” “IκB kinase/NFκB cascade,” and “regulation of programmed cell death” – all of which were not significant in signatures of existing subtypes. Finally, we used label-free proteomics to examine how our subnetwork signature predicted protein level expression differences in an independent GBM cohort of 16 patients. We found that the genes discovered using network biology had a higher probability of dysregulated protein expression than either genes exhibiting individual differential expression or genes derived from known GBM subtypes. In particular, the long-term survivor subtype was characterized by increased protein expression of DNM1 and MAPK1 and decreased expression of HSPA9, PSMD3, and CANX. Overall, we demonstrate that the combinatorial analysis of gene expression data constrained by PPIs outlines an approach for the discovery of robust and translatable molecular signatures in GBM.
Bioinformatics | 2013
Salim A. Chowdhury; Stanley E. Shackney; Kerstin Heselmeyer-Haddad; Thomas Ried; Alejandro A. Schäffer; Russell Schwartz
Motivation: Development and progression of solid tumors can be attributed to a process of mutations, which typically includes changes in the number of copies of genes or genomic regions. Although comparisons of cells within single tumors show extensive heterogeneity, recurring features of their evolutionary process may be discerned by comparing multiple regions or cells of a tumor. A useful source of data for studying likely progression of individual tumors is fluorescence in situ hybridization (FISH), which allows one to count copy numbers of several genes in hundreds of single cells. Novel algorithms for interpreting such data phylogenetically are needed, however, to reconstruct likely evolutionary trajectories from states of single cells and facilitate analysis of tumor evolution. Results: In this article, we develop phylogenetic methods to infer likely models of tumor progression using FISH copy number data and apply them to a study of FISH data from two cancer types. Statistical analyses of topological characteristics of the tree-based model provide insights into likely tumor progression pathways consistent with the prior literature. Furthermore, tree statistics from the resulting phylogenies can be used as features for prediction methods. This results in improved accuracy, relative to unstructured gene copy number data, at predicting tumor state and future metastasis. Availability: Source code for software that does FISH tree building (FISHtrees) and the data on cervical and breast cancer examined here are available at ftp://ftp.ncbi.nlm.nih.gov/pub/FISHtrees. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.
PLOS Computational Biology | 2014
Salim A. Chowdhury; Stanley E. Shackney; Kerstin Heselmeyer-Haddad; Thomas Ried; Alejandro A. Schäffer; Russell Schwartz
We present methods to construct phylogenetic models of tumor progression at the cellular level that include copy number changes at the scale of single genes, entire chromosomes, and the whole genome. The methods are designed for data collected by fluorescence in situ hybridization (FISH), an experimental technique especially well suited to characterizing intratumor heterogeneity using counts of probes to genetic regions frequently gained or lost in tumor development. Here, we develop new provably optimal methods for computing an edit distance between the copy number states of two cells given evolution by copy number changes of single probes, all probes on a chromosome, or all probes in the genome. We then apply this theory to develop a practical heuristic algorithm, implemented in publicly available software, for inferring tumor phylogenies on data from potentially hundreds of single cells by this evolutionary model. We demonstrate and validate the methods on simulated data and published FISH data from cervical cancers and breast cancers. Our computational experiments show that the new model and algorithm lead to more parsimonious trees than prior methods for single-tumor phylogenetics and to improved performance on various classification tasks, such as distinguishing primary tumors from metastases obtained from the same patient population.
American Journal of Pathology | 2014
Kerstin Heselmeyer-Haddad; Lissa Y. Berroa Garcia; Amanda Bradley; Leanora S. Hernandez; Yue Hu; Jens K. Habermann; Christoph Dumke; Christoph Thorns; Sven Perner; Ekaterina Pestova; Catherine Burke; Salim A. Chowdhury; Russell Schwartz; Alejandro H. Schäffer; Pamela L. Paris; Thomas Ried
Gauging the risk of developing progressive disease is a major challenge in prostate cancer patient management. We used genetic markers to understand genomic alteration dynamics during disease progression. By using a novel, advanced, multicolor fluorescence in situ hybridization approach, we enumerated copy numbers of six genes previously identified by array comparative genomic hybridization to be involved in aggressive prostate cancer [TBL1XR1, CTTNBP2, MYC (alias c-myc), PTEN, MEN1, and PDGFB] in six nonrecurrent and seven recurrent radical prostatectomy cases. An ERG break-apart probe to detect TMPRSS2-ERG fusions was included. Subsequent hybridization of probe panels and cell relocation resulted in signal counts for all probes in each individual cell analyzed. Differences in the degree of chromosomal and genomic instability (ie, tumor heterogeneity) or the percentage of cells with TMPRSS2-ERG fusion between samples with or without progression were not observed. Tumors from patients that progressed had more chromosomal gains and losses, and showed a higher degree of selection for a predominant clonal pattern. PTEN loss was the most frequent aberration in progressers (57%), followed by TBL1XR1 gain (29%). MYC gain was observed in one progresser, which was the only lesion with an ERG gain, but no TMPRSS2-ERG fusion. According to our results, a probe set consisting of PTEN, MYC, and TBL1XR1 would detect progressers with 86% sensitivity and 100% specificity. This will be evaluated further in larger studies.
Bioinformatics | 2015
Salim A. Chowdhury; E. Michael Gertz; Darawalee Wangsa; Kerstin Heselmeyer-Haddad; Thomas Ried; Alejandro A. Schäffer; Russell Schwartz
Motivation: Phylogenetic algorithms have begun to see widespread use in cancer research to reconstruct processes of evolution in tumor progression. Developing reliable phylogenies for tumor data requires quantitative models of cancer evolution that include the unusual genetic mechanisms by which tumors evolve, such as chromosome abnormalities, and allow for heterogeneity between tumor types and individual patients. Previous work on inferring phylogenies of single tumors by copy number evolution assumed models of uniform rates of genomic gain and loss across different genomic sites and scales, a substantial oversimplification necessitated by a lack of algorithms and quantitative parameters for fitting to more realistic tumor evolution models. Results: We propose a framework for inferring models of tumor progression from single-cell gene copy number data, including variable rates for different gain and loss events. We propose a new algorithm for identification of most parsimonious combinations of single gene and single chromosome events. We extend it via dynamic programming to include genome duplications. We implement an expectation maximization (EM)-like method to estimate mutation-specific and tumor-specific event rates concurrently with tree reconstruction. Application of our algorithms to real cervical cancer data identifies key genomic events in disease progression consistent with prior literature. Classification experiments on cervical and tongue cancer datasets lead to improved prediction accuracy for the metastasis of primary cervical cancers and for tongue cancer survival. Availability and implementation: Our software (FISHtrees) and two datasets are available at ftp://ftp.ncbi.nlm.nih.gov/pub/FISHtrees. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.
pacific symposium on biocomputing | 2013
Martin Renqiang Min; Salim A. Chowdhury; Yanjun Qi; Alex Stewart; Rachel Ostroff
Disrupted or abnormal biological processes responsible for cancers often quantitatively manifest as disrupted additive and multiplicative interactions of gene/protein expressions correlating with cancer progression. However, the examination of all possible combinatorial interactions between gene features in most case-control studies with limited training data is computationally infeasible. In this paper, we propose a practically feasible data integration approach, QUIRE (QUadratic Interactions among infoRmative fEatures), to identify discriminative complex interactions among informative gene features for cancer diagnosis and biomarker discovery directly based on patient blood samples. QUIRE works in two stages, where it first identifies functionally relevant gene groups for the disease with the help of gene functional annotations and available physical protein interactions, then it explores the combinatorial relationships among the genes from the selected informative groups. Based on our private experimentally generated data from patient blood samples using a novel SOMAmer (Slow Off-rate Modified Aptamer) technology, we apply QUIRE to cancer diagnosis and biomarker discovery for Renal Cell Carcinoma (RCC) and Ovarian Cancer (OVC). To further demonstrate the general applicability of our approach, we also apply QUIRE to a publicly available Colorectal Cancer (CRC) dataset that can be used to prioritize our SOMAmer design. Our experimental results show that QUIRE identifies gene-gene interactions that can better identify the different cancer stages of samples, as compared to other state-of-the-art feature selection methods. A literature survey shows that many of the interactions identified by QUIRE play important roles in the development of cancer.
