Chen-Hsiang Yeang
Academia Sinica
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Chen-Hsiang Yeang.
Proceedings of the National Academy of Sciences of the United States of America | 2001
Sridhar Ramaswamy; Pablo Tamayo; Ryan Rifkin; Sayan Mukherjee; Chen-Hsiang Yeang; Michael Angelo; Christine Ladd; Michael R. Reich; Eva Latulippe; Jill P. Mesirov; Tomaso Poggio; William L. Gerald; Massimo Loda; Eric S. Lander; Todd R. Golub
The optimal treatment of patients with cancer depends on establishing accurate diagnoses by using a complex combination of clinical and histopathological data. In some instances, this task is difficult or impossible because of atypical clinical presentation or histopathology. To determine whether the diagnosis of multiple common adult malignancies could be achieved purely by molecular classification, we subjected 218 tumor samples, spanning 14 common tumor types, and 90 normal tissue samples to oligonucleotide microarray gene expression analysis. The expression levels of 16,063 genes and expressed sequence tags were used to evaluate the accuracy of a multiclass classifier based on a support vector machine algorithm. Overall classification accuracy was 78%, far exceeding the accuracy of random classification (9%). Poorly differentiated cancers resulted in low-confidence predictions and could not be accurately classified according to their tissue of origin, indicating that they are molecularly distinct entities with dramatically different gene expression patterns compared with their well differentiated counterparts. Taken together, these results demonstrate the feasibility of accurate, multiclass molecular cancer classification and suggest a strategy for future clinical implementation of molecular cancer diagnostics.
Journal of Computational Biology | 2004
Chen-Hsiang Yeang; Trey Ideker; Tommi S. Jaakkola
We develop a new framework for inferring models of transcriptional regulation. The models, which we call physical network models, are annotated molecular interaction graphs. The attributes in the model correspond to verifiable properties of the underlying biological system such as the existence of protein-protein and protein-DNA interactions, the directionality of signal transduction in protein-protein interactions, as well as signs of the immediate effects of these interactions. Possible configurations of these variables are constrained by the available data sources. Some of the data sources, such as factor-binding data, involve measurements that are directly tied to the variables in the model. Other sources, such as gene knock-outs, are functional in nature and provide only indirect evidence about the variables. We associate each observed knock-out effect in the deletion mutant data with a set of causal paths (molecular cascades) that could in principle explain the effect, resulting in aggregate constraints about the physical variables in the model. The most likely settings of all the variables, specifying the most likely graph annotations, are found by a recursive application of the max-product algorithm. By testing our approach on datasets related to the pheromone response pathway in S. cerevisiae, we demonstrate that the resulting model is consistent with previous studies about the pathway. Moreover, we successfully predict gene knock-out effects with a high degree of accuracy in a cross-validation setting. When applying this approach genome-wide, we extract submodels consistent with previous studies. The approach can be readily extended to other data sources or to facilitate automated experimental design.
The FASEB Journal | 2008
Chen-Hsiang Yeang; Frank McCormick; Arnold J. Levine
Cancer is a complex process in which the abnormalities of many genes appear to be involved. The combinatorial patterns of gene mutations may reveal the functional relations between genes and pathways in tumorigenesis as well as identify targets for treatment. We examined the patterns of somatic mutations of cancers from Catalog of Somatic Mutations in Cancer (COSMIC), a large‐scale database curated by the Wellcome Trust Sanger Institute. The frequently mutated genes are well‐known oncogenes and tumor suppressors that are involved in generic processes of cell‐cycle control, signal transduction, and stress responses. These “signatures” of gene mutations are heterogeneous when the cancers from different tissues are compared. Mutations in genes functioning in different pathways can occur in the same cancer (i.e., co‐occur), whereas those in genes functioning in the same pathway are rarely mutated in the same sample. This observation supports the view of tumorigenesis as derived from a process like Darwinian evolution. However, certain combinatorial mutational patterns violate these simple rules and demonstrate tissue‐specific variations. For instance, mutations of genes in the Ras and Wnt pathways tend to co‐occur in the large intestine but are mutually exclusive in cancers of the pancreas. The relationships between mutations in different samples of a cancer can also reveal the temporal orders of mutational events. In addition, the observed mutational patterns suggest candidates of new cosequencing targets that can either reveal novel patterns or validate the predictions deduced from existing patterns. These combinatorial mutational patterns provide guiding information for the ongoing cancer genome projects.—Yeang, C‐H., McCormick, F., Levine, A. Combinatorial patterns of somatic gene mutations in cancer. FASEB J. 22, 2605–2622 (2008)
Genome Biology | 2005
Chen-Hsiang Yeang; H. Craig Mak; Scott McCuine; Christopher T. Workman; Tommi S. Jaakkola; Trey Ideker
As genome-scale measurements lead to increasingly complex models of gene regulation, systematic approaches are needed to validate and refine these models. Towards this goal, we describe an automated procedure for prioritizing genetic perturbations in order to discriminate optimally between alternative models of a gene-regulatory network. Using this procedure, we evaluate 38 candidate regulatory networks in yeast and perform four high-priority gene knockout experiments. The refined networks support previously unknown regulatory mechanisms downstream of SOK2 and SWI4.
Siam Review | 2003
Ryan Rifkin; Sayan Mukherjee; Pablo Tamayo; Sridhar Ramaswamy; Chen-Hsiang Yeang; Michael Angelo; Michael R. Reich; Tomaso Poggio; Eric S. Lander; Todd R. Golub; Jill P. Mesirov
A treating composition and method for treatment of shock and/or stress in animals. The composition comprises, in a preferred form, equal volume amounts of solutions of sodium acetate and sodium propionate. It may be administered orally, intravenously, subcutaneously, etc. The preferred dosage level is from about 0.25 cc. per pound of body weight to about 0.5 cc. per pound of body weight.
Proceedings of the National Academy of Sciences of the United States of America | 2012
Robert A. Beckman; Gunter S. Schemmann; Chen-Hsiang Yeang
Cancers are heterogeneous and genetically unstable. Current practice of personalized medicine tailors therapy to heterogeneity between cancers of the same organ type. However, it does not yet systematically address heterogeneity at the single-cell level within a single individual’s cancer or the dynamic nature of cancer due to genetic and epigenetic change as well as transient functional changes. We have developed a mathematical model of personalized cancer therapy incorporating genetic evolutionary dynamics and single-cell heterogeneity, and have examined simulated clinical outcomes. Analyses of an illustrative case and a virtual clinical trial of over 3 million evaluable “patients” demonstrate that augmented (and sometimes counterintuitive) nonstandard personalized medicine strategies may lead to superior patient outcomes compared with the current personalized medicine approach. Current personalized medicine matches therapy to a tumor molecular profile at diagnosis and at tumor relapse or progression, generally focusing on the average, static, and current properties of the sample. Nonstandard strategies also consider minor subclones, dynamics, and predicted future tumor states. Our methods allow systematic study and evaluation of nonstandard personalized medicine strategies. These findings may, in turn, suggest global adjustments and enhancements to translational oncology research paradigms.
PLOS Computational Biology | 2009
Charles J. Vaske; Carrie D. House; Truong Luu; Bryan Frank; Chen-Hsiang Yeang; Norman H. Lee; Joshua M. Stuart
Complex phenotypes such as the transformation of a normal population of cells into cancerous tissue result from a series of molecular triggers gone awry. We describe a method that searches for a genetic network consistent with expression changes observed under the knock-down of a set of genes that share a common role in the cell, such as a disease phenotype. The method extends the Nested Effects Model of Markowetz et al. (2005) by using a probabilistic factor graph to search for a network representing interactions among these silenced genes. The method also expands the network by attaching new genes at specific downstream points, providing candidates for subsequent perturbations to further characterize the pathway. We investigated an extension provided by the factor graph approach in which the model distinguishes between inhibitory and stimulatory interactions. We found that the extension yielded significant improvements in recovering the structure of simulated and Saccharomyces cerevisae networks. We applied the approach to discover a signaling network among genes involved in a human colon cancer cell invasiveness pathway. The method predicts several genes with new roles in the invasiveness process. We knocked down two genes identified by our approach and found that both knock-downs produce loss of invasive potential in a colon cancer cell line. Nested effects models may be a powerful tool for inferring regulatory connections and genes that operate in normal and disease-related processes.
Nucleic Acids Research | 2013
Nardnisa Sintupisut; Pei-Ling Liu; Chen-Hsiang Yeang
Glioblastoma multiforme (GBM) is the most common and malignant primary brain tumor in adults. Decades of investigations and the recent effort of the Cancer Genome Atlas (TCGA) project have mapped many molecular alterations in GBM cells. Alterations on DNAs may dysregulate gene expressions and drive malignancy of tumors. It is thus important to uncover causal and statistical dependency between ‘effector’ molecular aberrations and ‘target’ gene expressions in GBMs. A rich collection of prior studies attempted to combine copy number variation (CNV) and mRNA expression data. However, systematic methods to integrate multiple types of cancer genomic data—gene mutations, single nucleotide polymorphisms, CNVs, DNA methylations, mRNA and microRNA expressions and clinical information—are relatively scarce. We proposed an algorithm to build ‘association modules’ linking effector molecular aberrations and target gene expressions and applied the module-finding algorithm to the integrated TCGA GBM data sets. The inferred association modules were validated by six tests using external information and datasets of central nervous system tumors: (i) indication of prognostic effects among patients; (ii) coherence of target gene expressions; (iii) retention of effector–target associations in external data sets; (iv) recurrence of effector molecular aberrations in GBM; (v) functional enrichment of target genes; and (vi) co-citations between effectors and targets. Modules associated with well-known molecular aberrations of GBM—such as chromosome 7 amplifications, chromosome 10 deletions, EGFR and NF1 mutations—passed the majority of the validation tests. Furthermore, several modules associated with less well-reported molecular aberrations—such as chromosome 11 CNVs, CD40, PLXNB1 and GSTM1 methylations, and mir-21 expressions—were also validated by external information. In particular, modules constituting trans-acting effects with chromosome 11 CNVs and cis-acting effects with chromosome 10 CNVs manifested strong negative and positive associations with survival times in brain tumors. By aligning the information of association modules with the established GBM subclasses based on transcription or methylation levels, we found each subclass possessed multiple concurrent molecular aberrations. Furthermore, the joint molecular characteristics derived from 16 association modules had prognostic power not explained away by the strong biomarker of CpG island methylator phenotypes. Functional and survival analyses indicated that immune/inflammatory responses and epithelial-mesenchymal transitions were among the most important determining processes of prognosis. Finally, we demonstrated that certain molecular aberrations uniquely recurred in GBM but were relatively rare in non-GBM glioma cells. These results justify the utility of an integrative analysis on cancer genomes and provide testable characterizations of driver aberration events in GBM.
Stem cell reports | 2014
I-Ying Lin; Feng-Lan Chiu; Chen-Hsiang Yeang; Hsin-Fu Chen; Ching-Yu Chuang; Shii-Yi Yang; Pei-Shan Hou; Nardnisa Sintupisut; Hong-Nerng Ho; Hung-Chih Kuo; Kuo-I Lin
Summary The mechanisms of transcriptional regulation underlying human primordial germ cell (PGC) differentiation are largely unknown. The transcriptional repressor Prdm1/Blimp-1 is known to play a critical role in controlling germ cell specification in mice. Here, we show that PRDM1 is expressed in developing human gonads and contributes to the determination of germline versus neural fate in early development. We show that knockdown of PRDM1 in human embryonic stem cells (hESCs) impairs germline potential and upregulates neural genes. Conversely, ectopic expression of PRDM1 in hESCs promotes the generation of cells that exhibit phenotypic and transcriptomic features of early PGCs. Furthermore, PRDM1 suppresses transcription of SOX2. Overexpression of SOX2 in hESCs under conditions favoring germline differentiation skews cell fate from the germline to the neural lineage. Collectively, our results demonstrate that PRDM1 serves as a molecular switch to modulate the divergence of neural or germline fates through repression of SOX2 during human development.
bioinformatics and bioengineering | 2003
Chen-Hsiang Yeang; Tommi S. Jaakkola
We develop a method for integrating time series expression profiles and factor-gene binding data to quantify dynamic aspects of gene regulation. We estimate latencies for transcription activation by explaining time correlations between gene expression profiles through available factor-gene binding information. The resulting aligned expression profiles are subsequently clustered and again combined with binding information to determine groups or subgroups of co-regulated genes. The predictions derived from this approach are consistent with existing results. Our analysis also provides several hypotheses not implicated in previous studies.