Is this you? Create Your Porfile

Yan Kou

Icahn School of Medicine at Mount Sinai

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Yan Kou is active.

Explore More

Publication

Featured researches published by Yan Kou.

Nature | 2012

Patterns and rates of exonic de novo mutations in autism spectrum disorders

Benjamin M. Neale; Yan Kou; Li Liu; Avi Ma'ayan; Kaitlin E. Samocha; Aniko Sabo; Chiao-Feng Lin; Christine Stevens; Li-San Wang; Vladimir Makarov; Pazi Penchas Polak; Seungtai Yoon; Jared Maguire; Emily L. Crawford; Nicholas G. Campbell; Evan T. Geller; Otto Valladares; Chad Shafer; Han Liu; Tuo Zhao; Guiqing Cai; Jayon Lihm; Ruth Dannenfelser; Omar Jabado; Zuleyma Peralta; Uma Nagaswamy; Donna M. Muzny; Jeffrey G. Reid; Irene Newsham; Yuanqing Wu

Autism spectrum disorders (ASD) are believed to have genetic and environmental origins, yet in only a modest fraction of individuals can specific causes be identified. To identify further genetic risk factors, here we assess the role of de novo mutations in ASD by sequencing the exomes of ASD cases and their parents (n = 175 trios). Fewer than half of the cases (46.3%) carry a missense or nonsense de novo variant, and the overall rate of mutation is only modestly higher than the expected rate. In contrast, the proteins encoded by genes that harboured de novo missense or nonsense mutations showed a higher degree of connectivity among themselves and to previous ASD genes as indexed by protein-protein interaction screens. The small increase in the rate of de novo events, when taken together with the protein interaction results, are consistent with an important but limited role for de novo point mutations in ASD, similar to that documented for de novo copy number variants. Genetic models incorporating these data indicate that most of the observed de novo events are unconnected to ASD; those that do confer risk are distributed across many genes and are incompletely penetrant (that is, not necessarily sufficient for disease). Our results support polygenic models in which spontaneous coding mutations in any of a large number of genes increases risk by 5- to 20-fold. Despite the challenge posed by such models, results from de novo events and a large parallel case–control study provide strong evidence in favour of CHD8 and KATNAL2 as genuine autism risk factors.

BMC Bioinformatics | 2013

Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool

Edward Y. Chen; Christopher M. Tan; Yan Kou; Qiaonan Duan; Zichen Wang; Gabriela Vaz Meirelles; Neil R. Clark; Avi Ma’ayan

BackgroundSystem-wide profiling of genes and proteins in mammalian cells produce lists of differentially expressed genes/proteins that need to be further analyzed for their collective functions in order to extract new knowledge. Once unbiased lists of genes or proteins are generated from such experiments, these lists are used as input for computing enrichment with existing lists created from prior knowledge organized into gene-set libraries. While many enrichment analysis tools and gene-set libraries databases have been developed, there is still room for improvement.ResultsHere, we present Enrichr, an integrative web-based and mobile software application that includes new gene-set libraries, an alternative approach to rank enriched terms, and various interactive visualization approaches to display enrichment results using the JavaScript library, Data Driven Documents (D3). The software can also be embedded into any tool that performs gene list analysis. We applied Enrichr to analyze nine cancer cell lines by comparing their enrichment signatures to the enrichment signatures of matched normal tissues. We observed a common pattern of up regulation of the polycomb group PRC2 and enrichment for the histone mark H3K27me3 in many cancer cell lines, as well as alterations in Toll-like receptor and interlukin signaling in K562 cells when compared with normal myeloid CD33+ cells. Such analyses provide global visualization of critical differences between normal tissues and cancer cell lines but can be applied to many other scenarios.ConclusionsEnrichr is an easy to use intuitive enrichment analysis web-based tool providing various types of visualization summaries of collective functions of gene lists. Enrichr is open source and freely available online at: http://amp.pharm.mssm.edu/Enrichr.

Science | 2016

Fetal liver hematopoietic stem cell niches associate with portal vessels

Jalal A. Khan; Avital Mendelson; Yuya Kunisaki; Alexander Birbrair; Yan Kou; Anna Arnal-Estapé; Sandra Pinho; Paul Ciero; Fumio Nakahara; Avi Ma'ayan; Aviv Bergman; Miriam Merad; Paul S. Frenette

How HSCs populate the fetal liver Hematopoietic stem cells (HSCs) undergo dramatic expansion in the fetal liver before migrating to their definitive site in the bone marrow. Khan et al. identify portal vessel–associated Nestin+NG2+ pericytes as critical HSC niche components (see the Perspective by Cabezas-Wallscheid and Trumpp). The portal vessel niche and HSCs expand according to fractal geometries, suggesting that niche cells—rather than factors expressed by the niche—drive HSC proliferation. After birth, arterial portal vessels transform into portal veins, and lose Nestin+NG2+ pericytes. When this happens, the niche is lost and HSCs migrate away from the neonatal liver. Science, this issue p. 176; see also p. 126 A blood vessel–associated microenvironment supports the expansion of hematopoietic stem cells in the fetal liver. [Also see Perspective by Cabezas-Wallscheid and Trumpp] Whereas the cellular basis of the hematopoietic stem cell (HSC) niche in the bone marrow has been characterized, the nature of the fetal liver niche is not yet elucidated. We show that Nestin+NG2+ pericytes associate with portal vessels, forming a niche promoting HSC expansion. Nestin+NG2+ cells and HSCs scale during development with the fractal branching patterns of portal vessels, tributaries of the umbilical vein. After closure of the umbilical inlet at birth, portal vessels undergo a transition from Neuropilin-1+Ephrin-B2+ artery to EphB4+ vein phenotype, associated with a loss of periportal Nestin+NG2+ cells and emigration of HSCs away from portal vessels. These data support a model in which HSCs are titrated against a periportal vascular niche with a fractal-like organization enabled by placental circulation.

Molecular Autism | 2014

DAWN: a framework to identify autism genes and subnetworks using gene expression and genetics

Li Liu; Jing Lei; Stephan J. Sanders; Arthur Jeremy Willsey; Yan Kou; Abdullah Ercument Cicek; Lambertus Klei; Cong Lu; Xin He; Mingfeng Li; Rebecca A. Muhle; Avi Ma'ayan; James P. Noonan; Nenad Sestan; Kathryn McFadden; Matthew W. State; Joseph D. Buxbaum; Bernie Devlin; Kathryn Roeder

BackgroundDe novo loss-of-function (dnLoF) mutations are found twofold more often in autism spectrum disorder (ASD) probands than their unaffected siblings. Multiple independent dnLoF mutations in the same gene implicate the gene in risk and hence provide a systematic, albeit arduous, path forward for ASD genetics. It is likely that using additional non-genetic data will enhance the ability to identify ASD genes.MethodsTo accelerate the search for ASD genes, we developed a novel algorithm, DAWN, to model two kinds of data: rare variations from exome sequencing and gene co-expression in the mid-fetal prefrontal and motor-somatosensory neocortex, a critical nexus for risk. The algorithm casts the ensemble data as a hidden Markov random field in which the graph structure is determined by gene co-expression and it combines these interrelationships with node-specific observations, namely gene identity, expression, genetic data and the estimated effect on risk.ResultsUsing currently available genetic data and a specific developmental time period for gene co-expression, DAWN identified 127 genes that plausibly affect risk, and a set of likely ASD subnetworks. Validation experiments making use of published targeted resequencing results demonstrate its efficacy in reliably predicting ASD genes. DAWN also successfully predicts known ASD genes, not included in the genetic data used to create the model.ConclusionsValidation studies demonstrate that DAWN is effective in predicting ASD genes and subnetworks by leveraging genetic and gene expression data. The findings reported here implicate neurite extension and neuronal arborization as risks for ASD. Using DAWN on emerging ASD sequence data and gene expression data from other brain regions and tissues would likely identify novel ASD genes. DAWN can also be used for other complex disorders to identify genes and subnetworks in those disorders.

Database | 2013

ESCAPE: database for integrating high-content published data collected from human and mouse embryonic stem cells.

Huilei Xu; Caroline Baroukh; Ruth Dannenfelser; Edward Y. Chen; Christopher M. Tan; Yan Kou; Yujin E. Kim; Ihor R. Lemischka; Avi Ma'ayan

High content studies that profile mouse and human embryonic stem cells (m/hESCs) using various genome-wide technologies such as transcriptomics and proteomics are constantly being published. However, efforts to integrate such data to obtain a global view of the molecular circuitry in m/hESCs are lagging behind. Here, we present an m/hESC-centered database called Embryonic Stem Cell Atlas from Pluripotency Evidence integrating data from many recent diverse high-throughput studies including chromatin immunoprecipitation followed by deep sequencing, genome-wide inhibitory RNA screens, gene expression microarrays or RNA-seq after knockdown (KD) or overexpression of critical factors, immunoprecipitation followed by mass spectrometry proteomics and phosphoproteomics. The database provides web-based interactive search and visualization tools that can be used to build subnetworks and to identify known and novel regulatory interactions across various regulatory layers. The web-interface also includes tools to predict the effects of combinatorial KDs by additive effects controlled by sliders, or through simulation software implemented in MATLAB. Overall, the Embryonic Stem Cell Atlas from Pluripotency Evidence database is a comprehensive resource for the stem cell systems biology community. Database URL: http://www.maayanlab.net/ESCAPE

BMC Bioinformatics | 2014

The characteristic direction: a geometrical approach to identify differentially expressed genes

Neil R. Clark; Kevin Hu; Axel S Feldmann; Yan Kou; Edward Y. Chen; Qiaonan Duan; Avi Ma’ayan

BackgroundIdentifying differentially expressed genes (DEG) is a fundamental step in studies that perform genome wide expression profiling. Typically, DEG are identified by univariate approaches such as Significance Analysis of Microarrays (SAM) or Linear Models for Microarray Data (LIMMA) for processing cDNA microarrays, and differential gene expression analysis based on the negative binomial distribution (DESeq) or Empirical analysis of Digital Gene Expression data in R (edgeR) for RNA-seq profiling.ResultsHere we present a new geometrical multivariate approach to identify DEG called the Characteristic Direction. We demonstrate that the Characteristic Direction method is significantly more sensitive than existing methods for identifying DEG in the context of transcription factor (TF) and drug perturbation responses over a large number of microarray experiments. We also benchmarked the Characteristic Direction method using synthetic data, as well as RNA-Seq data. A large collection of microarray expression data from TF perturbations (73 experiments) and drug perturbations (130 experiments) extracted from the Gene Expression Omnibus (GEO), as well as an RNA-Seq study that profiled genome-wide gene expression and STAT3 DNA binding in two subtypes of diffuse large B-cell Lymphoma, were used for benchmarking the method using real data. ChIP-Seq data identifying DNA binding sites of the perturbed TFs, as well as known drug targets of the perturbing drugs, were used as prior knowledge silver-standard for validation. In all cases the Characteristic Direction DEG calling method outperformed other methods. We find that when drugs are applied to cells in various contexts, the proteins that interact with the drug-targets are differentially expressed and more of the corresponding genes are discovered by the Characteristic Direction method. In addition, we show that the Characteristic Direction conceptualization can be used to perform improved gene set enrichment analyses when compared with the gene-set enrichment analysis (GSEA) and the hypergeometric test.ConclusionsThe application of the Characteristic Direction method may shed new light on relevant biological mechanisms that would have remained undiscovered by the current state-of-the-art DEG methods. The method is freely accessible via various open source code implementations using four popular programming languages: R, Python, MATLAB and Mathematica, all available at: http://www.maayanlab.net/CD.

Nature Communications | 2014

Histone H3.3 and its proteolytically processed form drive a cellular senescence programme

Luis F. Duarte; Andrew J. Young; Zichen Wang; Hsan-Au Wu; Taniya Panda; Yan Kou; Avnish Kapoor; Dan Hasson; Nicholas R. Mills; Avi Ma'ayan; Masashi Narita; Emily Bernstein

The process of cellular senescence generates a repressive chromatin environment, however, the role of histone variants and histone proteolytic cleavage in senescence remains unclear. Using models of oncogene-induced and replicative senescence, here we report novel histone H3 tail cleavage events mediated by the protease Cathepsin L. We find that cleaved forms of H3 are nucleosomal and the histone variant H3.3 is the preferred cleaved form of H3. Ectopic expression of H3.3 and its cleavage product (H3.3cs1), which lacks the first twenty-one amino acids of the H3 tail, is sufficient to induce senescence. Further, H3.3cs1 chromatin incorporation is mediated by the HUCA histone chaperone complex. Genome-wide transcriptional profiling revealed that H3.3cs1 facilitates transcriptional silencing of cell cycle regulators including RB/E2F target genes, likely via the permanent removal of H3K4me3. Collectively, our study identifies histone H3.3 and its proteolytically processed forms as key regulators of cellular senescence.

Nature Communications | 2016

Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd.

Zichen Wang; Caroline D. Monteiro; Kathleen M. Jagodnik; Nicolas F. Fernandez; Gregory W. Gundersen; Andrew D. Rouillard; Sherry L. Jenkins; Axel S Feldmann; Kevin Hu; Michael G. McDermott; Qiaonan Duan; Neil R. Clark; Matthew R. Jones; Yan Kou; Troy Goff; Holly Woodland; Fabio M R. Amaral; Gregory L. Szeto; Oliver Fuchs; Sophia Miryam Schüssler-Fiorenza Rose; Shvetank Sharma; Uwe Schwartz; Xabier Bengoetxea Bausela; Maciej Szymkiewicz; Vasileios Maroulis; Anton Salykin; Carolina M. Barra; Candice D. Kruth; Nicholas J. Bongio; Vaibhav Mathur

Gene expression data are accumulating exponentially in public repositories. Reanalysis and integration of themed collections from these studies may provide new insights, but requires further human curation. Here we report a crowdsourcing project to annotate and reanalyse a large number of gene expression profiles from Gene Expression Omnibus (GEO). Through a massive open online course on Coursera, over 70 participants from over 25 countries identify and annotate 2,460 single-gene perturbation signatures, 839 disease versus normal signatures, and 906 drug perturbation signatures. All these signatures are unique and are manually validated for quality. Global analysis of these signatures confirms known associations and identifies novel associations between genes, diseases and drugs. The manually curated signatures are used as a training set to develop classifiers for extracting similar signatures from the entire GEO repository. We develop a web portal to serve these signatures for query, download and visualization.

American Journal of Medical Genetics Part C-seminars in Medical Genetics | 2012

Network- and attribute-based classifiers can prioritize genes and pathways for autism spectrum disorders and intellectual disability†‡

Yan Kou; Catalina Betancur; Huilei Xu; Joseph D. Buxbaum; Avi Ma'ayan

Autism spectrum disorders (ASD) are a group of related neurodevelopmental disorders with significant combined prevalence (∼1%) and high heritability. Dozens of individually rare genes and loci associated with high‐risk for ASD have been identified, which overlap extensively with genes for intellectual disability (ID). However, studies indicate that there may be hundreds of genes that remain to be identified. The advent of inexpensive massively parallel nucleotide sequencing can reveal the genetic underpinnings of heritable complex diseases, including ASD and ID. However, whole exome sequencing (WES) and whole genome sequencing (WGS) provides an embarrassment of riches, where many candidate variants emerge. It has been argued that genetic variation for ASD and ID will cluster in genes involved in distinct pathways and protein complexes. For this reason, computational methods that prioritize candidate genes based on additional functional information such as protein–protein interactions or association with specific canonical or empirical pathways, or other attributes, can be useful. In this study we applied several supervised learning approaches to prioritize ASD or ID disease gene candidates based on curated lists of known ASD and ID disease genes. We implemented two network‐based classifiers and one attribute‐based classifier to show that we can rank and classify known, and predict new, genes for these neurodevelopmental disorders. We also show that ID and ASD share common pathways that perturb an overlapping synaptic regulatory subnetwork. We also show that features relating to neuronal phenotypes in mouse knockouts can help in classifying neurodevelopmental genes. Our methods can be applied broadly to other diseases helping in prioritizing newly identified genetic variation that emerge from disease gene discovery based on WES and WGS.

PLOS Genetics | 2015

A Systems Approach Identifies Essential FOXO3 Functions at Key Steps of Terminal Erythropoiesis

Raymond Liang; Genís Campreciós; Yan Kou; Kathleen E. McGrath; Roberta B. Nowak; Seana C. Catherman; Carolina L. Bigarella; Pauline Rimmele; Xin Zhang; Merlin Nithya Gnanapragasam; James J. Bieker; Dmitri Papatsenko; Avi Ma’ayan; Emery H. Bresnick; Velia M. Fowler; James Palis; Saghi Ghaffari

Circulating red blood cells (RBCs) are essential for tissue oxygenation and homeostasis. Defective terminal erythropoiesis contributes to decreased generation of RBCs in many disorders. Specifically, ineffective nuclear expulsion (enucleation) during terminal maturation is an obstacle to therapeutic RBC production in vitro. To obtain mechanistic insights into terminal erythropoiesis we focused on FOXO3, a transcription factor implicated in erythroid disorders. Using an integrated computational and experimental systems biology approach, we show that FOXO3 is essential for the correct temporal gene expression during terminal erythropoiesis. We demonstrate that the FOXO3-dependent genetic network has critical physiological functions at key steps of terminal erythropoiesis including enucleation and mitochondrial clearance processes. FOXO3 loss deregulated transcription of genes implicated in cell polarity, nucleosome assembly and DNA packaging-related processes and compromised erythroid enucleation. Using high-resolution confocal microscopy and imaging flow cytometry we show that cell polarization is impaired leading to multilobulated Foxo3 -/- erythroblasts defective in nuclear expulsion. Ectopic FOXO3 expression rescued Foxo3 -/- erythroblast enucleation-related gene transcription, enucleation defects and terminal maturation. Remarkably, FOXO3 ectopic expression increased wild type erythroblast maturation and enucleation suggesting that enhancing FOXO3 activity may improve RBCs production. Altogether these studies uncover FOXO3 as a novel regulator of erythroblast enucleation and terminal maturation suggesting FOXO3 modulation might be therapeutic in disorders with defective erythroid maturation.

Explore More