Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Huihuang Yan is active.

Publication


Featured researches published by Huihuang Yan.


BMC Genomics | 2014

CAP-miRSeq: a comprehensive analysis pipeline for microRNA sequencing data

Zhifu Sun; Jared M. Evans; Aditya Bhagwate; Sumit Middha; Matthew A Bockol; Huihuang Yan; Jean Pierre A Kocher

BackgroundmiRNAs play a key role in normal physiology and various diseases. miRNA profiling through next generation sequencing (miRNA-seq) has become the main platform for biological research and biomarker discovery. However, analyzing miRNA sequencing data is challenging as it needs significant amount of computational resources and bioinformatics expertise. Several web based analytical tools have been developed but they are limited to processing one or a pair of samples at time and are not suitable for a large scale study. Lack of flexibility and reliability of these web applications are also common issues.ResultsWe developed a Comprehensive Analysis Pipeline for microRNA Sequencing data (CAP-miRSeq) that integrates read pre-processing, alignment, mature/precursor/novel miRNA detection and quantification, data visualization, variant detection in miRNA coding region, and more flexible differential expression analysis between experimental conditions. According to computational infrastructure, users can install the package locally or deploy it in Amazon Cloud to run samples sequentially or in parallel for a large number of samples for speedy analyses. In either case, summary and expression reports for all samples are generated for easier quality assessment and downstream analyses. Using well characterized data, we demonstrated the pipeline’s superior performances, flexibility, and practical use in research and biomarker discovery.ConclusionsCAP-miRSeq is a powerful and flexible tool for users to process and analyze miRNA-seq data scalable from a few to hundreds of samples. The results are presented in the convenient way for investigators or analysts to conduct further investigation and discovery.


Nucleic Acids Research | 2014

MACE: model based analysis of ChIP-exo

Liguo Wang; Junsheng Chen; Chen Wang; Liis Uusküla-Reimand; Kaifu Chen; Alejandra Medina-Rivera; Edwin J. Young; Michael T. Zimmermann; Huihuang Yan; Zhifu Sun; Yuji Zhang; Stephen T. Wu; Haojie Huang; Michael D. Wilson; Jean Pierre A Kocher; Wei Li

Understanding the role of a given transcription factor (TF) in regulating gene expression requires precise mapping of its binding sites in the genome. Chromatin immunoprecipitation-exo, an emerging technique using λ exonuclease to digest TF unbound DNA after ChIP, is designed to reveal transcription factor binding site (TFBS) boundaries with near-single nucleotide resolution. Although ChIP-exo promises deeper insights into transcription regulation, no dedicated bioinformatics tool exists to leverage its advantages. Most ChIP-seq and ChIP-chip analytic methods are not tailored for ChIP-exo, and thus cannot take full advantage of high-resolution ChIP-exo data. Here we describe a novel analysis framework, termed MACE (model-based analysis of ChIP-exo) dedicated to ChIP-exo data analysis. The MACE workflow consists of four steps: (i) sequencing data normalization and bias correction; (ii) signal consolidation and noise reduction; (iii) single-nucleotide resolution border peak detection using the Chebyshev Inequality and (iv) border matching using the Gale-Shapley stable matching algorithm. When applied to published human CTCF, yeast Reb1 and our own mouse ONECUT1/HNF6 ChIP-exo data, MACE is able to define TFBSs with high sensitivity, specificity and spatial resolution, as evidenced by multiple criteria including motif enrichment, sequence conservation, direct sequence pileup, nucleosome positioning and open chromatin states. In addition, we show that the fundamental advance of MACE is the identification of two boundaries of a TFBS with high resolution, whereas other methods only report a single location of the same event. The two boundaries help elucidate the in vivo binding structure of a given TF, e.g. whether the TF may bind as dimers or in a complex with other co-factors.


Oncogene | 2016

High-resolution profiling of histone h3 lysine 36 trimethylation in metastatic renal cell carcinoma

Thai H. Ho; In Young Park; Hao Zhao; Pan Tong; Mia D. Champion; Huihuang Yan; Federico A. Monzon; Anh Hoang; Pheroze Tamboli; Alexander S. Parker; Richard W. Joseph; Wei Qiao; Karl Dykema; Nizar M. Tannir; Erik P. Castle; R. Nunez-Nateras; Bin Tean Teh; Jing Wang; Cheryl L. Walker; Mien Chie Hung; Eric Jonasch

Mutations in SETD2, a histone H3 lysine trimethyltransferase, have been identified in clear cell renal cell carcinoma (ccRCC); however it is unclear if loss of SETD2 function alters the genomic distribution of histone 3 lysine 36 trimethylation (H3K36me3) in ccRCC. Furthermore, published epigenomic profiles are not specific to H3K36me3 or metastatic tumors. To determine if progressive SETD2 and H3K36me3 dysregulation occurs in metastatic tumors, H3K36me3, SETD2 copy number (CN) or SETD2 mRNA abundance was assessed in two independent cohorts: metastatic ccRCC (n=71) and the Cancer Genome Atlas Kidney Renal Clear Cell Carcinoma data set (n=413). Although SETD2 CN loss occurs with high frequency (>90%), H3K36me3 is not significantly impacted by monoallelic loss of SETD2. H3K36me3-positive nuclei were reduced an average of ~20% in primary ccRCC (90% positive nuclei in uninvolved vs 70% positive nuclei in ccRCC) and reduced by ~60% in metastases (90% positive in uninvolved kidney vs 30% positive in metastases) (P<0.001). To define a kidney-specific H3K36me3 profile, we generated genome-wide H3K36me3 profiles from four cytoreductive nephrectomies and SETD2 isogenic renal cell carcinoma (RCC) cell lines using chromatin immunoprecipitation coupled with high-throughput DNA sequencing and RNA sequencing. SETD2 loss of methyltransferase activity leads to regional alterations of H3K36me3 associated with aberrant RNA splicing in a SETD2 mutant RCC and SETD2 knockout cell line. These data suggest that during progression of ccRCC, a decline in H3K36me3 is observed in distant metastases, and regional H3K36me3 alterations influence alternative splicing in ccRCC.


BMC Bioinformatics | 2014

HiChIP: a high-throughput pipeline for integrative analysis of ChIP-Seq data

Huihuang Yan; Jared M. Evans; Mike Kalmbach; Raymond Moore; Sumit Middha; Stanislav Luban; Liguo Wang; Aditya Bhagwate; Ying Li; Zhifu Sun; Xianfeng Chen; Jean-Pierre A. Kocher

BackgroundChromatin immunoprecipitation (ChIP) followed by next-generation sequencing (ChIP-Seq) has been widely used to identify genomic loci of transcription factor (TF) binding and histone modifications. ChIP-Seq data analysis involves multiple steps from read mapping and peak calling to data integration and interpretation. It remains challenging and time-consuming to process large amounts of ChIP-Seq data derived from different antibodies or experimental designs using the same approach. To address this challenge, there is a need for a comprehensive analysis pipeline with flexible settings to accelerate the utilization of this powerful technology in epigenetics research.ResultsWe have developed a highly integrative pipeline, termed HiChIP for systematic analysis of ChIP-Seq data. HiChIP incorporates several open source software packages selected based on internal assessments and published comparisons. It also includes a set of tools developed in-house. This workflow enables the analysis of both paired-end and single-end ChIP-Seq reads, with or without replicates for the characterization and annotation of both punctate and diffuse binding sites. The main functionality of HiChIP includes: (a) read quality checking; (b) read mapping and filtering; (c) peak calling and peak consistency analysis; and (d) result visualization. In addition, this pipeline contains modules for generating binding profiles over selected genomic features, de novo motif finding from transcription factor (TF) binding sites and functional annotation of peak associated genes.ConclusionsHiChIP is a comprehensive analysis pipeline that can be configured to analyze ChIP-Seq data derived from varying antibodies and experiment designs. Using public ChIP-Seq data we demonstrate that HiChIP is a fast and reliable pipeline for processing large amounts of ChIP-Seq data.


Cell Reports | 2016

Retinoblastoma Binding Protein 4 Modulates Temozolomide Sensitivity in Glioblastoma by Regulating DNA Repair Proteins

Gaspar J. Kitange; Ann C. Mladek; Mark A. Schroeder; Jenny C. Pokorny; Brett L. Carlson; Yuji Zhang; Asha Nair; Jeong Heon Lee; Huihuang Yan; Paul A. Decker; Zhiguo Zhang; Jann N. Sarkaria

Here we provide evidence that RBBP4 modulates temozolomide (TMZ) sensitivity through coordinate regulation of two key DNA repair genes critical for recovery from TMZ-induced DNA damage: methylguanine-DNA-methyltransferase (MGMT) and RAD51. Disruption of RBBP4 enhanced TMZ sensitivity, induced synthetic lethality to PARP inhibition, and increased DNA damage signaling in response to TMZ. Moreover, RBBP4 silencing enhanced TMZ-induced H2AX phosphorylation and apoptosis in GBM cells. Intriguingly, RBBP4 knockdown suppressed the expression of MGMT, RAD51, and other genes in association with decreased promoter H3K9 acetylation (H3K9Ac) and increased H3K9 tri-methylation (H3K9me3). Consistent with these data, RBBP4 interacts with CBP/p300 to form a chromatin-modifying complex that binds within the promoter of MGMT, RAD51, and perhaps other genes. Globally, RBBP4 positively and negatively regulates genes involved in critical cellular functions including tumorigenesis. The RBBP4/CBP/p300 complex may provide an interesting target for developing therapy-sensitizing strategies for GBM and other tumors.


Journal of Biological Chemistry | 2016

Enhancer of Zeste Homolog 2 Inhibition Stimulates Bone Formation and Mitigates Bone Loss Caused by Ovariectomy in Skeletally Mature Mice.

Amel Dudakovic; Emily T. Camilleri; Scott M. Riester; Christopher R. Paradise; Martina Gluscevic; Tom O'Toole; Roman Thaler; Jared M. Evans; Huihuang Yan; Malayannan Subramaniam; John R. Hawse; Gary S. Stein; Martin A. Montecino; Meghan E. McGee-Lawrence; Jennifer J. Westendorf; Andre J. van Wijnen

Perturbations in skeletal development and bone degeneration may result in reduced bone mass and quality, leading to greater fracture risk. Bone loss is mitigated by bone protective therapies, but there is a clinical need for new bone-anabolic agents. Previous work has demonstrated that Ezh2 (enhancer of zeste homolog 2), a histone 3 lysine 27 (H3K27) methyltransferase, suppressed differentiation of osteogenic progenitors. Here, we investigated whether inhibition of Ezh2 can be leveraged for bone stimulatory applications. Pharmacologic inhibition and siRNA knockdown of Ezh2 enhanced osteogenic commitment of MC3T3 preosteoblasts. Next generation RNA sequencing of mRNAs and real time quantitative PCR profiling established that Ezh2 inactivation promotes expression of bone-related gene regulators and extracellular matrix proteins. Mechanistically, enhanced gene expression was linked to decreased H3K27 trimethylation (H3K27me3) near transcriptional start sites in genome-wide sequencing of chromatin immunoprecipitations assays. Administration of an Ezh2 inhibitor modestly increases bone density parameters of adult mice. Furthermore, Ezh2 inhibition also alleviated bone loss in an estrogen-deficient mammalian model for osteoporosis. Ezh2 inhibition enhanced expression of Wnt10b and Pth1r and increased the BMP-dependent phosphorylation of Smad1/5. Thus, these data suggest that inhibition of Ezh2 promotes paracrine signaling in osteoblasts and has bone-anabolic and osteoprotective potential in adults.


BMC Bioinformatics | 2016

Impact of post-alignment processing in variant discovery from whole exome data

Shulan Tian; Huihuang Yan; Michael T Kalmbach; Susan L. Slager

BackgroundGATK Best Practices workflows are widely used in large-scale sequencing projects and recommend post-alignment processing before variant calling. Two key post-processing steps include the computationally intensive local realignment around known INDELs and base quality score recalibration (BQSR). Both have been shown to reduce erroneous calls; however, the findings are mainly supported by the analytical pipeline that incorporates BWA and GATK UnifiedGenotyper. It is not known whether there is any benefit of post-processing and to what extent the benefit might be for pipelines implementing other methods, especially given that both mappers and callers are typically updated. Moreover, because sequencing platforms are upgraded regularly and the new platforms provide better estimations of read quality scores, the need for post-processing is also unknown. Finally, some regions in the human genome show high sequence divergence from the reference genome; it is unclear whether there is benefit from post-processing in these regions.ResultsWe used both simulated and NA12878 exome data to comprehensively assess the impact of post-processing for five or six popular mappers together with five callers. Focusing on chromosome 6p21.3, which is a region of high sequence divergence harboring the human leukocyte antigen (HLA) system, we found that local realignment had little or no impact on SNP calling, but increased sensitivity was observed in INDEL calling for the Stampy + GATK UnifiedGenotyper pipeline. No or only a modest effect of local realignment was detected on the three haplotype-based callers and no evidence of effect on Novoalign. BQSR had virtually negligible effect on INDEL calling and generally reduced sensitivity for SNP calling that depended on caller, coverage and level of divergence. Specifically, for SAMtools and FreeBayes calling in the regions with low divergence, BQSR reduced the SNP calling sensitivity but improved the precision when the coverage is insufficient. However, in regions of high divergence (e.g., the HLA region), BQSR reduced the sensitivity of both callers with little gain in precision rate. For the other three callers, BQSR reduced the sensitivity without increasing the precision rate regardless of coverage and divergence level.ConclusionsWe demonstrated that the gain from post-processing is not universal; rather, it depends on mapper and caller combination, and the benefit is influenced further by sequencing depth and divergence level. Our analysis highlights the importance of considering these key factors in deciding to apply the computationally intensive post-processing to Illumina exome data.


Epigenomics | 2016

ChIP-seq in studying epigenetic mechanisms of disease and promoting precision medicine: progresses and future directions

Huihuang Yan; Shulan Tian; Susan L. Slager; Zhifu Sun

Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is widely used for mapping histone modifications, histone proteins, chromatin regulators, transcription factors and other DNA-binding proteins. It has played a significant role in our understanding of disease mechanisms and in exploring epigenetic changes for potential clinical applications. However, the conventional protocol requires large amounts of starting material and does not quantify the actual occupancy, limiting its applications in clinical settings. Herein we summarize the latest progresses in utilizing ChIP-seq to link epigenetic alterations to disease initiation and progression, and the implications in precision medicine. We provide an update on the newly developed ChIP-seq protocols, especially those suitable for scare clinical samples. Technical and analytical challenges are outlined together with recommendations for improvement. Finally, future directions in expediting ChIP-seq use in clinic are discussed.


Briefings in Bioinformatics | 2018

Comparative analysis of de novo assemblers for variation discovery in personal genomes

Shulan Tian; Huihuang Yan; Eric W. Klee; Michael T Kalmbach; Susan L. Slager

&NA; Current variant discovery approaches often rely on an initial read mapping to the reference sequence. Their effectiveness is limited by the presence of gaps, potential misassemblies, regions of duplicates with a high‐sequence similarity and regions of high‐sequence divergence in the reference. Also, mapping‐based approaches are less sensitive to large INDELs and complex variations and provide little phase information in personal genomes. A few de novo assemblers have been developed to identify variants through direct variant calling from the assembly graph, micro‐assembly and whole‐genome assembly, but mainly for whole‐genome sequencing (WGS) data. We developed SGVar, a de novo assembly workflow for haplotype‐based variant discovery from whole‐exome sequencing (WES) data. Using simulated human exome data, we compared SGVar with five variation‐aware de novo assemblers and with BWA‐MEM together with three haplotype‐ or local de novo assembly‐based callers. SGVar outperforms the other assemblers in sensitivity and tolerance of sequencing errors. We recapitulated the findings on whole‐genome and exome data from a Utah residents with Northern and Western European ancestry (CEU) trio, showing that SGVar had high sensitivity both in the highly divergent human leukocyte antigen (HLA) region and in non‐HLA regions of chromosome 6. In particular, SGVar is robust to sequencing error, k‐mer selection, divergence level and coverage depth. Unlike mapping‐based approaches, SGVar is capable of resolving long‐range phase and identifying large INDELs from WES, more prominently from WGS. We conclude that SGVar represents an ideal platform for WES‐based variant discovery in highly divergent regions and across the whole genome.


Handbook of Epigenetics (Second edition)#R##N#The New Molecular and Medical Genetics | 2017

Analyses of Genome-Wide Histone Modifications in the Mammalian Genome

Shulan Tian; Susan L. Slager; Krutika Satish Gaonkar; Huihuang Yan

Abstract Epigenetic mechanisms regulate the transcriptional programs in response to developmental and environmental cues through DNA and histone modifications, histone variants, chromatin regulators, as well as other epigenetic layers and their interactions. There are at least 18 types of histone modifications discovered so far, mainly methylation, acetylation, ubiquitination, and phosphorylation, that occur predominantly on the histone tails. Chromatin immunoprecipitation and sequencing has enabled the genome-wide mapping of histone modifications, histone variants, and chromatin regulators from over 100 human cell types, even at the level of single cells. In this chapter, we summarize the functions and global distributions of major histone modifications, histone variants, and chromatin regulators, depicting those that are mainly located in cis-regulatory regions and gene bodies. We illustrate how histone marks can be used to identify epigenetic abnormalities in human disease. Finally, we describe major challenges in this field and highlight key areas for future studies.

Collaboration


Dive into the Huihuang Yan's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge