Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Shanrong Zhao is active.

Publication


Featured researches published by Shanrong Zhao.


BMC Genomics | 2016

QuickRNASeq lifts large-scale RNA-seq data analyses to the next level of automation and interactive visualization.

Shanrong Zhao; Li Xi; Jie Quan; Hualin Xi; Ying Zhang; David von Schack; Michael Vincent; Baohong Zhang

BackgroundRNA sequencing (RNA-seq), a next-generation sequencing technique for transcriptome profiling, is being increasingly used, in part driven by the decreasing cost of sequencing. Nevertheless, the analysis of the massive amounts of data generated by large-scale RNA-seq remains a challenge. Multiple algorithms pertinent to basic analyses have been developed, and there is an increasing need to automate the use of these tools so as to obtain results in an efficient and user friendly manner. Increased automation and improved visualization of the results will help make the results and findings of the analyses readily available to experimental scientists.ResultsBy combing the best open source tools developed for RNA-seq data analyses and the most advanced web 2.0 technologies, we have implemented QuickRNASeq, a pipeline for large-scale RNA-seq data analyses and visualization. The QuickRNASeq workflow consists of three main steps. In Step #1, each individual sample is processed, including mapping RNA-seq reads to a reference genome, counting the numbers of mapped reads, quality control of the aligned reads, and SNP (single nucleotide polymorphism) calling. Step #1 is computationally intensive, and can be processed in parallel. In Step #2, the results from individual samples are merged, and an integrated and interactive project report is generated. All analyses results in the report are accessible via a single HTML entry webpage. Step #3 is the data interpretation and presentation step. The rich visualization features implemented here allow end users to interactively explore the results of RNA-seq data analyses, and to gain more insights into RNA-seq datasets. In addition, we used a real world dataset to demonstrate the simplicity and efficiency of QuickRNASeq in RNA-seq data analyses and interactive visualizations. The seamless integration of automated capabilites with interactive visualizations in QuickRNASeq is not available in other published RNA-seq pipelines.ConclusionThe high degree of automation and interactivity in QuickRNASeq leads to a substantial reduction in the time and effort required prior to further downstream analyses and interpretation of the analyses findings. QuickRNASeq advances primary RNA-seq data analyses to the next level of automation, and is mature for public release and adoption.


BMC Genomics | 2017

Evaluation and comparison of computational tools for RNA-seq isoform quantification

Chi Zhang; Baohong Zhang; Lih-Ling Lin; Shanrong Zhao

BackgroundAlternatively spliced transcript isoforms are commonly observed in higher eukaryotes. The expression levels of these isoforms are key for understanding normal functions in healthy tissues and the progression of disease states. However, accurate quantification of expression at the transcript level is limited with current RNA-seq technologies because of, for example, limited read length and the cost of deep sequencing.ResultsA large number of tools have been developed to tackle this problem, and we performed a comprehensive evaluation of these tools using both experimental and simulated RNA-seq datasets. We found that recently developed alignment-free tools are both fast and accurate. The accuracy of all methods was mainly influenced by the complexity of gene structures and caution must be taken when interpreting quantification results for short transcripts. Using TP53 gene simulation, we discovered that both sequencing depth and the relative abundance of different isoforms affect quantification accuracyConclusionsOur comprehensive evaluation helps data analysts to make informed choice when selecting computational tools for isoform quantification.


BMC Bioinformatics | 2017

QuickMIRSeq: a pipeline for quick and accurate quantification of both known miRNAs and isomiRs by jointly processing multiple samples from microRNA sequencing

Shanrong Zhao; William Gordon; Sarah Du; Chi Zhang; Wen He; Li Xi; Sachin Mathur; Michael J. Agostino; Theresa Paradis; David von Schack; Michael Vincent; Baohong Zhang

BackgroundGenome-wide miRNA expression data can be used to study miRNA dysregulation comprehensively. Although many open-source tools for microRNA (miRNA)-seq data analyses are available, challenges remain in accurate miRNA quantification from large-scale miRNA-seq dataset. We implemented a pipeline called QuickMIRSeq for accurate quantification of known miRNAs and miRNA isoforms (isomiRs) from multiple samples simultaneously.ResultsQuickMIRSeq considers the unique nature of miRNAs and combines many important features into its implementation. First, it takes advantage of high redundancy of miRNA reads and introduces joint mapping of multiple samples to reduce computational time. Second, it incorporates the strand information in the alignment step for more accurate quantification. Third, reads potentially arising from background noise are filtered out to improve the reliability of miRNA detection. Fourth, sequences aligned to miRNAs with mismatches are remapped to a reference genome to further reduce false positives. Finally, QuickMIRSeq generates a rich set of QC metrics and publication-ready plots.ConclusionsThe rich visualization features implemented allow end users to interactively explore the results and gain more insights into miRNA-seq data analyses. The high degree of automation and interactivity in QuickMIRSeq leads to a substantial reduction in the time and effort required for miRNA-seq data analysis.


PLOS ONE | 2017

High resolution molecular and histological analysis of renal disease progression in ZSF1 fa/faCP rats, a model of type 2 diabetic nephropathy

Ken Dower; Shanrong Zhao; Franklin J Schlerman; Leigh Savary; Gabriela Campanholle; Bryce G. Johnson; Li Xi; Vuong Nguyen; Yutian Zhan; Matthew P. Lech; Ju Wang; Qing Nie; Morten A. Karsdal; Federica Genovese; Germaine Boucher; Thomas P. Brown; Baohong Zhang; Bruce L. Homer; Robert V. Martinez

ZSF1 rats exhibit spontaneous nephropathy secondary to obesity, hypertension, and diabetes, and have gained interest as a model system with potentially high translational value to progressive human disease. To thoroughly characterize this model, and to better understand how closely it recapitulates human disease, we performed a high resolution longitudinal analysis of renal disease progression in ZSF1 rats spanning from early disease to end stage renal disease. Analyses included metabolic endpoints, renal histology and ultrastructure, evaluation of a urinary biomarker of fibrosis, and transcriptome analysis of glomerular-enriched tissue over the course of disease. Our findings support the translational value of the ZSF1 rat model, and are provided here to assist researchers in the determination of the model’s suitability for testing a particular mechanism of interest, the design of therapeutic intervention studies, and the identification of new targets and biomarkers for type 2 diabetic nephropathy.


Journal of Next Generation Sequencing & Applications | 2016

Bioinformatics Tools for RNA-seq Gene and Isoform Quantification

Chi Zhang; Baohong Zhang; Michael Vincent; Shanrong Zhao

In recent years, RNA-seq has emerged as a powerful technology in estimation of gene or transcript expression. ‘Union-exon’ and transcript based approaches are widely used in gene quantification. The ‘Union-exon’ based approach is simple, but it does not distinguish between isoforms when multiple alternatively spliced transcripts are expressed from the same gene. Because a gene is expressed in one or more transcript isoforms, the transcript based approach is more biologically meaningful than the ‘union exon’-based approach. However, estimating the expression of individual isoform is intrinsically challenging because different isoforms of a gene usually have a high proportion of genomic overlap. Recently, a number of tools have been developed for RNA-seq isoform quantification. We review those methods and their main features to serve as guidance for users to choose suitable tools for their specific projects.


Scientific Reports | 2018

Evaluation of two main RNA-seq approaches for gene quantification in clinical RNA sequencing: polyA+ selection versus rRNA depletion

Shanrong Zhao; Ying Zhang; Ramya Gamini; Baohong Zhang; David von Schack

To allow efficient transcript/gene detection, highly abundant ribosomal RNAs (rRNA) are generally removed from total RNA either by positive polyA+ selection or by rRNA depletion (negative selection) before sequencing. Comparisons between the two methods have been carried out by various groups, but the assessments have relied largely on non-clinical samples. In this study, we evaluated these two RNA sequencing approaches using human blood and colon tissue samples. Our analyses showed that rRNA depletion captured more unique transcriptome features, whereas polyA+ selection outperformed rRNA depletion with higher exonic coverage and better accuracy of gene quantification. For blood- and colon-derived RNAs, we found that 220% and 50% more reads, respectively, would have to be sequenced to achieve the same level of exonic coverage in the rRNA depletion method compared with the polyA+ selection method. Therefore, in most cases we strongly recommend polyA+ selection over rRNA depletion for gene quantification in clinical RNA sequencing. Our evaluation revealed that a small number of lncRNAs and small RNAs made up a large fraction of the reads in the rRNA depletion RNA sequencing data. Thus, we recommend that these RNAs are specifically depleted to improve the sequencing depth of the remaining RNAs.


Archive | 2016

Bioinformatics for RNA‐Seq Data Analysis

Shanrong Zhao; Baohong Zhang; Ying Zhang; Sarah Du William Gordon; Theresa Paradis; Michael Vincent; David von Schack

While RNA sequencing (RNA‐seq) has become increasingly popular for transcrip‐ tome profiling, the analysis of the massive amount of data generated by large‐scale RNA‐seq still remains a challenge. RNA‐seq data analyses typically consist of (1) accurate mapping of millions of short sequencing reads to a reference genome, including the identification of splicing events; (2) quantifying expression levels of genes, transcripts, and exons; (3) differential analysis of gene expression among different biological conditions; and (4) biological interpretation of differentially expressed genes. Despite the fact that multiple algorithms pertinent to basic analyses have been developed, there are still a variety of unresolved questions. In this chapter, we review the main tools and algorithms currently available for RNA‐seq data analyses, and our goal is to help RNA‐seq data analysts to make an informed choice of tools in practical RNA‐seq data analysis. In the meantime, RNA‐seq is evolving rapidly, and newer sequencing technologies are briefly introduced, including stranded RNA‐seq, targeted RNA‐seq, and single‐cell RNA‐seq.


Scientific Reports | 2018

Computational identification and validation of alternative splicing in ZSF1 rat RNA-seq data, a preclinical model for type 2 diabetic nephropathy

Chi Zhang; Ken Dower; Baohong Zhang; Robert V. Martinez; Lih-Ling Lin; Shanrong Zhao

Obese ZSF1 rats exhibit spontaneous time-dependent diabetic nephropathy and are considered to be a highly relevant animal model of progressive human diabetic kidney disease. We previously identified gene expression changes between disease and control animals across six time points from 12 to 41 weeks. In this study, the same data were analysed at the isoform and exon levels to reveal additional disease mechanisms that may be governed by alternative splicing. Our analyses identified alternative splicing patterns in genes that may be implicated in disease pathogenesis (such as Shc1, Serpinc1, Epb4.1l5, and Il-33), which would have been overlooked in standard gene-level analysis. The alternatively spliced genes were enriched in pathways related to cell adhesion, cell–cell interactions/junctions, and cytoskeleton signalling, whereas the differentially expressed genes were enriched in pathways related to immune response, G protein-coupled receptor, and cAMP signalling. Our findings indicate that additional mechanistic insights can be gained from exon- and isoform-level data analyses over standard gene-level analysis. Considering alternative splicing is poorly conserved between rodents and humans, it is noted that this work is not translational, but the point holds true that additional insights can be gained from alternative splicing analysis of RNA-seq data.


BMC Genomics | 2015

A comprehensive evaluation of ensembl, RefSeq, and UCSC annotations in the context of RNA-seq read mapping and gene quantification

Shanrong Zhao; Baohong Zhang


BMC Genomics | 2015

Comparison of stranded and non-stranded RNA-seq transcriptome profiling and investigation of gene overlap.

Shanrong Zhao; Ying Zhang; William Gordon; Jie Quan; Hualin Xi; Sarah Du; David von Schack; Baohong Zhang

Collaboration


Dive into the Shanrong Zhao's collaboration.

Researchain Logo
Decentralizing Knowledge