Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Yongbing Zhao is active.

Publication


Featured researches published by Yongbing Zhao.


Bioinformatics | 2012

PGAP: Pan-genomes analysis pipeline

Yongbing Zhao; Jiayan Wu; Junhui Yang; Shixiang Sun; Jingfa Xiao; Jun Yu

Summary: With the rapid development of DNA sequencing technology, increasing bacteria genome data enable the biologists to dig the evolutionary and genetic information of prokaryotic species from pan-genome sight. Therefore, the high-efficiency pipelines for pan-genome analysis are mostly needed. We have developed a new pan-genome analysis pipeline (PGAP), which can perform five analytic functions with only one command, including cluster analysis of functional genes, pan-genome profile analysis, genetic variation analysis of functional genes, species evolution analysis and function enrichment analysis of gene clusters. PGAPs performance has been evaluated on 11 Streptococcus pyogenes strains. Availability:PGAP is developed with Perl script on the Linux Platform and the package is freely available from http://pgap.sf.net. Contact: [email protected]; [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.


Bioinformatics | 2014

PanGP: A tool for quickly analyzing bacterial pan-genome profile

Yongbing Zhao; Xinmiao Jia; Junhui Yang; Yunchao Ling; Zhang Zhang; Jun Yu; Jiayan Wu; Jingfa Xiao

Summary: Pan-genome analyses have shed light on the dynamics and evolution of bacterial genome from the point of population. The explosive growth of bacterial genome sequence also brought an extremely big challenge to pan-genome profile analysis. We developed a tool, named PanGP, to complete pan-genome profile analysis for large-scale strains efficiently. PanGP has integrated two sampling algorithms, totally random (TR) and distance guide (DG). The DG algorithm drew sample strain combinations on the basis of genome diversity of bacterial population. The performance of these two algorithms have been evaluated on four bacteria populations with strain numbers varying from 30 to 200, and the DG algorithm exhibited overwhelming advantage on accuracy and stability than the TR algorithm. Availability: PanGP was developed with a user-friendly graphic interface and it was available at http://PanGP.big.ac.cn. Contact: [email protected] or [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.


Frontiers in Plant Science | 2015

The complete chloroplast genome provides insight into the evolution and polymorphism of Panax ginseng.

Yongbing Zhao; Jinlong Yin; HaiYan Guo; Yuyu Zhang; Wen-Fa Xiao; Chen Ming Sun; Jiayan Wu; Xiaobo Qu; Jun-Han Yu; Xumin Wang; Jingfa Xiao

Panax ginseng C.A. Meyer (P. ginseng) is an important medicinal plant and is often used in traditional Chinese medicine. With next generation sequencing (NGS) technology, we determined the complete chloroplast genome sequences for four Chinese P. ginseng strains, which are Damaya (DMY), Ermaya (EMY), Gaolishen (GLS), and Yeshanshen (YSS). The total chloroplast genome sequence length for DMY, EMY, and GLS was 156,354 bp, while that for YSS was 156,355 bp. Comparative genomic analysis of the chloroplast genome sequences indicate that gene content, GC content, and gene order in DMY are quite similar to its relative species, and nucleotide sequence diversity of inverted repeat region (IR) is lower than that of its counterparts, large single copy region (LSC) and small single copy region (SSC). A comparison among these four P. ginseng strains revealed that the chloroplast genome sequences of DMY, EMY, and GLS were identical and YSS had a 1-bp insertion at base 5472. To further study the heterogeneity in chloroplast genome during domestication, high-resolution reads were mapped to the genome sequences to investigate the differences at the minor allele level; 208 minor allele sites with minor allele frequencies (MAF) of ≥0.05 were identified. The polymorphism site numbers per kb of chloroplast genome sequence for DMY, EMY, GLS, and YSS were 0.74, 0.59, 0.97, and 1.23, respectively. All the minor allele sites located in LSC and IR regions, and the four strains showed the same variation types (substitution base or indel) at all identified polymorphism sites. Comparison results of heterogeneity in the chloroplast genome sequences showed that the minor allele sites on the chloroplast genome were undergoing purifying selection to adapt to changing environment during domestication process. A study of P. ginseng chloroplast genome with particular focus on minor allele sites would aid in investigating the dynamics on the chloroplast genomes and different P. ginseng strains typing.


PLOS ONE | 2012

Pan-genomic analysis provides insights into the genomic variation and evolution of Salmonella Paratyphi A.

Weili Liang; Yongbing Zhao; Chunxia Chen; Xiaoying Cui; Jun Yu; Jingfa Xiao; Biao Kan

Salmonella Paratyphi A (S. Paratyphi A) is a highly adapted, human-specific pathogen that causes paratyphoid fever. Cases of paratyphoid fever have recently been increasing, and the disease is becoming a major public health concern, especially in Eastern and Southern Asia. To investigate the genomic variation and evolution of S. Paratyphi A, a pan-genomic analysis was performed on five newly sequenced S. Paratyphi A strains and two other reference strains. A whole genome comparison revealed that the seven genomes are collinear and that their organization is highly conserved. The high rate of substitutions in part of the core genome indicates that there are frequent homologous recombination events. Based on the changes in the pan-genome size and cluster number (both in the core functional genes and core pseudogenes), it can be inferred that the sharply increasing number of pseudogene clusters may have strong correlation with the inactivation of functional genes, and indicates that the S. Paratyphi A genome is being degraded.


BMC Genomics | 2014

VCGDB: a dynamic genome database of the Chinese population

Yunchao Ling; Zhong Jin; Mingming Su; Jun Zhong; Yongbing Zhao; Jun Yu; Jiayan Wu; Jingfa Xiao

BackgroundThe data released by the 1000 Genomes Project contain an increasing number of genome sequences from different nations and populations with a large number of genetic variations. As a result, the focus of human genome studies is changing from single and static to complex and dynamic. The currently available human reference genome (GRCh37) is based on sequencing data from 13 anonymous Caucasian volunteers, which might limit the scope of genomics, transcriptomics, epigenetics, and genome wide association studies.DescriptionWe used the massive amount of sequencing data published by the 1000 Genomes Project Consortium to construct the Virtual Chinese Genome Database (VCGDB), a dynamic genome database of the Chinese population based on the whole genome sequencing data of 194 individuals. VCGDB provides dynamic genomic information, which contains 35 million single nucleotide variations (SNVs), 0.5 million insertions/deletions (indels), and 29 million rare variations, together with genomic annotation information. VCGDB also provides a highly interactive user-friendly virtual Chinese genome browser (VCGBrowser) with functions like seamless zooming and real-time searching. In addition, we have established three population-specific consensus Chinese reference genomes that are compatible with mainstream alignment software.ConclusionsVCGDB offers a feasible strategy for processing big data to keep pace with the biological data explosion by providing a robust resource for genomics studies; in particular, studies aimed at finding regions of the genome associated with diseases.


Nucleic Acids Research | 2018

MethBank 3.0: a database of DNA methylomes across a variety of species

Rujiao Li; Fang Liang; Mengwei Li; Dong Zou; Shixiang Sun; Yongbing Zhao; Wenming Zhao; Yiming Bao; Jingfa Xiao; Zhang Zhang

Abstract MethBank (http://bigd.big.ac.cn/methbank) is a database that integrates high-quality DNA methylomes across a variety of species and provides an interactive browser for visualization of methylation data. Here, we present an updated implementation of MethBank (version 3.0) by incorporating more DNA methylomes from multiple species and equipping with more enhanced functionalities for data annotation and more friendly web interfaces for data presentation, search and visualization. MethBank 3.0 features large-scale integration of high-quality methylomes, involving 34 consensus reference methylomes derived from a large number of human samples, 336 single-base resolution methylomes from different developmental stages and/or tissues of five plants, and 18 single-base resolution methylomes from gametes and early embryos at multiple stages of two animals. Additionally, it is enhanced by improving the functionalities for data annotation, which accordingly enables systematic identification of methylation sites closely associated with age, sites with constant methylation levels across different ages, differentially methylated promoters, age-specific differentially methylated cytosines/regions, and methylated CpG islands. Moreover, MethBank provides tools to estimate human methylation age online and to identify differentially methylated promoters, respectively. Taken together, MethBank is upgraded with significant improvements and advances over the previous version, which is of great help for deciphering DNA methylation regulatory mechanisms for epigenetic studies.


Frontiers in Microbiology | 2018

PGAweb: A Web Server for Bacterial Pan-Genome Analysis

Xinyu Chen; Yadong Zhang; Zhewen Zhang; Yongbing Zhao; Chen Sun; Ming Yang; Jinyue Wang; Qian Liu; Baohua Zhang; Meili Chen; Jun Yu; Jiayan Wu; Zhong Jin; Jingfa Xiao

An astronomical increase in microbial genome data in recent years has led to strong demand for bioinformatic tools for pan-genome analysis within and across species. Here, we present PGAweb, a user-friendly, web-based tool for bacterial pan-genome analysis, which is composed of two main pan-genome analysis modules, PGAP and PGAP-X. PGAweb provides key interactive and customizable functions that include orthologous clustering, pan-genome profiling, sequence variation and evolution analysis, and functional classification. PGAweb presents features of genomic structural dynamics and sequence diversity with different visualization methods that are helpful for intuitively understanding the dynamics and evolution of bacterial genomes. PGAweb has an intuitive interface with one-click setting of parameters and is freely available at http://PGAweb.vlcc.cn/.


Vacuum | 2011

Deposition, structure and hardness of Ti-Cu-N hard films prepared by pulse biased arc ion plating

X. Q. Wang; Yongbing Zhao; Baohai Yu; Jinquan Xiao; F. Q. Li


BMC Genomics | 2018

PGAP-X: extension on pan-genome analysis pipeline

Yongbing Zhao; Chen Sun; Dongyu Zhao; Yadong Zhang; Yang You; Xinmiao Jia; Junhui Yang; Lingping Wang; Jinyue Wang; Haohuan Fu; Yu Kang; Fei Chen; Jun Yu; Jiayan Wu; Jingfa Xiao


Iet Systems Biology | 2014

Systematic study on G-protein couple receptor prototypes: did they really evolve from prokaryotic genes?

Zaichao Zhang; Zhong Jin; Yongbing Zhao; Zhewen Zhang; Rujiao Li; Jingfa Xiao; Jiayan Wu

Collaboration


Dive into the Yongbing Zhao's collaboration.

Top Co-Authors

Avatar

Jingfa Xiao

Beijing Institute of Genomics

View shared research outputs
Top Co-Authors

Avatar

Jiayan Wu

Beijing Institute of Genomics

View shared research outputs
Top Co-Authors

Avatar

Jun Yu

Beijing Institute of Genomics

View shared research outputs
Top Co-Authors

Avatar

Yunchao Ling

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Zhong Jin

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Junhui Yang

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Mingming Su

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Zhewen Zhang

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Chen Sun

Beijing Institute of Genomics

View shared research outputs
Top Co-Authors

Avatar

Jinyue Wang

Chinese Academy of Sciences

View shared research outputs
Researchain Logo
Decentralizing Knowledge