Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Yupeng Wang is active.

Publication


Featured researches published by Yupeng Wang.


Nucleic Acids Research | 2012

MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity

Yupeng Wang; Haibao Tang; Jeremy D. DeBarry; Xu-fei Tan; Jingping Li; Xiyin Wang; Tae-Ho Lee; Huizhe Jin; Barry S. Marler; Hui Guo; Jessica C. Kissinger; Andrew H. Paterson

MCScan is an algorithm able to scan multiple genomes or subgenomes in order to identify putative homologous chromosomal regions, and align these regions using genes as anchors. The MCScanX toolkit implements an adjusted MCScan algorithm for detection of synteny and collinearity that extends the original software by incorporating 14 utility programs for visualization of results and additional downstream analyses. Applications of MCScanX to several sequenced plant genomes and gene families are shown as examples. MCScanX can be used to effectively analyze chromosome structural changes, and reveal the history of gene family expansions that might contribute to the adaptation of lineages and taxa. An integrated view of various modes of gene duplication can supplement the traditional gene tree analysis in specific families. The source code and documentation of MCScanX are freely available at http://chibba.pgml.uga.edu/mcscan2/.


Nature Communications | 2014

The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes

Shengyi Liu; Xinhua Yang; Chaobo Tong; David Edwards; Isobel A. P. Parkin; Meixia Zhao; Jianxin Ma; Jingyin Yu; Shunmou Huang; Xiyin Wang; Wang J; Kun Lu; Zhiyuan Fang; Ian Bancroft; Tae-Jin Yang; Qiong Hu; Xinfa Wang; Zhen Yue; Haojie Li; Linfeng Yang; Jian Wu; Qing Zhou; Wanxin Wang; Graham J. King; J. Chris Pires; Changxin Lu; Zhangyan Wu; Perumal Sampath; Zhuo Wang; Hui Guo

Polyploidization has provided much genetic variation for plant adaptive evolution, but the mechanisms by which the molecular evolution of polyploid genomes establishes genetic architecture underlying species differentiation are unclear. Brassica is an ideal model to increase knowledge of polyploid evolution. Here we describe a draft genome sequence of Brassica oleracea, comparing it with that of its sister species B. rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks, asymmetrical amplification of transposable elements, differential gene co-retention for specific pathways and variation in gene expression, including alternative splicing, among a large number of paralogous and orthologous genes. Genes related to the production of anticancer phytochemicals and morphological variations illustrate consequences of genome duplication and gene divergence, imparting biochemical and morphological variation to B. oleracea. This study provides insights into Brassica genome evolution and will underpin research into the many important crops in this genus.


Theoretical and Applied Genetics | 2006

A comparative linkage map of oilseed rape and its use for QTL analysis of seed oil and erucic acid content.

D. Qiu; Colin Morgan; Jiaqin Shi; Yan Long; Jie Liu; Ruiyuan Li; X. Zhuang; Yupeng Wang; Xu Tan; E. Dietrich; T. Weihmann; C. Everett; S. Vanstraelen; Paul Beckett; Fiona Fraser; Martin Trick; S. Barnes; J. Wilmer; Renate Schmidt; Jingping Li; D. Li; J. Meng; Ian Bancroft

We have developed a new DH mapping population for oilseed rape, named TNDH, using genetically and phenotypically diverse parental lines. We used the population in the construction of a high stringency genetic linkage map, consisting of 277 loci, for use in quantitative genetic analysis. A proportion of the markers had been used previously in the construction of linkage maps for Brassica species, thus permitting the alignment of maps. The map includes 68 newly developed Sequence Tagged Site (STS) markers targeted to the homologues of defined genes of A. thaliana. The use of these markers permits the alignment of our linkage map with the A. thaliana genome sequence. An additional 74 loci (31 newly developed STS markers and 43 loci defined by SSR and RFLP markers that had previously been used in published linkage maps) were added to the map. These markers increased the resolution of alignment of the newly constructed linkage map with existing Brassica linkage maps and the A. thaliana genome sequence. We conducted field trials with the TNDH population at two sites, and over 2xa0years, and identified reproducible QTL for seed oil content and erucic acid content. The results provide new insights into the genetic control of seed oil and erucic acid content in oilseed rape, and demonstrate the utility of the linkage map and population.


Annals of the New York Academy of Sciences | 2012

Genome and gene duplications and gene expression divergence: a view from plants

Yupeng Wang; Xiyin Wang; Andrew H. Paterson

With many plant genomes sequenced, it is now clear that one distinguishing feature of angiosperm (flowering plant) genomes is their high frequency of whole‐genome duplication. Single‐gene duplication is also widespread in angiosperm genomes. Following various mechanisms of gene duplication, genes are often retained or lost in a biased manner, which has suggested recent models for gene family evolution, such as functional buffering and the gene balance hypothesis in addition to now‐classical models, including neofunctionalization and subfunctionalization. Evolutionary consequences of gene duplication, often studied through analyzing gene expression divergence, have enhanced understanding of the biological significance of different mechanisms of gene duplication.


PLOS ONE | 2011

Modes of Gene Duplication Contribute Differently to Genetic Novelty and Redundancy, but Show Parallels across Divergent Angiosperms

Yupeng Wang; Xiyin Wang; Haibao Tang; Xu Tan; Stephen P. Ficklin; F. Alex Feltus; Andrew H. Paterson

Background Both single gene and whole genome duplications (WGD) have recurred in angiosperm evolution. However, the evolutionary effects of different modes of gene duplication, especially regarding their contributions to genetic novelty or redundancy, have been inadequately explored. Results In Arabidopsis thaliana and Oryza sativa (rice), species that deeply sample botanical diversity and for which expression data are available from a wide range of tissues and physiological conditions, we have compared expression divergence between genes duplicated by six different mechanisms (WGD, tandem, proximal, DNA based transposed, retrotransposed and dispersed), and between positional orthologs. Both neo-functionalization and genetic redundancy appear to contribute to retention of duplicate genes. Genes resulting from WGD and tandem duplications diverge slowest in both coding sequences and gene expression, and contribute most to genetic redundancy, while other duplication modes contribute more to evolutionary novelty. WGD duplicates may more frequently be retained due to dosage amplification, while inferred transposon mediated gene duplications tend to reduce gene expression levels. The extent of expression divergence between duplicates is discernibly related to duplication modes, different WGD events, amino acid divergence, and putatively neutral divergence (time), but the contribution of each factor is heterogeneous among duplication modes. Gene loss may retard inter-species expression divergence. Members of different gene families may have non-random patterns of origin that are similar in Arabidopsis and rice, suggesting the action of pan-taxon principles of molecular evolution. Conclusion Gene duplication modes differ in contribution to genetic novelty and redundancy, but show some parallels in taxa separated by hundreds of millions of years of evolution.


New Phytologist | 2013

Gene body methylation shows distinct patterns associated with different gene origins and duplication modes and has a heterogeneous relationship with gene expression in Oryza sativa (rice)

Yupeng Wang; Xiyin Wang; Tae-Ho Lee; Shahid Mansoor; Andrew H. Paterson

Whole-genome duplication (WGD) has been recurring and single-gene duplication is also widespread in angiosperms. Recent whole-genome DNA methylation maps indicate that gene body methylation (i.e. of coding regions) has a functional role. However, whether gene body methylation is related to gene origins and duplication modes has yet to be reported. In rice (Oryza sativa), we computed a body methylation level (proportion of methylated CpG within coding regions) for each gene in five tissues. Body methylation levels follow a bimodal distribution, but show distinct patterns associated with transposable element-related genes; WGD, tandem, proximal and transposed duplicates; and singleton genes. For pairs of duplicated genes, divergence in body methylation levels increases with physical distance and synonymous (Ks) substitution rates, and WGDs show lower divergence than single-gene duplications of similar Ks levels. Intermediate body methylation tends to be associated with high levels of gene expression, whereas heavy body methylation is associated with lower levels of gene expression. The biological trends revealed here are consistent across five rice tissues, indicating that genes of different origins and duplication modes have distinct body methylation patterns, and body methylation has a heterogeneous relationship with gene expression and may be related to survivorship of duplicated genes.


BMC Genomics | 2013

Different patterns of gene structure divergence following gene duplication in Arabidopsis

Yupeng Wang; Xu Tan; Andrew H. Paterson

BackgroundDivergence in gene structure following gene duplication is not well understood. Gene duplication can occur via whole-genome duplication (WGD) and single-gene duplications including tandem, proximal and transposed duplications. Different modes of gene duplication may be associated with different types, levels, and patterns of structural divergence.ResultsIn Arabidopsis thaliana, we denote levels of structural divergence between duplicated genes by differences in coding-region lengths and average exon lengths, and the number of insertions/deletions (indels) and maximum indel length in their protein sequence alignment. Among recent duplicates of different modes, transposed duplicates diverge most dramatically in gene structure. In transposed duplications, parental loci tend to have longer coding-regions and exons, and smaller numbers of indels and maximum indel lengths than transposed loci, reflecting biased structural changes in transposed duplications. Structural divergence increases with evolutionary time for WGDs, but not transposed duplications, possibly because of biased gene losses following transposed duplications. Structural divergence has heterogeneous relationships with nucleotide substitution rates, but is consistently positively correlated with gene expression divergence. The NBS-LRR gene family shows higher-than-average levels of structural divergence.ConclusionsOur study suggests that structural divergence between duplicated genes is greatly affected by the mechanisms of gene duplication and may be not proportional to evolutionary time, and that certain gene families are under selection on rapid evolution of gene structure.


Biomarker Insights | 2010

LSOSS: Detection of Cancer Outlier Differential Gene Expression

Yupeng Wang; Romdhane Rekaya

Detection of differential gene expression using microarray technology has received considerable interest in cancer research studies. Recently, many researchers discovered that oncogenes may be activated in some but not all samples in a given disease group. The existing statistical tools for detecting differentially expressed genes in a subset of the disease group mainly include cancer outlier profile analysis (COPA), outlier sum (OS), outlier robust t-statistic (ORT) and maximum ordered subset t-statistics (MOST). In this study, another approach named Least Sum of Ordered Subset Square t-statistic (LSOSS) is proposed. The results of our simulation studies indicated that LSOSS often has more power than previous statistical methods. When applied to real human breast and prostate cancer data sets, LSOSS was competitive in terms of the biological relevance of top ranked genes. Furthermore, a modified hierarchical clustering method was developed to classify the heterogeneous gene activation patterns of human breast cancer samples based on the significant genes detected by LSOSS. Three classes of gene activation patterns, which correspond to estrogen receptor (ER)+, ER– and a mixture of ER+ and ER–, were detected and each class was assigned a different gene signature.


Bioinformatics | 2013

MCScanX-transposed

Yupeng Wang; Jingping Li; Andrew H. Paterson

SUMMARYnGene duplication occurs via different modes such as segmental and single-gene duplications. Transposed gene duplication, a specific form of single-gene duplication, copies a gene from an ancestral chromosomal location to a novel location. MCScanX is a toolkit for detection and evolutionary analysis of gene colinearity. We have developed MCScanX-transposed, a software package to detect transposed gene duplications that occurred within different epochs, based on execution of MCScanX within and between related genomes. MCScanX-transposed can be also used for integrative analysis of gene duplication modes for a genome and to annotate a gene family of interest with gene duplication modes.nnnAVAILABILITYnMCScanX-transposed is freely available at http://chibba.pgml.uga.edu/mcscan2/transposed/nnnSUPPLEMENTARY INFORMATIONnSupplementary data are available at Bioinformatics online.


PLOS ONE | 2016

Large-Scale Gene Relocations following an Ancient Genome Triplication Associated with the Diversification of Core Eudicots

Yupeng Wang; Stephen P. Ficklin; Xiyin Wang; F. Alex Feltus; Andrew H. Paterson

Different modes of gene duplication including whole-genome duplication (WGD), and tandem, proximal and dispersed duplications are widespread in angiosperm genomes. Small-scale, stochastic gene relocations and transposed gene duplications are widely accepted to be the primary mechanisms for the creation of dispersed duplicates. However, here we show that most surviving ancient dispersed duplicates in core eudicots originated from large-scale gene relocations within a narrow window of time following a genome triplication (γ) event that occurred in the stem lineage of core eudicots. We name these surviving ancient dispersed duplicates as relocated γ duplicates. In Arabidopsis thaliana, relocated γ, WGD and single-gene duplicates have distinct features with regard to gene functions, essentiality, and protein interactions. Relative to γ duplicates, relocated γ duplicates have higher non-synonymous substitution rates, but comparable levels of expression and regulation divergence. Thus, relocated γ duplicates should be distinguished from WGD and single-gene duplicates for evolutionary investigations. Our results suggest large-scale gene relocations following the γ event were associated with the diversification of core eudicots.

Collaboration


Dive into the Yupeng Wang's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

R. Rekaya

University of Georgia

View shared research outputs
Top Co-Authors

Avatar

Xinyu Liu

University of Georgia

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Stephen P. Ficklin

Washington State University

View shared research outputs
Top Co-Authors

Avatar

Haibao Tang

Fujian Agriculture and Forestry University

View shared research outputs
Top Co-Authors

Avatar

Wang J

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Chaobo Tong

Crops Research Institute

View shared research outputs
Researchain Logo
Decentralizing Knowledge