Qishan Wang
Shanghai Jiao Tong University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Qishan Wang.
Bioinformatics | 2012
Alexander E. Lipka; Feng Tian; Qishan Wang; Jason A. Peiffer; Meng Li; Peter J. Bradbury; Michael A. Gore; Edward S. Buckler; Zhiwu Zhang
SUMMARY Software programs that conduct genome-wide association studies and genomic prediction and selection need to use methodologies that maximize statistical power, provide high prediction accuracy and run in a computationally efficient manner. We developed an R package called Genome Association and Prediction Integrated Tool (GAPIT) that implements advanced statistical methods including the compressed mixed linear model (CMLM) and CMLM-based genomic prediction and selection. The GAPIT package can handle large datasets in excess of 10 000 individuals and 1 million single-nucleotide polymorphisms with minimal computational time, while providing user-friendly access and concise tables and graphs to interpret results. AVAILABILITY http://www.maizegenetics.net/GAPIT. CONTACT [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
PLOS ONE | 2013
Qiang Chen; Yufang Ma; Yumei Yang; Zhenliang Chen; Rongrong Liao; Xiaoxian Xie; Zhen Wang; Pengfei He; Yingying Tu; Xiangzhe Zhang; Changsuo Yang; Hongjie Yang; Fuqing Yu; Youmin Zheng; Zhiwu Zhang; Qishan Wang; Yuchun Pan
Next-generation sequencing (NGS) approaches are widely used in genome-wide genetic marker discovery and genotyping. However, current NGS approaches are not easy to apply to general outbred populations (human and some major farm animals) for SNP identification because of the high level of heterogeneity and phase ambiguity in the haplotype. Here, we reported a new method for SNP genotyping, called genotyping by genome reducing and sequencing (GGRS) to genotype outbred species. Through an improved procedure for library preparation and a marker discovery and genotyping pipeline, the GGRS approach can genotype outbred species cost-effectively and high-reproducibly. We also evaluated the efficiency and accuracy of our approach for high-density SNP discovery and genotyping in a large genome pig species (2.8 Gb), for which more than 70,000 single nucleotide polymorphisms (SNPs) can be identified for an expenditure of only
PLOS ONE | 2014
Qishan Wang; Feng Tian; Yuchun Pan; Edward S. Buckler; Zhiwu Zhang
80 (USD)/sample.
Genetica | 2011
Qishan Wang; Minghui Wang; Xiangzhe Zhang; Boji Hao; S. K. Kaushik; Yuchun Pan
Genome-Wide Association Studies shed light on the identification of genes underlying human diseases and agriculturally important traits. This potential has been shadowed by false positive findings. The Mixed Linear Model (MLM) method is flexible enough to simultaneously incorporate population structure and cryptic relationships to reduce false positives. However, its intensive computational burden is prohibitive in practice, especially for large samples. The newly developed algorithm, FaST-LMM, solved the computational problem, but requires that the number of SNPs be less than the number of individuals to derive a rank-reduced relationship. This restriction potentially leads to less statistical power when compared to using all SNPs. We developed a method to extract a small subset of SNPs and use them in FaST-LMM. This method not only retains the computational advantage of FaST-LMM, but also remarkably increases statistical power even when compared to using the entire set of SNPs. We named the method SUPER (Settlement of MLM Under Progressively Exclusive Relationship) and made it available within an implementation of the GAPIT software package.
The Plant Genome | 2016
You Tang; Xiaolei Liu; Jiabo Wang; Meng Li; Qishan Wang; Feng Tian; Zhongbin Su; Yuchun Pan; Di Liu; Alexander E. Lipka; Edward S. Buckler; Zhiwu Zhang
The Arabidopsis thaliana WRKY proteins are characterized by a sequence of 60 amino acids including WRKY domain. It is well established that these proteins are involved in the regulation of various physiological programs unique to plants including pathogen defense, senescence and response to environmental stresses, which attracts attention of the scientific community as to how this family might have evolved. We tried to satisfy this curiosity and analyze reasons for duplications of these gene sequences leading to their diversified gene actions. The WRKY sequences available in Arabidopsis thaliana were used to evaluate selection pressure following duplication events. A phylogenetic tree was constructed and the WRKY family was divided into five sub-families. After that, tests were conducted to decide whether positive or purified selection played key role in these events. Our results suggest that purifying selection played major role during the evolution of this family. Some amino acid changes were also detected in specific branches of phylogeny suggesting that relaxed constraints might also have contributed to functional divergence among sub-families. Sites relaxed from purifying selection were identified and mapped onto the structural and functional regions of the WRKY1 protein. These analyses will enhance our understanding of the precise role played by natural selection to create functional diversity in WRKY family.
BMC Evolutionary Biology | 2009
Minghui Wang; Xiangzhe Zhang; Hongbo Zhao; Qishan Wang; Yuchun Pan
Genome‐wide association study Genomic prediction Simulation and experimental design
PLOS ONE | 2014
Yumei Yang; Qishan Wang; Qiang Chen; Rongrong Liao; Xiangzhe Zhang; Hongjie Yang; Youmin Zheng; Zhiwu Zhang; Yuchun Pan
BackgroundForkhead box, class O (FoxO) belongs to the large family of forkhead transcription factors that are characterized by a conserved forkhead box DNA-binding domain. To date, the FoxO group has four mammalian members: FoxO1, FoxO3a, FoxO4 and FoxO6, which are orthologs of DAF16, an insulin-responsive transcription factor involved in regulating longevity of worms and flies. The degree of homology between these four members is high, especially in the forkhead domain, which contains the DNA-binding interface. Yet, mouse FoxO knockouts have revealed that each FoxO gene has its unique role in the physiological process. Whether the functional divergences are primarily due to adaptive selection pressure or relaxed selective constraint remains an open question. As such, this study aims to address the evolutionary mode of FoxO, which may lead to the functional divergence.ResultsSequence similarity searches have performed in genome and scaffold data to identify homologues of FoxO in vertebrates. Phylogenetic analysis was used to characterize the family evolutionary history by identifying two duplications early in vertebrate evolution. To determine the mode of evolution in vertebrates, we performed a rigorous statistical analysis with FoxO gene sequences, including relative rate ratio tests, branch-specific dN/dSratio tests, site-specific dN/dSratio tests, branch-site dN/dSratio tests and clade level amino acid conservation/variation patterns analysis. Our results suggest that FoxO is constrained by strong purifying selection except four sites in FoxO6, which have undergone positive Darwinian selection. The functional divergence in this family is best explained by either relaxed purifying selection or positive selection.ConclusionWe present a phylogeny describing the evolutionary history of the FoxO gene family and show that the genes have evolved through duplications followed by purifying selection except for four sites in FoxO6 fixed by positive selection lie mostly within the non-conserved optimal PKB motif in the C-terminal part. Relaxed selection may play important roles in the process of functional differentiation evolved through gene duplications as well.
Reproductive Biology and Endocrinology | 2010
Kan He; Hongbo Zhao; Qishan Wang; Yuchun Pan
We report a novel algorithm, iBLUP, to impute missing genotypes by simultaneously and comprehensively using identity by descent and linkage disequilibrium information. The simulation studies showed that the algorithm exhibited drastically tolerance to high missing rate, especially for rare variants than other common imputation methods, e.g. BEAGLE and fastPHASE. At a missing rate of 70%, the accuracy of BEAGLE and fastPHASE dropped to 0.82 and 0.74 respectively while iBLUP retained an accuracy of 0.95. For minor allele, the accuracy of BEAGLE and fastPHASE decreased to −0.1 and 0.03, while iBLUP still had an accuracy of 0.61.We implemented the algorithm in a publicly available software package also named iBLUP. The application of iBLUP for processing real sequencing data in an outbred pig population was demonstrated.
Animal Genetics | 2015
Zhen Wang; Qiang Chen; Yumei Yang; Rongrong Liao; Jianjun Zhao; Zhiyuan Zhang; Zi-Jiang Chen; Xiangzhe Zhang; M. Xue; Hongjie Yang; Y. Zheng; Qishan Wang; Y. Pan
BackgroundPre-implantation development is a crucial step in successful implantation and pregnancy in mammals. It has been studied in depth, but mostly in laboratory animal models. Less is known about the regulatory mechanism involved in the pre-implantation development in humans and about the comparative aspects.MethodsHere, we employed the microarray datasets from the public database library of GEO and applied comparative analysis of genome wide temporal gene expression data based on statistical analysis and functional annotation for both mouse and human, demonstrating the discordance between the regulatory mechanisms of both mouse and human pre-implantation development.ResultsThere were differences between mouse and human pre-implantation development both in the global gene expression pattern and in the expression changes of individual genes at each stage, including different major transient waves of transcription profiles and some stage-specific genes and significantly related pathways. There also appeared to be different functional changes from one stage to another between mouse and human.ConclusionsThe analysis presented here lead to interesting and complementary conclusions that the regulatory mechanism of human pre-implantation development is not completely the same as the mouse. Not as the fact that 1-cell to 2-cell stage is important for mouse pre-implantation development, the 4-cell stage and 8-cell stage are both essential for human. Unlike in mouse, of which most of pathways found were related to energy, RNA and protein metabolism, the identified pathways in human were mostly disease-related and associated with human pre-implantation embryonic development. All of these suggest that a further comparative analysis should be required for applying the result of mouse expression data to human research or therapy, particularly in pre-implantation developments. Our study provides several potential targets of genes and pathways for studying the regulatory mechanism of human pre-implantation development using mouse model.
Mammalian Genome | 2013
Kan He; Qishan Wang; Zhen Wang; Yuchun Pan
Summary The Chinese indigenous pig breeds in the Taihu Lake region are the most prolific pig breeds in the world. In this study, we investigated the genetic diversity and population structure of six breeds, including Meishan, Erhualian, Mi, Fengjing, Shawutou and Jiaxing Black, in this region using whole‐genome SNP data. A high SNP with proportions of polymorphic markers ranging from 0.925 to 0.995 was exhibited by the Chinese indigenous pigs in the Taihu Lake region. The allelic richness and expected heterozygosity also were calculated and indicated that the genetic diversity of the Meishan breed was the greatest, whereas that of the Fengjing breed was the lowest. The genetic differentiation, as indicated by the fixation index, exhibited an overall mean of 0.149. Both neighbor‐joining tree and principal components analysis were able to distinguish the breeds from each other, but structure analysis indicated that the Mi and Erhualian breeds exhibited similar major signals of admixture. With this genome‐wide comprehensive survey of the genetic diversity and population structure of the indigenous Chinese pigs in the Taihu Lake region, we confirmed the rationality of the current breed classification of the pigs in this region.