Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Shuanglin Zhang is active.

Publication


Featured researches published by Shuanglin Zhang.


BMC Bioinformatics | 2004

Joint analysis of two microarray gene-expression data sets to select lung adenocarcinoma marker genes

Hongying Jiang; Youping Deng; Huann Sheng Chen; Lin Tao; Qiuying Sha; Jun Chen; Chung-Jui Tsai; Shuanglin Zhang

BackgroundDue to the high cost and low reproducibility of many microarray experiments, it is not surprising to find a limited number of patient samples in each study, and very few common identified marker genes among different studies involving patients with the same disease. Therefore, it is of great interest and challenge to merge data sets from multiple studies to increase the sample size, which may in turn increase the power of statistical inferences. In this study, we combined two lung cancer studies using micorarray GeneChip®, employed two gene shaving methods and a two-step survival test to identify genes with expression patterns that can distinguish diseased from normal samples, and to indicate patient survival, respectively.ResultsIn addition to common data transformation and normalization procedures, we applied a distribution transformation method to integrate the two data sets. Gene shaving (GS) methods based on Random Forests (RF) and Fishers Linear Discrimination (FLD) were then applied separately to the joint data set for cancer gene selection. The two methods discovered 13 and 10 marker genes (5 in common), respectively, with expression patterns differentiating diseased from normal samples. Among these marker genes, 8 and 7 were found to be cancer-related in other published reports. Furthermore, based on these marker genes, the classifiers we built from one data set predicted the other data set with more than 98% accuracy. Using the univariate Cox proportional hazard regression model, the expression patterns of 36 genes were found to be significantly correlated with patient survival (p < 0.05). Twenty-six of these 36 genes were reported as survival-related genes from the literature, including 7 known tumor-suppressor genes and 9 oncogenes. Additional principal component regression analysis further reduced the gene list from 36 to 16.ConclusionThis study provided a valuable method of integrating microarray data sets with different origins, and new methods of selecting a minimum number of marker genes to aid in cancer diagnosis. After careful data integration, the classification method developed from one data set can be applied to the other with high prediction accuracy.


American Journal of Human Genetics | 2000

Transmission/disequilibrium tests using multiple tightly linked markers.

Hongyu Zhao; Shuanglin Zhang; Kathleen R. Merikangas; Mátyás Trixler; Dieter B. Wildenauer; Fengzhu Sun; Kenneth K. Kidd

Transmission/disequilibrium tests have attracted much attention in genetic studies of complex traits because (a) their power to detect genes having small to moderate effects may be greater than that of other linkage methods and (b) they are robust against population stratification. Highly polymorphic markers have become available throughout the human genome, and many such markers can be studied within short physical distances. Studies using multiple tightly linked markers are more informative than those using single markers. However, such information has not been fully utilized by existing statistical methods, resulting in possibly substantial loss of information in the identification of genes underlying complex traits. In this article, we propose novel statistical methods to analyze multiple tightly linked markers. Simulation studies comparing our methods versus existing methods suggest that our methods are more powerful. Finally, we apply the proposed methods to study genetic linkage between the dopamine D2 receptor locus and alcoholism.


American Journal of Human Genetics | 2003

Transmission/Disequilibrium Test Based on Haplotype Sharing for Tightly Linked Markers

Shuanglin Zhang; Qiuying Sha; Huann Sheng Chen; Jianping Dong; Renfang Jiang

Studies using haplotypes of multiple tightly linked markers are more informative than those using a single marker. However, studies based on multimarker haplotypes have some difficulties. First, if we consider each haplotype as an allele and use the conventional single-marker transmission/disequilibrium test (TDT), then the rapid increase in the degrees of freedom with an increasing number of markers means that the statistical power of the conventional tests will be low. Second, the parental haplotypes cannot always be unambiguously reconstructed. In the present article, we propose a haplotype-sharing TDT (HS-TDT) for linkage or association between a disease-susceptibility locus and a chromosome region in which several tightly linked markers have been typed. This method is applicable to both quantitative traits and qualitative traits. It is applicable to any size of nuclear family, with or without ambiguous phase information, and it is applicable to any number of alleles at each of the markers. The degrees of freedom (in a broad sense) of the test increase linearly as the number of markers considered increases but do not increase as the number of alleles at the markers increases. Our simulation results show that the HS-TDT has the correct type I error rate in structured populations and that, in most cases, the power of HS-TDT is higher than the power of the existing single-marker TDTs and haplotype-based TDTs.


Annals of Human Genetics | 2003

Qualitative semi-parametric test for genetic associations in case-control designs under structured populations.

Huann Sheng Chen; Xiaofeng Zhu; Hongyu Zhao; Shuanglin Zhang

Recently, statistical methods have been proposed using genomic markers to control for population stratification in genetic association studies. However, these methods either have unacceptable low power when population stratification becomes strong or cannot control for population stratification well under admixture population models. In this paper, we propose a semiparametric association test to detect genetic association between a candidate marker and a qualitative trait of interest in case‐control designs. The performanceof the test is compared to other existing methods through simulations. The results show that our method gives correct type I error rate both under discrete population models and admixture population models, and our method is robust to the extent of the population stratification. In most of the cases we considered, our method has higher power and, in some cases, substantially higher power than that of existing methods.


Human Genetics | 2006

A classical likelihood based approach for admixture mapping using EM algorithm

Xiaofeng Zhu; Shuanglin Zhang; Hua Tang; Richard S. Cooper

Several disease-mapping methods have been proposed recently, which use the information generated by recent admixture of populations from historically distinct geographic origins. These methods include both classic likelihood and Bayesian approaches. In this study we directly maximize the likelihood function from the hidden Markov Model for admixture mapping using the EM algorithm, allowing for uncertainty in model parameters, such as the allele frequencies in the parental populations. We determined the robustness of the proposed method by examining the ancestral allele frequency estimate and individual marker-location specific ancestry when the data were generated by different population admixture models and no learning sample was used. The proposed method outperforms a widely used Bayesian MCMC strategy for data generated from various population admixture models. The multipoint information content for ancestry was derived based on the map provided by Smith et al. (2004) and the associated statistical power was calculated. We examined the distribution of admixture LD across the genome for both real and simulated data and established a threshold for genome wide significance applicable to admixture mapping studies. The software ADMIXPROGRAM for performing admixture mapping is available from authors.


American Journal of Human Genetics | 2001

Quantitative Similarity-Based Association Tests Using Population Samples

Shuanglin Zhang; Hongyu Zhao

Although genetic association studies using unrelated individuals may be subject to bias caused by population stratification, alternative methods that are robust to population stratification, such as family-based association designs, may be less powerful. Furthermore, it is often more feasible and less expensive to collect unrelated individuals. Recently, several statistical methods have been proposed for case-control association tests in a structured population; these methods may be robust to population stratification. In the present study, we propose a quantitative similarity-based association test (QSAT) to identify association between a candidate marker and a quantitative trait of interest, through use of unrelated individuals. For the QSAT, we first determine whether two individuals are from the same subpopulation or from different subpopulations, using genotype data at a set of independent markers. We then perform an association test between the candidate marker and the quantitative trait, through incorporation of such information. Simulation results based on either coalescent models or empirical population genetics data show that the QSAT has a correct type I error rate in the presence of population stratification and that the power of the QSAT is higher than that of family-based association designs.


Genetic Epidemiology | 2012

Two Adaptive Weighting Methods to Test for Rare Variant Associations in Family‐Based Designs

Shurong Fang; Qiuying Sha; Shuanglin Zhang

Although next‐generation DNA sequencing technologies have made rare variant association studies feasible and affordable, the development of powerful statistical methods for rare variant association studies is still under way. Most of the existing methods for rare variant association studies compare the number of rare mutations in a group of rare variants (in a gene or a pathway) between cases and controls. However, these methods assume that all causal variants are risk to diseases. Recently, several methods that are robust to the direction and magnitude of effects of causal variants have been proposed. However, they are applicable to unrelated individuals only, whereas family data have been shown to improve power to detect rare variants. In this article, we propose two adaptive weighting methods for rare variant association studies based on family data for quantitative traits. Using extensive simulation studies, we evaluate and compare our proposed methods with two methods based on the weights proposed by Madsen and Browning. Our results show that both proposed methods are robust to population stratification, robust to the direction and magnitude of the effects of causal variants, and more powerful than the methods using weights suggested by Madsen and Browning, especially when both risk and protective variants are present. Genet. Epidemiol. 36:499‐507, 2012.


Genetic Epidemiology | 2001

Test of association for quantitative traits in general pedigrees: the quantitative pedigree disequilibrium test.

Shuanglin Zhang; Kui Zhang; Jinming Li; Fengzhu Sun; Hongyu Zhao

Many statistical methods have been proposed in recent years to test for genetic linkage and association between genetic markers and traits of interest through unrelated nuclear families. However, most of these methods are not valid tests of association in the presence of linkage when some of the nuclear families are related. As a result, related nuclear families in large pedigrees cannot be included in a single analysis to test for linkage disequilibrium. Recently, Martin et al. [Am J Hum Genet 67:146–54, 2000] proposed the pedigree disequilibrium test (PDT) to test for linkage and association in general pedigrees for qualitative traits. In this article, we develop a similar quantitative pedigree disequilibrium test (QPDT) to test for linkage and association in general pedigrees for quantitative traits. We apply both the PDT and the QPDT to analyze the sequence data from the seven candidate genes in the simulated data sets in the Genetic Analysis Workshop 12.


Annals of Human Genetics | 2006

A Combinatorial Searching Method for Detecting a Set of Interacting Loci Associated with Complex Traits

Qiuying Sha; Xiaofeng Zhu; Yijun Zuo; Richard S. Cooper; Shuanglin Zhang

Complex diseases are presumed to be the results of the interaction of several genes and environmental factors, with each gene only having a small effect on the disease. Mapping complex disease genes therefore becomes one of the greatest challenges facing geneticists. Most current approaches of association studies essentially evaluate one marker or one gene (haplotype approach) at a time. These approaches ignore the possibility that effects of multilocus functional genetic units may play a larger role than a single‐locus effect in determining trait variability. In this article, we propose a Combinatorial Searching Method (CSM) to detect a set of interacting loci (may be unlinked) that predicts the complex trait. In the application of the CSM, a simple filter is used to filter all the possible locus‐sets and retain the candidate locus‐sets, then a new objective function based on the cross‐validation and partitions of the multi‐locus genotypes is proposed to evaluate the retained locus‐sets. The locus‐set with the largest value of the objective function is the final locus‐set and a permutation procedure is performed to evaluate the overall p‐value of the test for association between the final locus‐set and the trait. The performance of the method is evaluated by simulation studies as well as by being applied to a real data set. The simulation studies show that the CSM has reasonable power to detect high‐order interactions. When the CSM is applied to a real data set to detect the locus‐set (among the 13 loci in the ACE gene) that predicts systolic blood pressure (SBP) or diastolic blood pressure (DBP), we found that a four‐locus gene‐gene interaction model best predicts SBP with an overall p‐value = 0.033, and similarly a two‐locus gene‐gene interaction model best predicts DBP with an overall p‐value = 0.045.


Annals of Human Genetics | 2005

Tests of association between quantitative traits and haplotypes in a reduced-dimensional space.

Qiuying Sha; Jianping Dong; Renfang Jiang; Shuanglin Zhang

Candidate gene association tests are currently performed using several intragenic SNPs simultaneously, by testing SNP haplotype or genotype effects in multifactorial diseases or traits. The number of haplotypes drastically increases with an increase in the number of typed SNPs. As a result, large numbers of haplotypes will introduce large degrees of freedom in haplotype‐based tests, and thus limit the power of the tests.

Collaboration


Dive into the Shuanglin Zhang's collaboration.

Top Co-Authors

Avatar

Qiuying Sha

Michigan Technological University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Huann Sheng Chen

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Xiaofeng Zhu

Loyola University Chicago

View shared research outputs
Top Co-Authors

Avatar

Zhaogong Zhang

Michigan Technological University

View shared research outputs
Top Co-Authors

Avatar

Jianping Dong

Michigan Technological University

View shared research outputs
Top Co-Authors

Avatar

Kui Zhang

University of Alabama at Birmingham

View shared research outputs
Top Co-Authors

Avatar

Renfang Jiang

Michigan Technological University

View shared research outputs
Top Co-Authors

Avatar

Xuexia Wang

University of Wisconsin–Milwaukee

View shared research outputs
Top Co-Authors

Avatar

Zhenchuan Wang

Michigan Technological University

View shared research outputs
Researchain Logo
Decentralizing Knowledge