Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Xiaojun Di is active.

Publication


Featured researches published by Xiaojun Di.


Nature Biotechnology | 2003

Large-scale genotyping of complex DNA

Giulia C. Kennedy; Hajime Matsuzaki; Shoulian Dong; Wei-Min Liu; Jing Huang; Guoying Liu; Xing Su; Manqiu Cao; Wenwei Chen; Jane Zhang; Weiwei Liu; Geoffrey Yang; Xiaojun Di; Thomas B. Ryder; Zhijun He; Urvashi Surti; Michael S. Phillips; Michael T. Boyce-Jacino; Stephen P. A. Fodor; Keith W. Jones

Genetic studies aimed at understanding the molecular basis of complex human phenotypes require the genotyping of many thousands of single-nucleotide polymorphisms (SNPs) across large numbers of individuals. Public efforts have so far identified over two million common human SNPs; however, the scoring of these SNPs is labor-intensive and requires a substantial amount of automation. Here we describe a simple but effective approach, termed whole-genome sampling analysis (WGSA), for genotyping thousands of SNPs simultaneously in a complex DNA sample without locus-specific primers or automation. Our method amplifies highly reproducible fractions of the genome across multiple DNA samples and calls genotypes at >99% accuracy. We rapidly genotyped 14,548 SNPs in three different human populations and identified a subset of them with significant allele frequency differences between groups. We also determined the ancestral allele for 8,386 SNPs by genotyping chimpanzee and gorilla DNA. WGSA is highly scaleable and enables the creation of ultrahigh density SNP maps for use in genetic studies.


Nature Methods | 2004

Genotyping over 100,000 SNPs on a pair of oligonucleotide arrays

Hajime Matsuzaki; Shoulian Dong; Halina Loi; Xiaojun Di; Guoying Liu; Earl Hubbell; Jane Law; Tam Berntsen; Monica Chadha; Henry Hui; Geoffrey Yang; Giulia C. Kennedy; Teresa Webster; Simon Cawley; P. Sean Walsh; Keith W. Jones; Stephen P. A. Fodor; Rui Mei

We present a genotyping method for simultaneously scoring 116,204 SNPs using oligonucleotide arrays. At call rates >99%, reproducibility is >99.97% and accuracy, as measured by inheritance in trios and concordance with the HapMap Project, is >99.7%. Average intermarker distance is 23.6 kb, and 92% of the genome is within 100 kb of a SNP marker. Average heterozygosity is 0.30, with 105,511 SNPs having minor allele frequencies >5%.


Bioinformatics | 2003

Algorithms for large-scale genotyping microarrays.

Wei-min Liu; Xiaojun Di; Geoffrey Yang; Hajime Matsuzaki; Jing Huang; Rui Mei; Thomas B. Ryder; Teresa A. Webster; Shoulian Dong; Guoying Liu; Keith W. Jones; Giulia C. Kennedy; David Kulp

MOTIVATION Analysis of many thousands of single nucleotide polymorphisms (SNPs) across whole genome is crucial to efficiently map disease genes and understanding susceptibility to diseases, drug efficacy and side effects for different populations and individuals. High density oligonucleotide microarrays provide the possibility for such analysis with reasonable cost. Such analysis requires accurate, reliable methods for feature extraction, classification, statistical modeling and filtering. RESULTS We propose the modified partitioning around medoids as a classification method for relative allele signals. We use the average silhouette width, separation and other quantities as quality measures for genotyping classification. We form robust statistical models based on the classification results and use these models to make genotype calls and calculate quality measures of calls. We apply our algorithms to several different genotyping microarrays. We use reference types, informative Mendelian relationship in families, and leave-one-out cross validation to verify our results. The concordance rates with the single base extension reference types are 99.36% for the SNPs on autosomes and 99.64% for the SNPs on sex chromosomes. The concordance of the leave-one-out test is over 99.5% and is 99.9% higher for AA, AB and BB cells. We also provide a method to determine the gender of a sample based on the heterozygous call rate of SNPs on the X chromosome. See http://www.affymetrix.com for further information. The microarray data will also be available from the Affymetrix web site. AVAILABILITY The algorithms will be available commercially in the Affymetrix software package.


Bioinformatics | 2007

LdCompare: rapid computation of single- and multiple-marker r2 and genetic coverage

K. Hao; Xiaojun Di; Simon Cawley

UNLABELLED The scale of genetic-variation datasets has increased enormously and the linkage equilibrium (LD) structure of these polymorphisms, particularly in whole-genome association studies, is of great interest. The significant computational complexity of calculating single- and multiple-marker correlations at a genome-wide scale remains challenging. We have developed a program that efficiently characterizes whole-genome LD structure on large number of SNPs in terms of single- and multiple-marker correlations. AVAILABILITY LdCompare is licensed under the GNU General Public License (GPL). Source code, documentation, testing datasets and precompiled executables are available for download at: http://www.affymetrix.com/support/developer/tools/devnettools.affx


Microarrays : optical technologies and informatics. Conference | 2001

Rank-based algorithms for anlaysis of microarrays

Wei-Min Liu; Rui Mei; Daniel M. Bartell; Xiaojun Di; Teresa Webster; Tom Ryder

Analysis of microarray data often involves extracting information from raw intensities of spots of cells and making certain calls. Rank-based algorithms are powerful tools to provide probability values of hypothesis tests, especially when the distribution of the intensities is unknown. For our current gene expression arrays, a gene is detected by a set of probe pairs consisting of perfect match and mismatch cells. The one-sided upper-tail Wilcoxons signed rank test is used in our algorithms for absolute calls (whether a gene is detected or not), as well as comparative calls (whether a gene is increasing or decreasing or no significant change in a sample compared with another sample). We also test the possibility to use only perfect match cells to make calls. This paper focuses on absolute calls. We have developed error analysis methods and software tools that allow us to compare the accuracy of the calls in the presence or absence of mismatch cells at different target concentrations. The usage of nonparametric rank-based tests is not limited to absolute and comparative calls of gene expression chips. They can also be applied to other oligonucleotide microarrays for genotyping and mutation detection, as well as spotted arrays.


Bioinformatics | 2002

Analysis of high density expression microarrays with signed-rank call algorithms

Wei-Min Liu; Rui Mei; Xiaojun Di; Thomas B. Ryder; Earl Hubbell; S. Dee; Teresa Webster; Christina A. Harrington; Ming-Hsiu Ho; J. Baid; S. P. Smeekens


Genome Research | 2004

Parallel genotyping of over 10,000 SNPs using a one-primer assay on a high-density oligonucleotide array

Hajime Matsuzaki; Halina Loi; Shoulian Dong; Ya-Yu Tsai; Joy Fang; Jane Law; Xiaojun Di; Wei-Min Liu; Geoffrey Yang; Guoying Liu; Jing Huang; Giulia C. Kennedy; Thomas B. Ryder; Gregory Marcus; P. Sean Walsh; Mark D. Shriver; Jennifer M. Puck; Keith W. Jones; Rui Mei


Bioinformatics | 2005

Dynamic model based algorithms for screening and genotyping over 100K SNPs on oligonucleotide microarrays

Xiaojun Di; Hajime Matsuzaki; Teresa Webster; Earl Hubbell; Guoying Liu; Shoulian Dong; Dan Bartell; Jing Huang; Richard Chiles; Geoffrey Yang; Mei-Mei Shen; David Kulp; Giulia C. Kennedy; Rui Mei; Keith W. Jones; Simon Cawley


Archive | 2004

Computer software products for analyzing genotyping

Wei-Min Liu; Xiaojun Di; Geoffrey Yang


Archive | 2006

Methods and computer software products for analyzing genotyping data

Wei-Min Liu; Xiaojun Di; Geoffrey Yang; Giulia C. Kennedy

Collaboration


Dive into the Xiaojun Di's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge