Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Manqiu Cao is active.

Publication


Featured researches published by Manqiu Cao.


Nature Biotechnology | 2003

Large-scale genotyping of complex DNA

Giulia C. Kennedy; Hajime Matsuzaki; Shoulian Dong; Wei-Min Liu; Jing Huang; Guoying Liu; Xing Su; Manqiu Cao; Wenwei Chen; Jane Zhang; Weiwei Liu; Geoffrey Yang; Xiaojun Di; Thomas B. Ryder; Zhijun He; Urvashi Surti; Michael S. Phillips; Michael T. Boyce-Jacino; Stephen P. A. Fodor; Keith W. Jones

Genetic studies aimed at understanding the molecular basis of complex human phenotypes require the genotyping of many thousands of single-nucleotide polymorphisms (SNPs) across large numbers of individuals. Public efforts have so far identified over two million common human SNPs; however, the scoring of these SNPs is labor-intensive and requires a substantial amount of automation. Here we describe a simple but effective approach, termed whole-genome sampling analysis (WGSA), for genotyping thousands of SNPs simultaneously in a complex DNA sample without locus-specific primers or automation. Our method amplifies highly reproducible fractions of the genome across multiple DNA samples and calls genotypes at >99% accuracy. We rapidly genotyped 14,548 SNPs in three different human populations and identified a subset of them with significant allele frequency differences between groups. We also determined the ancestral allele for 8,386 SNPs by genotyping chimpanzee and gorilla DNA. WGSA is highly scaleable and enables the creation of ultrahigh density SNP maps for use in genetic studies.


American Journal of Human Genetics | 2006

Oligonucleotide Microarray Analysis of Genomic Imbalance in Children with Mental Retardation

Jeffrey M. Friedman; Agnes Baross; Allen Delaney; Adrian Ally; Laura Arbour; Jennifer Asano; Dione K. Bailey; Sarah Barber; Patricia Birch; Mabel Brown-John; Manqiu Cao; Susanna Chan; David L. Charest; Noushin Farnoud; Nicole Fernandes; Stephane Flibotte; Anne Go; William T. Gibson; Robert A. Holt; Steven J.M. Jones; Giulia C. Kennedy; Martin Krzywinski; Sylvie Langlois; Haiyan I. Li; Barbara McGillivray; Tarun Nayar; Trevor J. Pugh; Evica Rajcan-Separovic; Jacqueline E. Schein; Angelique Schnerch

The cause of mental retardation in one-third to one-half of all affected individuals is unknown. Microscopically detectable chromosomal abnormalities are the most frequently recognized cause, but gain or loss of chromosomal segments that are too small to be seen by conventional cytogenetic analysis has been found to be another important cause. Array-based methods offer a practical means of performing a high-resolution survey of the entire genome for submicroscopic copy-number variants. We studied 100 children with idiopathic mental retardation and normal results of standard chromosomal analysis, by use of whole-genome sampling analysis with Affymetrix GeneChip Human Mapping 100K arrays. We found de novo deletions as small as 178 kb in eight cases, de novo duplications as small as 1.1 Mb in two cases, and unsuspected mosaic trisomy 9 in another case. This technology can detect at least twice as many potentially pathogenic de novo copy-number variants as conventional cytogenetic analysis can in people with mental retardation.


American Journal of Human Genetics | 2004

Whole-Genome Scan, in a Complex Disease, Using 11,245 Single-Nucleotide Polymorphisms: Comparison with Microsatellites

Sally John; Neil Shephard; Guoying Liu; Eleftheria Zeggini; Manqiu Cao; Wenwei Chen; Nisha Vasavda; Tracy Mills; Anne Barton; Anne Hinks; Steve Eyre; Keith W. Jones; William Ollier; A J Silman; Neil James Gibson; Jane Worthington; Giulia C. Kennedy

Despite the theoretical evidence of the utility of single-nucleotide polymorphisms (SNPs) for linkage analysis, no whole-genome scans of a complex disease have yet been published to directly compare SNPs with microsatellites. Here, we describe a whole-genome screen of 157 families with multiple cases of rheumatoid arthritis (RA), performed using 11,245 genomewide SNPs. The results were compared with those from a 10-cM microsatellite scan in the same cohort. The SNP analysis detected HLA*DRB1, the major RA susceptibility locus (P=.00004), with a linkage interval of 31 cM, compared with a 50-cM linkage interval detected by the microsatellite scan. In addition, four loci were detected at a nominal significance level (P<.05) in the SNP linkage analysis; these were not observed in the microsatellite scan. We demonstrate that variation in information content was the main factor contributing to observed differences in the two scans, with the SNPs providing significantly higher information content than the microsatellites. Reducing the number of SNPs in the marker set to 3,300 (1-cM spacing) caused several loci to drop below nominal significance levels, suggesting that decreases in information content can have significant effects on linkage results. In contrast, differences in maps employed in the analysis, the low detectable rate of genotyping error, and the presence of moderate linkage disequilibrium between markers did not significantly affect the results. We have demonstrated the utility of a dense SNP map for performing linkage analysis in a late-age-at-onset disease, where DNA from parents is not always available. The high SNP density allows loci to be defined more precisely and provides a partial scaffold for association studies, substantially reducing the resource requirement for gene-mapping studies.


American Journal of Human Genetics | 2005

High-Resolution Identification of Chromosomal Abnormalities Using Oligonucleotide Arrays Containing 116,204 SNPs

Howard R. Slater; Dione K. Bailey; Hua Ren; Manqiu Cao; Katrina M. Bell; Steven Nasioulas; Robert Henke; K.H. Andy Choo; Giulia C. Kennedy

Mutation of the human genome ranges from single base-pair changes to whole-chromosome aneuploidy. Karyotyping, fluorescence in situ hybridization, and comparative genome hybridization are currently used to detect chromosome abnormalities of clinical significance. These methods, although powerful, suffer from limitations in speed, ease of use, and resolution, and they do not detect copy-neutral chromosomal aberrations--for example, uniparental disomy (UPD). We have developed a high-throughput approach for assessment of DNA copy-number changes, through use of high-density synthetic oligonucleotide arrays containing 116,204 single-nucleotide polymorphisms, spaced at an average distance of 23.6 kb across the genome. Using this approach, we analyzed samples that failed conventional karyotypic analysis, and we detected amplifications and deletions across a wide range of sizes (1.3-145.9 Mb), identified chromosomes containing anonymous chromatin, and used genotype data to determine the molecular origin of two cases of UPD. Furthermore, our data provided independent confirmation for a case that had been misinterpreted by karyotype analysis. The high resolution of our approach provides more-precise breakpoint mapping, which allows subtle phenotypic heterogeneity to be distinguished at a molecular level. The accurate genotype information provided on these arrays enables the identification of copy-neutral loss-of-heterozygosity events, and the minimal requirement of DNA (250 ng per array) allows rapid analysis of samples without the need for cell culture. This technology overcomes many limitations currently encountered in routine clinical diagnostic laboratories tasked with accurate and rapid diagnosis of chromosomal abnormalities.


BMC Genetics | 2005

Description of the data from the Collaborative Study on the Genetics of Alcoholism (COGA) and single-nucleotide polymorphism genotyping for Genetic Analysis Workshop 14

Howard J. Edenberg; Laura J. Bierut; Paul Boyce; Manqiu Cao; Simon Cawley; Richard Chiles; Kimberly F. Doheny; Mark Hansen; Tony Hinrichs; Kevin A. Jones; Mark Kelleher; Giulia C. Kennedy; Guoying Liu; Gregory Marcus; Celeste McBride; Sarah S. Murray; Arnold Oliphant; James Pettengill; Bernice Porjesz; Elizabeth W. Pugh; John P. Rice; Stu Shannon; Rhoberta Steeke; Jay A. Tischfield; Ya Yu Tsai; Chun Zhang; Henri Begleiter

The data provided to the Genetic Analysis Workshop 14 (GAW 14) was the result of a collaboration among several different groups, catalyzed by Elizabeth Pugh from The Center for Inherited Disease Research (CIDR) and the organizers of GAW 14, Jean MacCluer and Laura Almasy. The DNA, phenotypic characterization, and microsatellite genomic survey were provided by the Collaborative Study on the Genetics of Alcoholism (COGA), a nine-site national collaboration funded by the National Institute of Alcohol and Alcoholism (NIAAA) and the National Institute of Drug Abuse (NIDA) with the overarching goal of identifying and characterizing genes that affect the susceptibility to develop alcohol dependence and related phenotypes. CIDR, Affymetrix, and Illumina provided single-nucleotide polymorphism genotyping of a large subset of the COGA subjects. This article briefly describes the dataset that was provided.


BMC Bioinformatics | 2007

Assessment of algorithms for high throughput detection of genomic copy number variation in oligonucleotide microarray data

Agnes Baross; Allen Delaney; H. Irene Li; Tarun Nayar; Stephane Flibotte; Hong Qian; Susanna Y. Chan; Jennifer Asano; Adrian Ally; Manqiu Cao; Patricia Birch; Mabel Brown-John; Nicole Fernandes; Anne Go; Giulia C. Kennedy; Sylvie Langlois; Patrice Eydoux; Jeffrey M. Friedman; Marco A. Marra

BackgroundGenomic deletions and duplications are important in the pathogenesis of diseases, such as cancer and mental retardation, and have recently been shown to occur frequently in unaffected individuals as polymorphisms. Affymetrix GeneChip whole genome sampling analysis (WGSA) combined with 100 K single nucleotide polymorphism (SNP) genotyping arrays is one of several microarray-based approaches that are now being used to detect such structural genomic changes. The popularity of this technology and its associated open source data format have resulted in the development of an increasing number of software packages for the analysis of copy number changes using these SNP arrays.ResultsWe evaluated four publicly available software packages for high throughput copy number analysis using synthetic and empirical 100 K SNP array data sets, the latter obtained from 107 mental retardation (MR) patients and their unaffected parents and siblings. We evaluated the software with regards to overall suitability for high-throughput 100 K SNP array data analysis, as well as effectiveness of normalization, scaling with various reference sets and feature extraction, as well as true and false positive rates of genomic copy number variant (CNV) detection.ConclusionWe observed considerable variation among the numbers and types of candidate CNVs detected by different analysis approaches, and found that multiple programs were needed to find all real aberrations in our test set. The frequency of false positive deletions was substantial, but could be greatly reduced by using the SNP genotype information to confirm loss of heterozygosity.


BMC Genetics | 2005

A genome-wide linkage analysis of alcoholism on microsatellite and single-nucleotide polymorphism data, using alcohol dependence phenotypes and electroencephalogram measures

Chun Zhang; Simon Cawley; Guoying Liu; Manqiu Cao; Harley Gorrell; Giulia C. Kennedy

The Collaborative Study on the Genetics of Alcoholism (COGA) is a large-scale family study designed to identify genes that affect the risk for alcoholism and alcohol-related phenotypes. We performed genome-wide linkage analyses on the COGA data made available to participants in the Genetic Analysis Workshop 14 (GAW 14). The dataset comprised 1,350 participants from 143 families. The samples were analyzed on three technologies: microsatellites spaced at 10 cM, Affymetrix GeneChip® Human Mapping 10 K Array (HMA10K) and Illumina SNP-based Linkage III Panel. We used ALDX1 and ALDX2, the COGA definitions of alcohol dependence, as well as electrophysiological measures TTTH1 and ECB21 to detect alcoholism susceptibility loci. Many chromosomal regions were found to be significant for each of the phenotypes at a p-value of 0.05. The most significant region for ALDX1 is on chromosome 7, with a maximum LOD score of 2.25 for Affymetrix SNPs, 1.97 for Illumina SNPs, and 1.72 for microsatellites. The same regions on chromosome 7 (96–106 cM) and 10 (149–176 cM) were found to be significant for both ALDX1 and ALDX2. A region on chromosome 7 (112–153 cM) and a region on chromosome 6 (169–185 cM) were identified as the most significant regions for TTTH1 and ECB21, respectively. We also performed linkage analysis on denser maps of markers by combining the SNPs datasets from Affymetrix and Illumina. Adding the microsatellite data to the combined SNP dataset improved the results only marginally. The results indicated that SNPs outperform microsatellites with the densest marker sets performing the best.


BMC Pulmonary Medicine | 2017

Analytical performance of Envisia: a genomic classifier for usual interstitial pneumonia

Yoonha Choi; Jiayi Lu; Zhanzhi Hu; Daniel G. Pankratz; Huimin Jiang; Manqiu Cao; Cristina Marchisano; Jennifer Huiras; Grazyna M. Fedorowicz; Mei G. Wong; Jessica R. Anderson; Edward Y. Tom; Joshua Babiarz; Urooj Imtiaz; Neil M. Barth; P. Sean Walsh; Giulia C. Kennedy; Jing Huang

BackgroundClinical guidelines specify that diagnosis of interstitial pulmonary fibrosis (IPF) requires identification of usual interstitial pneumonia (UIP) pattern. While UIP can be identified by high resolution CT of the chest, the results are often inconclusive, making surgical lung biopsy necessary to reach a definitive diagnosis (Raghu et al., Am J Respir Crit Care Med 183(6):788–824, 2011). The Envisia genomic classifier differentiates UIP from non-UIP pathology in transbronchial biopsies (TBB), potentially allowing patients to avoid an invasive procedure (Brown et al., Am J Respir Crit Care Med 195:A6792, 2017). To ensure patient safety and efficacy, a laboratory developed test (LDT) must meet strict regulatory requirements for accuracy, reproducibility and robustness. The analytical characteristics of the Envisia test are assessed and reported here.MethodsThe Envisia test utilizes total RNA extracted from TBB samples to perform Next Generation RNA Sequencing. The gene count data from 190 genes are then input to the Envisia genomic classifier, a machine learning algorithm, to output either a UIP or non-UIP classification result. We characterized the stability of RNA in TBBs during collection and shipment, and evaluated input RNA mass and proportions on the limit of detection of UIP. We evaluated potentially interfering substances such as blood and genomic DNA. Intra-run, inter-run, and inter-laboratory reproducibility of test results were also characterized.ResultsRNA content within TBBs preserved in RNAprotect is stable for up to 14 days with no detectable change in RNA quality. The Envisia test is tolerant to variation in RNA input (5 to 30 ng), with no impact on classifier results. The Envisia test can tolerate dilution of non-UIP and UIP classification signals at the RNA level by up to 60% and 20%, respectively. Analytical specificity studies utilizing UIP and non-UIP samples mixed with genomic DNA (up to 30% relative input) demonstrated no impact to classifier results. The Envisia test tolerates up to 22% of blood contamination, well beyond the level observed in TBBs. The test is reproducible from RNA extraction through to Envisia test result (standard deviation of 0.20 for Envisia classification scores on > 7-unit scale).ConclusionsThe Envisia test demonstrates the robust analytical performance required of an LDT. Envisia can be used to inform the diagnoses of patients with suspected IPF.


American Journal of Human Genetics | 2006

Reply to Wirtenberger et al.

Howard R. Slater; Dione K. Bailey; Hua Ren; Manqiu Cao; Katrina M. Bell; Steven Nasioulas; Robert Henke; K.H. Andy Choo; Giulia C. Kennedy

To the Editor: Wirtenberger et al. (2006) analyzed the SNP content of 82 large (median length 157 kb) common copy-number polymorphisms (CNPs), selected from the Database of Genomic Variations, and determined the number of SNPs included in the GeneChip Mapping 100K arrays (Affymetrix). The data they presented showed that the density of these SNPs within the CNPs is lower than would be expected, with 52.4% of CNPs having no SNP coverage (median length 120 kb) and only 8.5% having a SNP density equal to or higher than the overall mean intermarker density for all SNPs on the array. As suggested by Wirtenberger et al. (2006), the underlying reason for this low density of Mapping 100K SNPs in their selected CNPs is the selection criteria used for SNPs on the array. The SNP selection criteria for the Mapping 100K arrays select strongly but not completely against SNPs in segmental duplications. The selection is based on genotyping accuracy, Mendelian inheritance, Hardy-Weinberg equilibrium, robustness, and reproducibility—all of which are characteristics likely to give poor results in genotyping SNPs that are located in CNPs. Despite a selective bias against SNPs in CNPs, some SNPs on the Mapping 100K arrays are able to provide CNP information. For example, as Wirtenberger et al. (2006) indicate, 14.6% of the CNP regions contained more than four of the SNPs on the array. Even with modifications in SNP selection, the current algorithm implemented in CNAT (Affymetrix) would still need to be modified, because it compares copynumber data from the test sample with data from a large pool of normal reference individuals, thereby decreasing the likelihood of detecting CNPs. Future advances in SNP selection, algorithm development, and density will be required to identify frequent CNPs by use of SNP arrays. For investigation of CNPs, the advice of Wirtenberger et al. (2006) to be aware of the limitation of Mapping 100K microarrays is sound. However, it is worth remembering that we (Slater et al. [2005]) describe their use for detection of clinically significant chromosome abnormalities. Exclusion of SNPs within common CNPs is arguably an advantage in the diagnostic scenario when virtually nothing is currently known of the clinical significance of these CNPs.


Bioinformatics | 2006

A whole genome long-range haplotype (WGLRH) test for detecting imprints of positive selection in human populations

Chun Zhang; Dione K. Bailey; Tarif Awad; Guoying Liu; Guoliang Xing; Manqiu Cao; Venu Valmeekam; Jacques Retief; Hajime Matsuzaki; Margaret Taub; Mark Seielstad; Giulia C. Kennedy

Collaboration


Dive into the Manqiu Cao's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge