Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Michael H. Shapero is active.

Publication


Featured researches published by Michael H. Shapero.


Nature | 2006

Global variation in copy number in the human genome

Richard Redon; Shumpei Ishikawa; Karen R. Fitch; Lars Feuk; George H. Perry; T. Daniel Andrews; Heike Fiegler; Michael H. Shapero; Andrew R. Carson; Wenwei Chen; Eun Kyung Cho; Stephanie Dallaire; Jennifer L. Freeman; Juan R. González; Mònica Gratacòs; Jing Huang; Dimitrios Kalaitzopoulos; Daisuke Komura; Jeffrey R. MacDonald; Christian R. Marshall; Rui Mei; Lyndal Montgomery; Keunihiro Nishimura; Kohji Okamura; Fan Shen; Martin J. Somerville; Joelle Tchinda; Armand Valsesia; Cara Woodwark; Fengtang Yang

Copy number variation (CNV) of DNA sequences is functionally significant but has yet to be fully ascertained. We have constructed a first-generation CNV map of the human genome through the study of 270 individuals from four populations with ancestry in Europe, Africa or Asia (the HapMap collection). DNA from these individuals was screened for CNV using two complementary technologies: single-nucleotide polymorphism (SNP) genotyping arrays, and clone-based comparative genomic hybridization. A total of 1,447 copy number variable regions (CNVRs), which can encompass overlapping or adjacent gains or losses, covering 360 megabases (12% of the genome) were identified in these populations. These CNVRs contained hundreds of genes, disease loci, functional elements and segmental duplications. Notably, the CNVRs encompassed more nucleotide content per genome than SNPs, underscoring the importance of CNV in genetic diversity and evolution. The data obtained delineate linkage disequilibrium patterns for many CNVs, and reveal marked variation in copy number among populations. We also demonstrate the utility of this resource for genetic disease studies.


Nature Genetics | 2008

Integrated detection and population-genetic analysis of SNPs and copy number variation

Steven A. McCarroll; Finny Kuruvilla; Joshua M. Korn; Simon Cawley; James Nemesh; Alec Wysoker; Michael H. Shapero; Paul I. W. de Bakker; Julian Maller; Andrew Kirby; Amanda L. Elliott; Melissa Parkin; Earl Hubbell; Teresa Webster; Rui Mei; James Veitch; Patrick J Collins; Robert E. Handsaker; Steve Lincoln; Marcia M. Nizzari; John E. Blume; Keith W. Jones; Rich Rava; Mark J. Daly; Stacey Gabriel; David Altshuler

Dissecting the genetic basis of disease risk requires measuring all forms of genetic variation, including SNPs and copy number variants (CNVs), and is enabled by accurate maps of their locations, frequencies and population-genetic properties. We designed a hybrid genotyping array (Affymetrix SNP 6.0) to simultaneously measure 906,600 SNPs and copy number at 1.8 million genomic locations. By characterizing 270 HapMap samples, we developed a map of human CNV (at 2-kb breakpoint resolution) informed by integer genotypes for 1,320 copy number polymorphisms (CNPs) that segregate at an allele frequency >1%. More than 80% of the sequence in previously reported CNV regions fell outside our estimated CNV boundaries, indicating that large (>100 kb) CNVs affect much less of the genome than initially reported. Approximately 80% of observed copy number differences between pairs of individuals were due to common CNPs with an allele frequency >5%, and more than 99% derived from inheritance rather than new mutation. Most common, diallelic CNPs were in strong linkage disequilibrium with SNPs, and most low-frequency CNVs segregated on specific SNP haplotypes.


Human Genomics | 2004

Whole genome DNA copy number changes identified by high density oligonucleotide arrays

Jing Huang; Wen Wei; Jane Zhang; Guoying Liu; Graham R. Bignell; Michael R. Stratton; P. Andrew Futreal; Richard Wooster; Keith W. Jones; Michael H. Shapero

Changes in DNA copy number are one of the hallmarks of the genetic instability common to most human cancers. Previous micro-array-based methods have been used to identify chromosomal gains and losses; however, they are unable to genotype alleles at the level of single nucleotide polymorphisms (SNPs). Here we describe a novel algorithm that uses a recently developed high-density oligonucleotide array-based SNP genotyping method, whole genome sampling analysis (WGSA), to identify genome-wide chromosomal gains and losses at high resolution. WGSA simultaneously genotypes over 10,000 SNPs by allele-specific hybridisation to perfect match (PM) and mismatch (MM) probes synthesised on a single array. The copy number algorithm jointly uses PM intensity and discrimination ratios between paired PM and MM intensity values to identify and estimate genetic copy number changes. Values from an experimental sample are compared with SNP-specific distributions derived from a reference set containing over 100 normal individuals to gain statistical power. Genomic regions with statistically significant copy number changes can be identified using both single point analysis and contiguous point analysis of SNP intensities. We identified multiple regions of amplification and deletion using a panel of human breast cancer cell lines. We verified these results using an independent method based on quantitative polymerase chain reaction and found that our approach is both sensitive and specific and can tolerate samples which contain a mixture of both tumour and normal DNA. In addition, by using known allele frequencies from the reference set, statistically significant genomic intervals can be identified containing contiguous stretches of homozygous markers, potentially allowing the detection of regions undergoing loss of heterozygosity (LOH) without the need for a matched normal control sample. The coupling of LOH analysis, via SNP genotyping, with copy number estimations using a single array provides additional insight into the structure of genomic alterations. With mean and median inter-SNP euchromatin distances of 244 kilobases (kb) and 119 kb, respectively, this method affords a resolution that is not easily achievable with non-oligonucleotide-based experimental approaches.


Genomics | 2011

Next generation genome-wide association tool: Design and coverage of a high-throughput European-optimized SNP array

Thomas J. Hoffmann; Mark N. Kvale; Stephanie Hesselson; Yiping Zhan; Christine Aquino; Yang Cao; Simon Cawley; Elaine Chung; Sheryl Connell; Jasmin Eshragh; Marcia Ewing; Jeremy Gollub; Mary Henderson; Earl Hubbell; Carlos Iribarren; Jay Kaufman; Richard Lao; Yontao Lu; Dana Ludwig; Gurpreet K. Mathauda; William B. McGuire; Gangwu Mei; Sunita Miles; Matthew M. Purdy; Charles P. Quesenberry; Dilrini Ranatunga; Sarah Rowell; Marianne Sadler; Michael H. Shapero; Ling Shen

The success of genome-wide association studies has paralleled the development of efficient genotyping technologies. We describe the development of a next-generation microarray based on the new highly-efficient Affymetrix Axiom genotyping technology that we are using to genotype individuals of European ancestry from the Kaiser Permanente Research Program on Genes, Environment and Health (RPGEH). The array contains 674,517 SNPs, and provides excellent genome-wide as well as gene-based and candidate-SNP coverage. Coverage was calculated using an approach based on imputation and cross validation. Preliminary results for the first 80,301 saliva-derived DNA samples from the RPGEH demonstrate very high quality genotypes, with sample success rates above 94% and over 98% of successful samples having SNP call rates exceeding 98%. At steady state, we have produced 462 million genotypes per week for each Axiom system. The new array provides a valuable addition to the repertoire of tools for large scale genome-wide association studies.


Genomics | 2011

Design and coverage of high throughput genotyping arrays optimized for individuals of East Asian, African American, and Latino race/ethnicity using imputation and a novel hybrid SNP selection algorithm.

Thomas J. Hoffmann; Yiping Zhan; Mark N. Kvale; Stephanie Hesselson; Jeremy Gollub; Carlos Iribarren; Yontao Lu; Gangwu Mei; Matthew M. Purdy; Charles P. Quesenberry; Sarah Rowell; Michael H. Shapero; David Smethurst; Carol P. Somkin; Stephen K. Van Den Eeden; Larry Walter; Teresa Webster; Rachel A. Whitmer; Andrea Finn; Catherine Schaefer; Pui-Yan Kwok; Neil Risch

Four custom Axiom genotyping arrays were designed for a genome-wide association (GWA) study of 100,000 participants from the Kaiser Permanente Research Program on Genes, Environment and Health. The array optimized for individuals of European race/ethnicity was previously described. Here we detail the development of three additional microarrays optimized for individuals of East Asian, African American, and Latino race/ethnicity. For these arrays, we decreased redundancy of high-performing SNPs to increase SNP capacity. The East Asian array was designed using greedy pairwise SNP selection. However, removing SNPs from the target set based on imputation coverage is more efficient than pairwise tagging. Therefore, we developed a novel hybrid SNP selection method for the African American and Latino arrays utilizing rounds of greedy pairwise SNP selection, followed by removal from the target set of SNPs covered by imputation. The arrays provide excellent genome-wide coverage and are valuable additions for large-scale GWA studies.


BMC Bioinformatics | 2006

CARAT: A novel method for allelic detection of DNA copy number changes using high density oligonucleotide arrays

Jing-Jing Huang; Wen Wei; Joyce Chen; Jane Zhang; Guoying Liu; Xiaojun Di; Rui Mei; Shumpei Ishikawa; Hiroyuki Aburatani; Keith W. Jones; Michael H. Shapero

BackgroundDNA copy number alterations are one of the main characteristics of the cancer cell karyotype and can contribute to the complex phenotype of these cells. These alterations can lead to gains in cellular oncogenes as well as losses in tumor suppressor genes and can span small intervals as well as involve entire chromosomes. The ability to accurately detect these changes is central to understanding how they impact the biology of the cell.ResultsWe describe a novel algorithm called CARAT (C opy Number A nalysis with R egression A nd T ree) that uses probe intensity information to infer copy number in an allele-specific manner from high density DNA oligonuceotide arrays designed to genotype over 100, 000 SNPs. Total and allele-specific copy number estimations using CARAT are independently evaluated for a subset of SNPs using quantitative PCR and allelic TaqMan reactions with several human breast cancer cell lines. The sensitivity and specificity of the algorithm are characterized using DNA samples containing differing numbers of X chromosomes as well as a test set of normal individuals. Results from the algorithm show a high degree of agreement with results from independent verification methods.ConclusionOverall, CARAT automatically detects regions with copy number variations and assigns a significance score to each alteration as well as generating allele-specific output. When coupled with SNP genotype calls from the same array, CARAT provides additional detail into the structure of genome wide alterations that can contribute to allelic imbalance.


Methods of Molecular Biology | 2010

DMET™ Microarray Technology for Pharmacogenomics-Based Personalized Medicine

James K. Burmester; Marina Sedova; Michael H. Shapero; Elaine Mansfield

Human genome sequence variation in the form of single nucleotide polymorphisms (SNPs) as well as more complex structural variation such as insertions, duplications, and deletions underlies each individuals response to drugs and thus the likelihood of experiencing an adverse drug reaction. The ongoing challenge of the field of pharmacogenetics is to further understand the relationship between genetic variation and differential drug responses, with the overarching goal being that this will lead to improvements in both the safety and efficacy of drugs. The Affymetrix DMET Plus Premier Pack (DMET stands for Drug Metabolizing Enzymes and Transporters) enables highly multiplexed genotyping of known polymorphisms in Absorption, Distribution, Metabolism, and Elimination (ADME)-related genes on a single array. The DMET Plus Panel interrogates markers in 225 genes that have documented functional significance in phase I and phase II drug metabolism enzymes as well as drug transporters. The power of the DMET Assay has previously been demonstrated with regard to several different drugs including warfarin and clopidogrel. In a research study using an earlier four-color version of the assay, it was demonstrated that warfarin dosing can be influenced by a cytochrome P450 (CYP) 4F2 variant. Additionally, the assay has been used to demonstrate that CYP2C19 variants with decreased enzyme activity led to lower levels of the active clopidogrel metabolite, resulting in a decreased inhibition of platelets and a higher rate of cardiovascular events when compared to noncarriers of the DNA variant. Thus, highly multiplexed SNP genotyping focused on ADME-related polymorphisms should enable research into development of safer drugs with greater efficacy.


Genetics | 2015

Genotyping Informatics and Quality Control for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort

Mark N. Kvale; Stephanie Hesselson; Thomas J. Hoffmann; Yang Cao; David Chan; Sheryl Connell; Lisa A. Croen; Brad Dispensa; Jasmin Eshragh; Andrea Finn; Jeremy Gollub; Carlos Iribarren; Eric Jorgenson; Lawrence H. Kushi; Richard Lao; Yontao Lu; Dana Ludwig; Gurpreet K. Mathauda; William B. McGuire; Gangwu Mei; Sunita Miles; Michael Mittman; Mohini Patil; Charles P. Quesenberry; Dilrini Ranatunga; Sarah Rowell; Marianne Sadler; Lori C. Sakoda; Michael H. Shapero; Ling Shen

The Kaiser Permanente (KP) Research Program on Genes, Environment and Health (RPGEH), in collaboration with the University of California—San Francisco, undertook genome-wide genotyping of >100,000 subjects that constitute the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort. The project, which generated >70 billion genotypes, represents the first large-scale use of the Affymetrix Axiom Genotyping Solution. Because genotyping took place over a short 14-month period, creating a near-real-time analysis pipeline for experimental assay quality control and final optimized analyses was critical. Because of the multi-ethnic nature of the cohort, four different ethnic-specific arrays were employed to enhance genome-wide coverage. All assays were performed on DNA extracted from saliva samples. To improve sample call rates and significantly increase genotype concordance, we partitioned the cohort into disjoint packages of plates with similar assay contexts. Using strict QC criteria, the overall genotyping success rate was 103,067 of 109,837 samples assayed (93.8%), with a range of 92.1–95.4% for the four different arrays. Similarly, the SNP genotyping success rate ranged from 98.1 to 99.4% across the four arrays, the variation depending mostly on how many SNPs were included as single copy vs. double copy on a particular array. The high quality and large scale of genotype data created on this cohort, in conjunction with comprehensive longitudinal data from the KP electronic health records of participants, will enable a broad range of highly powered genome-wide association studies on a diversity of traits and conditions.


Human Molecular Genetics | 2010

Population-genetic nature of copy number variations in the human genome

Mamoru Kato; Takahisa Kawaguchi; Shumpei Ishikawa; Takayoshi Umeda; Reiichiro Nakamichi; Michael H. Shapero; Keith W. Jones; Yusuke Nakamura; Hiroyuki Aburatani; Tatsuhiko Tsunoda

Copy number variations (CNVs) are universal genetic variations, and their association with disease has been increasingly recognized. We designed high-density microarrays for CNVs, and detected 3000–4000 CNVs (4–6% of the genomic sequence) per population that included CNVs previously missed because of smaller sizes and residing in segmental duplications. The patterns of CNVs across individuals were surprisingly simple at the kilo-base scale, suggesting the applicability of a simple genetic analysis for these genetic loci. We utilized the probabilistic theory to determine integer copy numbers of CNVs and employed a recently developed phasing tool to estimate the population frequencies of integer copy number alleles and CNV–SNP haplotypes. The results showed a tendency toward a lower frequency of CNV alleles and that most of our CNVs were explained only by zero-, one- and two-copy alleles. Using the estimated population frequencies, we found several CNV regions with exceptionally high population differentiation. Investigation of CNV–SNP linkage disequilibrium (LD) for 500–900 bi- and multi-allelic CNVs per population revealed that previous conflicting reports on bi-allelic LD were unexpectedly consistent and explained by an LD increase correlated with deletion-allele frequencies. Typically, the bi-allelic LD was lower than SNP–SNP LD, whereas the multi-allelic LD was somewhat stronger than the bi-allelic LD. After further investigation of tag SNPs for CNVs, we conclude that the customary tagging strategy for disease association studies can be applicable for common deletion CNVs, but direct interrogation is needed for other types of CNVs.


BMC Genetics | 2008

Improved detection of global copy number variation using high density, non-polymorphic oligonucleotide probes

Fan Shen; Jing Huang; Karen R. Fitch; Vivi Truong; Andrew Kirby; Wenwei Chen; Jane Zhang; Guoying Liu; Steven A. McCarroll; Keith W. Jones; Michael H. Shapero

BackgroundDNA sequence diversity within the human genome may be more greatly affected by copy number variations (CNVs) than single nucleotide polymorphisms (SNPs). Although the importance of CNVs in genome wide association studies (GWAS) is becoming widely accepted, the optimal methods for identifying these variants are still under evaluation. We have previously reported a comprehensive view of CNVs in the HapMap DNA collection using high density 500 K EA (Early Access) SNP genotyping arrays which revealed greater than 1,000 CNVs ranging in size from 1 kb to over 3 Mb. Although the arrays used most commonly for GWAS predominantly interrogate SNPs, CNV identification and detection does not necessarily require the use of DNA probes centered on polymorphic nucleotides and may even be hindered by the dependence on a successful SNP genotyping assay.ResultsIn this study, we have designed and evaluated a high density array predicated on the use of non-polymorphic oligonucleotide probes for CNV detection. This approach effectively uncouples copy number detection from SNP genotyping and thus has the potential to significantly improve probe coverage for genome-wide CNV identification. This array, in conjunction with PCR-based, complexity-reduced DNA target, queries over 1.3 M independent NspI restriction enzyme fragments in the 200 bp to 1100 bp size range, which is a several fold increase in marker density as compared to the 500 K EA array. In addition, a novel algorithm was developed and validated to extract CNV regions and boundaries.ConclusionUsing a well-characterized pair of DNA samples, close to 200 CNVs were identified, of which nearly 50% appear novel yet were independently validated using quantitative PCR. The results indicate that non-polymorphic probes provide a robust approach for CNV identification, and the increasing precision of CNV boundary delineation should allow a more complete analysis of their genomic organization.

Collaboration


Dive into the Michael H. Shapero's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Shumpei Ishikawa

Tokyo Medical and Dental University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge