Alexander Wait Zaranek
Harvard University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Alexander Wait Zaranek.
Science | 2010
Radoje Drmanac; Andrew Sparks; Matthew J. Callow; Aaron L. Halpern; Norman L. Burns; Bahram Ghaffarzadeh Kermani; Paolo Carnevali; Igor Nazarenko; Geoffrey B. Nilsen; George Yeung; Fredrik Dahl; Andres Fernandez; Bryan Staker; Krishna Pant; Jonathan Baccash; Adam P. Borcherding; Anushka Brownley; Ryan Cedeno; Linsu Chen; Dan Chernikoff; Alex Cheung; Razvan Chirita; Benjamin Curson; Jessica Ebert; Coleen R. Hacker; Robert Hartlage; Brian Hauser; Steve Huang; Yuan Jiang; Vitali Karpinchyk
Toward
The Lancet | 2010
Euan A. Ashley; Atul J. Butte; Matthew T. Wheeler; Rong Chen; Teri E. Klein; Frederick E. Dewey; Joel T. Dudley; Kelly E. Ormond; Aleksandra Pavlovic; Alexander A. Morgan; Dmitry Pushkarev; Norma F. Neff; Louanne Hudgins; Li Gong; Laura M. Hodges; Dorit S. Berlin; Caroline F. Thorn; Joan M. Hebert; Mark Woon; Hersh Sagreiya; Ryan Whaley; Joshua W. Knowles; Michael F. Chou; Joseph V. Thakuria; Abraham M. Rosenbaum; Alexander Wait Zaranek; George M. Church; Henry T. Greely; Stephen R. Quake; Russ B. Altman
1000 Genomes The ability to generate human genome sequence data that is complete, accurate, and inexpensive is a necessary prerequisite to perform genome-wide disease association studies. Drmanac et al. (p. 78, published online 5 November) present a technique advancing toward this goal. The method uses Type IIS endonucleases to incorporate short oligonucleotides within a set of randomly sheared circularized DNA. DNA polymerase then generates concatenated copies of the circular oligonucleotides leading to formation of compact but very long oligonucleotides which are then sequenced by ligation. The relatively low cost of this technology, which shows a low error rate, advances sequencing closer to the goal of the
Nature | 2012
Brock A. Peters; Bahram Ghaffarzadeh Kermani; Andrew Sparks; Oleg Alferov; Peter Hong; Andrei Alexeev; Yuan Jiang; Fredrik Dahl; Y. Tom Tang; Juergen Haas; Kimberly Robasky; Alexander Wait Zaranek; Je-Hyuk Lee; Madeleine Ball; Joseph E. Peterson; Helena Perazich; George Yeung; Jia Liu; Linsu Chen; Michael Kennemer; Kaliprasad Pothuraju; Karel Konvicka; Mike Tsoupko-Sitnikov; Krishna Pant; Jessica Ebert; Geoffrey B. Nilsen; Jonathan Baccash; Aaron L. Halpern; George M. Church; Radoje Drmanac
1000 genome. A low-cost sequencing technique advances us closer to the goal of the
Proceedings of the National Academy of Sciences of the United States of America | 2012
Madeleine Ball; Joseph V. Thakuria; Alexander Wait Zaranek; Tom Clegg; Abraham M. Rosenbaum; Xiaodi Wu; Misha Angrist; Jong Bhak; Jason Bobe; Matthew J. Callow; Carlos Cano; Michael F. Chou; Wendy K. Chung; Shawn M. Douglas; Preston W. Estep; Athurva Gore; Peter J. Hulick; Alberto Labarga; Je-Hyuk Lee; Jeantine E. Lunshof; Byung Chul Kim; Jong-Il Kim; Zhe Li; Michael F. Murray; Geoffrey B. Nilsen; Brock A. Peters; Anugraha M. Raman; Hugh Y. Rienhoff; Kimberly Robasky; Matthew T. Wheeler
1000 human genome. Genome sequencing of large numbers of individuals promises to advance the understanding, treatment, and prevention of human diseases, among other applications. We describe a genome sequencing platform that achieves efficient imaging and low reagent consumption with combinatorial probe anchor ligation chemistry to independently assay each base from patterned nanoarrays of self-assembling DNA nanoballs. We sequenced three human genomes with this platform, generating an average of 45- to 87-fold coverage per genome and identifying 3.2 to 4.5 million sequence variants per genome. Validation of one genome data set demonstrates a sequence accuracy of about 1 false variant per 100 kilobases. The high accuracy, affordable cost of
PLOS Genetics | 2011
Frederick E. Dewey; Rong Chen; Sergio Cordero; Kelly E. Ormond; Colleen Caleshu; Konrad J. Karczewski; Michelle Whirl-Carrillo; Matthew T. Wheeler; Joel T. Dudley; Jake K. Byrnes; Omar E. Cornejo; Joshua W. Knowles; Mark Woon; Li Gong; Caroline F. Thorn; Joan M. Hebert; Emidio Capriotti; Sean P. David; Aleksandra Pavlovic; Anne West; Joseph V. Thakuria; Madeleine Ball; Alexander Wait Zaranek; Heidi L. Rehm; George M. Church; John West; Carlos Bustamante; Michael Snyder; Russ B. Altman; Teri E. Klein
4400 for sequencing consumables, and scalability of this platform enable complete human genome sequencing for the detection of rare variants in large-scale genetic studies.
Scientific Data | 2016
Justin M. Zook; David N. Catoe; Jennifer H. McDaniel; Lindsay Vang; Noah Spies; Arend Sidow; Ziming Weng; Yuling Liu; Christopher E. Mason; Noah Alexander; Elizabeth Henaff; Alexa B. R. McIntyre; Dhruva Chandramohan; Feng Chen; Erich Jaeger; Ali Moshrefi; Khoa Pham; William Stedman; Tiffany Liang; Michael Saghbini; Zeljko Dzakula; Alex Hastie; Han Cao; Gintaras Deikus; Eric E. Schadt; Robert Sebra; Ali Bashir; Rebecca Truty; Christopher C. Chang; Natali Gulbahce
BACKGROUND The cost of genomic information has fallen steeply, but the clinical translation of genetic risk estimates remains unclear. We aimed to undertake an integrated analysis of a complete human genome in a clinical context. METHODS We assessed a patient with a family history of vascular disease and early sudden death. Clinical assessment included analysis of this patients full genome sequence, risk prediction for coronary artery disease, screening for causes of sudden cardiac death, and genetic counselling. Genetic analysis included the development of novel methods for the integration of whole genome and clinical risk. Disease and risk analysis focused on prediction of genetic risk of variants associated with mendelian disease, recognised drug responses, and pathogenicity for novel variants. We queried disease-specific mutation databases and pharmacogenomics databases to identify genes and mutations with known associations with disease and drug response. We estimated post-test probabilities of disease by applying likelihood ratios derived from integration of multiple common variants to age-appropriate and sex-appropriate pre-test probabilities. We also accounted for gene-environment interactions and conditionally dependent risks. FINDINGS Analysis of 2.6 million single nucleotide polymorphisms and 752 copy number variations showed increased genetic risk for myocardial infarction, type 2 diabetes, and some cancers. We discovered rare variants in three genes that are clinically associated with sudden cardiac death-TMEM43, DSP, and MYBPC3. A variant in LPA was consistent with a family history of coronary artery disease. The patient had a heterozygous null mutation in CYP2C19 suggesting probable clopidogrel resistance, several variants associated with a positive response to lipid-lowering therapy, and variants in CYP4F2 and VKORC1 that suggest he might have a low initial dosing requirement for warfarin. Many variants of uncertain importance were reported. INTERPRETATION Although challenges remain, our results suggest that whole-genome sequencing can yield useful and clinically relevant information for individual patients. FUNDING National Institute of General Medical Sciences; National Heart, Lung And Blood Institute; National Human Genome Research Institute; Howard Hughes Medical Institute; National Library of Medicine, Lucile Packard Foundation for Childrens Health; Hewlett Packard Foundation; Breetwor Family Foundation.
Bioinformatics | 2009
Nava Whiteford; Tom Skelly; Christina Curtis; Matthew E. Ritchie; Andrea Löhr; Alexander Wait Zaranek; Irina I. Abnizova; Clive Gavin Brown
Recent advances in whole-genome sequencing have brought the vision of personal genomics and genomic medicine closer to reality. However, current methods lack clinical accuracy and the ability to describe the context (haplotypes) in which genome variants co-occur in a cost-effective manner. Here we describe a low-cost DNA sequencing and haplotyping process, long fragment read (LFR) technology, which is similar to sequencing long single DNA molecules without cloning or separation of metaphase chromosomes. In this study, ten LFR libraries were made using only ∼100 picograms of human DNA per sample. Up to 97% of the heterozygous single nucleotide variants were assembled into long haplotype contigs. Removal of false positive single nucleotide variants not phased by multiple LFR haplotypes resulted in a final genome error rate of 1 in 10 megabases. Cost-effective and accurate genome sequencing and haplotyping from 10–20 human cells, as demonstrated here, will enable comprehensive genetic studies and diverse clinical applications.
Genome Research | 2009
Jin Billy Li; Yuan Gao; John Aach; Kun Zhang; Gregory V. Kryukov; Bin Xie; Annika Ahlford; Jung-Ki Yoon; Abraham M. Rosenbaum; Alexander Wait Zaranek; Emily LeProust; Shamil R. Sunyaev; George M. Church
Rapid advances in DNA sequencing promise to enable new diagnostics and individualized therapies. Achieving personalized medicine, however, will require extensive research on highly reidentifiable, integrated datasets of genomic and health information. To assist with this, participants in the Personal Genome Project choose to forgo privacy via our institutional review board- approved “open consent” process. The contribution of public data and samples facilitates both scientific discovery and standardization of methods. We present our findings after enrollment of more than 1,800 participants, including whole-genome sequencing of 10 pilot participant genomes (the PGP-10). We introduce the Genome-Environment-Trait Evidence (GET-Evidence) system. This tool automatically processes genomes and prioritizes both published and novel variants for interpretation. In the process of reviewing the presumed healthy PGP-10 genomes, we find numerous literature references implying serious disease. Although it is sometimes impossible to rule out a late-onset effect, stringent evidence requirements can address the high rate of incidental findings. To that end we develop a peer production system for recording and organizing variant evaluations according to standard evidence guidelines, creating a public forum for reaching consensus on interpretation of clinically relevant variants. Genome analysis becomes a two-step process: using a prioritized list to record variant evaluations, then automatically sorting reviewed variants using these annotations. Genome data, health and trait information, participant samples, and variant interpretations are all shared in the public domain—we invite others to review our results using our participant samples and contribute to our interpretations. We offer our public resource and methods to further personalized medical research.
Genome Medicine | 2014
Madeleine Ball; Jason Bobe; Michael F. Chou; Tom Clegg; Preston W. Estep; Jeantine E. Lunshof; Ward Vandewege; Alexander Wait Zaranek; George M. Church
Whole-genome sequencing harbors unprecedented potential for characterization of individual and family genetic variation. Here, we develop a novel synthetic human reference sequence that is ethnically concordant and use it for the analysis of genomes from a nuclear family with history of familial thrombophilia. We demonstrate that the use of the major allele reference sequence results in improved genotype accuracy for disease-associated variant loci. We infer recombination sites to the lowest median resolution demonstrated to date (<1,000 base pairs). We use family inheritance state analysis to control sequencing error and inform family-wide haplotype phasing, allowing quantification of genome-wide compound heterozygosity. We develop a sequence-based methodology for Human Leukocyte Antigen typing that contributes to disease risk prediction. Finally, we advance methods for analysis of disease and pharmacogenomic risk across the coding and non-coding genome that incorporate phased variant data. We show these methods are capable of identifying multigenic risk for inherited thrombophilia and informing the appropriate pharmacological therapy. These ethnicity-specific, family-based approaches to interpretation of genetic variation are emblematic of the next generation of genetic risk assessment using whole-genome sequencing.
PLOS Genetics | 2010
Alexander Wait Zaranek; Erez Y. Levanon; Tomer Zecharia; Tom Clegg; George M. Church
The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCode WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly.