Karen Beeson
J. Craig Venter Institute
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Karen Beeson.
PLOS Biology | 2007
Douglas B. Rusch; Aaron L. Halpern; Granger Sutton; Karla B. Heidelberg; Shannon J. Williamson; Shibu Yooseph; Dongying Wu; Jonathan A. Eisen; Jeff Hoffman; Karin A. Remington; Karen Beeson; Bao Duc Tran; Hamilton O. Smith; Holly Baden-Tillson; Clare Stewart; Joyce Thorpe; Jason Freeman; Cynthia Andrews-Pfannkoch; Joseph E. Venter; Kelvin Li; Saul Kravitz; John F. Heidelberg; Terry Utterback; Yu-Hui Rogers; Luisa I. Falcón; Valeria Souza; Germán Bonilla-Rosso; Luis E. Eguiarte; David M. Karl; Shubha Sathyendranath
The worlds oceans contain a complex mixture of micro-organisms that are for the most part, uncharacterized both genetically and biochemically. We report here a metagenomic study of the marine planktonic microbiota in which surface (mostly marine) water samples were analyzed as part of the Sorcerer II Global Ocean Sampling expedition. These samples, collected across a several-thousand km transect from the North Atlantic through the Panama Canal and ending in the South Pacific yielded an extensive dataset consisting of 7.7 million sequencing reads (6.3 billion bp). Though a few major microbial clades dominate the planktonic marine niche, the dataset contains great diversity with 85% of the assembled sequence and 57% of the unassembled data being unique at a 98% sequence identity cutoff. Using the metadata associated with each sample and sequencing library, we developed new comparative genomic and assembly methods. One comparative genomic method, termed “fragment recruitment,” addressed questions of genome structure, evolution, and taxonomic or phylogenetic diversity, as well as the biochemical diversity of genes and gene families. A second method, termed “extreme assembly,” made possible the assembly and reconstruction of large segments of abundant but clearly nonclonal organisms. Within all abundant populations analyzed, we found extensive intra-ribotype diversity in several forms: (1) extensive sequence variation within orthologous regions throughout a given genome; despite coverage of individual ribotypes approaching 500-fold, most individual sequencing reads are unique; (2) numerous changes in gene content some with direct adaptive implications; and (3) hypervariable genomic islands that are too variable to assemble. The intra-ribotype diversity is organized into genetically isolated populations that have overlapping but independent distributions, implying distinct environmental preference. We present novel methods for measuring the genomic similarity between metagenomic samples and show how they may be grouped into several community types. Specific functional adaptations can be identified both within individual ribotypes and across the entire community, including proteorhodopsin spectral tuning and the presence or absence of the phosphate-binding gene PstS.
PLOS Biology | 2007
Samuel Levy; Granger Sutton; Pauline C. Ng; Lars Feuk; Aaron L. Halpern; Brian Walenz; Nelson Axelrod; Jiaqi Huang; Ewen F. Kirkness; Gennady Denisov; Yuan Lin; Jeffrey R. MacDonald; Andy Wing Chun Pang; Mary Shago; Timothy B. Stockwell; Alexia Tsiamouri; Vineet Bafna; Vikas Bansal; Saul Kravitz; Dana Busam; Karen Beeson; Tina McIntosh; Karin A. Remington; Josep F. Abril; John Gill; Jon Borman; Yu-Hui Rogers; Marvin Frazier; Stephen W. Scherer; Robert L. Strausberg
Presented here is a genome sequence of an individual human. It was produced from ∼32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel) included 3,213,401 single nucleotide polymorphisms (SNPs), 53,823 block substitutions (2–206 bp), 292,102 heterozygous insertion/deletion events (indels)(1–571 bp), 559,473 homozygous indels (1–82,711 bp), 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information.
PLOS Genetics | 2007
Yann Marcy; Thomas Ishoey; Roger S. Lasken; Timothy B. Stockwell; Brian Walenz; Aaron L. Halpern; Karen Beeson; Susanne M. D. Goldberg; Stephen R. Quake
Since only a small fraction of environmental bacteria are amenable to laboratory culture, there is great interest in genomic sequencing directly from single cells. Sufficient DNA for sequencing can be obtained from one cell by the Multiple Displacement Amplification (MDA) method, thereby eliminating the need to develop culture methods. Here we used a microfluidic device to isolate individual Escherichia coli and amplify genomic DNA by MDA in 60-nl reactions. Our results confirm a report that reduced MDA reaction volume lowers nonspecific synthesis that can result from contaminant DNA templates and unfavourable interaction between primers. The quality of the genome amplification was assessed by qPCR and compared favourably to single-cell amplifications performed in standard 50-μl volumes. Amplification bias was greatly reduced in nanoliter volumes, thereby providing a more even representation of all sequences. Single-cell amplicons from both microliter and nanoliter volumes provided high-quality sequence data by high-throughput pyrosequencing, thereby demonstrating a straightforward route to sequencing genomes from single cells.
Nature | 2010
Shibu Yooseph; Kenneth H. Nealson; Douglas B. Rusch; John P. McCrow; Christopher L. Dupont; Maria Kim; Justin Johnson; Robert Montgomery; Steve Ferriera; Karen Beeson; Shannon J. Williamson; Andrey Tovchigrechko; Andrew E. Allen; Lisa Zeigler; Granger Sutton; Eric Eisenstadt; Yu-Hui Rogers; Robert Friedman; Marvin Frazier; J. Craig Venter
The understanding of marine microbial ecology and metabolism has been hampered by the paucity of sequenced reference genomes. To this end, we report the sequencing of 137 diverse marine isolates collected from around the world. We analysed these sequences, along with previously published marine prokaryotic genomes, in the context of marine metagenomic data, to gain insights into the ecology of the surface ocean prokaryotic picoplankton (0.1–3.0 μm size range). The results suggest that the sequenced genomes define two microbial groups: one composed of only a few taxa that are nearly always abundant in picoplanktonic communities, and the other consisting of many microbial taxa that are rarely abundant. The genomic content of the second group suggests that these microbes are capable of slow growth and survival in energy-limited environments, and rapid growth in energy-rich environments. By contrast, the abundant and cosmopolitan picoplanktonic prokaryotes for which there is genomic representation have smaller genomes, are probably capable of only slow growth and seem to be relatively unable to sense or rapidly acclimate to energy-rich conditions. Their genomic features also lead us to propose that one method used to avoid predation by viruses and/or bacterivores is by means of slow growth and the maintenance of low biomass.
Environmental Microbiology | 2008
Allison K. Shaw; Aaron L. Halpern; Karen Beeson; Bao Tran; J. Craig Venter; Jennifer B. H. Martiny
The study of microbial diversity patterns is hampered by the enormous diversity of microbial communities and the lack of resources to sample them exhaustively. For many questions about richness and evenness, however, one only needs to know the relative order of diversity among samples rather than total diversity. We used 16S libraries from the Global Ocean Survey to investigate the ability of 10 diversity statistics (including rarefaction, non-parametric, parametric, curve extrapolation and diversity indices) to assess the relative diversity of six aquatic bacterial communities. Overall, we found that the statistics yielded remarkably similar rankings of the samples for a given sequence similarity cut-off. This correspondence, despite the different underlying assumptions of the statistics, suggests that diversity statistics are a useful tool for ranking samples of microbial diversity. In addition, sequence similarity cut-off influenced the diversity ranking of the samples, demonstrating that diversity statistics can also be used to detect differences in phylogenetic structure among microbial communities. Finally, a subsampling analysis suggests that further sequencing from these particular clone libraries would not have substantially changed the richness rankings of the samples.
Developmental Cell | 2009
Gaetano Gargiulo; Samuel Levy; Gabriele Bucci; Mauro Romanenghi; Lorenzo Fornasari; Karen Beeson; Susanne M. D. Goldberg; Matteo Cesaroni; Marco Ballarini; Fabio Santoro; Natalie Bezman; Gianmaria Frigè; Philip D. Gregory; Michael C. Holmes; Robert L. Strausberg; Pier Giuseppe Pelicci; Fyodor D. Urnov; Saverio Minucci
It is well established that epigenetic modulation of genome accessibility in chromatin occurs during biological processes. Here we describe a method based on restriction enzymes and next-generation sequencing for identifying accessible DNA elements using a small amount of starting material, and use it to examine myeloid differentiation of primary human CD34+ cells. The accessibility of several classes of cis-regulatory elements was a predictive marker of in vivo DNA binding by transcription factors, and was associated with distinct patterns of histone posttranslational modifications. We also mapped large chromosomal domains with differential accessibility in progenitors and maturing cells. Accessibility became restricted during differentiation, correlating with a decreased number of expressed genes and loss of regulatory potential. Our data suggest that a permissive chromatin structure in multipotent cells is progressively and selectively closed during differentiation, and illustrate the use of our method for the identification of functional cis-regulatory elements.
BMC Bioinformatics | 2008
Kelvin Li; Anushka Brownley; Timothy B. Stockwell; Karen Beeson; Tina McIntosh; Dana Busam; Steve Ferriera; Sean Murphy; Samuel Levy
BackgroundPolymerase chain reaction (PCR) is used in directed sequencing for the discovery of novel polymorphisms. As the first step in PCR directed sequencing, effective PCR primer design is crucial for obtaining high-quality sequence data for target regions. Since current computational primer design tools are not fully tuned with stable underlying laboratory protocols, researchers may still be forced to iteratively optimize protocols for failed amplifications after the primers have been ordered. Furthermore, potentially identifiable factors which contribute to PCR failures have yet to be elucidated. This inefficient approach to primer design is further intensified in a high-throughput laboratory, where hundreds of genes may be targeted in one experiment.ResultsWe have developed a fully integrated computational PCR primer design pipeline that plays a key role in our high-throughput directed sequencing pipeline. Investigators may specify target regions defined through a rich set of descriptors, such as Ensembl accessions and arbitrary genomic coordinates. Primer pairs are then selected computationally to produce a minimal amplicon set capable of tiling across the specified target regions. As part of the tiling process, primer pairs are computationally screened to meet the criteria for success with one of two PCR amplification protocols. In the process of improving our sequencing success rate, which currently exceeds 95% for exons, we have discovered novel and accurate computational methods capable of identifying primers that may lead to PCR failures. We reveal the laboratory protocols and their associated, empirically determined computational parameters, as well as describe the novel computational methods which may benefit others in future primer design research.ConclusionThe high-throughput PCR primer design pipeline has been very successful in providing the basis for high-quality directed sequencing results and for minimizing costs associated with labor and reprocessing. The modular architecture of the primer design software has made it possible to readily integrate additional primer critique tests based on iterative feedback from the laboratory. As a result, the primer design software, coupled with the laboratory protocols, serves as a powerful tool for low and high-throughput primer design to enable successful directed sequencing.
PLOS ONE | 2015
Eva K.F. Chan; Rae-Anne Hardie; Desiree C. Petersen; Karen Beeson; Riana Bornman; Andrew B. Smith; Vanessa M. Hayes
The oldest extant human maternal lineages include mitochondrial haplogroups L0d and L0k found in the southern African click-speaking forager peoples broadly classified as Khoesan. Profiling these early mitochondrial lineages allows for better understanding of modern human evolution. In this study, we profile 77 new early-diverged complete mitochondrial genomes and sub-classify another 105 L0d/L0k individuals from southern Africa. We use this data to refine basal phylogenetic divergence, coalescence times and Khoesan prehistory. Our results confirm L0d as the earliest diverged lineage (∼172 kya, 95%CI: 149–199 kya), followed by L0k (∼159 kya, 95%CI: 136–183 kya) and a new lineage we name L0g (∼94 kya, 95%CI: 72–116 kya). We identify two new L0d1 subclades we name L0d1d and L0d1c4/L0d1e, and estimate L0d2 and L0d1 divergence at ∼93 kya (95%CI:76–112 kya). We concur the earliest emerging L0d1’2 sublineage L0d1b (∼49 kya, 95%CI:37–58 kya) is widely distributed across southern Africa. Concomitantly, we find the most recent sublineage L0d2a (∼17 kya, 95%CI:10–27 kya) to be equally common. While we agree that lineages L0d1c and L0k1a are restricted to contemporary inland Khoesan populations, our observed predominance of L0d2a and L0d1a in non-Khoesan populations suggests a once independent coastal Khoesan prehistory. The distribution of early-diverged human maternal lineages within contemporary southern Africans suggests a rich history of human existence prior to any archaeological evidence of migration into the region. For the first time, we provide a genetic-based evidence for significant modern human evolution in southern Africa at the time of the Last Glacial Maximum at between ∼21–17 kya, coinciding with the emergence of major lineages L0d1a, L0d2b, L0d2d and L0d2a.
computational systems bioinformatics | 2006
Marvin E. Frazier; Douglas B. Rusch; Aaron L. Halpern; Karla B. Heidelberg; Granger Sutton; Shannon J. Williamson; Shibu Yooseph; Dongying Wu; Jonathan A. Eisen; Jeff Hoffman; Charles H. Howard; Cyrus Foote; Brooke A. Dill; Karin A. Remington; Karen Beeson; Bao Tran; Hamilton O. Smith; Holly Baden-Tillson; Clare Stewart; Joyce Thorpe; Jason Freemen; Cindy Pfannkoch; Joseph E. Venter; John F. Heidelberg; Terry Utterback; Yu-Hui Rogers; Shaojie Zhang; Vineet Bafna; Luisa I. Falcón; Valeria Souza
Marvin E. Frazier,Douglas B. Rusch, Aaron L. Halpern, Karla B. Heidelberg, Granger Sutton, Shannon Williamson, Shibu Yooseph, Dongying Wu, Jonathan A. Eisen, Jeff Hoffman, Charles H. Howard, Cyrus Foote, Brooke A. Dill, Karin Remington, Karen Beeson, Bao Tran, Hamilton Smith, Holly Baden-Tillson, Clare Stewart, Joyce Thorpe, Jason Freemen, Cindy Pfannkoch, Joseph E. Venter, John Heidelberg, Terry Utterback, Yu-Hui Rogers, Shaojie Zhang, Vineet Bafna, Luisa Falcon, Valeria Souza,German Bonilla, Luis E. Eguiarte , David M. Karl, Ken Nealson, Shubha Sathyendranath, Trevor Platt, Eldredge Bermingham, Victor Gallardo, Giselle Tamayo, Robert Friedman, Robert Strausberg, J. Craig Venter 1 J. Craig Venter Institute, Rockville, Maryland, United States Of America 2 The Institute For Genomic Research, Rockville, Maryland, United States Of America 3 Department of Computer Science, University of California San Diego 4 Instituto de Ecologia Dept. Ecologia Evolutiva, National Autonomous University of Mexico Mexico City, 04510 Distrito Federal, Mexico 5 University of Hawaii, Honolulu, United States of America 6 Dept. of Earth Sciences, University of Southern California, Los Angeles, California, United States of America 7 Dalhousie University, Halifax, Nova Scotia, Canada 8 Smithsonian Tropical Research Institute, Balboa, Ancon, Republic of Panama 9 University of Concepcion, Concepcion, Chile 10 University of Costa Rica, San Pedro, San Jose, Republic of Costa Rica
Genome Biology | 2009
Olivier Harismendy; Pauline C. Ng; Robert L. Strausberg; Xiaoyun Wang; Timothy B. Stockwell; Karen Beeson; Nicholas J. Schork; Sarah S. Murray; Eric J. Topol; Samuel Levy; Kelly A. Frazer