Sarah K. Young | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Sarah K. Young is active.

Explore More

Publication

Featured researches published by Sarah K. Young.

PLOS ONE | 2014

Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement.

Bruce J. Walker; Thomas Abeel; Terrance Shea; Margaret Priest; Amr Abouelliel; Sharadha Sakthikumar; Christina A. Cuomo; Qiandong Zeng; Jennifer R. Wortman; Sarah K. Young; Ashlee M. Earl

Advances in modern sequencing technologies allow us to generate sufficient data to analyze hundreds of bacterial genomes from a single machine in a single day. This potential for sequencing massive numbers of genomes calls for fully automated methods to produce high-quality assemblies and variant calls. We introduce Pilon, a fully automated, all-in-one tool for correcting draft assemblies and calling sequence variants of multiple sizes, including very large insertions and deletions. Pilon works with many types of sequence data, but is particularly strong when supplied with paired end data from two Illumina libraries with small e.g., 180 bp and large e.g., 3–5 Kb inserts. Pilon significantly improves draft genome assemblies by correcting bases, fixing mis-assemblies and filling gaps. For both haploid and diploid genomes, Pilon produces more contiguous genomes with fewer errors, enabling identification of more biologically relevant genes. Furthermore, Pilon identifies small variants with high accuracy as compared to state-of-the-art tools and is unique in its ability to accurately identify large sequence variants including duplications and resolve large insertions. Pilon is being used to improve the assemblies of thousands of new genomes and to identify variants from thousands of clinically relevant bacterial strains. Pilon is freely available as open source software.

Hepatology | 2008

Naturally Occurring Dominant Resistance Mutations to Hepatitis C Virus Protease and Polymerase Inhibitors in Treatment-Naïve Patients

Thomas Kuntzen; Joerg Timm; Andrew Berical; Niall J. Lennon; Aaron M. Berlin; Sarah K. Young; Bongshin Lee; David Heckerman; Jonathan M. Carlson; Laura L. Reyor; Marianna Kleyman; Cory McMahon; Christopher Birch; Julian Schulze zur Wiesch; Timothy Ledlie; Michael Koehrsen; Chinnappa D. Kodira; Andrew Roberts; Georg M. Lauer; Hugo R. Rosen; Florian Bihl; Andreas Cerny; Ulrich Spengler; Zhimin Liu; Arthur Y. Kim; Yanming Xing; Arne Schneidewind; Margaret A. Madey; Jaquelyn F. Fleckenstein; Vicki Park

Resistance mutations to hepatitis C virus (HCV) nonstructural protein 3 (NS3) protease inhibitors in <1% of the viral quasispecies may still allow >1000‐fold viral load reductions upon treatment, consistent with their reported reduced replicative fitness in vitro. Recently, however, an R155K protease mutation was reported as the dominant quasispecies in a treatment‐naïve individual, raising concerns about possible full drug resistance. To investigate the prevalence of dominant resistance mutations against specifically targeted antiviral therapy for HCV (STAT‐C) in the population, we analyzed HCV genome sequences from 507 treatment‐naïve patients infected with HCV genotype 1 from the United States, Germany, and Switzerland. Phylogenetic sequence analysis and viral load data were used to identify the possible spread of replication‐competent, drug‐resistant viral strains in the population and to infer the consequences of these mutations upon viral replication in vivo. Mutations described to confer resistance to the protease inhibitors Telaprevir, BILN2061, ITMN‐191, SCH6 and Boceprevir; the NS5B polymerase inhibitor AG‐021541; and to the NS4A antagonist ACH‐806 were observed mostly as sporadic, unrelated cases, at frequencies between 0.3% and 2.8% in the population, including two patients with possible multidrug resistance. Collectively, however, 8.6% of the patients infected with genotype 1a and 1.4% of those infected with genotype 1b carried at least one dominant resistance mutation. Viral loads were high in the majority of these patients, suggesting that drug‐resistant viral strains might achieve replication levels comparable to nonresistant viruses in vivo. Conclusion: Naturally occurring dominant STAT‐C resistance mutations are common in treatment‐naïve patients infected with HCV genotype 1. Their influence on treatment outcome should further be characterized to evaluate possible benefits of drug resistance testing for individual tailoring of drug combinations when treatment options are limited due to previous nonresponse to peginterferon and ribavirin. (HEPATOLOGY 2008;48:1769–1778.)

Science | 2011

Comparative Functional Genomics of the Fission Yeasts

Nicholas Rhind; Zehua Chen; Moran Yassour; Dawn Anne Thompson; Brian J. Haas; Naomi Habib; Ilan Wapinski; Sushmita Roy; Michael F. Lin; David I. Heiman; Sarah K. Young; Kanji Furuya; Yabin Guo; Alison L. Pidoux; Huei Mei Chen; Barbara Robbertse; Jonathan M. Goldberg; Keita Aoki; Elizabeth H. Bayne; Aaron M. Berlin; Christopher A. Desjardins; Edward Dobbs; Livio Dukaj; Lin Fan; Michael Fitzgerald; Courtney French; Sharvari Gujja; Klavs Wörgler Hansen; Daniel Keifenheim; Joshua Z. Levin

A combined analysis of genome sequence, structure, and expression gives insights into fission yeast biology. The fission yeast clade—comprising Schizosaccharomyces pombe, S. octosporus, S. cryophilus, and S. japonicus—occupies the basal branch of Ascomycete fungi and is an important model of eukaryote biology. A comparative annotation of these genomes identified a near extinction of transposons and the associated innovation of transposon-free centromeres. Expression analysis established that meiotic genes are subject to antisense transcription during vegetative growth, which suggests a mechanism for their tight regulation. In addition, trans-acting regulators control new genes within the context of expanded functional modules for meiosis and stress response. Differences in gene content and regulation also explain why, unlike the budding yeast of Saccharomycotina, fission yeasts cannot use ethanol as a primary carbon source. These analyses elucidate the genome structure and gene regulation of fission yeast and provide tools for investigation across the Schizosaccharomyces clade.

PLOS Pathogens | 2012

Whole genome deep sequencing of HIV-1 reveals the impact of early minor variants upon immune recognition during acute infection

Matthew R. Henn; Christian L. Boutwell; Patrick Charlebois; Niall J. Lennon; Karen A. Power; Alexander R. Macalalad; Aaron M. Berlin; Christine M. Malboeuf; Elizabeth Ryan; Sante Gnerre; Michael C. Zody; Rachel L. Erlich; Lisa Green; Andrew Berical; Yaoyu Wang; Monica Casali; Hendrik Streeck; Allyson K. Bloom; Tim Dudek; Damien C. Tully; Ruchi M. Newman; Karen L. Axten; Adrianne D. Gladden; Laura Battis; Michael Kemper; Qiandong Zeng; Terrance Shea; Sharvari Gujja; Carmen Zedlack; Olivier Gasser

Deep sequencing technologies have the potential to transform the study of highly variable viral pathogens by providing a rapid and cost-effective approach to sensitively characterize rapidly evolving viral quasispecies. Here, we report on a high-throughput whole HIV-1 genome deep sequencing platform that combines 454 pyrosequencing with novel assembly and variant detection algorithms. In one subject we combined these genetic data with detailed immunological analyses to comprehensively evaluate viral evolution and immune escape during the acute phase of HIV-1 infection. The majority of early, low frequency mutations represented viral adaptation to host CD8+ T cell responses, evidence of strong immune selection pressure occurring during the early decline from peak viremia. CD8+ T cell responses capable of recognizing these low frequency escape variants coincided with the selection and evolution of more effective secondary HLA-anchor escape mutations. Frequent, and in some cases rapid, reversion of transmitted mutations was also observed across the viral genome. When located within restricted CD8 epitopes these low frequency reverting mutations were sufficient to prime de novo responses to these epitopes, again illustrating the capacity of the immune response to recognize and respond to low frequency variants. More importantly, rapid viral escape from the most immunodominant CD8+ T cell responses coincided with plateauing of the initial viral load decline in this subject, suggestive of a potential link between maintenance of effective, dominant CD8 responses and the degree of early viremia reduction. We conclude that the early control of HIV-1 replication by immunodominant CD8+ T cell responses may be substantially influenced by rapid, low frequency viral adaptations not detected by conventional sequencing approaches, which warrants further investigation. These data support the critical need for vaccine-induced CD8+ T cell responses to target more highly constrained regions of the virus in order to ensure the maintenance of immunodominant CD8 responses and the sustained decline of early viremia.

Genome Research | 2012

Finished bacterial genomes from shotgun sequence data

Filipe J. Ribeiro; Dariusz Przybylski; Shuangye Yin; Ted Sharpe; Sante Gnerre; Amr Abouelleil; Aaron M. Berlin; Anna Montmayeur; Terrance Shea; Bruce J. Walker; Sarah K. Young; Carsten Russ; Chad Nusbaum; Iain MacCallum; David B. Jaffe

Exceptionally accurate genome reference sequences have proven to be of great value to microbial researchers. Thus, to date, about 1800 bacterial genome assemblies have been finished at great expense with the aid of manual laboratory and computational processes that typically iterate over a period of months or even years. By applying a new laboratory design and new assembly algorithm to 16 samples, we demonstrate that assemblies exceeding finished quality can be obtained from whole-genome shotgun data and automated computation. Cost and time requirements are thus dramatically reduced.

Nature | 2006

DNA sequence of human chromosome 17 and analysis of rearrangement in the human lineage

Michael C. Zody; Manuel Garber; David J. Adams; Ted Sharpe; Jennifer Harrow; James R. Lupski; Christine Nicholson; Steven M. Searle; Laurens Wilming; Sarah K. Young; Amr Abouelleil; Nicole R. Allen; Weimin Bi; Toby Bloom; Mark L. Borowsky; Boris Bugalter; Jonathan Butler; Jean L. Chang; Chao-Kung Chen; April Cook; Benjamin Corum; Christina A. Cuomo; Pieter J. de Jong; David DeCaprio; Ken Dewar; Michael Fitzgerald; James Gilbert; Richard Gibson; Sante Gnerre; Steven Goldstein

Chromosome 17 is unusual among the human chromosomes in many respects. It is the largest human autosome with orthology to only a single mouse chromosome, mapping entirely to the distal half of mouse chromosome 11. Chromosome 17 is rich in protein-coding genes, having the second highest gene density in the genome. It is also enriched in segmental duplications, ranking third in density among the autosomes. Here we report a finished sequence for human chromosome 17, as well as a structural comparison with the finished sequence for mouse chromosome 11, the first finished mouse chromosome. Comparison of the orthologous regions reveals striking differences. In contrast to the typical pattern seen in mammalian evolution, the human sequence has undergone extensive intrachromosomal rearrangement, whereas the mouse sequence has been remarkably stable. Moreover, although the human sequence has a high density of segmental duplication, the mouse sequence has a very low density. Notably, these segmental duplications correspond closely to the sites of structural rearrangement, demonstrating a link between duplication and rearrangement. Examination of the main classes of duplicated segments provides insight into the dynamics underlying expansion of chromosome-specific, low-copy repeats in the human genome.

PLOS Genetics | 2013

Distinctive expansion of potential virulence genes in the genome of the oomycete fish pathogen Saprolegnia parasitica.

Rays H. Y. Jiang; Irene de Bruijn; Brian J. Haas; Rodrigo Belmonte; Lars Löbach; James S. Christie; Guido Van den Ackerveken; Arnaud Bottin; Vincent Bulone; Sara M. Díaz-Moreno; Bernard Dumas; Lin Fan; Elodie Gaulin; Francine Govers; Laura J. Grenville-Briggs; Neil R. Horner; Joshua Z. Levin; Marco Mammella; Harold J. G. Meijer; Paul F. Morris; Chad Nusbaum; Stan Oome; Andrew J. Phillips; David van Rooyen; Elzbieta Rzeszutek; Marcia Saraiva; Christopher J. Secombes; Michael F. Seidl; Berend Snel; Joost H. M. Stassen

Oomycetes in the class Saprolegniomycetidae of the Eukaryotic kingdom Stramenopila have evolved as severe pathogens of amphibians, crustaceans, fish and insects, resulting in major losses in aquaculture and damage to aquatic ecosystems. We have sequenced the 63 Mb genome of the fresh water fish pathogen, Saprolegnia parasitica. Approximately 1/3 of the assembled genome exhibits loss of heterozygosity, indicating an efficient mechanism for revealing new variation. Comparison of S. parasitica with plant pathogenic oomycetes suggests that during evolution the host cellular environment has driven distinct patterns of gene expansion and loss in the genomes of plant and animal pathogens. S. parasitica possesses one of the largest repertoires of proteases (270) among eukaryotes that are deployed in waves at different points during infection as determined from RNA-Seq data. In contrast, despite being capable of living saprotrophically, parasitism has led to loss of inorganic nitrogen and sulfur assimilation pathways, strikingly similar to losses in obligate plant pathogenic oomycetes and fungi. The large gene families that are hallmarks of plant pathogenic oomycetes such as Phytophthora appear to be lacking in S. parasitica, including those encoding RXLR effectors, Crinklers, and Necrosis Inducing-Like Proteins (NLP). S. parasitica also has a very large kinome of 543 kinases, 10% of which is induced upon infection. Moreover, S. parasitica encodes several genes typical of animals or animal-pathogens and lacking from other oomycetes, including disintegrins and galactose-binding lectins, whose expression and evolutionary origins implicate horizontal gene transfer in the evolution of animal pathogenesis in S. parasitica.

PLOS Medicine | 2015

Evolution of Extensively Drug-Resistant Tuberculosis over Four Decades: Whole Genome Sequencing and Dating Analysis of Mycobacterium tuberculosis Isolates from KwaZulu-Natal

Keira A. Cohen; Thomas Abeel; Abigail Manson McGuire; Christopher A. Desjardins; Vanisha Munsamy; Terrance Shea; Bruce J. Walker; Nonkqubela Bantubani; Deepak Almeida; Lucia Alvarado; Sinéad B. Chapman; Nomonde R. Mvelase; Eamon Y. Duffy; Michael Fitzgerald; Pamla Govender; Sharvari Gujja; Susanna. Hamilton; Clinton Howarth; Jeffrey D. Larimer; Kashmeel Maharaj; Matthew Pearson; Margaret Priest; Qiandong Zeng; Nesri Padayatchi; Jacques Grosset; Sarah K. Young; Jennifer R. Wortman; Koleka Mlisana; Max O'Donnell; Bruce W. Birren

Background The continued advance of antibiotic resistance threatens the treatment and control of many infectious diseases. This is exemplified by the largest global outbreak of extensively drug-resistant (XDR) tuberculosis (TB) identified in Tugela Ferry, KwaZulu-Natal, South Africa, in 2005 that continues today. It is unclear whether the emergence of XDR-TB in KwaZulu-Natal was due to recent inadequacies in TB control in conjunction with HIV or other factors. Understanding the origins of drug resistance in this fatal outbreak of XDR will inform the control and prevention of drug-resistant TB in other settings. In this study, we used whole genome sequencing and dating analysis to determine if XDR-TB had emerged recently or had ancient antecedents. Methods and Findings We performed whole genome sequencing and drug susceptibility testing on 337 clinical isolates of Mycobacterium tuberculosis collected in KwaZulu-Natal from 2008 to 2013, in addition to three historical isolates, collected from patients in the same province and including an isolate from the 2005 Tugela Ferry XDR outbreak, a multidrug-resistant (MDR) isolate from 1994, and a pansusceptible isolate from 1995. We utilized an array of whole genome comparative techniques to assess the relatedness among strains, to establish the order of acquisition of drug resistance mutations, including the timing of acquisitions leading to XDR-TB in the LAM4 spoligotype, and to calculate the number of independent evolutionary emergences of MDR and XDR. Our sequencing and analysis revealed a 50-member clone of XDR M. tuberculosis that was highly related to the Tugela Ferry XDR outbreak strain. We estimated that mutations conferring isoniazid and streptomycin resistance in this clone were acquired 50 y prior to the Tugela Ferry outbreak (katG S315T [isoniazid]; gidB 130 bp deletion [streptomycin]; 1957 [95% highest posterior density (HPD): 1937–1971]), with the subsequent emergence of MDR and XDR occurring 20 y (rpoB L452P [rifampicin]; pncA 1 bp insertion [pyrazinamide]; 1984 [95% HPD: 1974–1992]) and 10 y (rpoB D435G [rifampicin]; rrs 1400 [kanamycin]; gyrA A90V [ofloxacin]; 1995 [95% HPD: 1988–1999]) prior to the outbreak, respectively. We observed frequent de novo evolution of MDR and XDR, with 56 and nine independent evolutionary events, respectively. Isoniazid resistance evolved before rifampicin resistance 46 times, whereas rifampicin resistance evolved prior to isoniazid only twice. We identified additional putative compensatory mutations to rifampicin in this dataset. One major limitation of this study is that the conclusions with respect to ordering and timing of acquisition of mutations may not represent universal patterns of drug resistance emergence in other areas of the globe. Conclusions In the first whole genome-based analysis of the emergence of drug resistance among clinical isolates of M. tuberculosis, we show that the ancestral precursor of the LAM4 XDR outbreak strain in Tugela Ferry gained mutations to first-line drugs at the beginning of the antibiotic era. Subsequent accumulation of stepwise resistance mutations, occurring over decades and prior to the explosion of HIV in this region, yielded MDR and XDR, permitting the emergence of compensatory mutations. Our results suggest that drug-resistant strains circulating today reflect not only vulnerabilities of current TB control efforts but also those that date back 50 y. In drug-resistant TB, isoniazid resistance was overwhelmingly the initial resistance mutation to be acquired, which would not be detected by current rapid molecular diagnostics employed in South Africa that assess only rifampicin resistance.

PLOS ONE | 2010

Analysis of high-throughput sequencing and annotation strategies for phage genomes

Matthew R. Henn; Matthew B. Sullivan; Nicole Stange-Thomann; Marcia S. Osburne; Aaron M. Berlin; Libusha Kelly; Chandri Yandava; Chinnappa D. Kodira; Qiandong Zeng; Michael Weiand; Todd Sparrow; Sakina Saif; Georgia Giannoukos; Sarah K. Young; Chad Nusbaum; Bruce W. Birren; Sallie W. Chisholm

Background Bacterial viruses (phages) play a critical role in shaping microbial populations as they influence both host mortality and horizontal gene transfer. As such, they have a significant impact on local and global ecosystem function and human health. Despite their importance, little is known about the genomic diversity harbored in phages, as methods to capture complete phage genomes have been hampered by the lack of knowledge about the target genomes, and difficulties in generating sufficient quantities of genomic DNA for sequencing. Of the approximately 550 phage genomes currently available in the public domain, fewer than 5% are marine phage. Methodology/Principal Findings To advance the study of phage biology through comparative genomic approaches we used marine cyanophage as a model system. We compared DNA preparation methodologies (DNA extraction directly from either phage lysates or CsCl purified phage particles), and sequencing strategies that utilize either Sanger sequencing of a linker amplification shotgun library (LASL) or of a whole genome shotgun library (WGSL), or 454 pyrosequencing methods. We demonstrate that genomic DNA sample preparation directly from a phage lysate, combined with 454 pyrosequencing, is best suited for phage genome sequencing at scale, as this method is capable of capturing complete continuous genomes with high accuracy. In addition, we describe an automated annotation informatics pipeline that delivers high-quality annotation and yields few false positives and negatives in ORF calling. Conclusions/Significance These DNA preparation, sequencing and annotation strategies enable a high-throughput approach to the burgeoning field of phage genomics.

Nature | 2006

Analysis of the DNA sequence and duplication history of human chromosome 15

Michael C. Zody; Manuel Garber; Ted Sharpe; Sarah K. Young; Lee Rowen; Keith O'Neill; Charles A. Whittaker; Michael Kamal; Jean L. Chang; Christina A. Cuomo; Ken Dewar; Michael Fitzgerald; Chinnappa D. Kodira; Anup Madan; Shizhen Qin; Xiaoping Yang; Nissa Abbasi; Amr Abouelleil; Harindra Arachchi; Lida Baradarani; Brian Birditt; Scott Bloom; Toby Bloom; Mark L. Borowsky; Jeremy Burke; Jonathan Butler; April Cook; Kurt DeArellano; David DeCaprio; Lester Dorris

Here we present a finished sequence of human chromosome 15, together with a high-quality gene catalogue. As chromosome 15 is one of seven human chromosomes with a high rate of segmental duplication, we have carried out a detailed analysis of the duplication structure of the chromosome. Segmental duplications in chromosome 15 are largely clustered in two regions, on proximal and distal 15q; the proximal region is notable because recombination among the segmental duplications can result in deletions causing Prader-Willi and Angelman syndromes. Sequence analysis shows that the proximal and distal regions of 15q share extensive ancient similarity. Using a simple approach, we have been able to reconstruct many of the events by which the current duplication structure arose. We find that most of the intrachromosomal duplications seem to share a common ancestry. Finally, we demonstrate that some remaining gaps in the genome sequence are probably due to structural polymorphisms between haplotypes; this may explain a significant fraction of the gaps remaining in the human genome.

Explore More