Sharvari Gujja
Broad Institute
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Sharvari Gujja.
Science | 2010
Karen E. Nelson; George M. Weinstock; Sarah K. Highlander; Kim C. Worley; Heather Huot Creasy; Jennifer R. Wortman; Douglas B. Rusch; Makedonka Mitreva; Erica Sodergren; Asif T. Chinwalla; Michael Feldgarden; Dirk Gevers; Brian J. Haas; Ramana Madupu; Doyle V. Ward; Bruce Birren; Richard A. Gibbs; Barbara A. Methé; Joseph F. Petrosino; Robert L. Strausberg; Granger Sutton; Owen White; Richard Wilson; Scott Durkin; Michelle G. Giglio; Sharvari Gujja; Clint Howarth; Chinnappa D. Kodira; Nikos C. Kyrpides; Teena Mehta
News from the Inner Tube of Life A major initiative by the U.S. National Institutes of Health to sequence 900 genomes of microorganisms that live on the surfaces and orifices of the human body has established standardized protocols and methods for such large-scale reference sequencing. By combining previously accumulated data with new data, Nelson et al. (p. 994) present an initial analysis of 178 bacterial genomes. The sampling so far barely scratches the surface of the microbial diversity found on humans, but the work provides an important baseline for future analyses. Standardized protocols and methods are being established for large-scale sequencing of the microorganisms living on humans. The human microbiome refers to the community of microorganisms, including prokaryotes, viruses, and microbial eukaryotes, that populate the human body. The National Institutes of Health launched an initiative that focuses on describing the diversity of microbial species that are associated with health and disease. The first phase of this initiative includes the sequencing of hundreds of microbial reference genomes, coupled to metagenomic sequencing from multiple body sites. Here we present results from an initial reference genome sequencing of 178 microbial genomes. From 547,968 predicted polypeptides that correspond to the gene complement of these strains, previously unidentified (“novel”) polypeptides that had both unmasked sequence length greater than 100 amino acids and no BLASTP match to any nonreference entry in the nonredundant subset were defined. This analysis resulted in a set of 30,867 polypeptides, of which 29,987 (~97%) were unique. In addition, this set of microbial genomes allows for ~40% of random sequences from the microbiome of the gastrointestinal tract to be associated with organisms based on the match criteria used. Insights into pan-genome analysis suggest that we are still far from saturating microbial species genetic data sets. In addition, the associated metrics and standards used by our group for quality assurance are presented.
Science | 2011
Nicholas Rhind; Zehua Chen; Moran Yassour; Dawn Anne Thompson; Brian J. Haas; Naomi Habib; Ilan Wapinski; Sushmita Roy; Michael F. Lin; David I. Heiman; Sarah K. Young; Kanji Furuya; Yabin Guo; Alison L. Pidoux; Huei Mei Chen; Barbara Robbertse; Jonathan M. Goldberg; Keita Aoki; Elizabeth H. Bayne; Aaron M. Berlin; Christopher A. Desjardins; Edward Dobbs; Livio Dukaj; Lin Fan; Michael Fitzgerald; Courtney French; Sharvari Gujja; Klavs Wörgler Hansen; Daniel Keifenheim; Joshua Z. Levin
A combined analysis of genome sequence, structure, and expression gives insights into fission yeast biology. The fission yeast clade—comprising Schizosaccharomyces pombe, S. octosporus, S. cryophilus, and S. japonicus—occupies the basal branch of Ascomycete fungi and is an important model of eukaryote biology. A comparative annotation of these genomes identified a near extinction of transposons and the associated innovation of transposon-free centromeres. Expression analysis established that meiotic genes are subject to antisense transcription during vegetative growth, which suggests a mechanism for their tight regulation. In addition, trans-acting regulators control new genes within the context of expanded functional modules for meiosis and stress response. Differences in gene content and regulation also explain why, unlike the budding yeast of Saccharomycotina, fission yeasts cannot use ethanol as a primary carbon source. These analyses elucidate the genome structure and gene regulation of fission yeast and provide tools for investigation across the Schizosaccharomyces clade.
PLOS Pathogens | 2012
Matthew R. Henn; Christian L. Boutwell; Patrick Charlebois; Niall J. Lennon; Karen A. Power; Alexander R. Macalalad; Aaron M. Berlin; Christine M. Malboeuf; Elizabeth Ryan; Sante Gnerre; Michael C. Zody; Rachel L. Erlich; Lisa Green; Andrew Berical; Yaoyu Wang; Monica Casali; Hendrik Streeck; Allyson K. Bloom; Tim Dudek; Damien C. Tully; Ruchi M. Newman; Karen L. Axten; Adrianne D. Gladden; Laura Battis; Michael Kemper; Qiandong Zeng; Terrance Shea; Sharvari Gujja; Carmen Zedlack; Olivier Gasser
Deep sequencing technologies have the potential to transform the study of highly variable viral pathogens by providing a rapid and cost-effective approach to sensitively characterize rapidly evolving viral quasispecies. Here, we report on a high-throughput whole HIV-1 genome deep sequencing platform that combines 454 pyrosequencing with novel assembly and variant detection algorithms. In one subject we combined these genetic data with detailed immunological analyses to comprehensively evaluate viral evolution and immune escape during the acute phase of HIV-1 infection. The majority of early, low frequency mutations represented viral adaptation to host CD8+ T cell responses, evidence of strong immune selection pressure occurring during the early decline from peak viremia. CD8+ T cell responses capable of recognizing these low frequency escape variants coincided with the selection and evolution of more effective secondary HLA-anchor escape mutations. Frequent, and in some cases rapid, reversion of transmitted mutations was also observed across the viral genome. When located within restricted CD8 epitopes these low frequency reverting mutations were sufficient to prime de novo responses to these epitopes, again illustrating the capacity of the immune response to recognize and respond to low frequency variants. More importantly, rapid viral escape from the most immunodominant CD8+ T cell responses coincided with plateauing of the initial viral load decline in this subject, suggestive of a potential link between maintenance of effective, dominant CD8 responses and the degree of early viremia reduction. We conclude that the early control of HIV-1 replication by immunodominant CD8+ T cell responses may be substantially influenced by rapid, low frequency viral adaptations not detected by conventional sequencing approaches, which warrants further investigation. These data support the critical need for vaccine-induced CD8+ T cell responses to target more highly constrained regions of the virus in order to ensure the maintenance of immunodominant CD8 responses and the sustained decline of early viremia.
Proceedings of the National Academy of Sciences of the United States of America | 2012
Yonatan H. Grad; Marc Lipsitch; Michael Feldgarden; Harindra Arachchi; Gustavo C. Cerqueira; Michael C. Fitzgerald; Paul A. Godfrey; Brian J. Haas; Cheryl Murphy; Carsten Russ; Sean Sykes; Bruce J. Walker; Jennifer R. Wortman; Qiandong Zeng; Amr Abouelleil; James Bochicchio; Sara Chauvin; Timothy DeSmet; Sharvari Gujja; Caryn McCowan; Anna Montmayeur; Scott Steelman; Jakob Frimodt-Møller; Andreas Petersen; Carsten Struve; Karen A. Krogfelt; Edouard Bingen; François-Xavier Weill; Eric S. Lander; Chad Nusbaum
The degree to which molecular epidemiology reveals information about the sources and transmission patterns of an outbreak depends on the resolution of the technology used and the samples studied. Isolates of Escherichia coli O104:H4 from the outbreak centered in Germany in May–July 2011, and the much smaller outbreak in southwest France in June 2011, were indistinguishable by standard tests. We report a molecular epidemiological analysis using multiplatform whole-genome sequencing and analysis of multiple isolates from the German and French outbreaks. Isolates from the German outbreak showed remarkably little diversity, with only two single nucleotide polymorphisms (SNPs) found in isolates from four individuals. Surprisingly, we found much greater diversity (19 SNPs) in isolates from seven individuals infected in the French outbreak. The German isolates form a clade within the more diverse French outbreak strains. Moreover, five isolates derived from a single infected individual from the French outbreak had extremely limited diversity. The striking difference in diversity between the German and French outbreak samples is consistent with several hypotheses, including a bottleneck that purged diversity in the German isolates, variation in mutation rates in the two E. coli outbreak populations, or uneven distribution of diversity in the seed populations that led to each outbreak.
Nature Genetics | 2012
Daniel E. Neafsey; Kevin Galinsky; Rays H. Y. Jiang; Lauren Young; Sean Sykes; Sakina Saif; Sharvari Gujja; Jonathan M. Goldberg; Qiandong Zeng; Sinéad B. Chapman; A. P. Dash; Anupkumar R. Anvikar; Patrick L. Sutton; Bruce W. Birren; Ananias A. Escalante; John W. Barnwell; Jane M. Carlton
We sequenced and annotated the genomes of four P. vivax strains collected from disparate geographic locations, tripling the number of genome sequences available for this understudied parasite and providing the first genome-wide perspective of global variability in this species. We observe approximately twice as much SNP diversity among these isolates as we do among a comparable collection of isolates of P. falciparum, a malaria-causing parasite that results in higher mortality. This indicates a distinct history of global colonization and/or a more stable demographic history for P. vivax relative to P. falciparum, which is thought to have undergone a recent population bottleneck. The SNP diversity, as well as additional microsatellite and gene family variability, suggests a capacity for greater functional variation in the global population of P. vivax. These findings warrant a deeper survey of variation in P. vivax to equip disease interventions targeting the distinctive biology of this neglected but major pathogen.
PLOS Genetics | 2014
Guilhem Janbon; Kate L. Ormerod; Damien Paulet; Edmond J. Byrnes; Vikas Yadav; Gautam Chatterjee; Nandita Mullapudi; Chung Chau Hon; R. Blake Billmyre; François Brunel; Yong Sun Bahn; Weidong Chen; Yuan Chen; Eve W. L. Chow; Jean Yves Coppée; Anna Floyd-Averette; Claude Gaillardin; Kimberly J. Gerik; Jonathan M. Goldberg; Sara Gonzalez-Hilarion; Sharvari Gujja; Joyce L. Hamlin; Yen-Ping Hsueh; Giuseppe Ianiri; Steven J.M. Jones; Chinnappa D. Kodira; Lukasz Kozubowski; Woei Lam; Marco A. Marra; Larry D. Mesner
Cryptococcus neoformans is a pathogenic basidiomycetous yeast responsible for more than 600,000 deaths each year. It occurs as two serotypes (A and D) representing two varieties (i.e. grubii and neoformans, respectively). Here, we sequenced the genome and performed an RNA-Seq-based analysis of the C. neoformans var. grubii transcriptome structure. We determined the chromosomal locations, analyzed the sequence/structural features of the centromeres, and identified origins of replication. The genome was annotated based on automated and manual curation. More than 40,000 introns populating more than 99% of the expressed genes were identified. Although most of these introns are located in the coding DNA sequences (CDS), over 2,000 introns in the untranslated regions (UTRs) were also identified. Poly(A)-containing reads were employed to locate the polyadenylation sites of more than 80% of the genes. Examination of the sequences around these sites revealed a new poly(A)-site-associated motif (AUGHAH). In addition, 1,197 miscRNAs were identified. These miscRNAs can be spliced and/or polyadenylated, but do not appear to have obvious coding capacities. Finally, this genome sequence enabled a comparative analysis of strain H99 variants obtained after laboratory passage. The spectrum of mutations identified provides insights into the genetics underlying the micro-evolution of a laboratory strain, and identifies mutations involved in stress responses, mating efficiency, and virulence.
PLOS Genetics | 2011
Christopher A. Desjardins; Mia D. Champion; Jason W. Holder; Anna Muszewska; Jonathan M. Goldberg; Alexandre M. Bailão; Marcelo M. Brigido; Márcia Eliana da Silva Ferreira; Ana Maria Garcia; Marcin Grynberg; Sharvari Gujja; David I. Heiman; Matthew R. Henn; Chinnappa D. Kodira; Henry León-Narváez; Larissa V. G. Longo; Li-Jun Ma; Iran Malavazi; Alisson L. Matsuo; Flavia V. Morais; Maristela Pereira; Sabrina Rodríguez-Brito; Sharadha Sakthikumar; Silvia Maria Salem-Izacc; Sean Sykes; Marcus de Melo Teixeira; Milene C. Vallejo; Maria Emilia Telles Walter; Chandri Yandava; Qiandong Zeng
Paracoccidioides is a fungal pathogen and the cause of paracoccidioidomycosis, a health-threatening human systemic mycosis endemic to Latin America. Infection by Paracoccidioides, a dimorphic fungus in the order Onygenales, is coupled with a thermally regulated transition from a soil-dwelling filamentous form to a yeast-like pathogenic form. To better understand the genetic basis of growth and pathogenicity in Paracoccidioides, we sequenced the genomes of two strains of Paracoccidioides brasiliensis (Pb03 and Pb18) and one strain of Paracoccidioides lutzii (Pb01). These genomes range in size from 29.1 Mb to 32.9 Mb and encode 7,610 to 8,130 genes. To enable genetic studies, we mapped 94% of the P. brasiliensis Pb18 assembly onto five chromosomes. We characterized gene family content across Onygenales and related fungi, and within Paracoccidioides we found expansions of the fungal-specific kinase family FunK1. Additionally, the Onygenales have lost many genes involved in carbohydrate metabolism and fewer genes involved in protein metabolism, resulting in a higher ratio of proteases to carbohydrate active enzymes in the Onygenales than their relatives. To determine if gene content correlated with growth on different substrates, we screened the non-pathogenic onygenale Uncinocarpus reesii, which has orthologs for 91% of Paracoccidioides metabolic genes, for growth on 190 carbon sources. U. reesii showed growth on a limited range of carbohydrates, primarily basic plant sugars and cell wall components; this suggests that Onygenales, including dimorphic fungi, can degrade cellulosic plant material in the soil. In addition, U. reesii grew on gelatin and a wide range of dipeptides and amino acids, indicating a preference for proteinaceous growth substrates over carbohydrates, which may enable these fungi to also degrade animal biomass. These capabilities for degrading plant and animal substrates suggest a duality in lifestyle that could enable pathogenic species of Onygenales to transfer from soil to animal hosts.
PLOS Medicine | 2015
Keira A. Cohen; Thomas Abeel; Abigail Manson McGuire; Christopher A. Desjardins; Vanisha Munsamy; Terrance Shea; Bruce J. Walker; Nonkqubela Bantubani; Deepak Almeida; Lucia Alvarado; Sinéad B. Chapman; Nomonde R. Mvelase; Eamon Y. Duffy; Michael Fitzgerald; Pamla Govender; Sharvari Gujja; Susanna. Hamilton; Clinton Howarth; Jeffrey D. Larimer; Kashmeel Maharaj; Matthew Pearson; Margaret Priest; Qiandong Zeng; Nesri Padayatchi; Jacques Grosset; Sarah K. Young; Jennifer R. Wortman; Koleka Mlisana; Max O'Donnell; Bruce W. Birren
Background The continued advance of antibiotic resistance threatens the treatment and control of many infectious diseases. This is exemplified by the largest global outbreak of extensively drug-resistant (XDR) tuberculosis (TB) identified in Tugela Ferry, KwaZulu-Natal, South Africa, in 2005 that continues today. It is unclear whether the emergence of XDR-TB in KwaZulu-Natal was due to recent inadequacies in TB control in conjunction with HIV or other factors. Understanding the origins of drug resistance in this fatal outbreak of XDR will inform the control and prevention of drug-resistant TB in other settings. In this study, we used whole genome sequencing and dating analysis to determine if XDR-TB had emerged recently or had ancient antecedents. Methods and Findings We performed whole genome sequencing and drug susceptibility testing on 337 clinical isolates of Mycobacterium tuberculosis collected in KwaZulu-Natal from 2008 to 2013, in addition to three historical isolates, collected from patients in the same province and including an isolate from the 2005 Tugela Ferry XDR outbreak, a multidrug-resistant (MDR) isolate from 1994, and a pansusceptible isolate from 1995. We utilized an array of whole genome comparative techniques to assess the relatedness among strains, to establish the order of acquisition of drug resistance mutations, including the timing of acquisitions leading to XDR-TB in the LAM4 spoligotype, and to calculate the number of independent evolutionary emergences of MDR and XDR. Our sequencing and analysis revealed a 50-member clone of XDR M. tuberculosis that was highly related to the Tugela Ferry XDR outbreak strain. We estimated that mutations conferring isoniazid and streptomycin resistance in this clone were acquired 50 y prior to the Tugela Ferry outbreak (katG S315T [isoniazid]; gidB 130 bp deletion [streptomycin]; 1957 [95% highest posterior density (HPD): 1937–1971]), with the subsequent emergence of MDR and XDR occurring 20 y (rpoB L452P [rifampicin]; pncA 1 bp insertion [pyrazinamide]; 1984 [95% HPD: 1974–1992]) and 10 y (rpoB D435G [rifampicin]; rrs 1400 [kanamycin]; gyrA A90V [ofloxacin]; 1995 [95% HPD: 1988–1999]) prior to the outbreak, respectively. We observed frequent de novo evolution of MDR and XDR, with 56 and nine independent evolutionary events, respectively. Isoniazid resistance evolved before rifampicin resistance 46 times, whereas rifampicin resistance evolved prior to isoniazid only twice. We identified additional putative compensatory mutations to rifampicin in this dataset. One major limitation of this study is that the conclusions with respect to ordering and timing of acquisition of mutations may not represent universal patterns of drug resistance emergence in other areas of the globe. Conclusions In the first whole genome-based analysis of the emergence of drug resistance among clinical isolates of M. tuberculosis, we show that the ancestral precursor of the LAM4 XDR outbreak strain in Tugela Ferry gained mutations to first-line drugs at the beginning of the antibiotic era. Subsequent accumulation of stepwise resistance mutations, occurring over decades and prior to the explosion of HIV in this region, yielded MDR and XDR, permitting the emergence of compensatory mutations. Our results suggest that drug-resistant strains circulating today reflect not only vulnerabilities of current TB control efforts but also those that date back 50 y. In drug-resistant TB, isoniazid resistance was overwhelmingly the initial resistance mutation to be acquired, which would not be detected by current rapid molecular diagnostics employed in South Africa that assess only rifampicin resistance.
Genome Research | 2015
Matthew P. Hirakawa; Diego Martinez; Sharadha Sakthikumar; Matthew Z. Anderson; Aaron M. Berlin; Sharvari Gujja; Qiandong Zeng; Ethan Zisson; Joshua M. Wang; Joshua M. Greenberg; Judith Berman; Richard J. Bennett; Christina A. Cuomo
Candida albicans is a commensal fungus of the human gastrointestinal tract and a prevalent opportunistic pathogen. To examine diversity within this species, extensive genomic and phenotypic analyses were performed on 21 clinical C. albicans isolates. Genomic variation was evident in the form of polymorphisms, copy number variations, chromosomal inversions, subtelomeric hypervariation, loss of heterozygosity (LOH), and whole or partial chromosome aneuploidies. All 21 strains were diploid, although karyotypic changes were present in eight of the 21 isolates, with multiple strains being trisomic for Chromosome 4 or Chromosome 7. Aneuploid strains exhibited a general fitness defect relative to euploid strains when grown under replete conditions. All strains were also heterozygous, yet multiple, distinct LOH tracts were present in each isolate. Higher overall levels of genome heterozygosity correlated with faster growth rates, consistent with increased overall fitness. Genes with the highest rates of amino acid substitutions included many cell wall proteins, implicating fast evolving changes in cell adhesion and host interactions. One clinical isolate, P94015, presented several striking properties including a novel cellular phenotype, an inability to filament, drug resistance, and decreased virulence. Several of these properties were shown to be due to a homozygous nonsense mutation in the EFG1 gene. Furthermore, loss of EFG1 function resulted in increased fitness of P94015 in a commensal model of infection. Our analysis therefore reveals intra-species genetic and phenotypic differences in C. albicans and delineates a natural mutation that alters the balance between commensalism and pathogenicity.
Mbio | 2015
Rhys A. Farrer; Christopher A. Desjardins; Sharadha Sakthikumar; Sharvari Gujja; Sakina Saif; Qiandong Zeng; Yuan Chen; Kerstin Voelz; Joseph Heitman; Robin C. May; Matthew C. Fisher; Christina A. Cuomo
ABSTRACT Cryptococcus gattii is a fungal pathogen of humans, causing pulmonary infections in otherwise healthy hosts. To characterize genomic variation among the four major lineages of C. gattii (VGI, -II, -III, and -IV), we generated, annotated, and compared 16 de novo genome assemblies, including the first for the rarely isolated lineages VGIII and VGIV. By identifying syntenic regions across assemblies, we found 15 structural rearrangements, which were almost exclusive to the VGI-III-IV lineages. Using synteny to inform orthology prediction, we identified a core set of 87% of C. gattii genes present as single copies in all four lineages. Remarkably, 737 genes are variably inherited across lineages and are overrepresented for response to oxidative stress, mitochondrial import, and metal binding and transport. Specifically, VGI has an expanded set of iron-binding genes thought to be important to the virulence of Cryptococcus, while VGII has expansions in the stress-related heat shock proteins relative to the other lineages. We also characterized genes uniquely absent in each lineage, including a copper transporter absent from VGIV, which influences Cryptococcus survival during pulmonary infection and the onset of meningoencephalitis. Through inclusion of population-level data for an additional 37 isolates, we identified a new transcontinental clonal group that we name VGIIx, mitochondrial recombination between VGII and VGIII, and positive selection of multidrug transporters and the iron-sulfur protein aconitase along multiple branches of the phylogenetic tree. Our results suggest that gene expansion or contraction and positive selection have introduced substantial variation with links to mechanisms of pathogenicity across this species complex. IMPORTANCE The genetic differences between phenotypically different pathogens provide clues to the underlying mechanisms of those traits and can lead to new drug targets and improved treatments for those diseases. In this paper, we compare 16 genomes belonging to four highly differentiated lineages of Cryptococcus gattii, which cause pulmonary infections in otherwise healthy humans and other animals. Half of these lineages have not had their genomes previously assembled and annotated. We identified 15 ancestral rearrangements in the genome and over 700 genes that are unique to one or more lineages, many of which are associated with virulence. In addition, we found evidence for recent transcontinental spread, mitochondrial genetic exchange, and positive selection in multidrug transporters. Our results suggest that gene expansion/contraction and positive selection are diversifying the mechanisms of pathogenicity across this species complex. The genetic differences between phenotypically different pathogens provide clues to the underlying mechanisms of those traits and can lead to new drug targets and improved treatments for those diseases. In this paper, we compare 16 genomes belonging to four highly differentiated lineages of Cryptococcus gattii, which cause pulmonary infections in otherwise healthy humans and other animals. Half of these lineages have not had their genomes previously assembled and annotated. We identified 15 ancestral rearrangements in the genome and over 700 genes that are unique to one or more lineages, many of which are associated with virulence. In addition, we found evidence for recent transcontinental spread, mitochondrial genetic exchange, and positive selection in multidrug transporters. Our results suggest that gene expansion/contraction and positive selection are diversifying the mechanisms of pathogenicity across this species complex.