Sujai Kumar | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Sujai Kumar is active.

Explore More

Publication

Featured researches published by Sujai Kumar.

BMC Genomics | 2010

Comparing de novo assemblers for 454 transcriptome data

Sujai Kumar; Mark Blaxter

BackgroundRoche 454 pyrosequencing has become a method of choice for generating transcriptome data from non-model organisms. Once the tens to hundreds of thousands of short (250-450 base) reads have been produced, it is important to correctly assemble these to estimate the sequence of all the transcripts. Most transcriptome assembly projects use only one program for assembling 454 pyrosequencing reads, but there is no evidence that the programs used to date are optimal. We have carried out a systematic comparison of five assemblers (CAP3, MIRA, Newbler, SeqMan and CLC) to establish best practices for transcriptome assemblies, using a new dataset from the parasitic nematode Litomosoides sigmodontis.ResultsAlthough no single assembler performed best on all our criteria, Newbler 2.5 gave longer contigs, better alignments to some reference sequences, and was fast and easy to use. SeqMan assemblies performed best on the criterion of recapitulating known transcripts, and had more novel sequence than the other assemblers, but generated an excess of small, redundant contigs. The remaining assemblers all performed almost as well, with the exception of Newbler 2.3 (the version currently used by most assembly projects), which generated assemblies that had significantly lower total length. As different assemblers use different underlying algorithms to generate contigs, we also explored merging of assemblies and found that the merged datasets not only aligned better to reference sequences than individual assemblies, but were also more consistent in the number and size of contigs.ConclusionsTranscriptome assemblies are smaller than genome assemblies and thus should be more computationally tractable, but are often harder because individual contigs can have highly variable read coverage. Comparing single assemblers, Newbler 2.5 performed best on our trial data set, but other assemblers were closely comparable. Combining differently optimal assemblies from different programs however gave a more credible final product, and this strategy is recommended.

Genome Research | 2009

Analysis of the genome sequences of three Drosophila melanogaster spontaneous mutation accumulation lines.

Peter D. Keightley; Urmi Trivedi; Marian Thomson; Fiona Oliver; Sujai Kumar; Mark Blaxter

We inferred the rate and properties of new spontaneous mutations in Drosophila melanogaster by carrying out whole-genome shotgun sequencing-by-synthesis of three mutation accumulation (MA) lines that had been maintained by close inbreeding for an average of 262 generations. We tested for the presence of new mutations by generating alignments of each MA line to the D. melanogaster reference genome sequence and then compared these alignments base by base. We determined empirically that at least five reads at a site within each line are required for accurate single nucleotide mutation calling. We mapped a total of 174 single-nucleotide mutations, giving a single nucleotide mutation rate of 3.5 x 10(-9) per site per generation. There were no false positives in a random sample of 40 of these mutations checked by Sanger sequencing. Variation in the numbers of mutations among the MA lines was small and nonsignificant. Numbers of transition and transversion mutations were 86 and 88, respectively, implying that transition mutation rate is close to 2x the transversion rate. We observed 1.5x as many G or C --> A or T as A or T --> G or C mutations, implying that the G or C --> A or T mutation rate is close to 2x the A or T --> G or C mutation rate. The base composition of the genome is therefore not at an equilibrium determined solely by mutation. The predicted G + C content at mutational equilibrium (33%) is similar to that observed in transposable element remnants. Nearest-neighbor mutational context dependencies are nonsignificant, suggesting that this is a weak phenomenon in Drosophila. We also saw nonsignificant differences in the mutation rate between transcribed and untranscribed regions, implying that any transcription-coupled repair process is weak. Of seven short indel mutations confirmed, six were deletions, consistent with the deletion bias that is thought to exist in Drosophila.

Nature Communications | 2014

Exosomes secreted by nematode parasites transfer small RNAs to mammalian cells and modulate innate immunity

Amy H. Buck; Gillian Coakley; Fabio Simbari; Henry J. McSorley; Juan F. Quintana; Thierry Le Bihan; Sujai Kumar; Cei Abreu-Goodger; Marissa Lear; Yvonne Harcus; Alessandro Ceroni; Simon A. Babayan; Mark Blaxter; Alasdair Ivens; Rick M. Maizels

In mammalian systems RNA can move between cells via vesicles. Here we demonstrate that the gastrointestinal nematode Heligmosomoides polygyrus, which infects mice, secretes vesicles containing microRNAs (miRNAs) and Y RNAs as well as a nematode Argonaute protein. These vesicles are of intestinal origin and are enriched for homologues of mammalian exosome proteins. Administration of the nematode exosomes to mice suppresses Type 2 innate responses and eosinophilia induced by the allergen Alternaria. Microarray analysis of mouse cells incubated with nematode exosomes in vitro identifies Il33r and Dusp1 as suppressed genes, and Dusp1 can be repressed by nematode miRNAs based on a reporter assay. We further identify miRNAs from the filarial nematode Litomosoides sigmodontis in the serum of infected mice, suggesting that miRNA secretion into host tissues is conserved among parasitic nematodes. These results reveal exosomes as another mechanism by which helminths manipulate their hosts and provide a mechanistic framework for RNA transfer between animal species.

Frontiers in Genetics | 2013

Blobology: exploring raw genome data for contaminants, symbionts, and parasites using taxon-annotated GC-coverage plots

Sujai Kumar; Martin Jones; Georgios Koutsovoulos; Michael Clarke; Mark Blaxter

Generating the raw data for a de novo genome assembly project for a target eukaryotic species is relatively easy. This democratization of access to large-scale data has allowed many research teams to plan to assemble the genomes of non-model organisms. These new genome targets are very different from the traditional, inbred, laboratory-reared model organisms. They are often small, and cannot be isolated free of their environment – whether ingested food, the surrounding host organism of parasites, or commensal and symbiotic organisms attached to or within the individuals sampled. Preparation of pure DNA originating from a single species can be technically impossible, but assembly of mixed-organism DNA can be difficult, as most genome assemblers perform poorly when faced with multiple genomes in different stoichiometries. This class of problem is common in metagenomic datasets that deliberately try to capture all the genomes present in an environment, but replicon assembly is not often the goal of such programs. Here we present an approach to extracting, from mixed DNA sequence data, subsets that correspond to single species’ genomes and thus improving genome assembly. We use both numerical (proportion of GC bases and read coverage) and biological (best-matching sequence in annotated databases) indicators to aid partitioning of draft assembly contigs, and the reads that contribute to those contigs, into distinct bins that can then be subjected to rigorous, optimized assembly, through the use of taxon-annotated GC-coverage plots (TAGC plots). We also present Blobsplorer, a tool that aids exploration and selection of subsets from TAGC-annotated data. Partitioning the data in this way can rescue poorly assembled genomes, and reveal unexpected symbionts and commensals in eukaryotic genome projects. The TAGC plot pipeline script is available from https://github.com/blaxterlab/blobology, and the Blobsplorer tool from https://github.com/mojones/Blobsplorer.

The FASEB Journal | 2012

The genome of the heartworm, Dirofilaria immitis, reveals drug and vaccine targets

Christelle Godel; Sujai Kumar; Georgios Koutsovoulos; Philipp Ludin; Daniel Nilsson; Francesco Comandatore; Nicola Wrobel; Marian Thompson; Christoph D. Schmid; Susumu Goto; Frédéric Bringaud; Adrian J. Wolstenholme; Claudio Bandi; Christian Epe; Ronald Kaminsky; Mark A. Blaxter; Pascal Mäser

The heartworm Dirofilaria immitis is an important parasite of dogs. Transmitted by mosquitoes in warmer climatic zones, it is spreading across southern Europe and the Americas at an alarming pace. There is no vaccine, and chemotherapy is prone to complications. To learn more about this parasite, we have sequenced the genomes of D. immitis and its endosymbiont Wolbachia. We predict 10,179 protein coding genes in the 84.2 Mb of the nuclear genome, and 823 genes in the 0.9‐Mb Wolbachia genome. The D. immitis genome harbors neither DNA transposons nor active retrotransposons, and there is very little genetic variation between two sequenced isolates from Europe and the United States. The differential presence of anabolic pathways such as heme and nucleotide biosynthesis hints at the intricate metabolic interrelationship between the heartworm and Wolbachia. Comparing the proteome of D. immitis with other nematodes and with mammalian hosts, we identify families of potential drug targets, immune modulators, and vaccine candidates. This genome sequence will support the development of new tools against dirofilariasis and aid efforts to combat related human pathogens, the causative agents of lymphatic filariasis and river blindness.—Godel, C., Kumar, S., Koutsovoulos, G., Ludin, P., Nilsson, D., Comandatore, F., Wrobel, N., Thompson, M., Schmid, C. D., Goto, S., Bringaud, F., Wolstenholme, A., Bandi, C., Epe, C., Kaminsky, R., Blaxter, M., Mäser, P. The genome of the heartworm, Dirofilaria immitis, reveals drug and vaccine targets. FASEB J. 26, 4650–4661 (2012). www.fasebj.org

Cell | 2013

Sulfur Amino Acids Regulate Translational Capacity and Metabolic Homeostasis through Modulation of tRNA Thiolation

Sunil Laxman; Benjamin M. Sutter; Xi Wu; Sujai Kumar; Xiaofeng Guo; David C. Trudgian; Hamid Mirzaei; Benjamin P. Tu

Protein translation is an energetically demanding process that must be regulated in response to changes in nutrient availability. Herein, we report that intracellular methionine and cysteine availability directly controls the thiolation status of wobble-uridine (U34) nucleotides present on lysine, glutamine, or glutamate tRNAs to regulate cellular translational capacity and metabolic homeostasis. tRNA thiolation is important for growth under nutritionally challenging environments and required for efficient translation of genes enriched in lysine, glutamine, and glutamate codons, which are enriched in proteins important for translation and growth-specific processes. tRNA thiolation is downregulated during sulfur starvation in order to decrease sulfur consumption and growth, and its absence leads to a compensatory increase in enzymes involved in methionine, cysteine, and lysine biosynthesis. Thus, tRNA thiolation enables cells to modulate translational capacity according to the availability of sulfur amino acids, establishing a functional significance for this conserved tRNA nucleotide modification in cell growth control.

Proceedings of the National Academy of Sciences of the United States of America | 2016

No evidence for extensive horizontal gene transfer in the genome of the tardigrade Hypsibius dujardini

Georgios Koutsovoulos; Sujai Kumar; Dominik R. Laetsch; Lewis Stevens; Jennifer Daub; Claire Conlon; Habib Maroon; Fran Thomas; Aziz Aboobaker; Mark Blaxter

Significance Tardigrades, also known as moss piglets or water bears, are renowned for their ability to withstand extreme environmental challenges. A recently published analysis of the genome of the tardigrade Hypsibius dujardini by Boothby et al. concluded that horizontal acquisition of genes from bacterial and other sources might be key to cryptobiosis in tardigrades. We independently sequenced the genome of H. dujardini and detected a low level of horizontal gene transfer. We show that the extensive horizontal transfer proposed by Boothby et al. was an artifact of a failure to eliminate contaminants from sequence data before assembly. Tardigrades are meiofaunal ecdysozoans that are key to understanding the origins of Arthropoda. Many species of Tardigrada can survive extreme conditions through cryptobiosis. In a recent paper [Boothby TC, et al. (2015) Proc Natl Acad Sci USA 112(52):15976–15981], the authors concluded that the tardigrade Hypsibius dujardini had an unprecedented proportion (17%) of genes originating through functional horizontal gene transfer (fHGT) and speculated that fHGT was likely formative in the evolution of cryptobiosis. We independently sequenced the genome of H. dujardini. As expected from whole-organism DNA sampling, our raw data contained reads from nontarget genomes. Filtering using metagenomics approaches generated a draft H. dujardini genome assembly of 135 Mb with superior assembly metrics to the previously published assembly. Additional microbial contamination likely remains. We found no support for extensive fHGT. Among 23,021 gene predictions we identified 0.2% strong candidates for fHGT from bacteria and 0.2% strong candidates for fHGT from nonmetazoan eukaryotes. Cross-comparison of assemblies showed that the overwhelming majority of HGT candidates in the Boothby et al. genome derived from contaminants. We conclude that fHGT into H. dujardini accounts for at most 1–2% of genes and that the proposal that one-sixth of tardigrade genes originate from functional HGT events is an artifact of undetected contamination.

Molecular Ecology | 2010

Characterization of a hotspot for mimicry: assembly of a butterfly wing transcriptome to genomic sequence at the HmYb/Sb locus

Laura Ferguson; Siu Fai Lee; Nicola Chamberlain; Nicola J. Nadeau; Mathieu Joron; Simon W. Baxter; Paul Wilkinson; Alexie Papanicolaou; Sujai Kumar; Thuan Jin Kee; Richard Clark; Claire Davidson; Rebecca Glithero; Helen Beasley; Heiko Vogel; Richard H. ffrench-Constant; Chris D. Jiggins

The mimetic wing patterns of Heliconius butterflies are an excellent example of both adaptive radiation and convergent evolution. Alleles at the HmYb and HmSb loci control the presence/absence of hindwing bar and hindwing margin phenotypes respectively between divergent races of Heliconius melpomene, and also between sister species. Here, we used fine‐scale linkage mapping to identify and sequence a BAC tilepath across the HmYb/Sb loci. We also generated transcriptome sequence data for two wing pattern forms of H. melpomene that differed in HmYb/Sb alleles using 454 sequencing technology. Custom scripts were used to process the sequence traces and generate transcriptome assemblies. Genomic sequence for the HmYb/Sb candidate region was annotated both using the MAKER pipeline and manually using transcriptome sequence reads. In total, 28 genes were identified in the HmYb/Sb candidate region, six of which have alternative splice forms. None of these are orthologues of genes previously identified as being expressed in butterfly wing pattern development, implying previously undescribed molecular mechanisms of pattern determination on Heliconius wings. The use of next‐generation sequencing has therefore facilitated DNA annotation of a poorly characterized genome, and generated hypotheses regarding the identity of wing pattern at the HmYb/Sb loci.

BMC Genomics | 2010

Experimental evolution, genetic analysis and genome re-sequencing reveal the mutation conferring artemisinin resistance in an isogenic lineage of malaria parasites

Paul Hunt; Axel Martinelli; Katarzyna Modrzynska; Sofia T. Borges; Alison M. Creasey; Louise Rodrigues; Dario Beraldi; Laurence Loewe; Richard Fawcett; Sujai Kumar; Marian Thomson; Urmi Trivedi; Thomas D. Otto; Arnab Pain; Mark Blaxter; Pedro Cravo

BackgroundClassical and quantitative linkage analyses of genetic crosses have traditionally been used to map genes of interest, such as those conferring chloroquine or quinine resistance in malaria parasites. Next-generation sequencing technologies now present the possibility of determining genome-wide genetic variation at single base-pair resolution. Here, we combine in vivo experimental evolution, a rapid genetic strategy and whole genome re-sequencing to identify the precise genetic basis of artemisinin resistance in a lineage of the rodent malaria parasite, Plasmodium chabaudi. Such genetic markers will further the investigation of resistance and its control in natural infections of the human malaria, P. falciparum.ResultsA lineage of isogenic in vivo drug-selected mutant P. chabaudi parasites was investigated. By measuring the artemisinin responses of these clones, the appearance of an in vivo artemisinin resistance phenotype within the lineage was defined. The underlying genetic locus was mapped to a region of chromosome 2 by Linkage Group Selection in two different genetic crosses. Whole-genome deep coverage short-read re-sequencing (Illumina® Solexa) defined the point mutations, insertions, deletions and copy-number variations arising in the lineage. Eight point mutations arise within the mutant lineage, only one of which appears on chromosome 2. This missense mutation arises contemporaneously with artemisinin resistance and maps to a gene encoding a de-ubiquitinating enzyme.ConclusionsThis integrated approach facilitates the rapid identification of mutations conferring selectable phenotypes, without prior knowledge of biological and molecular mechanisms. For malaria, this model can identify candidate genes before resistant parasites are commonly observed in natural human malaria populations.

Nucleic Acids Research | 2012

959 Nematode Genomes: a semantic wiki for coordinating sequencing projects

Sujai Kumar; Philipp H. Schiffer; Mark Blaxter

Genome sequencing has been democratized by second-generation technologies, and even small labs can sequence metazoan genomes now. In this article, we describe ‘959 Nematode Genomes’—a community-curated semantic wiki to coordinate the sequencing efforts of individual labs to collectively sequence 959 genomes spanning the phylum Nematoda. The main goal of the wiki is to track sequencing projects that have been proposed, are in progress, or have been completed. Wiki pages for species and strains are linked to pages for people and organizations, using machine- and human-readable metadata that users can query to see the status of their favourite worm. The site is based on the same platform that runs Wikipedia, with semantic extensions that allow the underlying taxonomy and data storage models to be maintained and updated with ease compared with a conventional database-driven web site. The wiki also provides a way to track and share preliminary data if those data are not polished enough to be submitted to the official sequence repositories. In just over a year, this wiki has already fostered new international collaborations and attracted newcomers to the enthusiastic community of nematode genomicists. www.nematodegenomes.org.

Explore More