Samuel A. Assefa
University of London
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Samuel A. Assefa.
Bioinformatics | 2009
Samuel A. Assefa; Thomas M. Keane; Thomas D. Otto; Chris Newbold; Matthew Berriman
Summary: Due to the availability of new sequencing technologies, we are now increasingly interested in sequencing closely related strains of existing finished genomes. Recently a number of de novo and mapping-based assemblers have been developed to produce high quality draft genomes from new sequencing technology reads. New tools are necessary to take contigs from a draft assembly through to a fully contiguated genome sequence. ABACAS is intended as a tool to rapidly contiguate (align, order, orientate), visualize and design primers to close gaps on shotgun assembled contigs based on a reference sequence. The input to ABACAS is a set of contigs which will be aligned to the reference genome, ordered and orientated, visualized in the ACT comparative browser, and optimal primer sequences are automatically generated. Availability and Implementation: ABACAS is implemented in Perl and is freely available for download from http://abacas.sourceforge.net Contact: [email protected]
PLOS Genetics | 2009
Timothy T. Perkins; Robert A. Kingsley; Maria Fookes; Paul P. Gardner; Keith D. James; Lu-Lu Yu; Samuel A. Assefa; Miao-Xia He; Nicholas J. Croucher; Derek Pickard; Duncan J. Maskell; Julian Parkhill; Jyoti S. Choudhary; Nicholas R. Thomson; Gordon Dougan
High-density, strand-specific cDNA sequencing (ssRNA–seq) was used to analyze the transcriptome of Salmonella enterica serovar Typhi (S. Typhi). By mapping sequence data to the entire S. Typhi genome, we analyzed the transcriptome in a strand-specific manner and further defined transcribed regions encoded within prophages, pseudogenes, previously un-annotated, and 3′- or 5′-untranslated regions (UTR). An additional 40 novel candidate non-coding RNAs were identified beyond those previously annotated. Proteomic analysis was combined with transcriptome data to confirm and refine the annotation of a number of hpothetical genes. ssRNA–seq was also combined with microarray and proteome analysis to further define the S. Typhi OmpR regulon and identify novel OmpR regulated transcripts. Thus, ssRNA–seq provides a novel and powerful approach to the characterization of the bacterial transcriptome.
PLOS Pathogens | 2011
Taisei Kikuchi; James A. Cotton; Jonathan J. Dalzell; Koichi Hasegawa; Natsumi Kanzaki; Paul McVeigh; Takuma Takanashi; Isheng J. Tsai; Samuel A. Assefa; Peter J. A. Cock; Thomas D. Otto; Martin Hunt; Adam J. Reid; Alejandro Sanchez-Flores; Kazuko Tsuchihara; Toshiro Yokoi; Mattias C. Larsson; Johji Miwa; Aaron G. Maule; Norio Sahashi; John T. Jones; Matthew Berriman
Bursaphelenchus xylophilus is the nematode responsible for a devastating epidemic of pine wilt disease in Asia and Europe, and represents a recent, independent origin of plant parasitism in nematodes, ecologically and taxonomically distinct from other nematodes for which genomic data is available. As well as being an important pathogen, the B. xylophilus genome thus provides a unique opportunity to study the evolution and mechanism of plant parasitism. Here, we present a high-quality draft genome sequence from an inbred line of B. xylophilus, and use this to investigate the biological basis of its complex ecology which combines fungal feeding, plant parasitic and insect-associated stages. We focus particularly on putative parasitism genes as well as those linked to other key biological processes and demonstrate that B. xylophilus is well endowed with RNA interference effectors, peptidergic neurotransmitters (including the first description of ins genes in a parasite) stress response and developmental genes and has a contracted set of chemosensory receptors. B. xylophilus has the largest number of digestive proteases known for any nematode and displays expanded families of lysosome pathway genes, ABC transporters and cytochrome P450 pathway genes. This expansion in digestive and detoxification proteins may reflect the unusual diversity in foods it exploits and environments it encounters during its life cycle. In addition, B. xylophilus possesses a unique complement of plant cell wall modifying proteins acquired by horizontal gene transfer, underscoring the impact of this process on the evolution of plant parasitism by nematodes. Together with the lack of proteins homologous to effectors from other plant parasitic nematodes, this confirms the distinctive molecular basis of plant parasitism in the Bursaphelenchus lineage. The genome sequence of B. xylophilus adds to the diversity of genomic data for nematodes, and will be an important resource in understanding the biology of this unusual parasite.
PLOS ONE | 2011
Timothy Robinson; Susana Campino; Sarah Auburn; Samuel A. Assefa; Spencer D. Polley; Magnus Manske; Bronwyn MacInnis; Kirk A. Rockett; Gareth Maslen; Mandy Sanders; Michael A. Quail; Peter L. Chiodini; Dominic P. Kwiatkowski; Taane G. Clark; Colin J. Sutherland
Naturally acquired blood-stage infections of the malaria parasite Plasmodium falciparum typically harbour multiple haploid clones. The apparent number of clones observed in any single infection depends on the diversity of the polymorphic markers used for the analysis, and the relative abundance of rare clones, which frequently fail to be detected among PCR products derived from numerically dominant clones. However, minority clones are of clinical interest as they may harbour genes conferring drug resistance, leading to enhanced survival after treatment and the possibility of subsequent therapeutic failure. We deployed new generation sequencing to derive genome data for five non-propagated parasite isolates taken directly from 4 different patients treated for clinical malaria in a UK hospital. Analysis of depth of coverage and length of sequence intervals between paired reads identified both previously described and novel gene deletions and amplifications. Full-length sequence data was extracted for 6 loci considered to be under selection by antimalarial drugs, and both known and previously unknown amino acid substitutions were identified. Full mitochondrial genomes were extracted from the sequencing data for each isolate, and these are compared against a panel of polymorphic sites derived from published or unpublished but publicly available data. Finally, genome-wide analysis of clone multiplicity was performed, and the number of infecting parasite clones estimated for each isolate. Each patient harboured at least 3 clones of P. falciparum by this analysis, consistent with results obtained with conventional PCR analysis of polymorphic merozoite antigen loci. We conclude that genome sequencing of peripheral blood P. falciparum taken directly from malaria patients provides high quality data useful for drug resistance studies, genomic structural analyses and population genetics, and also robustly represents clonal multiplicity.
Nature Communications | 2014
Preston; Susana Campino; Samuel A. Assefa; D.F. Echeverry; Harold Ocholla; Alfred Amambua-Ngwa; Lindsay B. Stewart; David J. Conway; Steffen Borrmann; Pascal Michon; Issaka Zongo; Jean-Bosco Ouédraogo; Abdoulaye A. Djimde; Ogobara K. Doumbo; François Nosten; Arnab Pain; Teun Bousema; Chris Drakeley; Rick M. Fairhurst; Colin J. Sutherland; Cally Roper; Taane G. Clark
Malaria is a major public health problem that is actively being addressed in a global eradication campaign. Increased population mobility through international air travel has elevated the risk of re-introducing parasites to elimination areas and dispersing drug-resistant parasites to new regions. A simple genetic marker that quickly and accurately identifies the geographic origin of infections would be a valuable public health tool for locating the source of imported outbreaks. Here we analyse the mitochondrion and apicoplast genomes of 711 Plasmodium falciparum isolates from 14 countries, and find evidence that they are non-recombining and co-inherited. The high degree of linkage produces a panel of relatively few single-nucleotide polymorphisms (SNPs) that is geographically informative. We design a 23-SNP barcode that is highly predictive (~92%) and easily adapted to aid case management in the field and survey parasite migration worldwide.
PLOS Pathogens | 2015
Paul Cliff Simon Divis; Balbir Singh; Fread Anderios; Shamilah Hisam; Asmad Matusop; Clemens H. M. Kocken; Samuel A. Assefa; Craig W. Duffy; David J. Conway
Human malaria parasite species were originally acquired from other primate hosts and subsequently became endemic, then spread throughout large parts of the world. A major zoonosis is now occurring with Plasmodium knowlesi from macaques in Southeast Asia, with a recent acceleration in numbers of reported cases particularly in Malaysia. To investigate the parasite population genetics, we developed sensitive and species-specific microsatellite genotyping protocols and applied these to analysis of samples from 10 sites covering a range of >1,600 km within which most cases have occurred. Genotypic analyses of 599 P. knowlesi infections (552 in humans and 47 in wild macaques) at 10 highly polymorphic loci provide radical new insights on the emergence. Parasites from sympatric long-tailed macaques (Macaca fascicularis) and pig-tailed macaques (M. nemestrina) were very highly differentiated (FST = 0.22, and K-means clustering confirmed two host-associated subpopulations). Approximately two thirds of human P. knowlesi infections were of the long-tailed macaque type (Cluster 1), and one third were of the pig-tailed-macaque type (Cluster 2), with relative proportions varying across the different sites. Among the samples from humans, there was significant indication of genetic isolation by geographical distance overall and within Cluster 1 alone. Across the different sites, the level of multi-locus linkage disequilibrium correlated with the degree of local admixture of the two different clusters. The widespread occurrence of both types of P. knowlesi in humans enhances the potential for parasite adaptation in this zoonotic system.
PLOS ONE | 2013
Taane G. Clark; Kim Mallard; Francesc Coll; Mark D. Preston; Samuel A. Assefa; David Harris; Sam Ogwang; Francis Mumbowa; Bruce Kirenga; Denise M. O’Sullivan; Alphonse Okwera; Kathleen D. Eisenach; Moses Joloba; Stephen D. Bentley; Jerrold J. Ellner; Julian Parkhill; Edward C. Jones-López; Ruth McNerney
Background Understanding the emergence and spread of multidrug-resistant tuberculosis (MDR-TB) is crucial for its control. MDR-TB in previously treated patients is generally attributed to the selection of drug resistant mutants during inadequate therapy rather than transmission of a resistant strain. Traditional genotyping methods are not sufficient to distinguish strains in populations with a high burden of tuberculosis and it has previously been difficult to assess the degree of transmission in these settings. We have used whole genome analysis to investigate M. tuberculosis strains isolated from treatment experienced patients with MDR-TB in Uganda over a period of four years. Methods and Findings We used high throughput genome sequencing technology to investigate small polymorphisms and large deletions in 51 Mycobacterium tuberculosis samples from 41 treatment-experienced TB patients attending a TB referral and treatment clinic in Kampala. This was a convenience sample representing 69% of MDR-TB cases identified over the four year period. Low polymorphism was observed in longitudinal samples from individual patients (2-15 SNPs). Clusters of samples with less than 50 SNPs variation were examined. Three clusters comprising a total of 8 patients were found with almost identical genetic profiles, including mutations predictive for resistance to rifampicin and isoniazid, suggesting transmission of MDR-TB. Two patients with previous drug susceptible disease were found to have acquired MDR strains, one of which shared its genotype with an isolate from another patient in the cohort. Conclusions Whole genome sequence analysis identified MDR-TB strains that were shared by more than one patient. The transmission of multidrug-resistant disease in this cohort of retreatment patients emphasises the importance of early detection and need for infection control. Consideration should be given to rapid testing for drug resistance in patients undergoing treatment to monitor the emergence of resistance and permit early intervention to avoid onward transmission.
Proceedings of the National Academy of Sciences of the United States of America | 2015
Samuel A. Assefa; Caeul Lim; Mark D. Preston; Craig W. Duffy; Mridul Nair; Sabir A. Adroub; Khamisah Abdul Kadir; Jonathan M. Goldberg; Daniel E. Neafsey; Paul Cliff Simon Divis; Taane G. Clark; Manoj T. Duraisingh; David J. Conway; Arnab Pain; Balbir Singh
Significance Genome sequence analysis reveals that the zoonotic malaria parasite Plasmodium knowlesi consists of three highly divergent subpopulations. Two, commonly seen in sympatric human clinical infections in Malaysian Borneo, were identified in a previous study as corresponding to parasites seen in long-tailed and pig-tailed macaque hosts, respectively. A third type has been detected in a few laboratory-maintained isolates originally derived in the 1960s elsewhere in Southeast Asia. Divergence between the subpopulations varies significantly across the genome but overall is at a level indicating different subspecies. Analysis of the diversity within the most common type in human infections shows strong signatures of natural selection, including balancing selection and directional selection, on loci distinct from those under selection in endemic human malaria parasites. Malaria cases caused by the zoonotic parasite Plasmodium knowlesi are being increasingly reported throughout Southeast Asia and in travelers returning from the region. To test for evidence of signatures of selection or unusual population structure in this parasite, we surveyed genome sequence diversity in 48 clinical isolates recently sampled from Malaysian Borneo and in five lines maintained in laboratory rhesus macaques after isolation in the 1960s from Peninsular Malaysia and the Philippines. Overall genomewide nucleotide diversity (π = 6.03 × 10−3) was much higher than has been seen in worldwide samples of either of the major endemic malaria parasite species Plasmodium falciparum and Plasmodium vivax. A remarkable substructure is revealed within P. knowlesi, consisting of two major sympatric clusters of the clinical isolates and a third cluster comprising the laboratory isolates. There was deep differentiation between the two clusters of clinical isolates [mean genomewide fixation index (FST) = 0.21, with 9,293 SNPs having fixed differences of FST = 1.0]. This differentiation showed marked heterogeneity across the genome, with mean FST values of different chromosomes ranging from 0.08 to 0.34 and with further significant variation across regions within several chromosomes. Analysis of the largest cluster (cluster 1, 38 isolates) indicated long-term population growth, with negatively skewed allele frequency distributions (genomewide average Tajima’s D = −1.35). Against this background there was evidence of balancing selection on particular genes, including the circumsporozoite protein (csp) gene, which had the top Tajima’s D value (1.57), and scans of haplotype homozygosity implicate several genomic regions as being under recent positive selection.
Scientific Reports | 2013
Steffen Borrmann; Judith Straimer; Leah Mwai; Abdirahman I. Abdi; Anja Rippert; John Okombo; Steven Muriithi; Philip Sasi; Moses Kortok; Brett Lowe; Susana Campino; Samuel A. Assefa; Sarah Auburn; Magnus Manske; Gareth Maslen; Norbert Peshu; Dominic P. Kwiatkowski; Kevin Marsh; Alexis Nzila; Taane G. Clark
Early identification of causal genetic variants underlying antimalarial drug resistance could provide robust epidemiological tools for timely public health interventions. Using a novel natural genetics strategy for mapping novel candidate genes we analyzed >75,000 high quality single nucleotide polymorphisms selected from high-resolution whole-genome sequencing data in 27 isolates of Plasmodium falciparum. We identified genetic variants associated with susceptibility to dihydroartemisinin that implicate one region on chromosome 13, a candidate gene on chromosome 1 (PFA0220w, a UBP1 ortholog) and others (PFB0560w, PFB0630c, PFF0445w) with putative roles in protein homeostasis and stress response. There was a strong signal for positive selection on PFA0220w, but not the other candidate loci. Our results demonstrate the power of full-genome sequencing-based association studies for uncovering candidate genes that determine parasite sensitivity to artemisinins. Our study provides a unique reference for the interpretation of results from resistant infections.
Bioinformatics | 2014
Samuel A. Assefa; Mark D. Preston; Susana Campino; Harold Ocholla; Colin J. Sutherland; Taane G. Clark
Summary: Individuals living in endemic areas generally harbour multiple parasite strains. Multiplicity of infection (MOI) can be an indicator of immune status and transmission intensity. It has a potentially confounding effect on a number of population genetic analyses, which often assume isolates are clonal. Polymerase chain reaction-based approaches to estimate MOI can lack sensitivity. For example, in the human malaria parasite Plasmodium falciparum, genotyping of the merozoite surface protein (MSP1/2) genes is a standard method for assessing MOI, despite the apparent problem of underestimation. The availability of deep coverage data from massively parallizable sequencing technologies means that MOI can be detected genome wide by considering the abundance of heterozygous genotypes. Here, we present a method to estimate MOI, which considers unique combinations of polymorphisms from sequence reads. The method is implemented within the estMOI software. When applied to clinical P.falciparum isolates from three continents, we find that multiple infections are common, especially in regions with high transmission. Availability and implementation: estMOI is freely available from http://pathogenseq.lshtm.ac.uk. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.
Collaboration
Dive into the Samuel A. Assefa's collaboration.
Malawi-Liverpool-Wellcome Trust Clinical Research Programme
View shared research outputs