Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Allan Anthony Kamau is active.

Publication


Featured researches published by Allan Anthony Kamau.


PLOS ONE | 2013

INDIGO – INtegrated Data Warehouse of MIcrobial GenOmes with Examples from the Red Sea Extremophiles

Intikhab Alam; André Antunes; Allan Anthony Kamau; Wail Ba alawi; Manal Kalkatawi; Ulrich Stingl; Vladimir B. Bajic

Background The next generation sequencing technologies substantially increased the throughput of microbial genome sequencing. To functionally annotate newly sequenced microbial genomes, a variety of experimental and computational methods are used. Integration of information from different sources is a powerful approach to enhance such annotation. Functional analysis of microbial genomes, necessary for downstream experiments, crucially depends on this annotation but it is hampered by the current lack of suitable information integration and exploration systems for microbial genomes. Results We developed a data warehouse system (INDIGO) that enables the integration of annotations for exploration and analysis of newly sequenced microbial genomes. INDIGO offers an opportunity to construct complex queries and combine annotations from multiple sources starting from genomic sequence to protein domain, gene ontology and pathway levels. This data warehouse is aimed at being populated with information from genomes of pure cultures and uncultured single cells of Red Sea bacteria and Archaea. Currently, INDIGO contains information from Salinisphaera shabanensis, Haloplasma contractile, and Halorhabdus tiamatea - extremophiles isolated from deep-sea anoxic brine lakes of the Red Sea. We provide examples of utilizing the system to gain new insights into specific aspects on the unique lifestyle and adaptations of these organisms to extreme environments. Conclusions We developed a data warehouse system, INDIGO, which enables comprehensive integration of information from various resources to be used for annotation, exploration and analysis of microbial genomes. It will be regularly updated and extended with new genomes. It is aimed to serve as a resource dedicated to the Red Sea microbes. In addition, through INDIGO, we provide our Automatic Annotation of Microbial Genomes (AAMG) pipeline. The INDIGO web server is freely available at http://www.cbrc.kaust.edu.sa/indigo.


Bioinformatics | 2012

Dragon PolyA Spotter: predictor of poly(A) motifs within human genomic DNA sequences.

Manal Kalkatawi; Farania Rangkuti; Michael Schramm; Boris R. Jankovic; Allan Anthony Kamau; Rajesh Chowdhary; John A. C. Archer; Vladimir B. Bajic

Motivation: Recognition of poly(A) signals in mRNA is relatively straightforward due to the presence of easily recognizable polyadenylic acid tail. However, the task of identifying poly(A) motifs in the primary genomic DNA sequence that correspond to poly(A) signals in mRNA is a far more challenging problem. Recognition of poly(A) signals is important for better gene annotation and understanding of the gene regulation mechanisms. In this work, we present one such poly(A) motif prediction method based on properties of human genomic DNA sequence surrounding a poly(A) motif. These properties include thermodynamic, physico-chemical and statistical characteristics. For predictions, we developed Artificial Neural Network and Random Forest models. These models are trained to recognize 12 most common poly(A) motifs in human DNA. Our predictors are available as a free web-based tool accessible at http://cbrc.kaust.edu.sa/dps. Compared with other reported predictors, our models achieve higher sensitivity and specificity and furthermore provide a consistent level of accuracy for 12 poly(A) motif variants. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.


Scientific Reports | 2016

Metabolic traits of an uncultured archaeal lineage -MSBL1- from brine pools of the Red Sea

Romano Mwirichia; Intikhab Alam; Mamoon Rashid; Manikandan Vinu; Wail Ba-alawi; Allan Anthony Kamau; David Kamanda Ngugi; Markus Göker; Hans-Peter Klenk; Vladimir B. Bajic; Ulrich Stingl

The candidate Division MSBL1 (Mediterranean Sea Brine Lakes 1) comprises a monophyletic group of uncultured archaea found in different hypersaline environments. Previous studies propose methanogenesis as the main metabolism. Here, we describe a metabolic reconstruction of MSBL1 based on 32 single-cell amplified genomes from Brine Pools of the Red Sea (Atlantis II, Discovery, Nereus, Erba and Kebrit). Phylogeny based on rRNA genes as well as conserved single copy genes delineates the group as a putative novel lineage of archaea. Our analysis shows that MSBL1 may ferment glucose via the Embden–Meyerhof–Parnas pathway. However, in the absence of organic carbon, carbon dioxide may be fixed via the ribulose bisphosphate carboxylase, Wood-Ljungdahl pathway or reductive TCA cycle. Therefore, based on the occurrence of genes for glycolysis, absence of the core genes found in genomes of all sequenced methanogens and the phylogenetic position, we hypothesize that the MSBL1 are not methanogens, but probably sugar-fermenting organisms capable of autotrophic growth. Such a mixotrophic lifestyle would confer survival advantage (or possibly provide a unique narrow niche) when glucose and other fermentable sugars are not available.


Applied and Environmental Microbiology | 2016

Comprehensive Genomic Analyses of the OM43 Clade, Including a Novel Species from the Red Sea, Indicate Ecotype Differentiation among Marine Methylotrophs.

Francy Jimenez-Infante; David Kamanda Ngugi; Manikandan Vinu; Intikhab Alam; Allan Anthony Kamau; Jochen Blom; Vladimir B. Bajic; Ulrich Stingl

ABSTRACT The OM43 clade within the family Methylophilaceae of Betaproteobacteria represents a group of methylotrophs that play important roles in the metabolism of C1 compounds in marine environments and other aquatic environments around the globe. Using dilution-to-extinction cultivation techniques, we successfully isolated a novel species of this clade (here designated MBRS-H7) from the ultraoligotrophic open ocean waters of the central Red Sea. Phylogenomic analyses indicate that MBRS-H7 is a novel species that forms a distinct cluster together with isolate KB13 from Hawaii (Hawaii-Red Sea [H-RS] cluster) that is separate from the cluster represented by strain HTCC2181 (from the Oregon coast). Phylogenetic analyses using the robust 16S-23S internal transcribed spacer revealed a potential ecotype separation of the marine OM43 clade members, which was further confirmed by metagenomic fragment recruitment analyses that showed trends of higher abundance in low-chlorophyll and/or high-temperature provinces for the H-RS cluster but a preference for colder, highly productive waters for the HTCC2181 cluster. This potential environmentally driven niche differentiation is also reflected in the metabolic gene inventories, which in the case of the H-RS cluster include those conferring resistance to high levels of UV irradiation, temperature, and salinity. Interestingly, we also found different energy conservation modules between these OM43 subclades, namely, the existence of the NADH:quinone oxidoreductase complex I (NUO) system in the H-RS cluster and the nonhomologous NADH:quinone oxidoreductase (NQR) system in the HTCC2181 cluster, which might have implications for their overall energetic yields.


FEMS Microbiology Ecology | 2014

Genomic differentiation among two strains of the PS1 clade isolated from geographically separated marine habitats

Francy Jimenez-Infante; David Kamanda Ngugi; Intikhab Alam; Mamoon Rashid; Wail Ba-alawi; Allan Anthony Kamau; Vladimir B. Bajic; Ulrich Stingl

Using dilution-to-extinction cultivation, we isolated a strain affiliated with the PS1 clade from surface waters of the Red Sea. Strain RS24 represents the second isolate of this group of marine Alphaproteobacteria after IMCC14465 that was isolated from the East (Japan) Sea. The PS1 clade is a sister group to the OCS116 clade, together forming a putatively novel order closely related to Rhizobiales. While most genomic features and most of the genetic content are conserved between RS24 and IMCC14465, their average nucleotide identity (ANI) is < 81%, suggesting two distinct species of the PS1 clade. Next to encoding two different variants of proteorhodopsin genes, they also harbor several unique genomic islands that contain genes related to degradation of aromatic compounds in IMCC14465 and in polymer degradation in RS24, possibly reflecting the physicochemical differences in the environment they were isolated from. No clear differences in abundance of the genomic content of either strain could be found in fragment recruitment analyses using different metagenomic datasets, in which both genomes were detectable albeit as minor part of the communities. The comparative genomic analysis of both isolates of the PS1 clade and the fragment recruitment analysis provide first insights into the ecology of this group.


Bioinformatics | 2013

Dragon TIS Spotter

Arturo Magana-Mora; Haitham Ashoor; Boris R. Jankovic; Allan Anthony Kamau; Karim Awara; Rajesh Chowdhary; John A. C. Archer; Vladimir B. Bajic

Summary: In higher eukaryotes, the identification of translation initiation sites (TISs) has been focused on finding these signals in cDNA or mRNA sequences. Using Arabidopsis thaliana (A.t.) information, we developed a prediction tool for signals within genomic sequences of plants that correspond to TISs. Our tool requires only genome sequence, not expressed sequences. Its sensitivity/specificity is for A.t. (90.75%/92.2%), for Vitis vinifera (66.8%/94.4%) and for Populus trichocarpa (81.6%/94.4%), which suggests that our tool can be used in annotation of different plant genomes. We provide a list of features used in our model. Further study of these features may improve our understanding of mechanisms of the translation initiation. Availability and implementation: Our tool is implemented as an artificial neural network. It is available as a web-based tool and, together with the source code, the list of features, and data used for model development, is accessible at http://cbrc.kaust.edu.sa/dts. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.


Reproductive Toxicology | 2012

DESTAF: a database of text-mined associations for reproductive toxins potentially affecting human fertility.

Adam Dawe; Aleksandar Radovanovic; Mandeep Kaur; Sunil Sagar; Sundararajan Vijayaraghava Seshadri; Ulf Schaefer; Allan Anthony Kamau; Alan Christoffels; Vladimir B. Bajic

The Dragon Exploration System for Toxicants and Fertility (DESTAF) is a publicly available resource which enables researchers to efficiently explore both known and potentially novel information and associations in the field of reproductive toxicology. To create DESTAF we used data from the literature (including over 10500 PubMed abstracts), several publicly available biomedical repositories, and specialized, curated dictionaries. DESTAF has an interface designed to facilitate rapid assessment of the key associations between relevant concepts, allowing for a more in-depth exploration of information based on different gene/protein-, enzyme/metabolite-, toxin/chemical-, disease- or anatomically centric perspectives. As a special feature, DESTAF allows for the creation and initial testing of potentially new association hypotheses that suggest links between biological entities identified through the database. DESTAF, along with a PDF manual, can be found at http://cbrc.kaust.edu.sa/destaf. It is free to academic and non-commercial users and will be updated quarterly.


Bioinformatics | 2013

Dragon PolyA Spotter

Manal Kalkatawi; Farania Rangkuti; Michael Schramm; Boris R. Jankovic; Allan Anthony Kamau; Rajesh Chowdhary; John A. C. Archer; Vladimir B. Bajic

Motivation: Recognition of poly(A) signals in mRNA is relatively straightforward due to the presence of easily recognizable polyadenylic acid tail. However, the task of identifying poly(A) motifs in the primary genomic DNA sequence that correspond to poly(A) signals in mRNA is a far more challenging problem. Recognition of poly(A) signals is important for better gene annotation and understanding of the gene regulation mechanisms. In this work, we present one such poly(A) motif prediction method based on properties of human genomic DNA sequence surrounding a poly(A) motif. These properties include thermodynamic, physico-chemical and statistical characteristics. For predictions, we developed Artificial Neural Network and Random Forest models. These models are trained to recognize 12 most common poly(A) motifs in human DNA. Our predictors are available as a free web-based tool accessible at http://cbrc.kaust.edu.sa/dps. Compared with other reported predictors, our models achieve higher sensitivity and specificity and furthermore provide a consistent level of accuracy for 12 poly(A) motif variants. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.


Frontiers in Plant Science | 2018

The Genome Sequence of the Wild Tomato Solanum pimpinellifolium Provides Insights Into Salinity Tolerance

Rozaimi Razali; Salim Bougouffa; Mitchell J. L. Morton; Damien J. Lightfoot; Intikhab Alam; Magbubah Essack; Stefan T. Arold; Allan Anthony Kamau; Sandra M. Schmöckel; Yveline Pailles; Mohammed Shahid; Craig T. Michell; Salim Al-Babili; Yung Shwen Ho; Mark Tester; Vladimir B. Bajic; Sónia Negrão

Solanum pimpinellifolium, a wild relative of cultivated tomato, offers a wealth of breeding potential for desirable traits such as tolerance to abiotic and biotic stresses. Here, we report the genome assembly and annotation of S. pimpinellifolium ‘LA0480.’ Moreover, we present phenotypic data from one field experiment that demonstrate a greater salinity tolerance for fruit- and yield-related traits in S. pimpinellifolium compared with cultivated tomato. The ‘LA0480’ genome assembly size (811 Mb) and the number of annotated genes (25,970) are within the range observed for other sequenced tomato species. We developed and utilized the Dragon Eukaryotic Analyses Platform (DEAP) to functionally annotate the ‘LA0480’ protein-coding genes. Additionally, we used DEAP to compare protein function between S. pimpinellifolium and cultivated tomato. Our data suggest enrichment in genes involved in biotic and abiotic stress responses. To understand the genomic basis for these differences in S. pimpinellifolium and S. lycopersicum, we analyzed 15 genes that have previously been shown to mediate salinity tolerance in plants. We show that S. pimpinellifolium has a higher copy number of the inositol-3-phosphate synthase and phosphatase genes, which are both key enzymes in the production of inositol and its derivatives. Moreover, our analysis indicates that changes occurring in the inositol phosphate pathway may contribute to the observed higher salinity tolerance in ‘LA0480.’ Altogether, our work provides essential resources to understand and unlock the genetic and breeding potential of S. pimpinellifolium, and to discover the genomic basis underlying its environmental robustness.


Biotechnology Reports | 2018

Genome sequence analysis of Zooshikella ganghwensis strain VG4 and its potential for the synthesis of antimicrobial metabolites

Zahid Ur Rehman; Intikhab Alam; Allan Anthony Kamau; Vladimir B. Bajic; TorOve Leiknes

Highlights • Draft genome sequence of Z. ganghwensis VG4 is reported.• Culture supernatant of Z. ganghwensis VG4 exhibit antimicrobial properties.• A total of 7634 genes are identified out of which 74% were annotated.• Z. ganghwensis VG4 has genetic potential to synthesize bioactive secondary metabolites, such as, polyketides and nonribosomal peptides.

Collaboration


Dive into the Allan Anthony Kamau's collaboration.

Top Co-Authors

Avatar

Vladimir B. Bajic

King Abdullah University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Intikhab Alam

King Abdullah University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Ulrich Stingl

King Abdullah University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Boris R. Jankovic

King Abdullah University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

David Kamanda Ngugi

King Abdullah University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

John A. C. Archer

King Abdullah University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Manal Kalkatawi

King Abdullah University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Farania Rangkuti

King Abdullah University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Francy Jimenez-Infante

King Abdullah University of Science and Technology

View shared research outputs
Researchain Logo
Decentralizing Knowledge