Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Bruno W. S. Sobral is active.

Publication


Featured researches published by Bruno W. S. Sobral.


Nucleic Acids Research | 2014

PATRIC, the bacterial bioinformatics database and analysis resource

Alice R. Wattam; David Abraham; Oral Dalay; Terry Disz; Timothy Driscoll; Joseph L. Gabbard; Joseph J. Gillespie; Roger Gough; Deborah Hix; Ronald W. Kenyon; Dustin Machi; Chunhong Mao; Eric K. Nordberg; Robert Olson; Ross Overbeek; Gordon D. Pusch; Maulik Shukla; Julie Schulman; Rick Stevens; Daniel E. Sullivan; Veronika Vonstein; Andrew S. Warren; Rebecca Will; Meredith J. C. Wilson; Hyun Seung Yoo; Chengdong Zhang; Yan Zhang; Bruno W. S. Sobral

The Pathosystems Resource Integration Center (PATRIC) is the all-bacterial Bioinformatics Resource Center (BRC) (http://www.patricbrc.org). A joint effort by two of the original National Institute of Allergy and Infectious Diseases-funded BRCs, PATRIC provides researchers with an online resource that stores and integrates a variety of data types [e.g. genomics, transcriptomics, protein–protein interactions (PPIs), three-dimensional protein structures and sequence typing data] and associated metadata. Datatypes are summarized for individual genomes and across taxonomic levels. All genomes in PATRIC, currently more than 10 000, are consistently annotated using RAST, the Rapid Annotations using Subsystems Technology. Summaries of different data types are also provided for individual genes, where comparisons of different annotations are available, and also include available transcriptomic data. PATRIC provides a variety of ways for researchers to find data of interest and a private workspace where they can store both genomic and gene associations, and their own private data. Both private and public data can be analyzed together using a suite of tools to perform comparative genomic or transcriptomic analysis. PATRIC also includes integrated information related to disease and PPIs. All the data and integrated analysis and visualization tools are freely available. This manuscript describes updates to the PATRIC since its initial report in the 2007 NAR Database Issue.


PLOS ONE | 2007

Plasmids and Rickettsial Evolution: Insight from Rickettsia felis

Joseph J. Gillespie; Magda S. Beier; M. Sayeedur Rahman; Nicole C. Ammerman; Joshua M. Shallom; Anjan Purkayastha; Bruno W. S. Sobral; Abdu F. Azad

Background The genome sequence of Rickettsia felis revealed a number of rickettsial genetic anomalies that likely contribute not only to a large genome size relative to other rickettsiae, but also to phenotypic oddities that have confounded the categorization of R. felis as either typhus group (TG) or spotted fever group (SFG) rickettsiae. Most intriguing was the first report from rickettsiae of a conjugative plasmid (pRF) that contains 68 putative open reading frames, several of which are predicted to encode proteins with high similarity to conjugative machinery in other plasmid-containing bacteria. Methodology/Principal Findings Using phylogeny estimation, we determined the mode of inheritance of pRF genes relative to conserved rickettsial chromosomal genes. Phylogenies of chromosomal genes were in agreement with other published rickettsial trees. However, phylogenies including pRF genes yielded different topologies and suggest a close relationship between pRF and ancestral group (AG) rickettsiae, including the recently completed genome of R. bellii str. RML369-C. This relatedness is further supported by the distribution of pRF genes across other rickettsiae, as 10 pRF genes (or inactive derivatives) also occur in AG (but not SFG) rickettsiae, with five of these genes characteristic of typical plasmids. Detailed characterization of pRF genes resulted in two novel findings: the identification of oriV and replication termination regions, and the likelihood that a second proposed plasmid, pRFδ, is an artifact of the original genome assembly. Conclusion/Significance Altogether, we propose a new rickettsial classification scheme with the addition of a fourth lineage, transitional group (TRG) rickettsiae, that is unique from TG and SFG rickettsiae and harbors genes from possible exchanges with AG rickettsiae via conjugation. We offer insight into the evolution of a plastic plasmid system in rickettsiae, including the role plasmids may have played in the acquirement of virulence traits in pathogenic strains, and the likely origin of plasmids within the rickettsial tree.


BMC Genomics | 2010

Transcriptome sequencing and comparative analysis of cucumber flowers with different sex types

Shaogui Guo; Yi Zheng; Je-Gun Joung; Shiqiang Liu; Zhonghua Zhang; Oswald Crasta; Bruno W. S. Sobral; Yong Xu; Sanwen Huang; Zhangjun Fei

BackgroundCucumber, Cucumis sativus L., is an economically and nutritionally important crop of the Cucurbitaceae family and has long served as a primary model system for sex determination studies. Recently, the sequencing of its whole genome has been completed. However, transcriptome information of this species is still scarce, with a total of around 8,000 Expressed Sequence Tag (EST) and mRNA sequences currently available in GenBank. In order to gain more insights into molecular mechanisms of plant sex determination and provide the community a functional genomics resource that will facilitate cucurbit research and breeding, we performed transcriptome sequencing of cucumber flower buds of two near-isogenic lines, WI1983G, a gynoecious plant which bears only pistillate flowers, and WI1983H, a hermaphroditic plant which bears only bisexual flowers.ResultUsing Roche-454 massive parallel pyrosequencing technology, we generated a total of 353,941 high quality EST sequences with an average length of 175bp, among which 188,255 were from gynoecious flowers and 165,686 from hermaphroditic flowers. These EST sequences, together with ~5,600 high quality cucumber EST and mRNA sequences available in GenBank, were clustered and assembled into 81,401 unigenes, of which 28,452 were contigs and 52,949 were singletons. The unigenes and ESTs were further mapped to the cucumber genome and more than 500 alternative splicing events were identified in 443 cucumber genes. The unigenes were further functionally annotated by comparing their sequences to different protein and functional domain databases and assigned with Gene Ontology (GO) terms. A biochemical pathway database containing 343 predicted pathways was also created based on the annotations of the unigenes. Digital expression analysis identified ~200 differentially expressed genes between flowers of WI1983G and WI1983H and provided novel insights into molecular mechanisms of plant sex determination process. Furthermore, a set of SSR motifs and high confidence SNPs between WI1983G and WI1983H were identified from the ESTs, which provided the material basis for future genetic linkage and QTL analysis.ConclusionA large set of EST sequences were generated from cucumber flower buds of two different sex types. Differentially expressed genes between these two different sex-type flowers, as well as putative SSR and SNP markers, were identified. These EST sequences provide valuable information to further understand molecular mechanisms of plant sex determination process and forms a rich resource for future functional genomics analysis, marker development and cucumber breeding.


PLOS ONE | 2008

Rickettsia Phylogenomics: Unwinding the Intricacies of Obligate Intracellular Life

Joseph J. Gillespie; Kelly P. Williams; Maulik Shukla; Eric E. Snyder; Eric K. Nordberg; Shane M. Ceraul; Chitti Dharmanolla; Daphne Rainey; Jeetendra Soneja; Joshua M. Shallom; Nataraj Dongre Vishnubhat; Rebecca Wattam; Anjan Purkayastha; Michael J. Czar; Oswald Crasta; João C. Setubal; Abdu F. Azad; Bruno W. S. Sobral

Background Completed genome sequences are rapidly increasing for Rickettsia, obligate intracellular α-proteobacteria responsible for various human diseases, including epidemic typhus and Rocky Mountain spotted fever. In light of phylogeny, the establishment of orthologous groups (OGs) of open reading frames (ORFs) will distinguish the core rickettsial genes and other group specific genes (class 1 OGs or C1OGs) from those distributed indiscriminately throughout the rickettsial tree (class 2 OG or C2OGs). Methodology/Principal Findings We present 1823 representative (no gene duplications) and 259 non-representative (at least one gene duplication) rickettsial OGs. While the highly reductive (∼1.2 MB) Rickettsia genomes range in predicted ORFs from 872 to 1512, a core of 752 OGs was identified, depicting the essential Rickettsia genes. Unsurprisingly, this core lacks many metabolic genes, reflecting the dependence on host resources for growth and survival. Additionally, we bolster our recent reclassification of Rickettsia by identifying OGs that define the AG (ancestral group), TG (typhus group), TRG (transitional group), and SFG (spotted fever group) rickettsiae. OGs for insect-associated species, tick-associated species and species that harbor plasmids were also predicted. Through superimposition of all OGs over robust phylogeny estimation, we discern between C1OGs and C2OGs, the latter depicting genes either decaying from the conserved C1OGs or acquired laterally. Finally, scrutiny of non-representative OGs revealed high levels of split genes versus gene duplications, with both phenomena confounding gene orthology assignment. Interestingly, non-representative OGs, as well as OGs comprised of several gene families typically involved in microbial pathogenicity and/or the acquisition of virulence factors, fall predominantly within C2OG distributions. Conclusion/Significance Collectively, we determined the relative conservation and distribution of 14354 predicted ORFs from 10 rickettsial genomes across robust phylogeny estimation. The data, available at PATRIC (PathoSystems Resource Integration Center), provide novel information for unwinding the intricacies associated with Rickettsia pathogenesis, expanding the range of potential diagnostic, vaccine and therapeutic targets.


PLOS ONE | 2009

An Anomalous Type IV Secretion System in Rickettsia Is Evolutionarily Conserved

Joseph J. Gillespie; Nicole C. Ammerman; Sheila M. Dreher-Lesnick; M. Sayeedur Rahman; Micah J. Worley; João C. Setubal; Bruno W. S. Sobral; Abdu F. Azad

Background Bacterial type IV secretion systems (T4SSs) comprise a diverse transporter family functioning in conjugation, competence, and effector molecule (DNA and/or protein) translocation. Thirteen genome sequences from Rickettsia, obligate intracellular symbionts/pathogens of a wide range of eukaryotes, have revealed a reduced T4SS relative to the Agrobacterium tumefaciens archetype (vir). However, the Rickettsia T4SS has not been functionally characterized for its role in symbiosis/virulence, and none of its substrates are known. Results Superimposition of T4SS structural/functional information over previously identified Rickettsia components implicate a functional Rickettsia T4SS. virB4, virB8 and virB9 are duplicated, yet only one copy of each has the conserved features of similar genes in other T4SSs. An extraordinarily duplicated VirB6 gene encodes five hydrophobic proteins conserved only in a short region known to be involved in DNA transfer in A. tumefaciens. virB1, virB2 and virB7 are newly identified, revealing a Rickettsia T4SS lacking only virB5 relative to the vir archetype. Phylogeny estimation suggests vertical inheritance of all components, despite gene rearrangements into an archipelago of five islets. Similarities of Rickettsia VirB7/VirB9 to ComB7/ComB9 proteins of ε-proteobacteria, as well as phylogenetic affinities to the Legionella lvh T4SS, imply the Rickettsiales ancestor acquired a vir-like locus from distantly related bacteria, perhaps while residing in a protozoan host. Modern modifications of these systems likely reflect diversification with various eukaryotic host cells. Conclusion We present the rvh (Rickettsiales vir homolog) T4SS, an evolutionary conserved transporter with an unknown role in rickettsial biology. This work lays the foundation for future laboratory characterization of this system, and also identifies the Legionella lvh T4SS as a suitable genetic model.


Veterinary Research | 2009

Louse- and flea-borne rickettsioses: biological and genomic analyses

Joseph J. Gillespie; Nicole C. Ammerman; Magda Beier-Sexton; Bruno W. S. Sobral; Abdu F. Azad

In contrast to 15 or more validated and/or proposed tick-borne spotted fever group species, only three named medically important rickettsial species are associated with insects. These insect-borne rickettsiae are comprised of two highly pathogenic species, Rickettsia prowazekii (the agent of epidemic typhus) and R. typhi (the agent of murine typhus), as well as R. felis, a species with unconfirmed pathogenicity. Rickettsial association with obligate hematophagous insects such as the human body louse (R. prowazekii transmitted by Pediculus h. humanus) and several flea species (R. typhi and R. felis, as well as R. prowazekii in sylvatic form) provides rickettsiae the potential for further multiplications, longer transmission cycles and rapid spread among susceptible human populations. Both human body lice and fleas are intermittent feeders capable of multiple blood meals per generation, facilitating the efficient transmission of rickettsiae to several disparate hosts within urban/rural ecosystems. While taking into consideration the existing knowledge of rickettsial biology and genomic attributes, we have analyzed and summarized the interacting features that are unique to both the rickettsiae and their vector fleas and lice. Furthermore, factors that underlie rickettsial changing ecology, where native mammalian populations are involved in the maintenance of rickettsial cycle and transmission, are discussed.


Virology Journal | 2009

Gene expression in primate liver during viral hemorrhagic fever

Mahmoud Djavani; Oswald Crasta; Yan Zhang; Juan Carlos Zapata; Bruno W. S. Sobral; Melissa G. Lechner; Joseph Bryant; Harry Davis; Maria S. Salvato

BackgroundRhesus macaques infected with lymphocytic choriomeningitis virus (LCMV) provide a model for human Lassa fever. Disease begins with flu-like symptoms and progresses rapidly with fatal consequences. Previously, we profiled the blood transcriptome of LCMV-infected monkeys (M. Djavani et al J. Virol. 2007) showing distinct pre-viremic and viremic stages that discriminated virulent from benign infections. In the present study, changes in liver gene expression from macaques infected with virulent LCMV-WE were compared to gene expression in uninfected monkeys as well as to monkeys that were infected but not diseased.ResultsBased on a functional pathway analysis of differentially expressed genes, virulent LCMV-WE had a broader effect on liver cell function than did infection with non-virulent LCMV-Armstrong. During the first few days after infection, LCMV altered expression of genes associated with energy production, including fatty acid and glucose metabolism. The transcriptome profile resembled that of an organism in starvation: mRNA for acetyl-CoA carboxylase, a key enzyme of fatty acid synthesis was reduced while genes for enzymes in gluconeogenesis were up-regulated. Expression was also altered for genes associated with complement and coagulation cascades, and with signaling pathways involving STAT1 and TGF-β.ConclusionMost of the 4500 differentially expressed transcripts represented a general response to both virulent and mild infections. However, approximately 250 of these transcripts had significantly different expression in virulent infections as compared to mild infections, with approximately 30 of these being differentially regulated during the pre-viremic stage of infection. The genes that are expressed early and differently in mild and virulent disease are potential biomarkers for prognosis and triage of acute viral disease.


PLOS Neglected Tropical Diseases | 2008

Targets of the Entamoeba histolytica Transcription Factor URE3-BP

Carol A. Gilchrist; Duza J. Baba; Yan Zhang; Oswald Crasta; Clive Evans; Elisabet Caler; Bruno W. S. Sobral; Christina B. Bousquet; Megan Leo; Ameilia Hochreiter; Sarah K. Connell; Barbara J. Mann; William A. Petri

The Entamoeba histolytica transcription factor Upstream Regulatory Element 3-Binding Protein (URE3-BP) is a calcium-responsive regulator of two E. histolytica virulence genes, hgl5 and fdx1. URE3-BP was previously identified by a yeast one-hybrid screen of E. histolytica proteins capable of binding to the sequence TATTCTATT (Upstream Regulatory Element 3 (URE3)) in the promoter regions of hgl5 and fdx1. In this work, precise definition of the consensus URE3 element was performed by electrophoretic mobility shift assays (EMSA) using base-substituted oligonucleotides, and the consensus motif validated using episomal reporter constructs. Transcriptome profiling of a strain induced to produce a dominant-positive URE3-BP was then used to identify additional genes regulated by URE3-BP. Fifty modulated transcripts were identified, and of these the EMSA defined motif T[atg]T[tc][cg]T[at][tgc][tg] was found in over half of the promoters (54% p<0.0001). Fifteen of the URE3-BP regulated genes were potential membrane proteins, suggesting that one function of URE3-BP is to remodel the surface of E. histolytica in response to a calcium signal. Induction of URE3-BP leads to an increase in tranwell migration, suggesting a possible role in the regulation of cellular motility.


Chemistry & Biodiversity | 2010

Data Integration for Dynamic and Sustainable Systems Biology Resources: Challenges and Lessons Learned

Daniel E. Sullivan; Joseph L. Gabbard; Maulik Shukla; Bruno W. S. Sobral

Systems‐biology and infectious‐disease (host–pathogen–environment) research and development is becoming increasingly dependent on integrating data from diverse and dynamic sources. Maintaining integrated resources over long periods of time presents distinct challenges. This review describes experiences and lessons learned from integrating data in two five‐year projects focused on pathosystems biology: the Pathosystems Resource Integration Center (PATRIC, http://patric.vbi.vt.edu/), with a goal of developing bioinformatics resources for the research and countermeasures‐development communities based on genomics data, and the Resource Center for Biodefense Proteomics Research (RCBPR, http://www.proteomicsresource.org/), with a goal of developing resources based on the experiment data such as microarray and proteomics data from diverse sources and technologies. Some challenges include integrating genomic sequence and experiment data, data synchronization, data quality control, and usability engineering. We present examples of a variety of data‐integration problems drawn from our experiences with PATRIC and RBPRC, as well as open research questions related to long‐term sustainability, and describe the next steps to meeting these challenges. Novel contributions of this work include 1) an approach for addressing discrepancies between experiment results and interpreted results, and 2) expanding the range of data‐integration techniques to include usability engineering at the presentation level.


PLOS ONE | 2014

Standardized Metadata for Human Pathogen/Vector Genomic Sequences

Vivien G. Dugan; Scott J. Emrich; Gloria I. Giraldo-Calderón; Omar S. Harb; Ruchi M. Newman; Brett E. Pickett; Lynn M. Schriml; Timothy B. Stockwell; Christian J. Stoeckert; Daniel E. Sullivan; Indresh Singh; Doyle V. Ward; Alison Yao; Jie Zheng; Tanya Barrett; Bruce W. Birren; Lauren M. Brinkac; Vincent M. Bruno; Elizabet Caler; Sinéad B. Chapman; Frank H. Collins; Christina A. Cuomo; Valentina Di Francesco; Scott Durkin; Mark Eppinger; Michael Feldgarden; Claire M. Fraser; W. Florian Fricke; Maria Giovanni; Matthew R. Henn

High throughput sequencing has accelerated the determination of genome sequences for thousands of human infectious disease pathogens and dozens of their vectors. The scale and scope of these data are enabling genotype-phenotype association studies to identify genetic determinants of pathogen virulence and drug/insecticide resistance, and phylogenetic studies to track the origin and spread of disease outbreaks. To maximize the utility of genomic sequences for these purposes, it is essential that metadata about the pathogen/vector isolate characteristics be collected and made available in organized, clear, and consistent formats. Here we report the development of the GSCID/BRC Project and Sample Application Standard, developed by representatives of the Genome Sequencing Centers for Infectious Diseases (GSCIDs), the Bioinformatics Resource Centers (BRCs) for Infectious Diseases, and the U.S. National Institute of Allergy and Infectious Diseases (NIAID), part of the National Institutes of Health (NIH), informed by interactions with numerous collaborating scientists. It includes mapping to terms from other data standards initiatives, including the Genomic Standards Consortium’s minimal information (MIxS) and NCBI’s BioSample/BioProjects checklists and the Ontology for Biomedical Investigations (OBI). The standard includes data fields about characteristics of the organism or environmental source of the specimen, spatial-temporal information about the specimen isolation event, phenotypic characteristics of the pathogen/vector isolated, and project leadership and support. By modeling metadata fields into an ontology-based semantic framework and reusing existing ontologies and minimum information checklists, the application standard can be extended to support additional project-specific data fields and integrated with other data represented with comparable standards. The use of this metadata standard by all ongoing and future GSCID sequencing projects will provide a consistent representation of these data in the BRC resources and other repositories that leverage these data, allowing investigators to identify relevant genomic sequences and perform comparative genomics analyses that are both statistically meaningful and biologically relevant.

Collaboration


Dive into the Bruno W. S. Sobral's collaboration.

Top Co-Authors

Avatar

Chunhong Mao

Virginia Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Daniel E. Sullivan

Virginia Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Maulik Shukla

Virginia Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Oswald Crasta

Virginia Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Hyun Seung Yoo

Virginia Bioinformatics Institute

View shared research outputs
Researchain Logo
Decentralizing Knowledge