Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Robert Schmieder is active.

Publication


Featured researches published by Robert Schmieder.


Bioinformatics | 2011

Quality control and preprocessing of metagenomic datasets

Robert Schmieder; Robert Edwards

Summary: Here, we present PRINSEQ for easy and rapid quality control and data preprocessing of genomic and metagenomic datasets. Summary statistics of FASTA (and QUAL) or FASTQ files are generated in tabular and graphical form and sequences can be filtered, reformatted and trimmed by a variety of options to improve downstream analysis. Availability and Implementation: This open-source application was implemented in Perl and can be used as a stand alone version or accessed online through a user-friendly web interface. The source code, user help and additional information are available at http://prinseq.sourceforge.net/. Contact: [email protected]; [email protected]


PLOS ONE | 2011

Fast identification and removal of sequence contamination from genomic and metagenomic datasets.

Robert Schmieder; Robert Edwards

High-throughput sequencing technologies have strongly impacted microbiology, providing a rapid and cost-effective way of generating draft genomes and exploring microbial diversity. However, sequences obtained from impure nucleic acid preparations may contain DNA from sources other than the sample. Those sequence contaminations are a serious concern to the quality of the data used for downstream analysis, causing misassembly of sequence contigs and erroneous conclusions. Therefore, the removal of sequence contaminants is a necessary and required step for all sequencing projects. We developed DeconSeq, a robust framework for the rapid, automated identification and removal of sequence contamination in longer-read datasets (150 bp mean read length). DeconSeq is publicly available as standalone and web-based versions. The results can be exported for subsequent analysis, and the databases used for the web-based version are automatically updated on a regular basis. DeconSeq categorizes possible contamination sequences, eliminates redundant hits with higher similarity to non-contaminant genomes, and provides graphical visualizations of the alignment results and classifications. Using DeconSeq, we conducted an analysis of possible human DNA contamination in 202 previously published microbial and viral metagenomes and found possible contamination in 145 (72%) metagenomes with as high as 64% contaminating sequences. This new framework allows scientists to automatically detect and efficiently remove unwanted sequence contamination from their datasets while eliminating critical limitations of current methods. DeconSeqs web interface is simple and user-friendly. The standalone version allows offline analysis and integration into existing data processing pipelines. DeconSeqs results reveal whether the sequencing experiment has succeeded, whether the correct sample was sequenced, and whether the sample contains any sequence contamination from DNA preparation or host. In addition, the analysis of 202 metagenomes demonstrated significant contamination of the non-human associated metagenomes, suggesting that this method is appropriate for screening all metagenomes. DeconSeq is available at http://deconseq.sourceforge.net/.


PLOS ONE | 2009

Metagenomic Analysis of Respiratory Tract DNA Viral Communities in Cystic Fibrosis and Non-Cystic Fibrosis Individuals

Dana Willner; Mike Furlan; Matthew Haynes; Robert Schmieder; Florent E. Angly; Joás L. da Silva; Sassan Tammadoni; Bahador Nosrat; Douglas Conrad; Forest Rohwer

The human respiratory tract is constantly exposed to a wide variety of viruses, microbes and inorganic particulates from environmental air, water and food. Physical characteristics of inhaled particles and airway mucosal immunity determine which viruses and microbes will persist in the airways. Here we present the first metagenomic study of DNA viral communities in the airways of diseased and non-diseased individuals. We obtained sequences from sputum DNA viral communities in 5 individuals with cystic fibrosis (CF) and 5 individuals without the disease. Overall, diversity of viruses in the airways was low, with an average richness of 175 distinct viral genotypes. The majority of viral diversity was uncharacterized. CF phage communities were highly similar to each other, whereas Non-CF individuals had more distinct phage communities, which may reflect organisms in inhaled air. CF eukaryotic viral communities were dominated by a few viruses, including human herpesviruses and retroviruses. Functional metagenomics showed that all Non-CF viromes were similar, and that CF viromes were enriched in aromatic amino acid metabolism. The CF metagenomes occupied two different metabolic states, probably reflecting different disease states. There was one outlying CF virome which was characterized by an over-representation of Guanosine-5′-triphosphate,3′-diphosphate pyrophosphatase, an enzyme involved in the bacterial stringent response. Unique environments like the CF airway can drive functional adaptations, leading to shifts in metabolic profiles. These results have important clinical implications for CF, indicating that therapeutic measures may be more effective if used to change the respiratory environment, as opposed to shifting the taxonomic composition of resident microbiota.


PLOS Computational Biology | 2009

The GAAS Metagenomic Tool and Its Estimations of Viral and Microbial Average Genome Size in Four Major Biomes

Florent E. Angly; Dana Willner; Alejandra Prieto-Davó; Robert Edwards; Robert Schmieder; Rebecca Vega-Thurber; Dionysios A. Antonopoulos; Katie L. Barott; Matthew T. Cottrell; Christelle Desnues; Elizabeth A. Dinsdale; Mike Furlan; Matthew Haynes; Matthew R. Henn; Yongfei Hu; David L. Kirchman; Tracey McDole; John D. McPherson; Folker Meyer; R. Michael Miller; Egbert Mundt; Robert K. Naviaux; Beltran Rodriguez-Mueller; Rick Stevens; Linda Wegley; Lixin Zhang; Baoli Zhu; Forest Rohwer

Metagenomic studies characterize both the composition and diversity of uncultured viral and microbial communities. BLAST-based comparisons have typically been used for such analyses; however, sampling biases, high percentages of unknown sequences, and the use of arbitrary thresholds to find significant similarities can decrease the accuracy and validity of estimates. Here, we present Genome relative Abundance and Average Size (GAAS), a complete software package that provides improved estimates of community composition and average genome length for metagenomes in both textual and graphical formats. GAAS implements a novel methodology to control for sampling bias via length normalization, to adjust for multiple BLAST similarities by similarity weighting, and to select significant similarities using relative alignment lengths. In benchmark tests, the GAAS method was robust to both high percentages of unknown sequences and to variations in metagenomic sequence read lengths. Re-analysis of the Sargasso Sea virome using GAAS indicated that standard methodologies for metagenomic analysis may dramatically underestimate the abundance and importance of organisms with small genomes in environmental systems. Using GAAS, we conducted a meta-analysis of microbial and viral average genome lengths in over 150 metagenomes from four biomes to determine whether genome lengths vary consistently between and within biomes, and between microbial and viral communities from the same environment. Significant differences between biomes and within aquatic sub-biomes (oceans, hypersaline systems, freshwater, and microbialites) suggested that average genome length is a fundamental property of environments driven by factors at the sub-biome level. The behavior of paired viral and microbial metagenomes from the same environment indicated that microbial and viral average genome sizes are independent of each other, but indicative of community responses to stressors and environmental conditions.


PLOS ONE | 2011

Broad surveys of DNA viral diversity obtained through viral metagenomics of mosquitoes.

Terry Fei Fan Ng; Dana Willner; Yan Wei Lim; Robert Schmieder; Betty Chau; Christina Nilsson; Simon J. Anthony; Yijun Ruan; Forest Rohwer; Mya Breitbart

Viruses are the most abundant and diverse genetic entities on Earth; however, broad surveys of viral diversity are hindered by the lack of a universal assay for viruses and the inability to sample a sufficient number of individual hosts. This study utilized vector-enabled metagenomics (VEM) to provide a snapshot of the diversity of DNA viruses present in three mosquito samples from San Diego, California. The majority of the sequences were novel, suggesting that the viral community in mosquitoes, as well as the animal and plant hosts they feed on, is highly diverse and largely uncharacterized. Each mosquito sample contained a distinct viral community. The mosquito viromes contained sequences related to a broad range of animal, plant, insect and bacterial viruses. Animal viruses identified included anelloviruses, circoviruses, herpesviruses, poxviruses, and papillomaviruses, which mosquitoes may have obtained from vertebrate hosts during blood feeding. Notably, sequences related to human papillomaviruses were identified in one of the mosquito samples. Sequences similar to plant viruses were identified in all mosquito viromes, which were potentially acquired through feeding on plant nectar. Numerous bacteriophages and insect viruses were also detected, including a novel densovirus likely infecting Culex erythrothorax. Through sampling insect vectors, VEM enables broad survey of viral diversity and has significantly increased our knowledge of the DNA viruses present in mosquitoes.


Future Microbiology | 2012

Insights into antibiotic resistance through metagenomic approaches

Robert Schmieder; Robert Edwards

The consequences of bacterial infections have been curtailed by the introduction of a wide range of antibiotics. However, infections continue to be a leading cause of mortality, in part due to the evolution and acquisition of antibiotic-resistance genes. Antibiotic misuse and overprescription have created a driving force influencing the selection of resistance. Despite the problem of antibiotic resistance in infectious bacteria, little is known about the diversity, distribution and origins of resistance genes, especially for the unculturable majority of environmental bacteria. Functional and sequence-based metagenomics have been used for the discovery of novel resistance determinants and the improved understanding of antibiotic-resistance mechanisms in clinical and natural environments. This review discusses recent findings and future challenges in the study of antibiotic resistance through metagenomic approaches.


BMC Bioinformatics | 2010

TagCleaner: Identification and removal of tag sequences from genomic and metagenomic datasets

Robert Schmieder; Yan Wei Lim; Forest Rohwer; Robert Edwards

BackgroundSequencing metagenomes that were pre-amplified with primer-based methods requires the removal of the additional tag sequences from the datasets. The sequenced reads can contain deletions or insertions due to sequencing limitations, and the primer sequence may contain ambiguous bases. Furthermore, the tag sequence may be unavailable or incorrectly reported. Because of the potential for downstream inaccuracies introduced by unwanted sequence contaminations, it is important to use reliable tools for pre-processing sequence data.ResultsTagCleaner is a web application developed to automatically identify and remove known or unknown tag sequences allowing insertions and deletions in the dataset. TagCleaner is designed to filter the trimmed reads for duplicates, short reads, and reads with high rates of ambiguous sequences. An additional screening for and splitting of fragment-to-fragment concatenations that gave rise to artificial concatenated sequences can increase the quality of the dataset. Users may modify the different filter parameters according to their own preferences.ConclusionsTagCleaner is a publicly available web application that is able to automatically detect and efficiently remove tag sequences from metagenomic datasets. It is easily configurable and provides a user-friendly interface. The interactive web interface facilitates export functionality for subsequent data processing, and is available at http://edwards.sdsu.edu/tagcleaner.


The ISME Journal | 2012

Spatial distribution of microbial communities in the cystic fibrosis lung

Dana Willner; Matthew Haynes; Mike Furlan; Robert Schmieder; Yan Wei Lim; Paul B. Rainey; Forest Rohwer; Douglas Conrad

Cystic fibrosis (CF) is a common fatal genetic disorder with mortality most often resulting from microbial infections of the lungs. Culture-independent studies of CF-associated microbial communities have indicated that microbial diversity in the CF airways is much higher than suggested by culturing alone. However, these studies have relied on indirect methods to sample the CF lung such as expectorated sputum and bronchoalveolar lavage (BAL). Here, we characterize the diversity of microbial communities in tissue sections from anatomically distinct regions of the CF lung using barcoded 16S amplicon pyrosequencing. Microbial communities differed significantly between different areas of the lungs, and few taxa were common to microbial communities in all anatomical regions surveyed. Our results indicate that CF lung infections are not only polymicrobial, but also spatially heterogeneous suggesting that treatment regimes tailored to dominant populations in sputum or BAL samples may be ineffective against infections in some areas of the lung.


Bioinformatics | 2012

Identification and removal of ribosomal RNA sequences from metatranscriptomes

Robert Schmieder; Yan Wei Lim; Robert Edwards

Summary: Here, we present riboPicker, a robust framework for the rapid, automated identification and removal of ribosomal RNA sequences from metatranscriptomic datasets. The results can be exported for subsequent analysis, and the databases used for the web-based version are updated on a regular basis. riboPicker categorizes rRNA-like sequences and provides graphical visualizations and tabular outputs of ribosomal coverage, alignment results and taxonomic classifications. Availability and implementation: This open-source application was implemented in Perl and can be used as stand-alone version or accessed online through a user-friendly web interface. The source code, user help and additional information is available at http://ribopicker.sourceforge.net/. Contact: [email protected]; [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.


Proceedings of the National Academy of Sciences of the United States of America | 2011

Metagenomic detection of phage-encoded platelet-binding factors in the human oral cavity

Dana Willner; Mike Furlan; Robert Schmieder; Juris A. Grasis; David T. Pride; David A. Relman; Florent E. Angly; Tracey McDole; Ray P. Mariella; Forest Rohwer; Matthew Haynes

The human oropharynx is a reservoir for many potential pathogens, including streptococcal species that cause endocarditis. Although oropharyngeal microbes have been well described, viral communities are essentially uncharacterized. We conducted a metagenomic study to determine the composition of oropharyngeal DNA viral communities (both phage and eukaryotic viruses) in healthy individuals and to evaluate oropharyngeal swabs as a rapid method for viral detection. Viral DNA was extracted from 19 pooled oropharyngeal swabs and sequenced. Viral communities consisted almost exclusively of phage, and complete genomes of several phage were recovered, including Escherichia coli phage T3, Propionibacterium acnes phage PA6, and Streptococcus mitis phage SM1. Phage relative abundances changed dramatically depending on whether samples were chloroform treated or filtered to remove microbial contamination. pblA and pblB genes of phage SM1 were detected in the metagenomes. pblA and pblB mediate the attachment of S. mitis to platelets and play a significant role in S. mitis virulence in the endocardium, but have never previously been detected in the oral cavity. These genes were also identified in salivary metagenomes from three individuals at three time points and in individual saliva samples by PCR. Additionally, we demonstrate that phage SM1 can be induced by commonly ingested substances. Our results indicate that the oral cavity is a reservoir for pblA and pblB genes and for phage SM1 itself. Further studies will determine the association between pblA and pblB genes in the oral cavity and the risk of endocarditis.

Collaboration


Dive into the Robert Schmieder's collaboration.

Top Co-Authors

Avatar

Robert Edwards

San Diego State University

View shared research outputs
Top Co-Authors

Avatar

Forest Rohwer

San Diego State University

View shared research outputs
Top Co-Authors

Avatar

Yan Wei Lim

San Diego State University

View shared research outputs
Top Co-Authors

Avatar

Matthew Haynes

San Diego State University

View shared research outputs
Top Co-Authors

Avatar

Mike Furlan

San Diego State University

View shared research outputs
Top Co-Authors

Avatar

Dana Willner

University of Queensland

View shared research outputs
Top Co-Authors

Avatar

Douglas Conrad

University of California

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Mya Breitbart

University of South Florida

View shared research outputs
Researchain Logo
Decentralizing Knowledge