Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Regina Bohnert is active.

Publication


Featured researches published by Regina Bohnert.


Proceedings of the National Academy of Sciences of the United States of America | 2009

Genomewide SNP variation reveals relationships among landraces and modern varieties of rice.

Kenneth L. McNally; Kevin L. Childs; Regina Bohnert; Rebecca M. Davidson; Keyan Zhao; Victor Jun Ulat; Georg Zeller; Richard M. Clark; Douglas R. Hoen; Thomas E. Bureau; Renee Stokowski; Dennis G. Ballinger; Kelly A. Frazer; D. R. Cox; Badri Padhukasahasram; Carlos Bustamante; Detlef Weigel; David J. Mackill; Richard Bruskiewich; Gunnar Rätsch; C. Robin Buell; Hei Leung; Jan E. Leach

Rice, the primary source of dietary calories for half of humanity, is the first crop plant for which a high-quality reference genome sequence from a single variety was produced. We used resequencing microarrays to interrogate 100 Mb of the unique fraction of the reference genome for 20 diverse varieties and landraces that capture the impressive genotypic and phenotypic diversity of domesticated rice. Here, we report the distribution of 160,000 nonredundant SNPs. Introgression patterns of shared SNPs revealed the breeding history and relationships among the 20 varieties; some introgressed regions are associated with agronomic traits that mark major milestones in rice improvement. These comprehensive SNP data provide a foundation for deep exploration of rice diversity and gene–trait relationships and their use for future rice improvement.


Nature | 2011

Multiple reference genomes and transcriptomes for Arabidopsis thaliana

Xiangchao Gan; Oliver Stegle; Jonas Behr; Joshua G. Steffen; Philipp Drewe; Katie L. Hildebrand; Rune Lyngsoe; Sebastian J. Schultheiss; Edward J. Osborne; Vipin T. Sreedharan; André Kahles; Regina Bohnert; Géraldine Jean; Paul S. Derwent; Paul J. Kersey; Eric J. Belfield; Nicholas P. Harberd; Eric Kemen; Christopher Toomajian; Paula X. Kover; Richard M. Clark; Gunnar Rätsch; Richard Mott

Genetic differences between Arabidopsis thaliana accessions underlie the plant’s extensive phenotypic variation, and until now these have been interpreted largely in the context of the annotated reference accession Col-0. Here we report the sequencing, assembly and annotation of the genomes of 18 natural A. thaliana accessions, and their transcriptomes. When assessed on the basis of the reference annotation, one-third of protein-coding genes are predicted to be disrupted in at least one accession. However, re-annotation of each genome revealed that alternative gene models often restore coding potential. Gene expression in seedlings differed for nearly half of expressed genes and was frequently associated with cis variants within 5 kilobases, as were intron retention alternative splicing events. Sequence and expression variation is most pronounced in genes that respond to the biotic environment. Our data further promote evolutionary and functional studies in A. thaliana, especially the MAGIC genetic reference population descended from these accessions.


Nucleic Acids Research | 2010

rQuant.web: a tool for RNA-Seq-based transcript quantitation

Regina Bohnert; Gunnar Rätsch

We provide a novel web service, called rQuant.web, allowing convenient access to tools for quantitative analysis of RNA sequencing data. The underlying quantitation technique rQuant is based on quadratic programming and estimates different biases induced by library preparation, sequencing and read mapping. It can tackle multiple transcripts per gene locus and is therefore particularly well suited to quantify alternative transcripts. rQuant.web is available as a tool in a Galaxy installation at http://galaxy.fml.mpg.de. Using rQuant.web is free of charge, it is open to all users, and there is no login requirement.


Nucleic Acids Research | 2013

Accurate detection of differential RNA processing

Philipp Drewe; Oliver Stegle; Lisa Hartmann; André Kahles; Regina Bohnert; Andreas Wachter; Karsten M. Borgwardt; Gunnar Rätsch

Deep transcriptome sequencing (RNA-Seq) has become a vital tool for studying the state of cells in the context of varying environments, genotypes and other factors. RNA-Seq profiling data enable identification of novel isoforms, quantification of known isoforms and detection of changes in transcriptional or RNA-processing activity. Existing approaches to detect differential isoform abundance between samples either require a complete isoform annotation or fall short in providing statistically robust and calibrated significance estimates. Here, we propose a suite of statistical tests to address these open needs: a parametric test that uses known isoform annotations to detect changes in relative isoform abundance and a non-parametric test that detects differential read coverages and can be applied when isoform annotations are not available. Both methods account for the discrete nature of read counts and the inherent biological variability. We demonstrate that these tests compare favorably to previous methods, both in terms of accuracy and statistical calibrations. We use these techniques to analyze RNA-Seq libraries from Arabidopsis thaliana and Drosophila melanogaster. The identified differential RNA processing events were consistent with RT–qPCR measurements and previous studies. The proposed toolkit is available from http://bioweb.me/rdiff and enables in-depth analyses of transcriptomes, with or without available isoform annotation.


BMC Bioinformatics | 2009

Transcript quantification with RNA-Seq data

Regina Bohnert; Jonas Behr; Gunnar Rätsch

Motivation Novel high-throughput sequencing technologies open exciting new approaches to transcriptome profiling. Sequencing transcript populations of interest, e.g. from different tissues or variable stress conditions, with RNA sequencing (RNA-Seq) [1] generates millions of short reads. Accurately aligned to a reference genome, they provide digital counts and thus facilitate transcript quantification. As the observed read counts only provide the summation of all expressed sequences at one locus, the inference of the underlying transcript abundances is crucial for further quantitative analyses.


Bioinformatics | 2014

Oqtans: The RNA-seq Workbench in the Cloud for Complete and Reproducible Quantitative Transcriptome Analysis

Vipin T. Sreedharan; Sebastian J. Schultheiss; Géraldine Jean; André Kahles; Regina Bohnert; Philipp Drewe; Pramod Kaushik Mudrakarta; Nico Görnitz; Georg Zeller; Gunnar Rätsch

We present Oqtans, an open-source workbench for quantitative transcriptome analysis, that is integrated in Galaxy. Its distinguishing features include customizable computational workflows and a modular pipeline architecture that facilitates comparative assessment of tool and data quality. Oqtans integrates an assortment of machine learning-powered tools into Galaxy, which show superior or equal performance to state-of-the-art tools. Implemented tools comprise a complete transcriptome analysis workflow: short-read alignment, transcript identification/quantification and differential expression analysis. Oqtans and Galaxy facilitate persistent storage, data exchange and documentation of intermediate results and analysis workflows. We illustrate how Oqtans aids the interpretation of data from different experiments in easy to understand use cases. Users can easily create their own workflows and extend Oqtans by integrating specific tools. Oqtans is available as (i) a cloud machine image with a demo instance at cloud.oqtans.org, (ii) a public Galaxy instance at galaxy.cbio.mskcc.org, (iii) a git repository containing all installed software (oqtans.org/git); most of which is also available from (iv) the Galaxy Toolshed and (v) a share string to use along with Galaxy CloudMan. Contact: [email protected], [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.


BMC Bioinformatics | 2010

Next generation genome annotation with mGene.ngs

Jonas Behr; Regina Bohnert; Georg Zeller; Gabriele Schweikert; Lisa Hartmann; Gunnar Rätsch

An increasingly large number of novel genomes is being sequenced and the task of automatic genome annotation has never been more important. The current revolution in sequencing technologies also allows us to obtain a detailed picture of the whole complement of expressed RNA transcripts. We have developed a novel de novo gene finding system mGene.ngs that combines the benefits of accurate ab initio gene finding with the rich information obtained in RNA sequencing (RNA-seq) experiments. The system is based on the recently developed accurate gene finding system mGene [1], which employs state-of-the-art prediction techniques and which has been shown to perform very well compared to established gene finding systems [2]. In contrast to many HMM-based gene finders, mGene has the conceptual advantage of being very flexible in terms of incorporating heterogeneous input data. The employed inference techniques can exploit the transcriptome information already at the learning stage to appropriately adapt to the relevance of the different evidences. We show that these advantages can be translated into more accurate gene predictions. Moreover, we developed extensions of mGene.ngs to predict and quantify alternative RNA transcripts. To provide de novo genome annotations based on RNA-seq experiments, we first construct a preliminary, highly specific gene set for genes that are well-covered with RNA-seq reads. In a second step, we train predictors for genomic signals on the preliminary gene set. In the third step we train mGene.ngs, using the preliminary gene models while taking advantage of the RNA-seq read coverage and genomic signal predictions. We illustrate the power of our approach for the C. elegans genome and 50M paired-end RNA-seq reads (Illumina; 76nt). Figure 1 shows transcript level evaluation results for all annotated genes (WS200) as a function of the expression level. The ab initio mGene-based system (blue) trained on the annotation achieves an average transcript-level F-score of 49.9%. We achieve a slightly better performance (51.8%) for the de novo annotation system (green) using RNA-seq reads, but without considering the existing genome annotation. If we use the RNA-seq reads and train on the existing annotation (red), we achieve 57.6%, and can therefore take advantage of the previous annotation. We find it remarkable that for medium to high expressed genes the de novo gene predictions are as similar to the genome annotation as the predictions of the system, that has seen parts of the annotation in training. Comparing these results to predictions from the recently published method cufflinks [3] (black) reveals that cufflinks seems not to be able to appropriately adapt to the RNA-seq data at hand. Investigating the contribution of individual features we found that spliced read alignments suggesting introns help most to increase the gene prediction performance; 91.6% of the achieved total improvement is due to spliced read alignments. The read coverage alone is much less informative and only leads to improvements similar to the ones achieved with transcriptome tiling arrays. We employed the developed annotation strategy for the re-annotation of the C. briggsae genome, for which only few transcriptome sequences are available yet. We can show that the new annotation is considerably more accurate than previous ones and additionally includes alternative RNA isoforms. mGene.ngs will be released as open source software on http://mgene.org and is already available as Galaxy-based web-service at http://galaxy.fml.mpg.de. * Correspondence: [email protected] Friedrich Miescher Laboratory of the Max Planck Society, Tubingen, Germany Full list of author information is available at the end of the article Behr et al. BMC Bioinformatics 2010, 11(Suppl 10):O8 http://www.biomedcentral.com/1471-2105/11/S10/O8


RNA Biology | 2012

Multiple insert size paired-end sequencing for deconvolution of complex transcriptomes.

Lisa M. Smith; Lisa Hartmann; Philipp Drewe; Regina Bohnert; André Kahles; Christa Lanz; Gunnar Rätsch

Deep sequencing of transcriptomes allows quantitative and qualitative analysis of many RNA species in a sample, with parallel comparison of expression levels, splicing variants, natural antisense transcripts, RNA editing and transcriptional start and stop sites the ideal goal. By computational modeling, we show how libraries of multiple insert sizes combined with strand-specific, paired-end (SS-PE) sequencing can increase the information gained on alternative splicing, especially in higher eukaryotes. Despite the benefits of gaining SS-PE data with paired ends of varying distance, the standard Illumina protocol allows only non-strand-specific, paired-end sequencing with a single insert size. Here, we modify the Illumina RNA ligation protocol to allow SS-PE sequencing by using a custom pre-adenylated 3′ adaptor. We generate parallel libraries with differing insert sizes to aid deconvolution of alternative splicing events and to characterize the extent and distribution of natural antisense transcription in C. elegans. Despite stringent requirements for detection of alternative splicing, our data increases the number of intron retention and exon skipping events annotated in the Wormbase genome annotations by 127% and 121%, respectively. We show that parallel libraries with a range of insert sizes increase transcriptomic information gained by sequencing and that by current established benchmarks our protocol gives competitive results with respect to library quality.


BMC Bioinformatics | 2011

Oqtans: a Galaxy-integrated workflow for quantitative transcriptome analysis from NGS Data

Sebastian J. Schultheiss; Géraldine Jean; Jonas Behr; Regina Bohnert; Philipp Drewe; Nico Görnitz; André Kahles; Pramod Mudrakarta; Vipin T. Sreedharan; Georg Zeller; Gunnar Rätsch

First published by BioMed Central: Schultheiss, Sebastian J.; Jean, Geraldine; Behr, Jonas; Bohnert, Regina; Drewe, Philipp; Gornitz, Nico; Kahles, Andre; Mudrakarta, Pramod; Sreedharan, Vipin T.; Zeller, Georg; Ratsch, Gunnar: Oqtans: a Galaxy-integrated workflow for quantitative transcriptome analysis from NGS Data - In: BMC Bioinformatics. - ISSN 1471-2105 (online). - 12 (2011), suppl. 11, art. A7. - doi:10.1186/1471-2105-12-S11-A7.


BMC Bioinformatics | 2008

Revealing sequence variation patterns in rice with machine learning methods

Regina Bohnert; Georg Zeller; Richard M. Clark; Kevin L. Childs; Victor Jun Ulat; Renee Stokowski; Dennis G. Ballinger; Kelly A. Frazer; D. R. Cox; Richard Bruskiewich; C. Robin Buell; Jan E. Leach; Hei Leung; Kenneth L. McNally; Detlef Weigel; Gunnar Rätsch

Address: 1Friedrich Miescher Laboratory, Max Planck Society, 72076 Tubingen, Germany, 2Department of Molecular Biology, Max Planck Institute for Developmental Biology, 72076 Tubingen, Germany, 3Department of Biology, University of Utah, Salt Lake City, UT 84112, USA, 4Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA, 5International Rice Research Institute, Metro Manila, The Philippines, 6Perlegen Sciences, Inc., Mountain View, California, CA 94043, USA and 7Bioagricultural Sciences and Pest Management, Colorado State University, Colorado, CO 80523, USA

Collaboration


Dive into the Regina Bohnert's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

André Kahles

Memorial Sloan Kettering Cancer Center

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

C. Robin Buell

Michigan State University

View shared research outputs
Researchain Logo
Decentralizing Knowledge