Korbinian Grote
Genomatix
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Korbinian Grote.
Bioinformatics | 2005
K. Cartharius; Kornelie Frech; Korbinian Grote; Bernward Klocke; Manuela Haltmeier; Andreas Klingenhoff; Matthias Frisch; M. Bayerlein; Thomas Werner
MOTIVATION Promoter analysis is an essential step on the way to identify regulatory networks. A prerequisite for successful promoter analysis is the prediction of potential transcription factor binding sites (TFBS) with reasonable accuracy. The next steps in promoter analysis can be tackled only with reliable predictions, e.g. finding phylogenetically conserved patterns or identifying higher order combinations of sites in promoters of co-regulated genes. RESULTS We present a new version of the program MatInspector that identifies TFBS in nucleotide sequences using a large library of weight matrices. By introducing a matrix family concept, optimized thresholds, and comparative analysis, the enhanced program produces concise results avoiding redundant and false-positive matches. We describe a number of programs based on MatInspector allowing in-depth promoter analysis (DiAlignTF, FrameWorker) and targeted design of regulatory sequences (SequenceShaper).
Plant Physiology | 2004
Stephen Rudd; Matthias Frisch; Korbinian Grote; Blake C. Meyers; Klaus F. X. Mayer; Thomas Werner
We carried out a genome-wide prediction of scaffold/matrix attachment regions (S/MARs) in Arabidopsis. Results indicate no uneven distribution on the chromosomal level but a clear underrepresentation of S/MARs inside genes. In cases where S/MARs were predicted within genes, these intragenic S/MARs were preferentially located within the 5′-half, most prominently within introns 1 and 2. Using Arabidopsis whole-genome expression data generated by the massively parallel signature sequencing methodology, we found a negative correlation between S/MAR-containing genes and transcriptional abundance. Expressed sequence tag data correlated the same way with S/MAR-containing genes. Thus, intragenic S/MARs show a negative correlation with transcription level. For various genes it has been shown experimentally that S/MARs can function as transcriptional regulators and that they have an implication in stabilizing expression levels within transgenic plants. On the basis of a genome-wide in silico S/MAR analysis, we found a significant correlation between the presence of intragenic S/MARs and transcriptional down-regulation.
Nucleic Acids Research | 2005
Holger Maier; Stefanie Döhr; Korbinian Grote; Sean O'Keeffe; Thomas Werner; Martin Hrabé de Angelis; R. Schneider
The LitMiner software is a literature data-mining tool that facilitates the identification of major gene regulation key players related to a user-defined field of interest in PubMed abstracts. The prediction of gene-regulatory relationships is based on co-occurrence analysis of key terms within the abstracts. LitMiner predicts relationships between key terms from the biomedical domain in four categories (genes, chemical compounds, diseases and tissues). Owing to the limitations (no direction, unverified automatic prediction) of the co-occurrence approach, the primary data in the LitMiner database represent postulated basic gene–gene relationships. The usefulness of the LitMiner system has been demonstrated recently in a study that reconstructed disease-related regulatory networks by promoter modelling that was initiated by a LitMiner generated primary gene list. To overcome the limitations and to verify and improve the data, we developed WikiGene, a Wiki-based curation tool that allows revision of the data by expert users over the Internet. LitMiner () and WikiGene () can be used unrestricted with any Internet browser.
Bioinformatics | 1996
Kerstin Quandt; Korbinian Grote; Thomas Werner
MOTIVATION Most of the sequences determined in current genome sequencing projects remain at least partially unannotated. The available software for DNA sequence analysis is usually limited to the prediction of individual elements (level 1 methods), but does not assess the context of different motifs. However, the functionality of biological units like promoters depends on the correct spatial organization of multiple individual elements. RESULTS Here, we present a second-level software package called GenomeInspector [[http:@www.gsf.de/biodv/genomeinspector.html ]], for further analysis of results obtained with level 1 methods (e.g. MatInspector [[http:@www.gsf.de/biodv/matinspector.html ]] or ConsInspector [[http:@www.gsf.de/biodv/consinspector.html++ +]]). One of the main features of this modular program is its ability to assess distance correlations between large sets of sequence elements which can be used for the identification and definition of basic patterns of functional units. The program provides an easy-to-use graphical user interface with direct comprehensive display of all results for megabase sequences. Sequence elements showing spatial correlations can be easily extracted and traced back to the nucleotide sequence with the program. GenomeInspector identified promoters of glycolytic enzymes in yeast [[http:@www.mips.biochem.mpg.de/mips/yeast/]] as members of a subgroup with unusual location of an ABF1 site. Solely on the basis of distance correlation analysis, the program correctly selected those transcription factors within these promoters already known to be involved in the regulation of glycolytic enzymes, demonstrating the power of this method.
Proceedings of the National Academy of Sciences of the United States of America | 2015
Antti Honkela; Jaakko Peltonen; Hande Topa; Iryna Charapitsa; Filomena Matarese; Korbinian Grote; Hendrik G. Stunnenberg; George Reid; Neil D. Lawrence; Magnus Rattray
Significance Gene transcription is a highly regulated dynamic process. Delays in transcription have important consequences on dynamics of gene expression and consequently on downstream biological function. We model temporal dynamics of transcription using genome-wide time course data measuring transcriptional activity and mRNA concentration. We find a significant number of genes exhibit a long RNA processing delay between transcription termination and mRNA production. These long processing delays are more common for short genes, which would otherwise be expected to transcribe most rapidly. The distribution of intronic reads suggests that these delays are required for splicing to be completed. Understanding such delays is essential for understanding how a rapid cellular response is regulated. Genes with similar transcriptional activation kinetics can display very different temporal mRNA profiles because of differences in transcription time, degradation rate, and RNA-processing kinetics. Recent studies have shown that a splicing-associated RNA production delay can be significant. To investigate this issue more generally, it is useful to develop methods applicable to genome-wide datasets. We introduce a joint model of transcriptional activation and mRNA accumulation that can be used for inference of transcription rate, RNA production delay, and degradation rate given data from high-throughput sequencing time course experiments. We combine a mechanistic differential equation model with a nonparametric statistical modeling approach allowing us to capture a broad range of activation kinetics, and we use Bayesian parameter estimation to quantify the uncertainty in estimates of the kinetic parameters. We apply the model to data from estrogen receptor α activation in the MCF-7 breast cancer cell line. We use RNA polymerase II ChIP-Seq time course data to characterize transcriptional activation and mRNA-Seq time course data to quantify mature transcripts. We find that 11% of genes with a good signal in the data display a delay of more than 20 min between completing transcription and mature mRNA production. The genes displaying these long delays are significantly more likely to be short. We also find a statistical association between high delay and late intron retention in pre-mRNA data, indicating significant splicing-associated production delays in many genes.
Proceedings of the National Academy of Sciences of the United States of America | 2007
Thomas K. Albert; Korbinian Grote; Stefan Boeing; Gertraud Stelzer; Aloys Schepers; Michael Meisterernst
Negative cofactor 2 (NC2) forms a stable complex with TATA-binding protein (TBP) on promoters in vitro. Its association with TBP prevents the binding of TFIIB and leads to inhibition of preinitiation complex formation. Here, we investigate the association of NC2 subunit-α with human RNA polymerase II promoter regions by using gene-specific ChIP and genome-wide promoter ChIPchip analyses. We find NC2α associated with a large number of human promoters, where it peaks close to the core regions. NC2 occupancy in vivo positively correlates with mRNA levels, which perhaps reflects its capacity to stabilize TBP on promoter regions. In single gene analyses, we confirm core promoter binding and in addition map the NC2 complex to enhancer proximal regions. High-occupancy histone genes display a stable NC2/TFIIB ratio during the cell cycle, which otherwise varies markedly from one gene to another. The latter is at least in part explained by an observed negative correlation of NC2 occupancy with the presence of the TFIIB recognition element in core promoter regions. Our data establish the genome-wide basis for general and gene-specific functions of NC2 in mammalian cells.
EBioMedicine | 2015
Gyorgy Petrovics; Hua Li; Tanja Stümpel; Shyh-Han Tan; Denise Young; Shilpa Katta; Qiyuan Li; Kai Ying; Bernward Klocke; Lakshmi Ravindranath; Indu Kohaar; Yongmei Chen; Dezso Ribli; Korbinian Grote; Hua Zou; Joseph Cheng; Clifton L. Dalgard; Shimin Zhang; István Csabai; Jacob Kagan; David Y. Takeda; Massimo Loda; Sudhir Srivastava; Matthias Scherf; Martin Seifert; Timo Gaiser; David G. McLeod; Zoltan Szallasi; Reinhard Ebner; Thomas Werner
Evaluation of cancer genomes in global context is of great interest in light of changing ethnic distribution of the world population. We focused our study on men of African ancestry because of their disproportionately higher rate of prostate cancer (CaP) incidence and mortality. We present a systematic whole genome analyses, revealing alterations that differentiate African American (AA) and Caucasian American (CA) CaP genomes. We discovered a recurrent deletion on chromosome 3q13.31 centering on the LSAMP locus that was prevalent in tumors from AA men (cumulative analyses of 435 patients: whole genome sequence, 14; FISH evaluations, 101; and SNP array, 320 patients). Notably, carriers of this deletion experienced more rapid disease progression. In contrast, PTEN and ERG common driver alterations in CaP were significantly lower in AA prostate tumors compared to prostate tumors from CA. Moreover, the frequency of inter-chromosomal rearrangements was significantly higher in AA than CA tumors. These findings reveal differentially distributed somatic mutations in CaP across ancestral groups, which have implications for precision medicine strategies.
Methods | 2013
Jochen Supper; Claudia Gugenmus; Johannes Wollnik; Tanja Drueke; Matthias Scherf; Alexander Hahn; Korbinian Grote; Nancy Bretschneider; Bernward Klocke; Christian Zinser; Kerstin Cartharius; Martin Seifert
In recent years, gene fusions have gained significant recognition as biomarkers. They can assist treatment decisions, are seldom found in normal tissue and are detectable through Next-generation sequencing (NGS) of the transcriptome (RNA-seq). To transform the data provided by the sequencer into robust gene fusion detection several analysis steps are needed. Usually the first step is to map the sequenced transcript fragments (RNA-seq) to a reference genome. One standard application of this approach is to estimate expression and detect variants within known genes, e.g. SNPs and indels. In case of gene fusions, however, completely novel gene structures have to be detected. Here, we describe the detection of such gene fusion events based on our comprehensive transcript annotation (ElDorado). To demonstrate the utility of our approach, we extract gene fusion candidates from eight breast cancer cell lines, which we compare to experimentally verified gene fusions. We discuss several gene fusion events, like BCAS3-BCAS4 that was only detected in the breast cancer cell line MCF7. As supporting evidence we show that gene fusions occur more frequently in copy number enriched regions (CNV analysis). In addition, we present the Transcriptome Viewer (TViewer) a tool that allows to interactively visualize gene fusions. Finally, we support detected gene fusions through literature mining based annotations and network analyses. In conclusion, we present a platform that allows detecting gene fusions and supporting them through literature knowledge as well as rich visualization capabilities. This enables scientists to better understand molecular processes, biological functions and disease associations, which will ultimately lead to better biomedical knowledge for the development of biomarkers for diagnostics and therapies.
Genome Biology | 2010
Thomas K. Albert; Korbinian Grote; Stefan Boeing; Michael Meisterernst
BackgroundThe general transcription factor TFIIB and its antagonist negative cofactor 2 (NC2) are hallmarks of RNA polymerase II (RNAPII) transcription. Both factors bind TATA box-binding protein (TBP) at promoters in a mutually exclusive manner. Dissociation of NC2 is thought to be followed by TFIIB association and subsequent preinitiation complex formation. TFIIB dissociates upon RNAPII promoter clearance, thereby providing a specific measure for steady-state preinitiation complex levels. As yet, genome-scale promoter mapping of human TFIIB has not been reported. It thus remains elusive how human core promoters contribute to preinitiation complex formation in vivo.ResultsWe compare target genes of TFIIB and NC2 in human B cells and analyze associated core promoter architectures. TFIIB occupancy is positively correlated with gene expression, with the vast majority of promoters being GC-rich and lacking defined core promoter elements. TATA elements, but not the previously in vitro defined TFIIB recognition elements, are enriched in some 4 to 5% of the genes. NC2 binds to a highly related target gene set. Nonetheless, subpopulations show strong variations in factor ratios: whereas high TFIIB/NC2 ratios select for promoters with focused start sites and conserved core elements, high NC2/TFIIB ratios correlate to multiple start-site promoters lacking defined core elements.ConclusionsTFIIB and NC2 are global players that occupy active genes. Preinitiation complex formation is independent of core elements at the majority of genes. TATA and TATA-like elements dictate TFIIB occupancy at a subset of genes. Biochemical data support a model in which preinitiation complex but not TBP-NC2 complex formation is regulated.
PLOS Computational Biology | 2013
Kristian Ovaska; Filomena Matarese; Korbinian Grote; Iryna Charapitsa; Alejandra Cervera; Chengyu Liu; George Reid; Martin Seifert; Hendrik G. Stunnenberg; Sampsa Hautaniemi
Identification of responsive genes to an extra-cellular cue enables characterization of pathophysiologically crucial biological processes. Deep sequencing technologies provide a powerful means to identify responsive genes, which creates a need for computational methods able to analyze dynamic and multi-level deep sequencing data. To answer this need we introduce here a data-driven algorithm, SPINLONG, which is designed to search for genes that match the user-defined hypotheses or models. SPINLONG is applicable to various experimental setups measuring several molecular markers in parallel. To demonstrate the SPINLONG approach, we analyzed ChIP-seq data reporting PolII, estrogen receptor (), H3K4me3 and H2A.Z occupancy at five time points in the MCF-7 breast cancer cell line after estradiol stimulus. We obtained 777 early responsive genes and compared the biological functions of the genes having binding within 20 kb of the transcription start site (TSS) to genes without such binding site. Our results show that the non-genomic action of via the MAPK pathway, instead of direct binding, may be responsible for early cell responses to activation. Our results also indicate that the responsive genes triggered by the genomic pathway are transcribed faster than those without binding sites. The survival analysis of the 777 responsive genes with 150 primary breast cancer tumors and in two independent validation cohorts indicated the ATAD3B gene, which does not have binding site within 20 kb of its TSS, to be significantly associated with poor patient survival.