Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Sabine Dietmann is active.

Publication


Featured researches published by Sabine Dietmann.


Nucleic Acids Research | 2001

A fully automatic evolutionary classification of protein folds: Dali Domain Dictionary version 3

Sabine Dietmann; Jong Park; Cedric Notredame; Andreas Heger; Michael Lappe; Liisa Holm

The Dali Domain Dictionary (http://www.ebi.ac.uk/dali/domain) is a numerical taxonomy of all known structures in the Protein Data Bank (PDB). The taxonomy is derived fully automatically from measurements of structural, functional and sequence similarities. Here, we report the extension of the classification to match the traditional four hierarchical levels corresponding to: (i) supersecondary structural motifs (attractors in fold space), (ii) the topology of globular domains (fold types), (iii) remote homologues (functional families) and (iv) homologues with sequence identity above 25% (sequence families). The computational definitions of attractors and functional families are new. In September 2000, the Dali classification contained 10 531 PDB entries comprising 17 101 chains, which were partitioned into five attractor regions, 1375 fold types, 2582 functional families and 3724 domain sequence families. Sequence families were further associated with 99 582 unique homologous sequences in the HSSP database, which increases the number of effectively known structures several-fold. The resulting database contains the description of protein domain architecture, the definition of structural neighbours around each known structure, the definition of structurally conserved cores and a comprehensive library of explicit multiple alignments of distantly related protein families.


Nature Structural & Molecular Biology | 2001

Identification of homology in protein structure classification

Sabine Dietmann; Liisa Holm

Structural biology and structural genomics are expected to produce many three-dimensional protein structures in the near future. Each new structure raises questions about its function and evolution. Correct functional and evolutionary classification of a new structure is difficult for distantly related proteins and error-prone using simple statistical scores based on sequence or structure similarity. Here we present an accurate numerical method for the identification of evolutionary relationships (homology). The method is based on the principle that natural selection maintains structural and functional continuity within a diverging protein family. The problem of different rates of structural divergence between different families is solved by first using structural similarities to produce a global map of folds in protein space and then further subdividing fold neighborhoods into superfamilies based on functional similarities. In a validation test against a classification by human experts (SCOP), 77% of homologous pairs were identified with 92% reliability. The method is fully automated, allowing fast, self-consistent and complete classification of large numbers of protein structures. In particular, the discrimination between analogy and homology of close structural neighbors will lead to functional predictions while avoiding overprediction.


Proteomics | 2009

PPI spider: A tool for the interpretation of proteomics data in the context of protein–protein interaction networks

Alexey V. Antonov; Sabine Dietmann; Igor V. Rodchenkov; Hans W. Mewes

Recent advances in experimental technologies allow for the detection of a complete cell proteome. Proteins that are expressed at a particular cell state or in a particular compartment as well as proteins with differential expression between various cells states are commonly delivered by many proteomics studies. Once a list of proteins is derived, a major challenge is to interpret the identified set of proteins in the biological context. Protein–protein interaction (PPI) data represents abundant information that can be employed for this purpose. However, these data have not yet been fully exploited due to the absence of a methodological framework that can integrate this type of information. Here, we propose to infer a network model from an experimentally identified protein list based on the available information about the topology of the global PPI network. We propose to use a Monte Carlo simulation procedure to compute the statistical significance of the inferred models. The method has been implemented as a freely available web‐based tool, PPI spider (http://mips.helmholtz‐muenchen.de/proj/ppispider). To support the practical significance of PPI spider, we collected several hundreds of recently published experimental proteomics studies that reported lists of proteins in various biological contexts. We reanalyzed them using PPI spider and demonstrated that in most cases PPI spider could provide statistically significant hypotheses that are helpful for understanding of the protein list.


Genome Biology | 2008

KEGG spider: interpretation of genomics data in the context of the global gene metabolic network

Alexey V. Antonov; Sabine Dietmann; Hans W. Mewes

KEGG spider is a web-based tool for interpretation of experimentally derived gene lists in order to gain understanding of metabolism variations at a genomic level. KEGG spider implements a pathway-free framework that overcomes a major bottleneck of enrichment analyses: it provides global models uniting genes from different metabolic pathways. Analyzing a number of experimentally derived gene lists, we demonstrate that KEGG spider provides deeper insights into metabolism variations in comparison to existing methods.


Nucleic Acids Research | 2010

R spider: a network-based analysis of gene lists by combining signaling and metabolic pathways from Reactome and KEGG databases

Alexey V. Antonov; Esther Schmidt; Sabine Dietmann; Maria Krestyaninova; Henning Hermjakob

R spider is a web-based tool for the analysis of a gene list using the systematic knowledge of core pathways and reactions in human biology accumulated in the Reactome and KEGG databases. R spider implements a network-based statistical framework, which provides a global understanding of gene relations in the supplied gene list, and fully exploits the Reactome and KEGG knowledge bases. R spider provides a user-friendly dialog-driven web interface for several model organisms and supports most available gene identifiers. R spider is freely available at http://mips.helmholtz-muenchen.de/proj/rspider.


FEBS Journal | 2009

TICL – a web tool for network‐based interpretation of compound lists inferred by high‐throughput metabolomics

Alexey V. Antonov; Sabine Dietmann; Philip Wong; Hans W. Mewes

High‐throughput metabolomics is a dynamically developing technology that enables the mass separation of complex mixtures at very high resolution. Metabolic profiling has begun to be widely used in clinical research to study the molecular mechanisms of complex cell disorders. Similar to transcriptomics, which is capable of detecting genes at differential states, metabolomics is able to deliver a list of compounds differentially present between explored cell physiological conditions. The bioinformatics challenge lies in a statistically valid interpretation of the functional context for identified sets of metabolites. Here, we present TICL, a web tool for the automatic interpretation of lists of compounds. The major advance of TICL is that it not only provides a model of possible compound transformations related to the input list, but also implements a robust statistical framework to estimate the significance of the inferred model. The TICL web tool is freely accessible at http://mips.helmholtz‐muenchen.de/proj/cmp.


Current Opinion in Structural Biology | 2002

Automated detection of remote homology

Sabine Dietmann; Narcis Fernandez-Fuentes; Liisa Holm

The classification of a newly identified protein as a member of a superfamily is important for focusing experiments on its most likely functions. Such classification, often performed by hand, has now been fully automated. This sophisticated new approach takes into account not only alignment scores but also a number of other computable attributes, such as functional sites deduced from sequence conservation patterns.


Journal of Proteome Research | 2009

PLIPS, an automatically collected database of protein lists reported by proteomics studies.

Alexey V. Antonov; Sabine Dietmann; Philip Wong; Rodchenkov Igor; Hans W. Mewes

The spectrum of problems covered by proteomics studies range from the discovery of compartment specific cell proteomes to clinical applications, including the identification of diagnostic markers and monitoring the effects of drug treatments. In most cases, the ultimate results of a proteomics study are lists of proteins found to be present (or differentially present) at cell physiological conditions under study. Normally, the results are published directly in the article in one or several tables. In many cases, this type of information remains disseminated in hundreds of proteomics publications. We have developed a Web mining tool which allows the collection of this information by searching through full text papers and automatically selecting tables, which report a list of protein identifiers. By searching through major proteomics journals, we have collected approximately 800 independent studies published recently, which reported about 1000 different protein lists. On the basis of this data, we developed a computational tool PLIPS (Protein Lists Identified in Proteomics Studies). PLIPS accepts as input a list of protein/gene identifiers. With the use of statistical analyses, PLIPS infers recently published proteomics studies, which report protein lists that significantly intersect with a query list. PLIPS is a freely available Web-based tool ( http://mips.helmholtz-muenchen.de/proj/plips ).


Bioinformatics | 2000

Estimating the significance of sequence order in protein secondary structure and prediction

Jong Park; Sabine Dietmann; Andreas Heger; Liisa Holm

MOTIVATIONnHow critical is the sequence order information in predicting protein secondary structure segments? We tried to get a rough insight on it from a theoretical approach using both a prediction algorithm and structural fragments from Protein Databank (PDB).nnnRESULTSnUsing reverse protein sequences and PDB structural fragments, we theoretically estimated the significance of the order for protein secondary structure and prediction. On average: (1) 79% of protein sequence segments resulted in the same prediction in both normal and reverse directions, which indicated a relatively high conservation of secondary structure propensity in the reverse direction; (2) the reversed sequence prediction alone performed less accurately than the normal forward sequence prediction, but comparably high (2% difference); (3) the commonly predicted regions showed a slightly higher prediction accuracy (4%) than the normal sequences prediction; and (4) structural fragments which have counterparts in reverse direction in the same protein showed a comparable degree of secondary structure conservation (73% identity with reversed structures on average for pentamers)[email protected]; [email protected]; [email protected]; [email protected]


Trends in Genetics | 2001

Tools for the modern biologist

Michael D.R Croning; Sabine Dietmann; Steffen Möller

Developing Bioinformatics Computer Skillsby Cynthia Gibas and Per Jambeck OReilly & Associates, 2001.

Collaboration


Dive into the Sabine Dietmann's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Liisa Holm

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jong Park

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Alvis Brazma

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Esther Schmidt

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Henning Hermjakob

European Bioinformatics Institute

View shared research outputs
Researchain Logo
Decentralizing Knowledge