Saqib Mir
Heidelberg Institute for Theoretical Studies
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Saqib Mir.
Nucleic Acids Research | 2012
Ulrike Wittig; Renate Kania; Martin Golebiewski; Maja Rey; Lei Shi; Lenneke Jong; Enkhjargal Algaa; Andreas Weidemann; Heidrun Sauer-Danzwith; Saqib Mir; Olga Krebs; Meik Bittkowski; Isabel Rojas; Wolfgang Müller
SABIO-RK (http://sabio.h-its.org/) is a web-accessible database storing comprehensive information about biochemical reactions and their kinetic properties. SABIO-RK offers standardized data manually extracted from the literature and data directly submitted from lab experiments. The database content includes kinetic parameters in relation to biochemical reactions and their biological sources with no restriction on any particular set of organisms. Additionally, kinetic rate laws and corresponding equations as well as experimental conditions are represented. All the data are manually curated and annotated by biological experts, supported by automated consistency checks. SABIO-RK can be accessed via web-based user interfaces or automatically via web services that allow direct data access by other tools. Both interfaces support the export of the data together with its annotations in SBML (Systems Biology Markup Language), e.g. for import in modelling tools.
data integration in the life sciences | 2006
Ulrike Wittig; Martin Golebiewski; Renate Kania; Olga Krebs; Saqib Mir; Andreas Weidemann; Stefanie Anstein; Jasmin Saric; Isabel Rojas
Simulating networks of biochemical reactions require reliable kinetic data. In order to facilitate the access to such kinetic data we have developed SABIO-RK, a curated database with information about biochemical reactions and their kinetic properties. The data are manually extracted from literature and verified by curators, concerning standards, formats and controlled vocabularies. This process is supported by tools in a semi-automatic manner. SABIO-RK contains and merges information about reactions such as reactants and modifiers, organism, tissue and cellular location, as well as the kinetic properties of the reactions. The type of the kinetic mechanism, modes of inhibition or activation, and corresponding rate equations are presented together with their parameters and measured values, specifying the experimental conditions under which these were determined. Links to other databases enable the user to gather further information and to refer to the original publication. Information about reactions and their kinetic data can be exported to an SBML file, allowing users to employ the information as the basis for their simulation models.
BMC Systems Biology | 2007
Isabel Rojas; Martin Golebiewski; Renate Kania; Olga Krebs; Saqib Mir; Andreas Weidemann; Ulrike Wittig
Systems biology is an emerging field that aims at obtaining a system-level understanding of biological processes. The modelling and simulation of networks of biochemical reactions have great and promising application potential but require reliable kinetic data. In order to support the systems biology community with such data we have developed SABIO-RK (System for the Analysis of Biochemical Pathways - Reaction Kinetics), a curated database with information about biochemical reactions and their kinetic properties, which allows researchers to obtain and compare kinetic data and to integrate them into models of biochemical networks. SABIO-RK is freely available for academic use at http://sabio.villa-bosch.de/SABIORK/.
FEBS Journal | 2010
Neil Swainston; Martin Golebiewski; Hanan L. Messiha; Naglis Malys; Renate Kania; Sylvestre Kengne; Olga Krebs; Saqib Mir; Heidrun Sauer-Danzwith; Kieran Smallbone; Andreas Weidemann; Ulrike Wittig; Douglas B. Kell; Pedro Mendes; Wolfgang Müller; Norman W. Paton; Isabel Rojas
A limited number of publicly available resources provide access to enzyme kinetic parameters. These have been compiled through manual data mining of published papers, not from the original, raw experimental data from which the parameters were calculated. This is largely due to the lack of software or standards to support the capture, analysis, storage and dissemination of such experimental data. Introduced here is an integrative system to manage experimental enzyme kinetics data from instrument to browser. The approach is based on two interrelated databases: the existing SABIO‐RK database, containing kinetic data and corresponding metadata, and the newly introduced experimental raw data repository, MeMo‐RK. Both systems are publicly available by web browser and web service interfaces and are configurable to ensure privacy of unpublished data. Users of this system are provided with the ability to view both kinetic parameters and the experimental raw data from which they are calculated, providing increased confidence in the data. A data analysis and submission tool, the kineticswizard, has been developed to allow the experimentalist to perform data collection, analysis and submission to both data resources. The system is designed to be extensible, allowing integration with other manufacturer instruments covering a range of analytical techniques.
Journal of Integrative Bioinformatics | 2007
Olga Krebs; Martin Golebiewski; Renate Kania; Saqib Mir; Jasmin Saric; Andreas Weidemann; Ulrike Wittig; Isabel Rojas
Abstract Systems biology is an emerging field that aims at obtaining a system-level understanding of biological processes. The modelling and simulation of networks of biochemical reactions have great and promising application potential but require reliable kinetic data. In order to support the systems biology community with such data we have developed SABIO-RK (System for the Analysis of Biochemical Pathways - Reaction Kinetics), a curated database with information about biochemical reactions and their kinetic properties, which allows researchers to obtain and compare kinetic data and to integrate them into models of biochemical networks. SABIO-RK is freely available for academic use at http://sabio.villa-bosch.de/SABIORK/.
international semantic web conference | 2010
Saqib Mir; Steffen Staab; Isabel Rojas
In the Linked Open Data cloud one of the largest data sets, comprising of 2.5 billion triples, is derived from the Life Science domain. Yet this represents a small fraction of the total number of publicly available data sources on the Web. We briefly describe past attempts to transform specific Life Science sources from a plethora of open as well as proprietary formats into RDF data. In particular, we identify and tackle two bottlenecks in current practice: Acquiring ontologies to formally describe these data and creating “RDFizer” programs to convert data from legacy formats into RDF. We propose an unsupervised method, based on transformation rules, for performing these two key tasks, which makes use of our previous work on unsupervised wrapper induction for extracting labelled data from complete Life Science Web sites. We apply our approach to 13 real-world online Life Science databases. The learned ontologies are evaluated by domain experts as well as against gold standard ontologies. Furthermore, we compare the learned ontologies against ontologies that are “lifted” directly from the underlying relational schema using an existing unsupervised approach. Finally, we apply our approach to three online databases to extract RDF data. Our results indicate that this approach can be used to bootstrap and speed up the migration of life science data into the Linked Open Data cloud.
data integration in the life sciences | 2009
Saqib Mir; Steffen Staab; Isabel Rojas
We present a novel approach to automatic information extraction from Deep Web Life Science databases using wrapper induction. Traditional wrapper induction techniques focus on learning wrappers based on examples from one class of Web pages, i.e. from Web pages that are all similar in structure and content. Thereby, traditional wrapper induction targets the understanding of Web pages generated from a database using the same generation template as observed in the example set. However, Life Science Web sites typically contain structurally diverse web pages from multiple classes making the problem more challenging. Furthermore, we observed that such Life Science Web sites do not just provide mere data, but they also tend to provide schema information in terms of data labels --- giving further cues for solving the Web site wrapping task. Our solution to this novel challenge of Site-Wide wrapper induction consists of a sequence of steps: 1. classification of similar Web pages into classes, 2. discovery of these classes and 3. wrapper induction for each class. Our approach thus allows us to perform unsupervised information retrieval from across an entire Web site. We test our algorithm against three real-world biochemical deep Web sources and report our preliminary results, which are very promising.
BMC Systems Biology | 2007
Martin Golebiewski; Saqib Mir; Renate Kania; Olga Krebs; Andreas Weidemann; Ulrike Wittig; Isabel Rojas
Systems biology deals with analyzing and predicting the behavior of complex biological systems like cells, organisms or even whole ecosystems. This requires qualitative information about the interplay of genes, proteins, chemical compounds and biochemical reactions, but also calls for quantitative data describing the dynamics of these networks. These data have to be collected, systematically structured and made accessible for the set-up of biochemical model simulations.
Nucleic Acids Research | 2018
Stephen K. Burley; Helen M. Berman; Charmi Bhikadiya; Chunxiao Bi; Li Chen; Luigi Di Costanzo; Cole Christie; Jose M. Duarte; Shuchismita Dutta; Zukang Feng; Sutapa Ghosh; David S. Goodsell; Rachel Kramer Green; Vladimir Guranovic; Dmytro Guzenko; Brian P. Hudson; Yuhe Liang; Robert Lowe; Ezra Peisach; Irina Periskova; Chris Randle; Alexander S. Rose; Monica Sekharan; Chenghua Shao; Yi-Ping Tao; Yana Valasatava; Maria Voigt; John D. Westbrook; Jasmine Young; Christine Zardecki
Abstract The Protein Data Bank (PDB) is the single global archive of experimentally determined three-dimensional (3D) structure data of biological macromolecules. Since 2003, the PDB has been managed by the Worldwide Protein Data Bank (wwPDB; wwpdb.org), an international consortium that collaboratively oversees deposition, validation, biocuration, and open access dissemination of 3D macromolecular structure data. The PDB Core Archive houses 3D atomic coordinates of more than 144 000 structural models of proteins, DNA/RNA, and their complexes with metals and small molecules and related experimental data and metadata. Structure and experimental data/metadata are also stored in the PDB Core Archive using the readily extensible wwPDB PDBx/mmCIF master data format, which will continue to evolve as data/metadata from new experimental techniques and structure determination methods are incorporated by the wwPDB. Impacts of the recently developed universal wwPDB OneDep deposition/validation/biocuration system and various methods-specific wwPDB Validation Task Forces on improving the quality of structures and data housed in the PDB Core Archive are described together with current challenges and future plans.
in Silico Biology | 2007
Isabel Rojas; Martin Golebiewski; Renate Kania; Olga Krebs; Saqib Mir; Andreas Weidemann; Ulrike Wittig