Jon Ison
Technical University of Denmark
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Jon Ison.
Nucleic Acids Research | 2010
Steve Pettifer; Jon Ison; Matúš Kalaš; Dave Thorne; Philip McDermott; Inge Jonassen; Ali Liaquat; José María Fernández; Jose Manuel Rodriguez; David G. Pisano; Christophe Blanchet; Mahmut Uludag; Peter Rice; Edita Bartaseviciute; Kristoffer Rapacki; Maarten L. Hekkelman; Olivier Sand; Heinz Stockinger; Andrew B. Clegg; Erik Bongcam-Rudloff; Jean Salzemann; Vincent Breton; Teresa K. Attwood; Graham Cameron; Gert Vriend
The EMBRACE (European Model for Bioinformatics Research and Community Education) web service collection is the culmination of a 5-year project that set out to investigate issues involved in developing and deploying web services for use in the life sciences. The project concluded that in order for web services to achieve widespread adoption, standards must be defined for the choice of web service technology, for semantically annotating both service function and the data exchanged, and a mechanism for discovering services must be provided. Building on this, the project developed: EDAM, an ontology for describing life science web services; BioXSD, a schema for exchanging data between services; and a centralized registry (http://www.embraceregistry.net) that collects together around 1000 services developed by the consortium partners. This article presents the current status of the collection and its associated recommendations and standards definitions.
Nucleic Acids Research | 2016
Jon Ison; Kristoffer Rapacki; Hervé Ménager; Matúš Kalaš; Emil Rydza; Piotr Jaroslaw Chmura; Christian Anthon; Niall Beard; Karel Berka; Dan Bolser; Tim Booth; Anthony Bretaudeau; Jan Brezovsky; Rita Casadio; Gianni Cesareni; Frederik Coppens; Michael Cornell; Gianmauro Cuccuru; Kristian Davidsen; Gianluca Della Vedova; Tunca Doğan; Olivia Doppelt-Azeroual; Laura Emery; Elisabeth Gasteiger; Thomas Gatter; Tatyana Goldberg; Marie Grosjean; Björn Grüning; Manuela Helmer-Citterich; Hans Ienasescu
Life sciences are yielding huge data sets that underpin scientific discoveries fundamental to improvement in human health, agriculture and the environment. In support of these discoveries, a plethora of databases and tools are deployed, in technically complex and diverse implementations, across a spectrum of scientific disciplines. The corpus of documentation of these resources is fragmented across the Web, with much redundancy, and has lacked a common standard of information. The outcome is that scientists must often struggle to find, understand, compare and use the best resources for the task at hand. Here we present a community-driven curation effort, supported by ELIXIR—the European infrastructure for biological information—that aspires to a comprehensive and consistent registry of information about bioinformatics resources. The sustainable upkeep of this Tools and Data Services Registry is assured by a curation effort driven by and tailored to local needs, and shared amongst a network of engaged partners. As of November 2015, the registry includes 1785 resources, with depositions from 126 individual registrations including 52 institutional providers and 74 individuals. With community support, the registry can become a standard for dissemination of information about bioinformatics resources: we welcome everyone to join us in this common endeavour. The registry is freely available at https://bio.tools.
european conference on computational biology | 2010
Matúš Kalaš; Pæl Puntervoll; Alexandre Joseph; Edita Bartaševičiūtė; Armin Töpfer; Prabakar Venkataraman; Steve Pettifer; Jan Christian Bryne; Jon Ison; Christophe Blanchet; Kristoffer Rapacki; Inge Jonassen
Motivation: The world-wide community of life scientists has access to a large number of public bioinformatics databases and tools, which are developed and deployed using diverse technologies and designs. More and more of the resources offer programmatic web-service interface. However, efficient use of the resources is hampered by the lack of widely used, standard data-exchange formats for the basic, everyday bioinformatics data types. Results: BioXSD has been developed as a candidate for standard, canonical exchange format for basic bioinformatics data. BioXSD is represented by a dedicated XML Schema and defines syntax for biological sequences, sequence annotations, alignments and references to resources. We have adapted a set of web services to use BioXSD as the input and output format, and implemented a test-case workflow. This demonstrates that the approach is feasible and provides smooth interoperability. Semantics for BioXSD is provided by annotation with the EDAM ontology. We discuss in a separate section how BioXSD relates to other initiatives and approaches, including existing standards and the Semantic Web. Availability: The BioXSD 1.0 XML Schema is freely available at http://www.bioxsd.org/BioXSD-1.0.xsd under the Creative Commons BY-ND 3.0 license. The http://bioxsd.org web page offers documentation, examples of data in BioXSD format, example workflows with source codes in common programming languages, an updated list of compatible web services and tools and a repository of feature requests from the community. Contact: [email protected]; [email protected]; [email protected]
PLOS Biology | 2017
Julie McMurry; Nick Juty; Niklas Blomberg; Tony Burdett; Tom Conlin; Nathalie Conte; Mélanie Courtot; John Deck; Michel Dumontier; Donal Fellows; Alejandra Gonzalez-Beltran; Philipp Gormanns; Jeffrey S. Grethe; Janna Hastings; Jean-Karim Hériché; Henning Hermjakob; Jon Ison; Rafael C. Jimenez; Simon Jupp; John Kunze; Camille Laibe; Nicolas Le Novère; James Malone; María Martín; Johanna McEntyre; Chris Morris; Juha Muilu; Wolfgang Müller; Philippe Rocca-Serra; Susanna-Assunta Sansone
In many disciplines, data are highly decentralized across thousands of online databases (repositories, registries, and knowledgebases). Wringing value from such databases depends on the discipline of data science and on the humble bricks and mortar that make integration possible; identifiers are a core component of this integration infrastructure. Drawing on our experience and on work by other groups, we outline 10 lessons we have learned about the identifier qualities and best practices that facilitate large-scale data integration. Specifically, we propose actions that identifier practitioners (database providers) should take in the design, provision and reuse of identifiers. We also outline the important considerations for those referencing identifiers in various circumstances, including by authors and data generators. While the importance and relevance of each lesson will vary by context, there is a need for increased awareness about how to avoid and manage common identifier problems, especially those related to persistence and web-accessibility/resolvability. We focus strongly on web-based identifiers in the life sciences; however, the principles are broadly relevant to other disciplines.
F1000Research | 2017
Rafael C. Jimenez; Mateusz Kuzak; Monther Alhamdoosh; Michelle Barker; Bérénice Batut; Mikael Borg; Salvador Capella-Gutierrez; Neil Chue Hong; Martin Cook; Manuel Corpas; Madison Flannery; Leyla Garcia; Josep Ll. Gelpí; Simon Gladman; Carole A. Goble; Montserrat González Ferreiro; Alejandra Gonzalez-Beltran; Philippa C. Griffin; Björn Grüning; Jonas Hagberg; Petr Holub; Rob W. W. Hooft; Jon Ison; Daniel S. Katz; Brane Leskošek; Federico López Gómez; Luis J. Oliveira; David Mellor; Rowland Mosbergen; Nicola Mulder
Scientific research relies on computer software, yet software is not always developed following practices that ensure its quality and sustainability. This manuscript does not aim to propose new software development best practices, but rather to provide simple recommendations that encourage the adoption of existing best practices. Software development best practices promote better quality software, and better quality software improves the reproducibility and reusability of research. These recommendations are designed around Open Source values, and provide practical suggestions that contribute to making research software and its source code more discoverable, reusable and transparent. This manuscript is aimed at developers, but also at organisations, projects, journals and funders that can increase the quality and sustainability of research software by encouraging the adoption of these recommendations.
International Journal on Software Tools for Technology Transfer | 2016
Hervé Ménager; Matúš Kalaš; Kristoffer Rapacki; Jon Ison
The diversity and complexity of bioinformatics resources presents significant challenges to their localisation, deployment and use, creating a need for reliable systems that address these issues. Meanwhile, users demand increasingly usable and integrated ways to access and analyse data, especially within convenient, integrated “workbench” environments. Resource descriptions are the core element of registry and workbench systems, which are used to both help the user find and comprehend available software tools, data resources, and Web Services, and to localise, execute and combine them. The descriptions are, however, hard and expensive to create and maintain, because they are volatile and require an exhaustive knowledge of the described resource, its applicability to biological research, and the data model and syntax used to describe it. We present here the Workbench Integration Enabler, a software component that will ease the integration of bioinformatics resources in a workbench environment, using their description provided by the existing ELIXIR Tools and Data Services Registry.
GigaScience | 2017
Olivia Doppelt-Azeroual; Fabien Mareuil; Éric Deveaud; Matúš Kalaš; Nicola Soranzo; Marius van den Beek; Björn Grüning; Jon Ison; Hervé Ménager
Abstract Background: Bioinformaticians routinely use multiple software tools and data sources in their day-to-day work and have been guided in their choices by a number of cataloguing initiatives. The ELIXIR Tools and Data Services Registry (bio.tools) aims to provide a central information point, independent of any specific scientific scope within bioinformatics or technological implementation. Meanwhile, efforts to integrate bioinformatics software in workbench and workflow environments have accelerated to enable the design, automation, and reproducibility of bioinformatics experiments. One such popular environment is the Galaxy framework, with currently more than 80 publicly available Galaxy servers around the world. In the context of a generic registry for bioinformatics software, such as bio.tools, Galaxy instances constitute a major source of valuable content. Yet there has been, to date, no convenient mechanism to register such services en masse. Findings: We present ReGaTE (Registration of Galaxy Tools in Elixir), a software utility that automates the process of registering the services available in a Galaxy instance. This utility uses the BioBlend application program interface to extract service metadata from a Galaxy server, enhance the metadata with the scientific information required by bio.tools, and push it to the registry. Conclusions: ReGaTE provides a fast and convenient way to publish Galaxy services in bio.tools. By doing so, service providers may increase the visibility of their services while enriching the software discovery function that bio.tools provides for its users. The source code of ReGaTE is freely available on Github at https://github.com/C3BI-pasteur-fr/ReGaTE.
F1000Research | 2017
Kenzo-Hugo Hillion; Ivan Kuzmin; Anton Khodak; Eric Rasche; Michael R. Crusoe; Hedi Peterson; Jon Ison; Hervé Ménager
Workbench and workflow systems such as Galaxy, Taverna, Chipster, or Common Workflow Language (CWL)-based frameworks, facilitate the access to bioinformatics tools in a user-friendly, scalable and reproducible way. Still, the integration of tools in such environments remains a cumbersome, time consuming and error-prone process. A major consequence is the incomplete or outdated description of tools that are often missing important information, including parameters and metadata such as publication or links to documentation. ToolDog (Tool DescriptiOn Generator) facilitates the integration of tools - which have been registered in the ELIXIR tools registry (https://bio.tools) - into workbench environments by generating tool description templates. ToolDog includes two modules. The first module analyses the source code of the bioinformatics software with language-specific plugins, and generates a skeleton for a Galaxy XML or CWL tool description. The second module is dedicated to the enrichment of the generated tool description, using metadata provided by bio.tools. This last module can also be used on its own to complete or correct existing tool descriptions with missing metadata.
F1000Research | 2013
Matúš Kalaš; Edita Karosiene; László Kaján; Sveinung Gundersen; Jon Ison; Pål Puntervoll; Christophe Blanchet; Kristoffer Rapacki; Inge Jonassen
1 Computational Biology Unit, Uni Computing and 2 Department of Informatics, University of Bergen, Bergen, Norway; 3 Center for Biological Sequence Analysis, Technical University of Denmark, Kongens Lyngby, Denmark; 4 Bioinformatics and Computational Biology Department, Technische Universitat Munchen, Garching, Germany; Institute for Cancer research, Oslo University Hospital, Oslo, Norway; European Bioinformatics Institute, EMBL, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK; 7 Institut de Biologie et Chimie des Proteines, CNRS and Universite Claude Bernard Lyon 1, Lyon, France. BioXSD An XML Schema for sequence data, features, alignments, and identifiers
Archive | 2015
Julie McMurry; Juha Muilu; Michel Dumontier; Henning Hermjakob; Nathalie Conte; Philipp Gormanns; Murat Sariyar; Janna Hastings; Alejandra Gonzalez-Beltran; Niklas Blomberg; Chris Morris; Jean-Karim Hériché; Melissa Haendel; Rafael C. Jimenez; Tony Burdett; Philippe Rocca-Serra; Nicolas Le Novère; Nick Juty; Katherine Wolstencroft; Simon Jupp; Wolfgang Müller; Donal Fellows; María Martín; Neil Swainston; Helen Parkinson; Carole A. Goble; Johanna McEntyre; Camille Laibe; Jacky L. Snoep; Nicole L. Washington