Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Graham Klyne is active.

Publication


Featured researches published by Graham Klyne.


PLOS Computational Biology | 2009

Adventures in Semantic Publishing: Exemplar Semantic Enhancements of a Research Article

David M. Shotton; Katie Portwin; Graham Klyne; Alistair Miles

Scientific innovation depends on finding, integrating, and re-using the products of previous research. Here we explore how recent developments in Web technology, particularly those related to the publication of data and metadata, might assist that process by providing semantic enhancements to journal articles within the mainstream process of scholarly journal publishing. We exemplify this by describing semantic enhancements we have made to a recent biomedical research article taken from PLoS Neglected Tropical Diseases, providing enrichment to its content and increased access to datasets within it. These semantic enhancements include provision of live DOIs and hyperlinks; semantic markup of textual terms, with links to relevant third-party information resources; interactive figures; a re-orderable reference list; a document summary containing a study summary, a tag cloud, and a citation analysis; and two novel types of semantic enrichment: the first, a Supporting Claims Tooltip to permit “Citations in Context”, and the second, Tag Trees that bring together semantically related terms. In addition, we have published downloadable spreadsheets containing data from within tables and figures, have enriched these with provenance information, and have demonstrated various types of data fusion (mashups) with results from other research articles and with Google Maps. We have also published machine-readable RDF metadata both about the article and about the references it cites, for which we developed a Citation Typing Ontology, CiTO (http://purl.org/net/cito/). The enhanced article, which is available at http://dx.doi.org/10.1371/journal.pntd.0000228.x001, presents a compelling existence proof of the possibilities of semantic publication. We hope the showcase of examples and ideas it contains, described in this paper, will excite the imaginations of researchers and publishers, stimulating them to explore the possibilities of semantic publishing for their own research articles, and thereby break down present barriers to the discovery and re-use of information within traditional modes of scholarly communication.


international conference on e-science | 2012

Why workflows break — Understanding and combating decay in Taverna workflows

Jun Zhao; José Manuél Gómez-Pérez; Khalid Belhajjame; Graham Klyne; Esteban García-Cuesta; Aleix Garrido; Kristina M. Hettne; Marco Roos; David De Roure; Carole A. Goble

Workflows provide a popular means for preserving scientific methods by explicitly encoding their process. However, some of them are subject to a decay in their ability to be re-executed or reproduce the same results over time, largely due to the volatility of the resources required for workflow executions. This paper provides an analysis of the root causes of workflow decay based on an empirical study of a collection of Taverna workflows from the myExperiment repository. Although our analysis was based on a specific type of workflow, the outcomes and methodology should be applicable to workflows from other systems, at least those whose executions also rely largely on accessing third-party resources. Based on our understanding about decay we recommend a minimal set of auxiliary resources to be preserved together with the workflows as an aggregation object and provide a software tool for end-users to create such aggregations and to assess their completeness.


Journal of Web Semantics | 2015

Using a suite of ontologies for preserving workflow-centric research objects

Khalid Belhajjame; Jun Zhao; Daniel Garijo; Matthew Gamble; Kristina M. Hettne; Raúl Palma; Eleni Mina; Oscar Corcho; José Manuél Gómez-Pérez; Sean Bechhofer; Graham Klyne; Carole A. Goble

Scientific workflows are a popular mechanism for specifying and automating data-driven in silico experiments. A significant aspect of their value lies in their potential to be reused. Once shared, workflows become useful building blocks that can be combined or modified for developing new experiments. However, previous studies have shown that storing workflow specifications alone is not sufficient to ensure that they can be successfully reused, without being able to understand what the workflows aim to achieve or to re-enact them. To gain an understanding of the workflow, and how it may be used and repurposed for their needs, scientists require access to additional resources such as annotations describing the workflow, datasets used and produced by the workflow, and provenance traces recording workflow executions.In this article, we present a novel approach to the preservation of scientific workflows through the application of research objects-aggregations of data and metadata that enrich the workflow specifications. Our approach is realised as a suite of ontologies that support the creation of workflow-centric research objects. Their design was guided by requirements elicited from previous empirical analyses of workflow decay and repair. The ontologies developed make use of and extend existing well known ontologies, namely the Object Reuse and Exchange (ORE) vocabulary, the Annotation Ontology (AO) and the W3C PROV ontology (PROVO). We illustrate the application of the ontologies for building Workflow Research Objects with a case-study that investigates Huntingtons disease, performed in collaboration with a team from the Leiden University Medial Centre (HG-LUMC). Finally we present a number of tools developed for creating and managing workflow-centric research objects.


Briefings in Bioinformatics | 2009

Linked data and provenance in biological data webs

Jun Zhao; Alistair Miles; Graham Klyne; David M. Shotton

The Web is now being used as a platform for publishing and linking life science data. The Webs linking architecture can be exploited to join heterogeneous data from multiple sources. However, as data are frequently being updated in a decentralized environment, provenance information becomes critical to providing reliable and trustworthy services to scientists. This article presents design patterns for representing and querying provenance information relating to mapping links between heterogeneous data from sources in the domain of functional genomics. We illustrate the use of named resource description framework (RDF) graphs at different levels of granularity to make provenance assertions about linked data, and demonstrate that these assertions are sufficient to support requirements including data currency, integrity, evidential support and historical queries.


Nucleic Acids Research | 2010

FlyTED: the Drosophila Testis Gene Expression Database

Jun Zhao; Graham Klyne; Elizabeth Benson; Elin Gudmannsdottir; Helen White-Cooper; David M. Shotton

FlyTED, the Drosophila Testis Gene Expression Database, is a biological research database for gene expression images from the testis of the fruit fly Drosophila melanogaster. It currently contains 2762 mRNA in situ hybridization images and ancillary metadata revealing the patterns of gene expression of 817 Drosophila genes in testes of wild type flies and of seven meiotic arrest mutant strains in which spermatogenesis is defective. This database has been built by adapting a widely used digital library repository software system, EPrints (http://eprints.org/software/), and provides both web-based search and browse interfaces, and programmatic access via an SQL dump, OAI-PMH and SPARQL. FlyTED is available at http://www.fly-ted.org/.


Journal of Biomedical Informatics | 2010

OpenFlyData: An exemplar data web integrating gene expression data on the fruit fly Drosophila melanogaster

Alistair Miles; Jun Zhao; Graham Klyne; Helen White-Cooper; David M. Shotton

MOTIVATION Integrating heterogeneous data across distributed sources is a major requirement for in silico bioinformatics supporting translational research. For example, genome-scale data on patterns of gene expression in the fruit fly Drosophila melanogaster are widely used in functional genomic studies in many organisms to inform candidate gene selection and validate experimental results. However, current data integration solutions tend to be heavy weight, and require significant initial and ongoing investment of effort. Development of a common Web-based data integration infrastructure (a.k.a. data web), using Semantic Web standards, promises to alleviate these difficulties, but little is known about the feasibility, costs, risks or practical means of migrating to such an infrastructure. RESULTS We describe the development of OpenFlyData, a proof-of-concept system integrating gene expression data on D. melanogaster, combining Semantic Web standards with light-weight approaches to Web programming based on Web 2.0 design patterns. To support researchers designing and validating functional genomic studies, OpenFlyData includes user-facing search applications providing intuitive access to and comparison of gene expression data from FlyAtlas, the BDGP in situ database, and FlyTED, using data from FlyBase to expand and disambiguate gene names. OpenFlyDatas services are also openly accessible, and are available for reuse by other bioinformaticians and application developers. Semi-automated methods and tools were developed to support labour- and knowledge-intensive tasks involved in deploying SPARQL services. These include methods for generating ontologies and relational-to-RDF mappings for relational databases, which we illustrate using the FlyBase Chado database schema; and methods for mapping gene identifiers between databases. The advantages of using Semantic Web standards for biomedical data integration are discussed, as are open issues. In particular, although the performance of open source SPARQL implementations is sufficient to query gene expression data directly from user-facing applications such as Web-based data fusions (a.k.a. mashups), we found open SPARQL endpoints to be vulnerable to denial-of-service-type problems, which must be mitigated to ensure reliability of services based on this standard. These results are relevant to data integration activities in translational bioinformatics. AVAILABILITY The gene expression search applications and SPARQL endpoints developed for OpenFlyData are deployed at http://openflydata.org. FlyUI, a library of JavaScript widgets providing re-usable user-interface components for Drosophila gene expression data, is available at http://flyui.googlecode.com. Software and ontologies to support transformation of data from FlyBase, FlyAtlas, BDGP and FlyTED to RDF are available at http://openflydata.googlecode.com. SPARQLite, an implementation of the SPARQL protocol, is available at http://sparqlite.googlecode.com. All software is provided under the GPL version 3 open source license.


Journal of Biomedical Semantics | 2014

Structuring research methods and data with the research object model: genomics workflows as a case study.

Kristina M. Hettne; Harish Dharuri; Jun Zhao; Katherine Wolstencroft; Khalid Belhajjame; Stian Soiland-Reyes; Eleni Mina; Mark Thompson; Don C. Cruickshank; L. Verdes-Montenegro; Julián Garrido; David De Roure; Oscar Corcho; Graham Klyne; Reinout van Schouwen; Peter A. C. 't Hoen; Sean Bechhofer; Carole A. Goble; Marco Roos

BackgroundOne of the main challenges for biomedical research lies in the computer-assisted integrative study of large and increasingly complex combinations of data in order to understand molecular mechanisms. The preservation of the materials and methods of such computational experiments with clear annotations is essential for understanding an experiment, and this is increasingly recognized in the bioinformatics community. Our assumption is that offering means of digital, structured aggregation and annotation of the objects of an experiment will provide necessary meta-data for a scientist to understand and recreate the results of an experiment. To support this we explored a model for the semantic description of a workflow-centric Research Object (RO), where an RO is defined as a resource that aggregates other resources, e.g., datasets, software, spreadsheets, text, etc. We applied this model to a case study where we analysed human metabolite variation by workflows.ResultsWe present the application of the workflow-centric RO model for our bioinformatics case study. Three workflows were produced following recently defined Best Practices for workflow design. By modelling the experiment as an RO, we were able to automatically query the experiment and answer questions such as “which particular data was input to a particular workflow to test a particular hypothesis?”, and “which particular conclusions were drawn from a particular workflow?”.ConclusionsApplying a workflow-centric RO model to aggregate and annotate the resources used in a bioinformatics experiment, allowed us to retrieve the conclusions of the experiment in the context of the driving hypothesis, the executed workflows and their input data. The RO model is an extendable reference model that can be used by other systems as well.AvailabilityThe Research Object is available at http://www.myexperiment.org/packs/428The Wf4Ever Research Object Model is available at http://wf4ever.github.io/ro


international conference on e-science | 2009

CLAROS - Bringing Classical Art to a Global Public

Donna Kurtz; Greg Parker; David M. Shotton; Graham Klyne; Florian Schroff; Andrew Zisserman; Yorick Wilks

CLAROS (Classical Art Research Online Services; www.clarosweb.org) is an international interdisciplinary research initiative led by the University of Oxford (Humanities and Mathematics and Physical Sciences), hosted by the Oxford e-Research Centre (OeRC, www.oerc.ox.ac.uk), and inspired by the Beazley Archive (www.beazley.ox.ac.uk) participating in EU R&D projects. During 2009, a pump-priming grant from the University’s Fell Fund enabled CLAROS to integrate on line more than two million records and images held in research centres in Oxford, Paris, Cologne and Berlin. CLAROS uses CIDOC CRM (http://cidoc.ics.forth.gr/), developed under UNESCO’s ICOM (http://icom.museum/), to map across datasets and a portfolio of Open Source software to deliver them swiftly to a broad range of global users. Data web applications for integration are being developed by Zoology (http://ibrg.zoo.ox.ac.uk/), image recognition by Engineering Science (www.robots.ox.ac.uk/~vgg/), and artificial intelligence by the Oxford Internet Institute (www.oii.ox.ac.uk). CLAROS will welcome new institutional members and engage with the public to document art and disseminate results.


european semantic web conference | 2008

Building a semantic web image repository for biological research images

Jun Zhao; Graham Klyne; David M. Shotton

Images play a vital role in scientific studies. An image repository would become a costly and meaningless data graveyard without descriptive metadata. We adapted EPrints, a conventional repository software system, to create a biological research image repository for a local research group, in order to publish images with structured metadata with a minimum of development effort. However, in its native installation, this repository cannot easily be linked with information from third parties, and the user interface has limited flexibility. We address these two limitations by providing Semantic Web access to the contents of this image repository, causing the image metadata to become programmatically accessible through a SPARQL endpoint and enabling the images and their metadata to be presented in more flexible faceted browsers, jS-pace and Exhibit. We show the feasibility of publishing image metadata on the Semantic Web using existing tools, and examine the inadequacies of the Semantic Web browsers in providing effective user interfaces. We highlight the importance of a loosely coupled software framework that provides a lightweight solution and enables us to switch between alternative components.


Proceedings of the 2nd International Workshop on Digital Libraries for Musicology | 2015

Music and Science: Parallels in Production

David De Roure; Graham Klyne; Kevin R. Page; John Pybus; David M. Weigl

The music industry has embraced digital technology, from recording and production of music through to distribution and consumption. Meanwhile scholarly communication, including academic publishing and libraries, is also undergoing transformation thanks to the affordances of the digital. We suggest that comparing and contrasting these two sociotechnical systems will provide insights of mutual benefit. We propose a preliminary framing of that comparison, introduce a notion of Digital Music Object that is analogous to the Research Object, and discuss some implications for digital libraries for musicology.

Collaboration


Dive into the Graham Klyne's collaboration.

Top Co-Authors

Avatar

Jun Zhao

University of Oxford

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Raúl Palma

Technical University of Madrid

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Daniel Garijo

Technical University of Madrid

View shared research outputs
Researchain Logo
Decentralizing Knowledge