Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Peter Murray-Rust is active.

Publication


Featured researches published by Peter Murray-Rust.


Nature Biotechnology | 2010

The BioPAX community standard for pathway data sharing

Emek Demir; Michael P. Cary; Suzanne M. Paley; Ken Fukuda; Christian Lemer; Imre Vastrik; Guanming Wu; Peter D'Eustachio; Carl F. Schaefer; Joanne S. Luciano; Frank Schacherer; Irma Martínez-Flores; Zhenjun Hu; Verónica Jiménez-Jacinto; Geeta Joshi-Tope; Kumaran Kandasamy; Alejandra López-Fuentes; Huaiyu Mi; Elgar Pichler; Igor Rodchenkov; Andrea Splendiani; Sasha Tkachev; Jeremy Zucker; Gopal Gopinath; Harsha Rajasimha; Ranjani Ramakrishnan; Imran Shah; Mustafa Syed; Nadia Anwar; Özgün Babur

Biological Pathway Exchange (BioPAX) is a standard language to represent biological pathways at the molecular and cellular level and to facilitate the exchange of pathway data. The rapid growth of the volume of pathway data has spurred the development of databases and computational tools to aid interpretation; however, use of these data is hampered by the current fragmentation of pathway information across many databases with incompatible formats. BioPAX, which was created through a community process, solves this problem by making pathway data substantially easier to collect, index, interpret and share. BioPAX can represent metabolic and signaling pathways, molecular and genetic interactions and gene regulation networks. Using BioPAX, millions of interactions, organized into thousands of pathways, from many organisms are available from a growing number of databases. This large amount of pathway data in a computable form will support visualization, analysis and biological discovery.


Journal of Chemical Information and Modeling | 2006

The Blue Obelisk-interoperability in chemical informatics.

Rajarshi Guha; Michael T. Howard; Geoffrey R. Hutchison; Peter Murray-Rust; Henry S. Rzepa; Christoph Steinbeck; Jörg K. Wegner; Egon Willighagen

The Blue Obelisk Movement (http://www.blueobelisk.org/) is the name used by a diverse Internet group promoting reusable chemistry via open source software development, consistent and complimentary chemoinformatics research, open data, and open standards. We outline recent examples of cooperation in the Blue Obelisk group:  a shared dictionary of algorithms and implementations in chemoinformatics algorithms drawing from our various software projects; a shared repository of chemoinformatics data including elemental properties, atomic radii, isotopes, atom typing rules, and so forth; and Web services for the platform-independent use of chemoinformatics programs.


Journal of Chemical Information and Computer Sciences | 2003

Chemical Markup, XML, and the World Wide Web. 4. CML Schema

Peter Murray-Rust; Henry S. Rzepa

A revision to Chemical Markup Language (CML) is presented as a XML Schema compliant form, modularized into nonchemical and chemical components. STMML contains generic concepts for numeric data and scientific units, while CMLCore retains most of the chemical functionality of the original CML 1.0 and extends it by adding handlers for chemical substances, extended bonding models and names. We propose extension via new namespaced components for chemical queries, reactions, spectra, and computational chemistry. The conformance with XML schemas allows much greater control over datatyping, document validation, and structure.


Journal of Chemical Information and Computer Sciences | 1999

Chemical Markup, XML, and the Worldwide Web. 1. Basic Principles

Peter Murray-Rust; Henry S. Rzepa

Chemical markup language (CML) is an application of XML, the extensible markup language, developed for containing chemical information components within documents. Its design supports interoperability with the XML family of tools and protocols. It provides a base functionality for atomic, molecular, and crystallographic information and allows extensibility for other chemical applications. Legacy files can be imported into CML without information loss and can carry any desired chemical ontology. Some applications of CML (Markush structures, chemical searching) will be discussed in later articles. An XML document type declaration (DTD) for CML is included as a Chart.


Journal of Cheminformatics | 2011

OSCAR4: a flexible architecture for chemical text-mining

David M Jessop; Sam Adams; Egon Willighagen; Lezan Hawizy; Peter Murray-Rust

The Open-Source Chemistry Analysis Routines (OSCAR) software, a toolkit for the recognition of named entities and data in chemistry publications, has been developed since 2002. Recent work has resulted in the separation of the core OSCAR functionality and its release as the OSCAR4 library. This library features a modular API (based on reduction of surface coupling) that permits client programmers to easily incorporate it into external applications. OSCAR4 offers a domain-independent architecture upon which chemistry specific text-mining tools can be built, and its development and usage are discussed.


BMC Bioinformatics | 2007

Bioclipse: an open source workbench for chemo- and bioinformatics

Ola Spjuth; Tobias Helmus; Egon Willighagen; Stefan Kuhn; Martin Eklund; Johannes Wagener; Peter Murray-Rust; Christoph Steinbeck; Jarl E. S. Wikberg

BackgroundThere is a need for software applications that provide users with a complete and extensible toolkit for chemo- and bioinformatics accessible from a single workbench. Commercial packages are expensive and closed source, hence they do not allow end users to modify algorithms and add custom functionality. Existing open source projects are more focused on providing a framework for integrating existing, separately installed bioinformatics packages, rather than providing user-friendly interfaces. No open source chemoinformatics workbench has previously been published, and no sucessful attempts have been made to integrate chemo- and bioinformatics into a single framework.ResultsBioclipse is an advanced workbench for resources in chemo- and bioinformatics, such as molecules, proteins, sequences, spectra, and scripts. It provides 2D-editing, 3D-visualization, file format conversion, calculation of chemical properties, and much more; all fully integrated into a user-friendly desktop application. Editing supports standard functions such as cut and paste, drag and drop, and undo/redo. Bioclipse is written in Java and based on the Eclipse Rich Client Platform with a state-of-the-art plugin architecture. This gives Bioclipse an advantage over other systems as it can easily be extended with functionality in any desired direction.ConclusionBioclipse is a powerful workbench for bio- and chemoinformatics as well as an advanced integration platform. The rich functionality, intuitive user interface, and powerful plugin architecture make Bioclipse the most advanced and user-friendly open source workbench for chemo- and bioinformatics. Bioclipse is released under Eclipse Public License (EPL), an open source license which sets no constraints on external plugin licensing; it is totally open for both open source plugins as well as commercial ones. Bioclipse is freely available at http://www.bioclipse.net.


CompLife'06 Proceedings of the Second international conference on Computational Life Sciences | 2006

High-Throughput identification of chemistry in life science texts

Peter T. Corbett; Peter Murray-Rust

OSCAR3 is an open extensible system for the automated annotation of chemistry in scientific articles, which can process thousands of articles per hour. This XML annotation supports applications such as interactive browsing and chemically-aware searching, and has been designed for integration with larger text-analysis systems. We report its application to the high-throughput analysis of the small-molecule chemistry content of texts in life sciences, such as PubMed abstracts.


New Journal of Chemistry | 2001

Development of chemical markup language (CML) as a system for handling complex chemical content

Peter Murray-Rust; Henry S. Rzepa; Mike Wright

We report the first fully operational system for managing complex chemical content entirely in interoperating XML-based markup languages. This involves the application of version 1.0 of chemical markup language (CML 1.0) and the development of mechanisms allowing the display of CML marked up molecules within a standard web browser (Internet Explorer 5). We demonstrate how an extension to include spectra and reactions could be achieved. Integrating these techniques with existing XML compliant languages (e.g. XHTML and SVG) results in electronic documents with the significant advantages of data retrieval and flexibility over existing HTML/plugin solutions. These documents can be optimised for a variety of purposes (e.g. screen display or printing) by single XSL stylesheet transformations. An XML schema has been developed from the CML 1.0 DTD to allow document validation and the use of data links. A working online demonstration of these concepts, termed ChiMeraL, containing a range of online demonstrations, examples and CML resources such as the CML DTD and schema has been associated with this article ia the supplementary material.


Nature Reviews Drug Discovery | 2011

Minimum information about a bioactive entity (MIABE)

Sandra Orchard; Bissan Al-Lazikani; Steve Bryant; Dominic Clark; Elizabeth Calder; Ian Dix; Ola Engkvist; Mark J. Forster; Anna Gaulton; Michael Gilson; Robert Glen; Martin Grigorov; Kim E. Hammond-Kosack; Lee Harland; Andrew Hopkins; Christopher Larminie; Nick Lynch; Romeena K. Mann; Peter Murray-Rust; Elena Lo Piparo; Christopher Southan; Christoph Steinbeck; David Wishart; Henning Hermjakob; John P. Overington; Janet M. Thornton

Bioactive molecules such as drugs, pesticides and food additives are produced in large numbers by many commercial and academic groups around the world. Enormous quantities of data are generated on the biological properties and quality of these molecules. Access to such data — both on licensed and commercially available compounds, and also on those that fail during development — is crucial for understanding how improved molecules could be developed. For example, computational analysis of aggregated data on molecules that are investigated in drug discovery programmes has led to a greater understanding of the properties of successful drugs. However, the information required to perform these analyses is rarely published, and when it is made available it is often missing crucial data or is in a format that is inappropriate for efficient data-mining. Here, we propose a solution: the definition of reporting guidelines for bioactive entities — the Minimum Information About a Bioactive Entity (MIABE) — which has been developed by representatives of pharmaceutical companies, data resource providers and academic groups.


Journal of Chemical Information and Modeling | 2011

Chemical name to structure: OPSIN, an open source solution.

Daniel M. Lowe; Peter T. Corbett; Peter Murray-Rust; Robert C. Glen

We have produced an open source, freely available, algorithm (Open Parser for Systematic IUPAC Nomenclature, OPSIN) that interprets the majority of organic chemical nomenclature in a fast and precise manner. This has been achieved using an approach based on a regular grammar. This grammar is used to guide tokenization, a potentially difficult problem in chemical names. From the parsed chemical name, an XML parse tree is constructed that is operated on in a stepwise manner until the structure has been reconstructed from the name. Results from OPSIN on various computer generated name/structure pair sets are presented. These show exceptionally high precision (99.8%+) and, when using general organic chemical nomenclature, high recall (98.7-99.2%). This software can serve as the basis for future open source developments of chemical name interpretation.

Collaboration


Dive into the Peter Murray-Rust's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Martin T. Dove

Queen Mary University of London

View shared research outputs
Top Co-Authors

Avatar

Jim Downing

University of Cambridge

View shared research outputs
Top Co-Authors

Avatar

Sam Adams

University of Cambridge

View shared research outputs
Top Co-Authors

Avatar

Roger F. Newton

University of Hertfordshire

View shared research outputs
Top Co-Authors

Avatar

Rp Bruin

University of Cambridge

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge