Joseph A Townsend
University of Cambridge
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Joseph A Townsend.
Journal of Cheminformatics | 2011
Peter Murray-Rust; Joseph A Townsend; Sam Adams; Weerapong Phadungsukanan; Jens Thomas
The semantic architecture of CML consists of conventions, dictionaries and units. The conventions conform to a top-level specification and each convention can constrain compliant documents through machine-processing (validation). Dictionaries conform to a dictionary specification which also imposes machine validation on the dictionaries. Each dictionary can also be used to validate data in a CML document, and provide human-readable descriptions. An additional set of conventions and dictionaries are used to support scientific units. All conventions, dictionaries and dictionary elements are identifiable and addressable through unique URIs.
Journal of Cheminformatics | 2011
Sam Adams; Pablo de Castro; Pablo Echenique; Jorge Estrada; Marcus D. Hanwell; Peter Murray-Rust; Paul Sherwood; Jens Thomas; Joseph A Townsend
Computational Quantum Chemistry has developed into a powerful, efficient, reliable and increasingly routine tool for exploring the structure and properties of small to medium sized molecules. Many thousands of calculations are performed every day, some offering results which approach experimental accuracy. However, in contrast to other disciplines, such as crystallography, or bioinformatics, where standard formats and well-known, unified databases exist, this QC data is generally destined to remain locally held in files which are not designed to be machine-readable. Only a very small subset of these results will become accessible to the wider community through publication.In this paper we describe how the Quixote Project is developing the infrastructure required to convert output from a number of different molecular quantum chemistry packages to a common semantically rich, machine-readable format and to build respositories of QC results. Such an infrastructure offers benefits at many levels. The standardised representation of the results will facilitate software interoperability, for example making it easier for analysis tools to take data from different QC packages, and will also help with archival and deposition of results. The repository infrastructure, which is lightweight and built using Open software components, can be implemented at individual researcher, project, organisation or community level, offering the exciting possibility that in future many of these QC results can be made publically available, to be searched and interpreted just as crystallography and bioinformatics results are today.Although we believe that quantum chemists will appreciate the contribution the Quixote infrastructure can make to the organisation and and exchange of their results, we anticipate that greater rewards will come from enabling their results to be consumed by a wider community. As the respositories grow they will become a valuable source of chemical data for use by other disciplines in both research and education.The Quixote project is unconventional in that the infrastructure is being implemented in advance of a full definition of the data model which will eventually underpin it. We believe that a working system which offers real value to researchers based on tools and shared, searchable repositories will encourage early participation from a broader community, including both producers and consumers of data. In the early stages, searching and indexing can be performed on the chemical subject of the calculations, and well defined calculation meta-data. The process of defining more specific quantum chemical definitions, adding them to dictionaries and extracting them consistently from the results of the various software packages can then proceed in an incremental manner, adding additional value at each stage.Not only will these results help to change the data management model in the field of Quantum Chemistry, but the methodology can be applied to other pressing problems related to data in computational and experimental science.
Organic and Biomolecular Chemistry | 2004
Sam Adams; Jonathan M. Goodman; Richard Kidd; A. D. McNaught; Peter Murray-Rust; F. R. Norton; Joseph A Townsend; Christopher A. Waudby
An experimental data checker has been developed that reads, analyses, and cross-correlates experimental information copied and pasted from authors manuscripts, which will be useful for authors, referees, editors and readers of papers reporting new molecular information, and which makes possible a quantification of the accuracy of journals data.
Journal of Cheminformatics | 2011
Joseph A Townsend; Peter Murray-Rust
CMLLite is a collection of definitions and processes which provide strong and flexible validation for a document in Chemical Markup Language (CML). It consists of an updated CML schema (schema3), conventions specifying rules in both human and machine-understandable forms and a validator available both online and offline to check conformance. This article explores the rationale behind the changes which have been made to the schema, explains how conventions interact and how they are designed, formulated, implemented and tested, and gives an overview of the validation service.
international conference on e-science | 2009
Joseph A Townsend; Jim Downing; Peter Murray-Rust
We have developed a methodology and workflow (CHIC) for the automatic semantification and structuring of legacy textual scientific documents. CHIC imports common document formats (PDF, DOCX and (X)HTML) and uses a number of toolkits to extract components and convert them into SciXML. This is sectioned into text-rich and data-rich streams and stand-off annotation (SAF) is created for each. Embedded domain specific objects can be converted into XML (Chemical Markup Language). The different workflow streams can then be recombined and typically converted into RDF (Resource Description Format).
Archive | 2008
Peter Murray-Rust; Robert C. Glen; Henry S. Rzepa; James J. P. Stewart; Joseph A Townsend; Egon Willighagen; Zhang Yong
Archive | 2011
Sam Adams; Tamas Beke; Pablo Echenique; Jorge Estrada; Marcus D. Hanwell; Peter Murray-Rust; Jens Thomas; Joseph A Townsend; Lance Westerhoff
Archive | 2008
Peter Morgan; Jim Downing; Peter Murray-Rust; Diana Stewart; Alan P. Tonge; Joseph A Townsend; M. J. Harvey; Henry S Rzepa
ABSTR PAP AM CHEM S , 232 ? - ?. (2006) | 2006
Sam Adams; S Kuhn; Peter Murray-Rust; C Steinbeck; Joseph A Townsend; Christopher A. Waudby
Archive | 2005
Joseph A Townsend; Peter Murray-Rust; S. Tyrrell; Yong Zhang