Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where John D. Holliday is active.

Publication


Featured researches published by John D. Holliday.


Combinatorial Chemistry & High Throughput Screening | 2002

Grouping of Coefficients for the Calculation of Inter-Molecular Similarity and Dissimilarity using 2D Fragment Bit-Strings

John D. Holliday; Hu Cy; Peter Willett

This paper compares 22 different similarity coefficients when they are used for searching databases of 2D fragment bit-strings. Experiments with the National Cancer Institute s AIDS and IDAlert databases show that the coefficients fall into several well-marked clusters, in which the members of a cluster will produce comparable rankings of a set of molecules. These clusters provide a basis for selecting combinations of coefficients for use in data fusion experiments. The results of these experiments provide a simple way of increasing the effectiveness of fragment-based similarity searching systems.


Journal of Chemical Information and Computer Sciences | 2003

Combination of fingerprint-based similarity coefficients using data fusion.

Naomie Salim; John D. Holliday; Peter Willett

Many different types of similarity coefficients have been described in the literature. Since different coefficients take into account different characteristics when assessing the degree of similarity between molecules, it is reasonable to combine them to further optimize the measures of similarity between molecules. This paper describes experiments in which data fusion is used to combine several binary similarity coefficients to get an overall estimate of similarity for searching databases of bioactive molecules. The results show that search performances can be improved by combining coefficients with little extra computational cost. However, there is no single combination which gives a consistently high performance for all search types.


Journal of Chemical Information and Computer Sciences | 2003

Analysis and display of the size dependence of chemical similarity coefficients

John D. Holliday; Naomie Salim; Martin Whittle; Peter Willett

We discuss the size-bias inherent in several chemical similarity coefficients when used for the similarity searching or diversity selection of compound collections. Limits to the upper bounds of 14 standard similarity coefficients are investigated, and the results are used to identify some exceptional characteristics of a few of the coefficients. An additional numerical contribution to the known size bias in the Tanimoto coefficient is identified. Graphical plots with respect to relative bit density are introduced to further assess the coefficients. Our methods reveal the asymmetries inherent in most similarity coefficients that lead to bias in selection, most notably with the Forbes and Russell-Rao coefficients. Conversely, when applied to the recently introduced Modified Tanimoto coefficient our methods provide support for the view that it is less biased toward molecular size than most. In this work we focus our discussion on fragment-based bit strings, but we demonstrate how our approach can be generalized to continuous representations.


Journal of Chemical Information and Computer Sciences | 1989

Review of ring perception algorithms for chemical graphs

Geoffrey M. Downs; Valerie J. Gillet; John D. Holliday; Michael F. Lynch

Current ring perception algorithms for use on chemical graphs concentrate on processing specific structures. In this review, the various published ring perception algorithms are classified according to the initial ring set obtained, and each algorithm or method of perception is described in detail. The final ring sets obtained are discussed in terms of their suitability for use in representing the ring systems in structurally explicit parts of generic chemical structures.


Journal of Chemical Information and Modeling | 2012

Similarity coefficients for binary chemoinformatics data: overview and extended comparison using simulated and real data sets

Roberto Todeschini; Viviana Consonni; Hua Xiang; John D. Holliday; Massimo Buscema; Peter Willett

This paper reports an analysis and comparison of the use of 51 different similarity coefficients for computing the similarities between binary fingerprints for both simulated and real chemical data sets. Five pairs and a triplet of coefficients were found to yield identical similarity values, leading to the elimination of seven of the coefficients. The remaining 44 coefficients were then compared in two ways: by their theoretical characteristics using simple descriptive statistics, correlation analysis, multidimensional scaling, Hasse diagrams, and the recently described atemporal target diffusion model; and by their effectiveness for similarity-based virtual screening using MDDR, WOMBAT, and MUV data. The comparisons demonstrate the general utility of the well-known Tanimoto method but also suggest other coefficients that may be worthy of further attention.


Journal of Molecular Graphics & Modelling | 2000

Effectiveness of retrieval in similarity searches of chemical databases: a review of performance measures.

Sarah J Edgar; John D. Holliday; Peter Willett

This article reviews measures for evaluating the effectiveness of similarity searches in chemical databases, drawing principally upon the many measures that have been described previously for evaluating the performance of text search engines. The use of the various measures is exemplified by fragment-based 2D similarity searches on several databases for which both structural and bioactivity data are available. It is concluded that the cumulative recall and G-H score measures are the most useful of those tested.


Future Medicinal Chemistry | 2011

Effectiveness of 2D fingerprints for scaffold hopping

Eleanor J. Gardiner; John D. Holliday; C. O'Dowd; Peter Willett

BACKGROUND It has been suggested that similarity searching using 2D fingerprints may not be suitable for scaffold hopping. METHODS This article reports a detailed evaluation of the effectiveness of six common types of 2D fingerprints when they are used for scaffold-hopping similarity searches of the Molecular Design Limited Drug Data Report database, World of Molecular Bioactivity database and Maximum Unbiased Validation database. RESULTS The results demonstrate that 2D fingerprints can be used for scaffold hopping, with novel scaffolds being identified in nearly every search that was carried out. The degree of enrichment depends on the structural diversity of the actives that are being sought, with the greatest enrichments often being obtained using the extended connectivity fingerprint encoding a circular substructure of diameter four bonds (ECFP4) fingerprint. CONCLUSION 2D fingerprints provide a simple and computationally efficient way of identifying novel chemotypes in lead-discovery programs.


Journal of Chemical Information and Modeling | 2009

Comparison of nonbinary similarity coefficients for similarity searching, clustering and compound selection

Aysha Al Khalifa; Maciej Haranczyk; John D. Holliday

Several recent studies have compared the relative performance of a selection of similarity coefficients when applied to chemical databases represented by binary fingerprints. Considerable variation in performance, when used for (dis)similarity-based techniques, such as similarity searching, database clustering, and dissimilarity-based compound selection, has been reported, the reasons for which are closely related to molecular size. For many of these similarity coefficients, an alternative form can be derived which is applicable to sets of nonbinary data, such as calculated or measured physicochemical properties, or counts of substructural fragments. Here we report on several studies which have been undertaken to investigate the relative performance of twelve coefficients when applied to nonbinary data using such (dis)similarity-based techniques. Results suggest that no single coefficient is appropriate for all methodologies investigated and that the size bias detected with binary data is not as apparent when the data and, hence, coefficient are nonbinary in nature.


Journal of Chemical Information and Computer Sciences | 2004

Clustering Files of Chemical Structures Using the Fuzzy k-Means Clustering Method

John D. Holliday; Sarah L. Rodgers; Peter Willett; Min-You Chen; Mahdi Mahfouf; Kevin Lawson; Graham Mullier

This paper evaluates the use of the fuzzy k-means clustering method for the clustering of files of 2D chemical structures. Simulated property prediction experiments with the Starlist file of logP values demonstrate that use of the fuzzy k-means method can, in some cases, yield results that are superior to those obtained with the conventional k-means method and with Wards clustering method. Clustering of several small sets of agrochemical compounds demonstrate the ability of the fuzzy k-means method to highlight multicluster membership and to identify outlier compounds, although the former can be difficult to interpret in some cases.


Journal of Chemical Information and Computer Sciences | 1996

THE SHEFFIELD GENERIC STRUCTURES PROJECT : A RETROSPECTIVE REVIEW

Michael F. Lynch; John D. Holliday

The problems posed by the requirements for storage and manipulation of generic chemical structure definitions in patents are reviewed. Chemists and patents agents have developed an armory of linguistic devices over many decades so that a generic structure description can describe large and often unlimited numbers of substances as a result of the combinatorial opportunities provided. The nature of these linguistic devices is defined, and the theoretical foundations devised during the Sheffield project for the successful solution of the problems in order to provide the desired retrieval facilities are reviewed. Progress toward the practical implementation of a system based on these solutions is evaluated. The relevance of the data structures and algorithms devised in this work to the issues raised by developments in combinatorial libraries is also reviewed.

Collaboration


Dive into the John D. Holliday's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Shereena M. Arif

National University of Malaysia

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Naomie Salim

Universiti Teknologi Malaysia

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Chia-Wei Chu

University of Sheffield

View shared research outputs
Researchain Logo
Decentralizing Knowledge