Eelke van der Horst
Leiden University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Eelke van der Horst.
BMC Bioinformatics | 2010
Eelke van der Horst; Julio E. Peironcely; Adriaan P. IJzerman; Margot W. Beukers; Jonathan Robert Lane; Herman W. T. van Vlijmen; Michael Emmerich; Yasushi Okuno; Andreas Bender
BackgroundG protein-coupled receptors (GPCRs) represent a family of well-characterized drug targets with significant therapeutic value. Phylogenetic classifications may help to understand the characteristics of individual GPCRs and their subtypes. Previous phylogenetic classifications were all based on the sequences of receptors, adding only minor information about the ligand binding properties of the receptors. In this work, we compare a sequence-based classification of receptors to a ligand-based classification of the same group of receptors, and evaluate the potential to use sequence relatedness as a predictor for ligand interactions thus aiding the quest for ligands of orphan receptors.ResultsWe present a classification of GPCRs that is purely based on their ligands, complementing sequence-based phylogenetic classifications of these receptors. Targets were hierarchically classified into phylogenetic trees, for both sequence space and ligand (substructure) space. The overall organization of the sequence-based tree and substructure-based tree was similar; in particular, the adenosine receptors cluster together as well as most peptide receptor subtypes (e.g. opioid, somatostatin) and adrenoceptor subtypes. In ligand space, the prostanoid and cannabinoid receptors are more distant from the other targets, whereas the tachykinin receptors, the oxytocin receptor, and serotonin receptors are closer to the other targets, which is indicative for ligand promiscuity. In 93% of the receptors studied, de-orphanization of a simulated orphan receptor using the ligands of related receptors performed better than random (AUC > 0.5) and for 35% of receptors de-orphanization performance was good (AUC > 0.7).ConclusionsWe constructed a phylogenetic classification of GPCRs that is solely based on the ligands of these receptors. The similarities and differences with traditional sequence-based classifications were investigated: our ligand-based classification uncovers relationships among GPCRs that are not apparent from the sequence-based classification. This will shed light on potential cross-reactivity of GPCR ligands and will aid the design of new ligands with the desired activity profiles. In addition, we linked the ligand-based classification with a ligand-focused sequence-based classification described in literature and proved the potential of this method for de-orphanization of GPCRs.
Journal of Chemical Information and Modeling | 2009
Eelke van der Horst; Yasushi Okuno; Andreas Bender; Adriaan P. IJzerman
In this study, we conducted frequent substructure mining to identify structural features that discriminate between ligands that do bind to G protein-coupled receptors (GPCRs) and those that do not. In most cases, particular chemical representations resulted in the most significant substructures. Substructures found to be characteristic for the background control set reflected reactions that may have been used to construct this library, e.g., for the ChemBridge DIVERSet library employed these are ester and carboxamide moieties. Alkane amine substructures were identified as most important for GPCR ligands, e.g. the butylamine substructure, often linked to an aromatic system. Hierarchical analysis of targeted GPCRs revealed well-known motives and new substructural features. One example is the imidazole-like substructure common for the histamine binding receptor ligands. Another example is the planar ring system consisting of a fused five- and six-membered ring (indole-like substucture) common for the serotonin receptor ligands.
Journal of Medicinal Chemistry | 2012
Marijn P. A. Sanders; Luc Roumen; Eelke van der Horst; J. Robert Lane; Henry F. Vischer; Jody van Offenbeek; Henk de Vries; Stefan Verhoeven; Ken Y. Chow; Folkert Verkaar; Margot W. Beukers; Ross McGuire; Rob Leurs; Adriaan P. IJzerman; Jacob de Vlieg; Iwan J. P. de Esch; Guido J.R. Zaman; Jan P. G. Klomp; Andreas Bender; Chris de Graaf
We present the systematic prospective evaluation of a protein-based and a ligand-based virtual screening platform against a set of three G-protein-coupled receptors (GPCRs): the β-2 adrenoreceptor (ADRB2), the adenosine A(2A) receptor (AA2AR), and the sphingosine 1-phosphate receptor (S1PR1). Novel bioactive compounds were identified using a consensus scoring procedure combining ligand-based (frequent substructure ranking) and structure-based (Snooker) tools, and all 900 selected compounds were screened against all three receptors. A striking number of ligands showed affinity/activity for GPCRs other than the intended target, which could be partly attributed to the fuzziness and overlap of protein-based pharmacophore models. Surprisingly, the phosphodiesterase 5 (PDE5) inhibitor sildenafil was found to possess submicromolar affinity for AA2AR. Overall, this is one of the first published prospective chemogenomics studies that demonstrate the identification of novel cross-pharmacology between unrelated protein targets. The lessons learned from this study can be used to guide future virtual ligand design efforts.
ChemMedChem | 2011
Eelke van der Horst; Rianne van der Pijl; Thea Mulder-Krieger; Andreas Bender; Adriaan P. IJzerman
A virtual ligand‐based screening approach was designed and evaluated for the discovery of new A2A adenosine receptor (AR) ligands. For comparison and evaluation, the procedures from a recently published virtual screening study that used the A2A AR X‐ray crystal structure for the target‐based discovery of new A2A ligands were largely followed. Several screening models were constructed by deriving the distinguishing structural features from selected sets of A2A AR antagonists, so‐called frequent substructure mining. The best model in statistical terms was subsequently applied to large‐scale virtual screens of a commercial vendor library. This resulted in the selection of 36 candidates for acquisition and testing. Of the selected candidates, eight compounds significantly inhibited radioligand binding at A2A AR (>30 %) at 10 μM, corresponding to a “hit rate” of 22 %. This hit rate is quite similar to that of the referenced target‐based virtual screening study, while both approaches yield new, non‐overlapping sets of ligands.
international conference on evolutionary multi criterion optimization | 2009
Johannes W. Kruisselbrink; Michael Emmerich; Thomas Bäck; Andreas Bender; Adriaan P. IJzerman; Eelke van der Horst
This paper is motivated by problem scenarios in automated drug design. It discusses a modeling approach for design optimization problems with many criteria that can be partitioned into objectives and fuzzy constraints. The purpose of this remodeling is to transform the original criteria such that, when using them in an evolutionary search method, a good view on the trade-off between the different objectives and the satisfaction of constraints is obtained. Instead of reducing a many objective problem to a single-objective problem, it is proposed to reduce it to a multi-objective optimization problem with a low number of objectives, for which the visualization of the Pareto front is still possible and the size of a high-resolution approximation set is affordable. For design problems where it is reasonable to combine certain objectives and/or constraints into logical groups by means of desirability indexes, this method will yield good trade-off results with reduced computational effort. The proposed methodology is evaluated in a case-study on automated drug design where we aim to find molecular structures that could serve as estrogen receptor antagonists.
PLOS ONE | 2015
Alejandra Gonzalez-Beltran; Peter Li; Jun Zhao; Maria Susana Avila-Garcia; Marco Roos; Mark Thompson; Eelke van der Horst; Rajaram Kaliyaperumal; Ruibang Luo; Tin-Lap Lee; Tak Wah Lam; Scott C Edmunds; Susanna-Assunta Sansone; Philippe Rocca-Serra
Motivation Reproducing the results from a scientific paper can be challenging due to the absence of data and the computational tools required for their analysis. In addition, details relating to the procedures used to obtain the published results can be difficult to discern due to the use of natural language when reporting how experiments have been performed. The Investigation/Study/Assay (ISA), Nanopublications (NP), and Research Objects (RO) models are conceptual data modelling frameworks that can structure such information from scientific papers. Computational workflow platforms can also be used to reproduce analyses of data in a principled manner. We assessed the extent by which ISA, NP, and RO models, together with the Galaxy workflow system, can capture the experimental processes and reproduce the findings of a previously published paper reporting on the development of SOAPdenovo2, a de novo genome assembler. Results Executable workflows were developed using Galaxy, which reproduced results that were consistent with the published findings. A structured representation of the information in the SOAPdenovo2 paper was produced by combining the use of ISA, NP, and RO models. By structuring the information in the published paper using these data and scientific workflow modelling frameworks, it was possible to explicitly declare elements of experimental design, variables, and findings. The models served as guides in the curation of scientific information and this led to the identification of inconsistencies in the original published paper, thereby allowing its authors to publish corrections in the form of an errata. Availability SOAPdenovo2 scripts, data, and results are available through the GigaScience Database: http://dx.doi.org/10.5524/100044; the workflows are available from GigaGalaxy: http://galaxy.cbiit.cuhk.edu.hk; and the representations using the ISA, NP, and RO models are available through the SOAPdenovo2 case study website http://isa-tools.github.io/soapdenovo2/. Contact: [email protected] and [email protected].
Statistical Analysis and Data Mining | 2009
Munikumar R. Doddareddy; Gerard J. P. van Westen; Eelke van der Horst; Julio E. Peironcely; Frans Corthals; Adriaan P. IJzerman; Michael Emmerich; Jeremy L. Jenkins; Andreas Bender
An alternator pulley includes a driving member driven and rotated via a belt from an output shaft of an engine. A driving member is disposed on an inner surface of the driving member and a one-way clutch is interposed between the driving and driven member. The one-way clutch includes rollers capable of rolling in a locked side direction along which a rotating power of the driving member is transmitted to the driven member or a free side direction along which the rotating powder is interrupted. Depending on a relative speed difference between the driving member and the driven member, the rollers are biased for pressing in the locked side direction and a torque value of the pressing is set preferably to less than 4 Nm.
Journal of Cheminformatics | 2015
Saber A. Akhondi; Kristina M. Hettne; Eelke van der Horst; Erik M. van Mulligen; Jan A. Kors
BackgroundThe past decade has seen an upsurge in the number of publications in chemistry. The ever-swelling volume of available documents makes it increasingly hard to extract relevant new information from such unstructured texts. The BioCreative CHEMDNER challenge invites the development of systems for the automatic recognition of chemicals in text (CEM task) and for ranking the recognized compounds at the document level (CDI task). We investigated an ensemble approach where dictionary-based named entity recognition is used along with grammar-based recognizers to extract compounds from text. We assessed the performance of ten different commercial and publicly available lexical resources using an open source indexing system (Peregrine), in combination with three different chemical compound recognizers and a set of regular expressions to recognize chemical database identifiers. The effect of different stop-word lists, case-sensitivity matching, and use of chunking information was also investigated. We focused on lexical resources that provide chemical structure information. To rank the different compounds found in a text, we used a term confidence score based on the normalized ratio of the term frequencies in chemical and non-chemical journals.ResultsThe use of stop-word lists greatly improved the performance of the dictionary-based recognition, but there was no additional benefit from using chunking information. A combination of ChEBI and HMDB as lexical resources, the LeadMine tool for grammar-based recognition, and the regular expressions, outperformed any of the individual systems. On the test set, the F-scores were 77.8% (recall 71.2%, precision 85.8%) for the CEM task and 77.6% (recall 71.7%, precision 84.6%) for the CDI task. Missed terms were mainly due to tokenization issues, poor recognition of formulas, and term conjunctions.ConclusionsWe developed an ensemble system that combines dictionary-based and grammar-based approaches for chemical named entity recognition, outperforming any of the individual systems that we considered. The system is able to provide structure information for most of the compounds that are found. Improved tokenization and better recognition of specific entity types is likely to further improve system performance.
PLOS ONE | 2016
Kristina M. Hettne; Mark Thompson; Herman H. H. B. M. van Haagen; Eelke van der Horst; Rajaram Kaliyaperumal; Eleni Mina; Zuotian Tatum; Jeroen F. J. Laros; Erik M. van Mulligen; Martijn J. Schuemie; Emmelien Aten; Tong Shu Li; Richard Bruskiewich; Benjamin M. Good; Andrew I. Su; Jan A. Kors; Johan T. den Dunnen; Gert-Jan B. van Ommen; Marco Roos; Peter A. C. 't Hoen; Barend Mons; Erik Schultes
High-throughput experimental methods such as medical sequencing and genome-wide association studies (GWAS) identify increasingly large numbers of potential relations between genetic variants and diseases. Both biological complexity (millions of potential gene-disease associations) and the accelerating rate of data production necessitate computational approaches to prioritize and rationalize potential gene-disease relations. Here, we use concept profile technology to expose from the biomedical literature both explicitly stated gene-disease relations (the explicitome) and a much larger set of implied gene-disease associations (the implicitome). Implicit relations are largely unknown to, or are even unintended by the original authors, but they vastly extend the reach of existing biomedical knowledge for identification and interpretation of gene-disease associations. The implicitome can be used in conjunction with experimental data resources to rationalize both known and novel associations. We demonstrate the usefulness of the implicitome by rationalizing known and novel gene-disease associations, including those from GWAS. To facilitate the re-use of implicit gene-disease associations, we publish our data in compliance with FAIR Data Publishing recommendations [https://www.force11.org/group/fairgroup] using nanopublications. An online tool (http://knowledge.bio) is available to explore established and potential gene-disease associations in the context of other biomedical relations.
genetic and evolutionary computation conference | 2009
Johannes W. Kruisselbrink; Alexander Aleman; Michael Emmerich; Ad P. IJzerman; Andreas Bender; Thomas Baeck; Eelke van der Horst
There exist several applications of multi-objective evolutionary algorithms for drug design, however, a common drawback in recent approaches is that the diversity of resulting molecule populations is relatively low. This paper seeks to overcome this problem by introducing niching as a technique to enhance search space diversity. A single population approach with dynamic niche identification is studied in the application domain. In order to apply niching in molecular spaces a metric for measuring the dissimilarity of molecules will be introduced. The approach will be validated in case studies and compared with results of an NSGA-II algorithm without niching in the search space.