Christian Lemmen
Center for Information Technology
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Christian Lemmen.
Drug Discovery Today | 2004
Thomas Lengauer; Christian Lemmen; Matthias Rarey; Marc Zimmermann
There are several methods for virtual screening of databases of small organic compounds to find tight binders to a given protein target. Recent reviews in Drug Discovery Today have concentrated on screening by docking and by pharmacophore searching. Here, we complement these reviews by focusing on virtual screening methods that are based on analyzing ligand similarity on a structural level. Specifically, we concentrate on methods that exploit structural properties of the complete ligand molecules, as opposed to using just partial structural templates, such as pharmacophores. The in silico procedure of virtual screening (VS) and its relationship to the experimental procedure, HTS, is discussed, new developments in the field are summarized and perspectives on future research are offered.
Journal of Computer-aided Molecular Design | 2012
Nadine Schneider; Sally A. Hindle; Gudrun Lange; Robert Klein; Jürgen Albrecht; Hans Briem; Kristin Beyer; Holger Claußen; Marcus Gastreich; Christian Lemmen; Matthias Rarey
The HYDE scoring function consistently describes hydrogen bonding, the hydrophobic effect and desolvation. It relies on HYdration and DEsolvation terms which are calibrated using octanol/water partition coefficients of small molecules. We do not use affinity data for calibration, therefore HYDE is generally applicable to all protein targets. HYDE reflects the Gibbs free energy of binding while only considering the essential interactions of protein–ligand complexes. The greatest benefit of HYDE is that it yields a very intuitive atom-based score, which can be mapped onto the ligand and protein atoms. This allows the direct visualization of the score and consequently facilitates analysis of protein–ligand complexes during the lead optimization process. In this study, we validated our new scoring function by applying it in large-scale docking experiments. We could successfully predict the correct binding mode in 93% of complexes in redocking calculations on the Astex diverse set, while our performance in virtual screening experiments using the DUD dataset showed significant enrichment values with a mean AUC of 0.77 across all protein targets with little or no structural defects. As part of these studies, we also carried out a very detailed analysis of the data that revealed interesting pitfalls, which we highlight here and which should be addressed in future benchmark datasets.
Journal of Medicinal Chemistry | 2008
Markus Boehm; Tong-Ying Wu; Holger Claussen; Christian Lemmen
Large collections of combinatorial libraries are an integral element in todays pharmaceutical industry. It is of great interest to perform similarity searches against all virtual compounds that are synthetically accessible by any such library. Here we describe the successful application of a new software tool CoLibri on 358 combinatorial libraries based on validated reaction protocols to create a single chemistry space containing over 10 (12) possible products. Similarity searching with FTrees-FS allows the systematic exploration of this space without the need to enumerate all product structures. The search result is a set of virtual hits which are synthetically accessible by one or more of the existing reaction protocols. Grouping these virtual hits by their synthetic protocols allows the rapid design and synthesis of multiple follow-up libraries. Such library ideas support hit-to-lead design efforts for tasks like follow-up from high-throughput screening hits or scaffold hopping from one hit to another attractive series.
Journal of Chemical Information and Computer Sciences | 2002
Santosh Putta; Christian Lemmen; Paul Beroza; Jonathan Greene
The shape of and the chemical features of a ligand are both critical for biological activity. This paper presents a strategy that uses these descriptors to build a computational model for virtual screening of bioactive compounds. Molecules are represented in a binary shape-feature descriptor space as bit-strings, and their relative activities are used to identify the subset of the bit-string that is most relevant to bioactivity. This subset is used to score virtual libraries. We describe the computational details of the method and present an example validation experiment on thrombin inhibitors.
Journal of Computer-aided Molecular Design | 1997
Christian Lemmen; Thomas Lengauer
We present an efficient algorithm for the structural alignment of medium-sized organic molecules. The algorithm has been developed for applications in 3D QSAR and in receptor modeling. The method assumes one of the molecules, the reference ligand, to be presented in the conformation that it adopts inside the receptor pocket. The second molecule, the test ligand, is considered to be flexible, and is assumed to be given in an arbitrary low-energy conformation. Ligand flexibility is modeled by decomposing the test ligand into molecular fragments, such that ring systems are completely contained in a single fragment. Conformations of fragments and torsional angles of single bonds are taken from a small finite set, which depends on the fragment and bond, respectively. The algorithm superimposes a distinguished base fragment of the test ligand onto a suitable region of the reference ligand and then attaches the remaining fragments of the test ligand in a step-by-step fashion. During this process, a scoring function is optimized that encompasses bonding terms and terms accounting for steric overlap as well as for similarity of chemical properties of both ligands. The algorithm has been implemented in the FLEXS system. To validate the quality of the produced results, we have selected a number of examples for which the mutual superposition of two ligands is experimentally given by the comparison of the binding geometries known from the crystal structures of their corresponding protein–ligand complexes. On more than two-thirds of the test examples the algorithm produces rms deviations of the predicted versus the observed conformation of the test ligand below 1.5 Å. The run time of the algorithm on a single problem instance is a few minutes on a common-day workstation. The overall goal of this research is to drastically reduce run times, while limiting the inaccuracies of the model and the computation to a tolerable level.
Journal of Computer-aided Molecular Design | 2009
Andrea Zaliani; Krisztina Boda; Thomas Seidel; Achim Herwig; Christof H. Schwab; Johann Gasteiger; Holger Claußen; Christian Lemmen; Jörg Degen; Juri Pärn; Matthias Rarey
For computational de novo design, a general retrospective validation work is a very challenging task. Here we propose a comprehensive workflow to de novo design driven by the needs of computational and medicinal chemists and, at the same time, we propose a general validation scheme for this technique. The study was conducted combining a suite of already published programs developed within the framework of the NovoBench project, which involved three different pharmaceutical companies and four groups of developers. Based on 188 PDB protein–ligand complexes with diverse functions, the study involved the ligand reconstruction by means of a fragment-based de-novo design approach. The structure-based de novo search engine FlexNovo showed in five out of eight total cases the ability to reconstruct native ligands and to rank them in four cases out of five within the first five candidates. The generated structures were ranked according to their synthetic accessibilities evaluated by the program SYLVIA. This investigation showed that the final candidate molecules have about the same synthetic complexity as the respective reference ligands. Furthermore, the plausibility of being true actives was assessed through literature searches.
Current Drug Discovery Technologies | 2004
Holger Claussen; Marcus Gastreich; Volker Apelt; Jonathan Greene; Sally A. Hindle; Christian Lemmen
We present an integrated docking environment that allows for iterative and interactive detailed analysis of many docking solutions. All docking information is stored in an ORACLE database. New scoring schemes (e.g. target-specific scoring functions) as well as various types of filters can be easily defined and tested within this environment. As an example application we investigated the validity of the following hypothesis: If a docking procedure can lead to enrichments significantly better than random then a bias towards (partially) correct placements should be detectable. Such bias in terms of a preference for certain interacting groups within the active site can be used to select a set of receptor-based pharmacophore constraints, which in turn might be used to enhance the docking procedure. As a proof of concept for this approach we performed docking studies on three targets: thrombin, the cyclin-dependent kinase 2 (CDK2) and the angiotensin converting enzyme (ACE). We docked a set of known active compounds with standard FlexX and derived three sets of target-specific receptor-based pharmacophore constraints by statistical analysis of the predicted placements. Applying these receptor-based constraints in a virtual screening protocol utilizing FlexX-Pharm led to significantly improved enrichments.
Journal of Computer-aided Molecular Design | 1998
Christian Lemmen; Claus Hiller; Thomas Lengauer
If structural knowledge of a receptor under consideration is lacking, drug design approaches focus on similarity or dissimilarity analysis of putative ligands. In this context the mutual ligand superposition is of utmost importance. Methods that are rapid enough to facilitate interactive usage, that allow to process sets of conformers and that enable database screening are of special interest here. The ability to superpose molecular fragments instead of entire molecules has proven to be helpful too. The RigFit approach meets these requirements and has several additional advantages. In three distinct test applications, we evaluated how closely we can approximate the observed relative orientation for a set of known crystal structures, we employed RigFit as a fragment placement procedure, and we performed a fragment-based database screening. The run time of RigFit can be traded off against its accuracy. To be competitive in accuracy with another state-of-the-art alignment tool, with which we compare our method explicitly, computing times of about 6s per superposition on a common day workstation are required. If longer run times can be afforded the accuracy increases significantly. RigFit is part of the flexible superposition software FlexS which can be accessed on the WWW [http://cartan.gmd.de/FlexS].
Journal of Chemical Information and Computer Sciences | 2003
J. Kevin Lanctot; Santosh Putta; Christian Lemmen; Jonathan Greene
This paper introduces Signal, a novel method for classifying activity against a small molecule drug target. Signal creates an ensemble, or collection, of meaningful descriptors chosen from a much larger property space. The method works with a variety of descriptor types, including fingerprints that represent four-point pharmacophores or shape descriptors. It also exploits information from both active and inactive compounds and generates predictive models suitable for high throughput screening data analysis. Given the fingerprints and activity data for a set of compounds, Signal is a two step process. The first step is to Evaluate the Descriptors: for each descriptor in the fingerprint, quantify and rank the correlation between the activity of the compounds and the presence of that descriptor. The second step is to Create an Ensemble Model: use the high ranking descriptors to create a model of activity against the biological target. For the first step, two possible ranking strategies were investigated: mutual information and chi-square. For the second step, two types of ensemble models were investigated: high ranking and a novel method called high ranking set cover. Of the four possible pairings, the combination of chi-square and high ranking set cover performed the best on a Thrombin data set.
Perspectives in Drug Discovery and Design | 2000
Christian Lemmen; Marc Zimmermann; Thomas Lengauer
Molecular superpositioning is an important task in rational drug design. Usually it is the key step in a comparative analysis of molecules by 3D QSAR methods. Also it is helpful for the elucidation of a pharmacophore and crucial in the attempt to derive a receptor model. Generally speaking, molecular superpositioning can be seen as the analog of molecular docking if the receptor structure is not available, and direct methods are not applicable. Virtual database screening is the computational counterpart to modem experimental techniques like high throughput screening and assaying of combinatorial libraries. Both screening techniques have the common goal to detect active molecules in a large selection of compounds. Usually hundreds of thousands of candidates are to be tested, hence, time is the limiting factor and rapid processing of utmost importance. Descriptor-based methods that usually provide a simple linear encoding of the molecules meet the demands of computational speed and have been used predominantly for the task of virtual screening, for a long time. However, more powerful superposition methods have been developed during the past few years and now begin also to be applicable to screening large databases. Especially in combination with the faster methods, molecular superpositioning as the final step of a filtering protocol provides a power- ful tool for virtual database screening. The present work reports on our latest developments of molecular superpositioning techniques and assessing their applicability to virtual database screening.