Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where O. Anatole von Lilienfeld is active.

Publication


Featured researches published by O. Anatole von Lilienfeld.


Physical Review Letters | 2012

Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning

Matthias Rupp; Alexandre Tkatchenko; Klaus-Robert Müller; O. Anatole von Lilienfeld

We introduce a machine learning model to predict atomization energies of a diverse set of organic molecules, based on nuclear charges and atomic positions only. The problem of solving the molecular Schrödinger equation is mapped onto a nonlinear statistical regression problem of reduced complexity. Regression models are trained on and compared to atomization energies computed with hybrid density-functional theory. Cross validation over more than seven thousand organic molecules yields a mean absolute error of ∼10  kcal/mol. Applicability is demonstrated for the prediction of molecular atomization potential energy curves.


Journal of Chemical Theory and Computation | 2013

Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies

Katja Hansen; Grégoire Montavon; Franziska Biegler; Siamac Fazli; Matthias Rupp; Matthias Scheffler; O. Anatole von Lilienfeld; Alexandre Tkatchenko; Klaus-Robert Müller

The accurate and reliable prediction of properties of molecules typically requires computationally intensive quantum-chemical calculations. Recently, machine learning techniques applied to ab initio calculations have been proposed as an efficient approach for describing the energies of molecules in their given ground-state structure throughout chemical compound space (Rupp et al. Phys. Rev. Lett. 2012, 108, 058301). In this paper we outline a number of established machine learning techniques and investigate the influence of the molecular representation on the methods performance. The best methods achieve prediction errors of 3 kcal/mol for the atomization energies of a wide variety of molecules. Rationales for this performance improvement are given together with pitfalls and challenges when applying machine learning approaches to the prediction of quantum-mechanical observables.


New Journal of Physics | 2013

Machine learning of molecular electronic properties in chemical compound space

Grégoire Montavon; Matthias Rupp; Vivekanand V. Gobre; Alvaro Vazquez-Mayagoitia; Katja Hansen; Alexandre Tkatchenko; Klaus-Robert Müller; O. Anatole von Lilienfeld

The combination of modern scientific computing with electronic structure theory can lead to an unprecedented amount of data amenable to intelligent data analysis for the identification of meaningful, novel and predictive structure?property relationships. Such relationships enable high-throughput screening for relevant properties in an exponentially growing pool of virtual compounds that are synthetically accessible. Here, we present a machine learning model, trained on a database of ab initio calculation results for thousands of organic molecules, that simultaneously predicts multiple electronic ground- and excited-state properties. The properties include atomization energy, polarizability, frontier orbital eigenvalues, ionization potential, electron affinity and excitation energies. The machine learning model is based on a deep multi-task artificial neural network, exploiting the underlying correlations between various molecular properties. The input is identical to ab initio methods, i.e.?nuclear charges and Cartesian coordinates of all atoms. For small organic molecules, the accuracy of such a ?quantum machine? is similar, and sometimes superior, to modern quantum-chemical methods?at negligible computational cost.


Proceedings of the National Academy of Sciences of the United States of America | 2012

Collective many-body van der Waals interactions in molecular systems

Robert A. DiStasio; O. Anatole von Lilienfeld; Alexandre Tkatchenko

Van der Waals (vdW) interactions are ubiquitous in molecules and condensed matter, and play a crucial role in determining the structure, stability, and function for a wide variety of systems. The accurate prediction of these interactions from first principles is a substantial challenge because they are inherently quantum mechanical phenomena that arise from correlations between many electrons within a given molecular system. We introduce an efficient method that accurately describes the nonadditive many-body vdW energy contributions arising from interactions that cannot be modeled by an effective pairwise approach, and demonstrate that such contributions can significantly exceed the energy of thermal fluctuations—a critical accuracy threshold highly coveted during molecular simulations—in the prediction of several relevant properties. Cases studied include the binding affinity of ellipticine, a DNA-intercalating anticancer agent, the relative energetics between the A- and B-conformations of DNA, and the thermodynamic stability among competing paracetamol molecular crystal polymorphs. Our findings suggest that inclusion of the many-body vdW energy is essential for achieving chemical accuracy and therefore must be accounted for in molecular simulations.


Journal of Physical Chemistry Letters | 2015

Machine Learning Predictions of Molecular Properties: Accurate Many-Body Potentials and Nonlocality in Chemical Space

Katja Hansen; Franziska Biegler; Raghunathan Ramakrishnan; Wiktor Pronobis; O. Anatole von Lilienfeld; Klaus-Robert Müller; Alexandre Tkatchenko

Simultaneously accurate and efficient prediction of molecular properties throughout chemical compound space is a critical ingredient toward rational compound design in chemical and pharmaceutical industries. Aiming toward this goal, we develop and apply a systematic hierarchy of efficient empirical methods to estimate atomization and total energies of molecules. These methods range from a simple sum over atoms, to addition of bond energies, to pairwise interatomic force fields, reaching to the more sophisticated machine learning approaches that are capable of describing collective interactions between many atoms or bonds. In the case of equilibrium molecular geometries, even simple pairwise force fields demonstrate prediction accuracy comparable to benchmark energies calculated using density functional theory with hybrid exchange-correlation functionals; however, accounting for the collective many-body interactions proves to be essential for approaching the “holy grail” of chemical accuracy of 1 kcal/mol for both equilibrium and out-of-equilibrium geometries. This remarkable accuracy is achieved by a vectorized representation of molecules (so-called Bag of Bonds model) that exhibits strong nonlocality in chemical space. In addition, the same representation allows us to predict accurate electronic properties of molecules, such as their polarizability and molecular frontier orbital energies.


Scientific Data | 2014

Quantum chemistry structures and properties of 134 kilo molecules

Raghunathan Ramakrishnan; Pavlo O. Dral; Matthias Rupp; O. Anatole von Lilienfeld

Computational de novo design of new drugs and materials requires rigorous and unbiased exploration of chemical compound space. However, large uncharted territories persist due to its size scaling combinatorially with molecular size. We report computed geometric, energetic, electronic, and thermodynamic properties for 134k stable small organic molecules made up of CHONF. These molecules correspond to the subset of all 133,885 species with up to nine heavy atoms (CONF) out of the GDB-17 chemical universe of 166 billion organic molecules. We report geometries minimal in energy, corresponding harmonic frequencies, dipole moments, polarizabilities, along with energies, enthalpies, and free energies of atomization. All properties were calculated at the B3LYP/6-31G(2df,p) level of quantum chemistry. Furthermore, for the predominant stoichiometry, C7H10O2, there are 6,095 constitutional isomers among the 134k molecules. We report energies, enthalpies, and free energies of atomization at the more accurate G4MP2 level of theory for all of them. As such, this data set provides quantum chemical properties for a relevant, consistent, and comprehensive chemical space of small organic molecules. This database may serve the benchmarking of existing methods, development of new methods, such as hybrid quantum mechanics/machine learning, and systematic identification of structure-property relationships.


Journal of Chemical Theory and Computation | 2015

Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach

Raghunathan Ramakrishnan; Pavlo O. Dral; Matthias Rupp; O. Anatole von Lilienfeld

Chemically accurate and comprehensive studies of the virtual space of all possible molecules are severely limited by the computational cost of quantum chemistry. We introduce a composite strategy that adds machine learning corrections to computationally inexpensive approximate legacy quantum methods. After training, highly accurate predictions of enthalpies, free energies, entropies, and electron correlation energies are possible, for significantly larger molecular sets than used for training. For thermochemical properties of up to 16k isomers of C7H10O2 we present numerical evidence that chemical accuracy can be reached. We also predict electron correlation energy in post Hartree-Fock methods, at the computational cost of Hartree-Fock, and we establish a qualitative relationship between molecular entropy and electron correlation. The transferability of our approach is demonstrated, using semiempirical quantum chemistry and machine learning models trained on 1 and 10% of 134k organic molecules, to reproduce enthalpies of all remaining molecules at density functional theory level of accuracy.


Journal of Chemical Physics | 2005

Variational optimization of effective atom centered potentials for molecular properties

O. Anatole von Lilienfeld; Ivano Tavernelli; Ursula Rothlisberger; Daniel Sebastiani

In plane wave based electronic structure calculations the interaction of core and valence electrons is usually represented by atomic effective core potentials. They are constructed in such a way that the shape of the atomic valence orbitals outside a certain core radius is reproduced correctly with respect to the corresponding all-electron calculations. Here we present a method which, in conjunction with density functional perturbation theory, allows to optimize effective core potentials in order to reproduce ground-state molecular properties from arbitrarily accurate reference calculations within standard density functional calculations. We demonstrate the wide range of possible applications in theoretical chemistry of such optimized effective core potentials (OECPs) by means of two examples. We first use OECPs to tackle the link atom problem in quantum mechanics/molecular mechanics (QM/MM) schemes proposing a fully automatized procedure for the design of link OECPs, which are designed in such a way that they minimally perturb the electronic structure in the QM region. In the second application, we use OECPs in two sample molecules (water and acetic acid) such as to reproduce electronic densities and derived molecular properties of hybrid (B3LYP) quality within general gradient approximated (BLYP) density functional calculations.


Journal of Chemical Physics | 2006

Molecular grand-canonical ensemble density functional theory and exploration of chemical space

O. Anatole von Lilienfeld; Mark E. Tuckerman

We present a rigorous description of chemical space within a molecular grand-canonical ensemble multi-component density functional theory framework. A total energy density functional for chemical compounds in contact with an electron and a proton bath is introduced using Lagrange multipliers which correspond to the energetic response to changes of the elementary particle densities. From a generalized Gibbs-Duhem equation analog, reactivity indices such as the nuclear hardness and a molecular Fukui function, which couples the grand-canonical electronic and nuclear degrees of freedom, are obtained. Maxwell relations between composition particles, ionic displacements, and the external potential are discussed. Numerical results for the molecular Fukui function are presented as well as finite temperature estimates for the oxidation of ammonia.


Journal of Chemical Theory and Computation | 2017

Prediction Errors of Molecular Machine Learning Models Lower than Hybrid DFT Error

Felix A. Faber; Luke Hutchison; Bing Huang; Justin Gilmer; Samuel S. Schoenholz; George E. Dahl; Oriol Vinyals; Steven Kearnes; Patrick F. Riley; O. Anatole von Lilienfeld

We investigate the impact of choosing regressors and molecular representations for the construction of fast machine learning (ML) models of 13 electronic ground-state properties of organic molecules. The performance of each regressor/representation/property combination is assessed using learning curves which report out-of-sample errors as a function of training set size with up to ∼118k distinct molecules. Molecular structures and properties at the hybrid density functional theory (DFT) level of theory come from the QM9 database [ Ramakrishnan et al. Sci. Data 2014 , 1 , 140022 ] and include enthalpies and free energies of atomization, HOMO/LUMO energies and gap, dipole moment, polarizability, zero point vibrational energy, heat capacity, and the highest fundamental vibrational frequency. Various molecular representations have been studied (Coulomb matrix, bag of bonds, BAML and ECFP4, molecular graphs (MG)), as well as newly developed distribution based variants including histograms of distances (HD), angles (HDA/MARAD), and dihedrals (HDAD). Regressors include linear models (Bayesian ridge regression (BR) and linear regression with elastic net regularization (EN)), random forest (RF), kernel ridge regression (KRR), and two types of neural networks, graph convolutions (GC) and gated graph networks (GG). Out-of sample errors are strongly dependent on the choice of representation and regressor and molecular property. Electronic properties are typically best accounted for by MG and GC, while energetic properties are better described by HDAD and KRR. The specific combinations with the lowest out-of-sample errors in the ∼118k training set size limit are (free) energies and enthalpies of atomization (HDAD/KRR), HOMO/LUMO eigenvalue and gap (MG/GC), dipole moment (MG/GC), static polarizability (MG/GG), zero point vibrational energy (HDAD/KRR), heat capacity at room temperature (HDAD/KRR), and highest fundamental vibrational frequency (BAML/RF). We present numerical evidence that ML model predictions deviate from DFT (B3LYP) less than DFT (B3LYP) deviates from experiment for all properties. Furthermore, out-of-sample prediction errors with respect to hybrid DFT reference are on par with, or close to, chemical accuracy. The results suggest that ML models could be more accurate than hybrid DFT if explicitly electron correlated quantum (or experimental) data were available.We investigate the impact of choosing regressors and molecular representations for the construction of fast machine learning (ML) models of thirteen electronic ground-state properties of organic molecules. The performance of each regressor/representation/property combination is assessed using learning curves which report out-of-sample errors as a function of training set size with up to

Collaboration


Dive into the O. Anatole von Lilienfeld's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Klaus-Robert Müller

Technical University of Berlin

View shared research outputs
Top Co-Authors

Avatar

Angelos Michaelides

London Centre for Nanotechnology

View shared research outputs
Top Co-Authors

Avatar

Yasmine S. Al-Hamdani

London Centre for Nanotechnology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge