Exploring the necessary complexity of interatomic potentials

https://doi.org/10.1016/j.commatsci.2021.110752Get rights and content

Highlights

  • Spline-based potentials provide a framework for a systematically-improvable model.

  • These potentials can achieve near-DFT accuracies with classical potential speeds.

  • The Pareto front is dominated by spline MEAM and moment tensor potentials.

  • Custom data structures make fitting spline-based potentials fast and scalable.

Abstract

The application of machine learning models and algorithms towards describing atomic interactions has been a major area of interest in materials simulations in recent years, as machine learning interatomic potentials (MLIPs) are seen as being more flexible and accurate than their classical potential counterparts. This increase in accuracy of MLIPs over classical potentials has come at the cost of significantly increased complexity, leading to higher computational costs and lower physical interpretability and spurring research into improving the speeds and interpretability of MLIPs. As an alternative, in this work we leverage “machine learning” fitting databases and advanced optimization algorithms to fit a class of spline-based classical potentials, showing that they can be systematically improved in order to achieve accuracies comparable to those of low-complexity MLIPs. These results demonstrate that high model complexities may not be strictly necessary in order to achieve near-DFT accuracy in interatomic potentials and suggest an alternative route towards sampling the high accuracy, low complexity region of model space by starting with forms that promote simpler and more interpretable interatomic potentials.

Introduction

For nearly a century of designing interatomic potentials for use in materials simulations, a strong emphasis was placed on constructing classical potentials with physically-motivated forms derived from quantum mechanical theories [1], [2], [3], [4], [5], [6], [7], [8], [9]. More recently, with the enormous success of machine learning in various fields, machine learning interatomic potentials (MLIPs) have come to dominate the attention of the computational materials science community (an incomplete list: [10], [11], [12], [13], [14], [15], [16], [17], [18], [19], [20]). MLIPs have been shown to be able to predict energies and forces on diverse ranges of atomic configurations with unprecedented accuracy, which in combination with active research into improving their speed and interpretability [21], [22], [23] makes them good candidates for being fast and accurate general-purpose models of atomic interactions.

Much of the increased performance of MLIPs over classical potentials comes from the flexibility in their functional forms that allows them to be systematically extended in order to be able to model increasingly diverse sets of atomic environments. However, pushing MLIPs to this limit of high accuracy and generalizability often results in highly complex models that are computationally expensive to use and conceptually difficult to interpret. Classical potentials, on the other hand, are by construction much simpler to interpret due to their basis in known physics and usually have much lower computational costs than even the simplest MLIPs. These two forms thus are generally used in opposing regions of “model space”: classical potentials in the low complexity, low accuracy region; MLIPs in the high complexity, high accuracy region.

Sampling the space between these two regions (low complexity, high accuracy) is a fundamental goal of computational materials science, and in recent years has most often been approached by attempting to decrease the complexity of MLIPs. In this work we show that a family of spline-based classical potentials, when leveraging traditional “machine learning” databases and fitting algorithms, can be systematically improved in order to achieve accuracies on existing benchmark databases that push them into the low complexity, high accuracy region of model space alongside low-complexity MLIPs. Despite the lower interpretability of spline-based potentials relative to classical potentials with explicit analytical definitions, our results show that spline-based classical potentials offer a good balance between speed, interpretability, and accuracy. These results suggest that spline-based classical potentials may be good options alongside low-complexity MLIPs for designing practical and computationally tractable general-purpose potentials.

Section snippets

Machine learning interatomic potentials

For this work, the most important distinctions between MLIPs and classical potentials are in their functional forms and in how they describe local atomic environments. The main advantage of MLIPs is that their functional forms are more easily extended to account for different environments by increasing their degrees of freedom, and that they are able to leverage more advanced descriptors of local atomic environments. The commonly-assumed implications of these differences are that (1) their

Fitting databases

In order to ensure a fair comparison, we train all models on the databases produced in [24], which were specifically designed to encompass a large variety of atomic environments. In total, we explored six different elements (three crystal systems): Ni and Cu (FCC), Li and Mo (BCC), and Si and Ge (diamond). Each database contains the ground state structure for the given element, strained supercells, slab structures, ab initio molecular dynamics (AIMD) sampling of supercells at different

Fitting procedure

The following results were obtained by fitting s-MEAM potentials to a set of six benchmark databases published by [24] for Ni, Cu, Li, Mo, Si, and Ge. The databases were designed to cover a broad collection of atomic environments for each element, including ground state structures, strained configurations, surfaces, liquids, vacancies, and molecular dynamics snapshots. The potentials were fitted with software developed by the authors of this paper using the Covariance Matrix Adaptation

Conclusion

We demonstrate that a family of spline-based classical potentials offer a viable route towards sampling the low complexity, high accuracy region of model space while promoting physical interpretability, making them good candidates to be general-purpose potentials alongside existing low complexity MLIPs. In related recent work [52], an example of an MLIP modeled after the embedded-atom method also achieved high accuracy on the databases used in this paper, highlighting some of the efforts of

Data and Code Availability

The databases of atomic configurations were made available on Github [35] by the authors of [24], who originally created them. Fitted s-MEAM models from our work, and software used for fitting the s-MEAM potentials, are available on Github [41].

CRediT authorship contribution statement

Joshua A. Vita: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing - original draft, Writing - review & editing. Dallas R. Trinkle: Conceptualization, Funding acquisition, Project administration, Resources, Supervision, Writing - review & editing.

Declaration of competing interest

The authors declare no competing financial or non-financial interests.

Acknowledgements

The authors thank Michael R. Fellinger for useful comments and discussions, and Yunxing Zuo for providing the data from [24] and helping to verify their results. This research was supported by the National Science Foundation through awards NSF/BD-SPOKE-1636929, NSF/NRT-1922758, and NSF/HDR-1940303. This research is part of the Blue Waters sustained-petascale computing project, which is supported by the National Science Foundation (awards OCI-0725070 and ACI-1238993) and the state of Illinois.

References (56)

  • M.I. Baskes

    Modified embedded-atom potentials for cubic materials and impurities

    Phys. Rev. B

    (1992)
  • J. Tersoff

    New empirical model for the structural properties of silicon

    Phys. Rev. Lett.

    (1986)
  • Frank H. Stillinger et al.

    Computer simulation of local order in condensed phases of silicon

    Phys. Rev. B

    (1985)
  • Adri C.T. van Duin et al.

    ReaxFF: a reactive force field for hydrocarbons

    J. Phys. Chem. A

    (2001)
  • Donald W Brenner, Olga A Shenderova, Judith A Harrison, Steven J Stuart, Boris Ni, and Susan B Sinnott. A...
  • Tzu-Ray Shan, Bryce D. Devine, Travis W. Kemper, Susan B. Sinnott, and Simon R. Phillpot. Charge-optimized many-body...
  • Jörg Behler et al.

    Generalized neural-network representation of high-dimensional potential-energy surfaces

    Phys. Rev. Lett.

    (2007)
  • Albert P. Bartók et al.

    Gaussian approximation potentials: The accuracy of quantum mechanics, without the electrons

    Phys. Rev. Lett.

    (2010)
  • V. Botu et al.

    Learning scheme to predict atomic forces and accelerate materials simulations

    Phys. Rev. B

    (2015)
  • Alexander V. Shapeev

    Moment tensor potentials: A class of systematically improvable interatomic potentials

    Multiscale Model. Simul.

    (2016)
  • K.T. Schütt et al.

    SchNet – a deep learning architecture for molecules and materials

    J. Chem. Phys.

    (2018)
  • Mitchell A. Wood et al.

    Extending the accuracy of the SNAP interatomic potential form

    J. Chem. Phys.

    (2018)
  • Yaolong Zhang et al.

    Embedded atom neural network potentials: Efficient and accurate machine learning with a physically inspired representation

    J. Phys. Chem. Lett.

    (2019)
  • G.P. Purja Pun et al.

    Physically informed artificial neural networks for atomistic modeling of materials

    Nat. Commun.

    (2019)
  • Rahulkumar Gayatri, Stan Moore, Evan Weinberg, Nicholas Lubbers, Sarah Anderson, Jack Deslippe, Danny Perez, and Aidan...
  • Yu Xie, Jonathan Vandermause, Lixin Sun, Andrea Cepellotti, and Boris Kozinsky. Bayesian force fields from active...
  • Yunxing Zuo, Chi Chen, Xiangguo Li, Zhi Deng, Yiming Chen, Jörg Behler, Gábor Csányi, Alexander V. Shapeev, Aidan P....
  • Jörg Behler

    Atom-centered symmetry functions for constructing high-dimensional neural network potentials

    J. Chem. Phys.

    (2011)
  • Cited by (15)

    View all citing articles on Scopus
    View full text