pyGACE: Combining the genetic algorithm and cluster expansion methods to predict the ground-state structure of systems containing point defects

https://doi.org/10.1016/j.commatsci.2019.109482Get rights and content

Abstract

Searching the most stable atomic-structure of a solid with point defects (including the extrinsic alloying/doping elements), is one of the central issues in materials science. Both adequate sampling of the configuration space and the accurate energy evaluation at relatively low cost are demanding for the structure prediction. In this work, we have employed a framework combining genetic algorithm, cluster expansion (CE) method and first-principles calculations, which can effectively locate the ground-state or meta-stable states of the relatively large/complex systems. We employ this framework to search the stable structures of two distinct systems, i.e., oxygen-vacancy-containing HfO2−x and the Nb-doped SrTi1−xNbxO3, and more stable structures are found compared with the structures available in the literature. The present framework can be applied to the ground-state search of extensive alloyed/doped materials, which is particularly significant for the design of advanced engineering alloys and semiconductors.

Introduction

Point defects occur naturally in materials and affect the properties of the materials in many aspects [1], [2], including the mechanical [3], [4], [5] transport[6], [7], [8], electronic[9], [10], [11], optoelectronic [12], [13], and thermoelectric properties[14], [15], [16]. Explicit configurations of the point defects at relatively high concentration can significantly change the electronic property of the materials, as in the case of the oxides employed in resistance random access memory (RRAM) [17], [18] for information storage: clustering of the oxygen vacancy leading to the low resistance represents the signal ‘1’, while the uniform distribution of the vacancies inducing the high resistance represents the signal ’0’. Unfortunately, locating the ground state structure of the point-defects-containing materials remains a challenge, especially when the concentration of the point defects is high, due to the numerous atomic configurations that can be formed and the huge cost to evaluate their energies accurately with density functional theory (DFT).

In the past decades, extensive efforts have been put on the efficient searching of the most stable crystalline structures given the compositions of the systems, and two strategies are mainly focused so far. The first one is enhancing the sampling of the phase space and reducing the candidate structures for the density functional theory calculations, by using the genetic algorithms[19], [20], [21], [22], evolutionary algorithm [23], [24], [25], particle swarm optimization [26], random sampling [27], [28], deep learning[29], etc. These methods work very well for the unit cell design given the composition of the materials, while when the number of the atoms in the unit cell is large, the high computation cost for the DFT is inevitable. The other strategy is using empirical potential to replace DFT or to prescreen the ground-state candidates for DFT [20]. However, empirical potentials comparable to the accuracy of DFT calculations for extensive materials are still lacking, although recently the machine-learning potentials based on a quite large data set of DFT calculations shed some light on that [30].

Apart from the structure prediction for the materials from ’scratch’ which uses the composition of the materials as the only input as mentioned above, in certain circumstances, searching the stable atomic structures given a rigid lattice is also demanding. These ground-state searching methods with confined phase space are of great importance in engineering-alloy and semiconductor industry, where additional elements or defects are introduced to the parent materials to improve their performance [31], [32], [33], [34], [35]. In principle, figuring out the structures of the alloyed/doped systems with a confined supercell should be easier than structure prediction from scratch where no ‘template’ exist, while the challenges are the expensive energy calculation since for the alloyed/doped systems a larger supercell is required to avoid the periodic interactions between the defects with their images. So far, the efficient techniques designed for the ground-state searching of the relatively large defect-containing systems are seldom reported in the literature. The worth mentioning technique dealing with this issue is the cluster expansion method. Cluster expansion (CE) method, first proposed by Sanchez et al. decades ago [36], [37], describe the energy of the multicomponent systems in terms of the orthogonal discrete Chebyshev’s polynomials, in which the parameters, i.e., the coefficients for the effective cluster interaction (ECI) among clusters formed by two or three atoms, can be obtained by fitting moderate DFT energies. With reliable ECI, energies of any structures can be estimated at a meager cost; then the ground state structures can be localized after enumerating all the possible configurations. CE method coupling with DFT calculations has been integrated in a few codes [38], [39], [40], and widely used in the ground-state searching of the alloying systems (i.e., substitutional defects) [41], [42], [43], [44] or in the thermodynamic calculations in conjunction with Monte Carlo simulation [45]. The key step for the CE method is fitting reliable ECI parameters using DFT calculations. However, for a large supercell, even if its shape is constrained, the fitting process will be computational demanding. Therefore, when the large supercell is involved in the ground-state searching process, both the efficient sampling and efficient calculations are needed.

In the present work, to handle the ground-state search of a relatively large system, we have employed a theoretical framework by combing the genetic algorithm for enhanced sampling of the configuration space and the CE method coupling with DFT for efficient and accurate energy evaluation. Particularly, to obtain the more effective clusters, we use the previously found ground-state structures in GA to re-calculate a new set of clusters in CE, which will be further employed to predicate new structures. The loop will continue until the ground-state candidates do not change anymore compared to the previous step, which guarantees that one can obtain a precise cluster and reasonable ground-state configurations. It is worth mentioned that CE was once connected to GA [46], while we do not see extensive parameter exchange (as using the iteration) between the GA and CE in the previous work [46]. Here the efficiency and accuracy of our framework are validated in two examples, namely the oxygen-vacancies configuration in HfO2−x and the substitutional-defect structures in SrTi1−xNbxO3.

Section snippets

Methodology

In the following parts, the theory, method, and technique used in the present work will be introduced.

Applications

In this section, we pick two distinct examples to show the efficiency of pyGACE. One example is HfO2−x with oxygen vacancies, and the other is SrTi1−xNbxO3 with Nb substituting Ti. The supercell structures of the two systems are depicted in Fig. 2.

Conclusions

In this work, we implemented a python framework ’pyGACE’ which combines genetic algorithm (GA) and cluster expansion (CE) method, to enhance the structures sampling and improve the energy-evaluation efficiency at the same time in the ground-state searching of the defect-containing materials. In pyGACE, CE part is normally dealing with smaller cell and generate a rational ECI which is used as a fitness function in GA to handle larger supercell. More stable structures are found can be found via a

Acknowledgements

This work is financially supported by the National Key Research and Development Program of China (Grant No. 2017YFB0701700) and the National Natural Science Foundation of China (No. 51871009).

References (68)

  • J. Sanchez et al.

    Generalized cluster description of multicomponent systems

    Phys. A

    (1984)
  • A. van de Walle et al.

    The alloy theoretic automated toolkit: a user guide

    Calphad

    (2002)
  • L.G. Caldas et al.

    A design optimization tool based on a genetic algorithm

    Autom. Constr.

    (2002)
  • A. van de Walle

    Multicomponent multisublattice alloys, nonconfigurational entropy and other additions to the Alloy Theoretic Automated Toolkit

    Calphad

    (2009)
  • J. Choi et al.

    Development of hafnium based high-k materials-a review

    Mater. Sci. Eng. R Rep.

    (2011)
  • L. Zhu et al.

    An overview of materials issues in resistive random access memory

    J. Materiomics

    (2015)
  • Y. Cheng et al.

    Electronic structure of strongly reduced (111) surface of monoclinic HfO2

    Appl. Surf. Sci.

    (2018)
  • B.P. Burton et al.

    First principles phase diagram calculations for the octahedral-interstitial system HfOx, 0⩽X⩽1/2

    Calphad

    (2012)
  • J.-M. Spaeth et al.
    (2013)
  • M.D. McCluskey et al.

    Dopants and defects in semiconductors

    (2012)
  • G. Rajasekaran et al.

    Effect of point and line defects on mechanical and thermal properties of graphene: a review

    Crit. Rev. Sol. State Mater. Sci.

    (2016)
  • L. Zhu et al.

    Atomic-scale modeling of the dynamics of titanium oxidation

    J. Phys. Chem. C

    (2012)
  • S. Kasap et al.

    Springer handbook of electronic and photonic materials

    (2017)
  • L. Zhu et al.

    Synergistic resistive switching mechanism of oxygen vacancies and metal interstitials in Ta2O5

    J. Phys. Chem. C

    (2016)
  • J.S. Park et al.

    Point defect engineering in thin-film solar cells

    Nat. Rev. Mater.

    (2018)
  • Y. Pei et al.

    Interstitial point defect scattering contributing to high thermoelectric performance in SnTe

    Adv. Electron. Mater.

    (2016)
  • J.-L. Lan et al.

    Enhanced thermoelectric performance of In2O3-based ceramics via nanostructuring and point defect engineering

    Sci. Rep.

    (2015)
  • R. Waser et al.

    Nanoionics-based resistive switching memories

    Nat. Mater.

    (2007)
  • L. Zhu et al.

    Realization of a reversible switching in TaO2 polymorphs via peierls distortion for resistance random access memory

    Appl. Phys. Lett.

    (2015)
  • E. Atilgan et al.

    First-principle-based computational doping of SrTiO3 using combinatorial genetic algorithms

    Bull. Mater. Sci.

    (2018)
  • A.L.-S. Chua et al.

    A genetic algorithm for predicting the structures of interfaces in multicomponent systems

    Nat. Mater.

    (2010)
  • N.L. Abraham et al.

    A periodic genetic algorithm with real-space representation for crystal structure and polymorph prediction

    Phys. Rev. B

    (2006)
  • Y. Yao et al.

    Metastable high-pressure single-bonded phases of nitrogen predicted via genetic algorithm

    Phys. Rev. B

    (2008)
  • A.R. Oganov et al.

    Crystal structure prediction using ab initio evolutionary techniques: principles and applications

    J. Chem. Phys.

    (2006)
  • Cited by (5)

    • Influence of nitrogen interstitials in α-titanium and nitrogen vacancies in δ-titanium nitride on lattice parameters and bulk modulus - computational study

      2022, Computational Materials Science
      Citation Excerpt :

      However, as the large number of N vacancies arrangements may exist, it can be computationally difficult to scan energies over all possible arrangements, even when including symmetries due to the periodicity of the supercell. To overcome this problem, various global optimization methods such as genetic algorithm (GA) [17], particle swarm Optimization, or Random Sampling had been developed to find the ground or metastable state of the system without scanning all the arrangements [18]. In this work, we chose genetic algorithm (in-house build script in PERL language) as the way to find the distribution of defects in supercells minimizing their total energies.

    View full text