pyGACE: Combining the genetic algorithm and cluster expansion methods to predict the ground-state structure of systems containing point defects
Graphical abstract
Introduction
Point defects occur naturally in materials and affect the properties of the materials in many aspects [1], [2], including the mechanical [3], [4], [5] transport[6], [7], [8], electronic[9], [10], [11], optoelectronic [12], [13], and thermoelectric properties[14], [15], [16]. Explicit configurations of the point defects at relatively high concentration can significantly change the electronic property of the materials, as in the case of the oxides employed in resistance random access memory (RRAM) [17], [18] for information storage: clustering of the oxygen vacancy leading to the low resistance represents the signal ‘1’, while the uniform distribution of the vacancies inducing the high resistance represents the signal ’0’. Unfortunately, locating the ground state structure of the point-defects-containing materials remains a challenge, especially when the concentration of the point defects is high, due to the numerous atomic configurations that can be formed and the huge cost to evaluate their energies accurately with density functional theory (DFT).
In the past decades, extensive efforts have been put on the efficient searching of the most stable crystalline structures given the compositions of the systems, and two strategies are mainly focused so far. The first one is enhancing the sampling of the phase space and reducing the candidate structures for the density functional theory calculations, by using the genetic algorithms[19], [20], [21], [22], evolutionary algorithm [23], [24], [25], particle swarm optimization [26], random sampling [27], [28], deep learning[29], etc. These methods work very well for the unit cell design given the composition of the materials, while when the number of the atoms in the unit cell is large, the high computation cost for the DFT is inevitable. The other strategy is using empirical potential to replace DFT or to prescreen the ground-state candidates for DFT [20]. However, empirical potentials comparable to the accuracy of DFT calculations for extensive materials are still lacking, although recently the machine-learning potentials based on a quite large data set of DFT calculations shed some light on that [30].
Apart from the structure prediction for the materials from ’scratch’ which uses the composition of the materials as the only input as mentioned above, in certain circumstances, searching the stable atomic structures given a rigid lattice is also demanding. These ground-state searching methods with confined phase space are of great importance in engineering-alloy and semiconductor industry, where additional elements or defects are introduced to the parent materials to improve their performance [31], [32], [33], [34], [35]. In principle, figuring out the structures of the alloyed/doped systems with a confined supercell should be easier than structure prediction from scratch where no ‘template’ exist, while the challenges are the expensive energy calculation since for the alloyed/doped systems a larger supercell is required to avoid the periodic interactions between the defects with their images. So far, the efficient techniques designed for the ground-state searching of the relatively large defect-containing systems are seldom reported in the literature. The worth mentioning technique dealing with this issue is the cluster expansion method. Cluster expansion (CE) method, first proposed by Sanchez et al. decades ago [36], [37], describe the energy of the multicomponent systems in terms of the orthogonal discrete Chebyshev’s polynomials, in which the parameters, i.e., the coefficients for the effective cluster interaction (ECI) among clusters formed by two or three atoms, can be obtained by fitting moderate DFT energies. With reliable ECI, energies of any structures can be estimated at a meager cost; then the ground state structures can be localized after enumerating all the possible configurations. CE method coupling with DFT calculations has been integrated in a few codes [38], [39], [40], and widely used in the ground-state searching of the alloying systems (i.e., substitutional defects) [41], [42], [43], [44] or in the thermodynamic calculations in conjunction with Monte Carlo simulation [45]. The key step for the CE method is fitting reliable ECI parameters using DFT calculations. However, for a large supercell, even if its shape is constrained, the fitting process will be computational demanding. Therefore, when the large supercell is involved in the ground-state searching process, both the efficient sampling and efficient calculations are needed.
In the present work, to handle the ground-state search of a relatively large system, we have employed a theoretical framework by combing the genetic algorithm for enhanced sampling of the configuration space and the CE method coupling with DFT for efficient and accurate energy evaluation. Particularly, to obtain the more effective clusters, we use the previously found ground-state structures in GA to re-calculate a new set of clusters in CE, which will be further employed to predicate new structures. The loop will continue until the ground-state candidates do not change anymore compared to the previous step, which guarantees that one can obtain a precise cluster and reasonable ground-state configurations. It is worth mentioned that CE was once connected to GA [46], while we do not see extensive parameter exchange (as using the iteration) between the GA and CE in the previous work [46]. Here the efficiency and accuracy of our framework are validated in two examples, namely the oxygen-vacancies configuration in HfO2−x and the substitutional-defect structures in SrTi1−xNbxO3.
Section snippets
Methodology
In the following parts, the theory, method, and technique used in the present work will be introduced.
Applications
In this section, we pick two distinct examples to show the efficiency of pyGACE. One example is HfO2−x with oxygen vacancies, and the other is SrTi1−xNbxO3 with Nb substituting Ti. The supercell structures of the two systems are depicted in Fig. 2.
Conclusions
In this work, we implemented a python framework ’pyGACE’ which combines genetic algorithm (GA) and cluster expansion (CE) method, to enhance the structures sampling and improve the energy-evaluation efficiency at the same time in the ground-state searching of the defect-containing materials. In pyGACE, CE part is normally dealing with smaller cell and generate a rational ECI which is used as a fitness function in GA to handle larger supercell. More stable structures are found can be found via a
Acknowledgements
This work is financially supported by the National Key Research and Development Program of China (Grant No. 2017YFB0701700) and the National Natural Science Foundation of China (No. 51871009).
References (68)
- et al.
Investigation on electronic structures and mechanical properties of nb-doped tial2 intermetallic compound
J. Alloys Compd.
(2019) - et al.
The damping and mechanical properties of magnesium alloys balanced by aluminum addition
J. Alloys Compd.
(2019) - et al.
Defects controlled hole doping and multivalley transport in SNSE single crystals
Nat. Commun.
(2018) - et al.
Highly anisotropic thermoelectric transport properties responsible for enhanced thermoelectric performance in the hot-deformed tetradymite bi2te2s
J. Alloys Compd.
(2019) - et al.
Point defect of titanium sesquioxide Ti2O3 as the application of next generation Li-ion batteries
J. Alloys Compd.
(2019) - et al.
Optimizing electrospinning-hydrothermal hybrid process based on taguchi method for modulation of point defects in zno micro/nano arrays towards photoelectronic application
J. Alloys Compd.
(2019) - et al.
Reactive spark plasma sintering and thermoelectric properties of nd-substituted bicuseo oxyselenides
J. Alloys Compd.
(2019) - et al.
Xtalopt: an open-source evolutionary algorithm for crystal structure prediction
Comput. Phys. Commun.
(2011) - et al.
An implementation of artificial neural-network potentials for atomistic materials simulations: performance for TiO2
Comput. Mater. Sci.
(2016) - et al.
Mapping the relationship among composition, stacking fault energy and ductility in nb alloys: a first-principles study
Acta Mater.
(2018)
Generalized cluster description of multicomponent systems
Phys. A
The alloy theoretic automated toolkit: a user guide
Calphad
A design optimization tool based on a genetic algorithm
Autom. Constr.
Multicomponent multisublattice alloys, nonconfigurational entropy and other additions to the Alloy Theoretic Automated Toolkit
Calphad
Development of hafnium based high-k materials-a review
Mater. Sci. Eng. R Rep.
An overview of materials issues in resistive random access memory
J. Materiomics
Electronic structure of strongly reduced (11) surface of monoclinic HfO2
Appl. Surf. Sci.
First principles phase diagram calculations for the octahedral-interstitial system HfOx, 0⩽X⩽1/2
Calphad
Dopants and defects in semiconductors
Effect of point and line defects on mechanical and thermal properties of graphene: a review
Crit. Rev. Sol. State Mater. Sci.
Atomic-scale modeling of the dynamics of titanium oxidation
J. Phys. Chem. C
Springer handbook of electronic and photonic materials
Synergistic resistive switching mechanism of oxygen vacancies and metal interstitials in Ta2O5
J. Phys. Chem. C
Point defect engineering in thin-film solar cells
Nat. Rev. Mater.
Interstitial point defect scattering contributing to high thermoelectric performance in SnTe
Adv. Electron. Mater.
Enhanced thermoelectric performance of In2O3-based ceramics via nanostructuring and point defect engineering
Sci. Rep.
Nanoionics-based resistive switching memories
Nat. Mater.
Realization of a reversible switching in TaO2 polymorphs via peierls distortion for resistance random access memory
Appl. Phys. Lett.
First-principle-based computational doping of SrTiO3 using combinatorial genetic algorithms
Bull. Mater. Sci.
A genetic algorithm for predicting the structures of interfaces in multicomponent systems
Nat. Mater.
A periodic genetic algorithm with real-space representation for crystal structure and polymorph prediction
Phys. Rev. B
Metastable high-pressure single-bonded phases of nitrogen predicted via genetic algorithm
Phys. Rev. B
Crystal structure prediction using ab initio evolutionary techniques: principles and applications
J. Chem. Phys.
Cited by (5)
Influence of nitrogen interstitials in α-titanium and nitrogen vacancies in δ-titanium nitride on lattice parameters and bulk modulus - computational study
2022, Computational Materials ScienceCitation Excerpt :However, as the large number of N vacancies arrangements may exist, it can be computationally difficult to scan energies over all possible arrangements, even when including symmetries due to the periodicity of the supercell. To overcome this problem, various global optimization methods such as genetic algorithm (GA) [17], particle swarm Optimization, or Random Sampling had been developed to find the ground or metastable state of the system without scanning all the arrangements [18]. In this work, we chose genetic algorithm (in-house build script in PERL language) as the way to find the distribution of defects in supercells minimizing their total energies.
Defect modeling and control in structurally and compositionally complex materials
2023, Nature Computational SciencePerspective on optimal strategies of building cluster expansion models for configurationally disordered materials
2022, Journal of Chemical PhysicsEvolutionary computing and machine learning for discovering of low-energy defect configurations
2021, npj Computational Materials