Reverse graph self-attention for target-directed atomic importance estimation

doi:10.1016/j.neunet.2020.09.022

Neural Networks

Volume 133, January 2021, Pages 1-10

https://doi.org/10.1016/j.neunet.2020.09.022 Get rights and content

Abstract

Estimating the importance of each atom in a molecule is one of the most appealing and challenging problems in chemistry, physics, and materials science. The most common way to estimate the atomic importance is to compute the electronic structure using density functional theory (DFT), and then to interpret it using domain knowledge of human experts. However, this conventional approach is impractical to the large molecular database because DFT calculation requires large computation, specifically, $O (n^{4})$ time complexity w.r.t. the number of electronic basis functions. Furthermore, the calculation results should be manually interpreted by human experts to estimate the atomic importance in terms of the target molecular property. To tackle this problem, we first exploit the machine learning-based approach for the atomic importance estimation based on the reverse self-attention on graph neural networks and integrating it with graph-based molecular description. Our method provides an efficiently-automated and target-directed way to estimate the atomic importance without any domain knowledge of chemistry and physics.

Introduction

In molecules, each atom has its own contributions in manifesting the entire molecular properties, and estimating such atomic importance plays a key role in interpreting molecular systems. For these reasons, the atomic importance estimation has been consistently studied in the scientific communities (Pan et al., 2018, Tang et al., 2016, Yen and Winefordner, 1976). However, estimating the atomic importance is one of the most challenging tasks in chemistry and physics because the importance of each atom is comprehensively determined based on atomic properties, neighbor atoms, bonding types, target molecular property, and whole structure of the molecule.

The most common approach for estimating the atomic importance in physics and chemistry is to interpret the electronic structure using density functional theory (DFT) (Sholl & Steckel, 2009). In this approach, the atomic importance is estimated through three steps: (1) A human expert selects appropriate functional and basis sets for a given molecule to apply DFT; (2) The electronic structure of the molecule is calculated based on DFT calculation; (3) The human expert estimates the atomic importance by interpreting the calculated electronic structure in terms of target molecular property. Although some methods are proposed to estimate relative contributions of atoms in molecules, their generality is typically limited to certain molecular properties (Glendening et al., 2019, Marenich et al., 2012). For this reason, DFT has been most widely used to interpret molecular systems and to reveal the important atoms according to target molecular property because it can generate a universal description of the molecular systems (Chibani et al., 2018, Crimme et al., 2010, Geerlings et al., 2003, Lee et al., 2018b).

However, the conventional approach based on DFT has three fundamental limitations in efficiency, automation, and generality.

•
Efficiency: As an example of the electronic structure computations, DFT calculation requires $O (n^{4})$ time complexity to compute the electronic structure, where $n$ is the number of basis functions that describe electrons in the molecule (Jensen, 2017). In general, molecules have more electrons than atoms, and thus DFT calculation requires large computation.
•
Automation: Although DFT provides details of the electronic structure that describes many properties of molecules, it does not explain all physical and chemical properties of the molecules without proper analyses (Sholl & Steckel, 2009). Thus, additional calculations that require the domain knowledge of human experts should be applied to explain some molecular properties.
•
Generality: For some molecular properties, the relationship between them and the electronic distributions is not clear. Moreover, sometimes the atomic importance estimation is impossible because the relationships between molecular properties and molecular structures are not interpretable.

Due to these limitations, estimating the atomic importance remains an open problem in physics, chemistry, pharmacy, and materials science.

To overcome the limitations of the conventional approach in estimating the atomic importance, we exploit the machine learning-based approach by proposing a new concept of reverse graph self-attention (RGSA) and integrating it with the graph neural networks for the first time. The self-attention mechanism was originally designed to determine important elements within the input data to accurately predict its corresponding target or label in natural language processing (Vaswani et al., 2017). Similarly, in graph neural networks, the self-attention is used to determine important neighbor nodes within the input graph to generate a more accurate node or graph embeddings (Velickovic et al., 2018). The proposed RGSA is defined as the inverse of the self-attention to calculate how important a selected node is considered by its neighbor nodes in the graph. For a given molecule and target property, the proposed estimation method selects the atom that has the largest RGSA score as the most important atom in terms of the target property.

The proposed method estimates the target-directed atomic importance through two steps: (1) For the given molecular graphs and their corresponding target properties, a self-attention based graph neural network is trained to predict the target properties. (2) After the training, RGSA scores are calculated, and then the atomic importance is estimated based on the calculated RGSA scores. As shown in this estimation process, neither large computation nor human experts in physics and chemistry are required, and the estimation process is fully automated.

To validate the effectiveness of the proposed method for atomic importance estimation, we conducted comprehensive experiments and evaluated the estimation performance using both quantitative and qualitative analyses. The contributions of this paper are summarized as:

•
This paper first proposes a machine learning-based approach to estimate the atomic importance in the molecule.
•
The proposed method drastically reduced the computational cost for the atomic importance estimation from quartic time complexity to the practical time complexity of training graph neural networks.
•
The proposed method provides a fully-automated and target-directed way to estimate the atomic importance.
•
We comprehensively validated the effectiveness of the proposed method using both quantitative and qualitative evaluations with domain knowledge and scientific literature in physics and chemistry. However, since there is no labeled dataset for the atomic importance estimation and a systematic way to quantitatively evaluate the estimation accuracy, we devised a new quantitative evaluation method and validated the effectiveness of the proposed method using it.

Section snippets

Preliminaries

Before describing the atomic importance estimation based on the reverse self-attention, in this section, we will briefly explain two essential concepts for understanding the proposed method: (1) graph-based molecular analysis. (2) graph self-attention and graph attention network.

Machine learning-based atomic importance estimation

In this section, we explain our machine learning-based approach to estimate target-directed atomic importance. To this end, we define a new concept of reverse graph self-attention (RGSA) and integrate it with GAT.

Experiments

To accurately validate the effectiveness of MIAIE, we conducted both quantitative and qualitative evaluations on two well-known molecular datasets. However, to the best of our knowledge, neither a labeled dataset for the atomic importance estimation nor a systematic way to evaluate the performance of the atomic importance estimator exists. For this reason, we devised a validation method to quantitatively evaluate the performance of the atomic importance estimators. We will explain this

Conclusion

This paper first exploited machine the learning approach to estimate the atomic importance in molecules. To this end, the reverse graph self-attention was proposed and integrated with a graph attention network. The proposed method is efficient and fully-automated. Furthermore, it can estimate the atomic importance in terms of the given target molecular property without human experts. However, the proposed method can estimate the importance of the group of atoms that consists of $k$ -hop neighbor

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This work was supported by the core KRICT project, Republic of Korea [Grant Number SI2051-10] from the Korea Research Institute of Chemical Technology (KRICT), Republic of Korea .

References (40)

ColeyC.W. et al.
A graph-convolutional neural network model for the prediction of chemical reactivity
Chemical Science
(2019)
WuZ. et al.
MoleculeNet: a benchmark for molecular machine learning
Chemical Science
(2018)
ChibaniS. et al.
A DFT study of RuO $_{4}$ interactions with porous materials: metal–organic frameworks (MOFs) and zeolites
Physical Chemistry Chemical Physics
(2018)
CrimmeS. et al.
A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-d) for the 94 elements H-Pu
Journal of Chemical Physics
(2010)
CybenkoG.
Approximation by superpositions of a sigmoidal function
Math Control Signal Systems
(1989)
DelaneyJ.S.
ESOL: Estimating aqueous solubility directly from molecular structure
Chemical Informatio and Computer Sciences
(2004)
Gao, H., & Ji, S. (2019). Graph U-Nets, In International conference on learning...
GeerlingsP. et al.
Conceptual density functional theory
Chemical Reviews
(2003)
GlendeningE.D. et al.
NBO 7.0: New vistas in localized and delocalized chemical bonding theory
Journal of Computational Chemistry
(2019)
HaninB.
Universal function approximation by deep neural nets with bounded width and ReLU activations
Mathematics
(2019)

Ioffe, S., & Szegedy, C. (2015). Batch normalization: Accelerating deep network training by reducing internal covariate...

JensenF.

Introduction to Computational Chemistry

(2017)

Kingma, D. P., & Ba, J. L. (2015). Adam: A method for stochastic optimization. In International conference on learning...

Krogh, A., & Hertz, J. A. (1992). A simple weight decay can improve generalization. In Conference on neural information...

Lee, J., Lee, I., & Kang, J. (2019). Self-attention graph pooling. In International conference on machine...

Lee, J. B., Rossi, R., & Kong, X. (2018). Graph classification using structural attention. In ACM SIGKDD conference on...

LeeY.-L. et al.

Topological phases in cove-edged and chevron graphene nanoribbons: Geometric structures, Z $_{2}$ invariants, and junction states

Nano Letters

(2018)

LuC. et al.

Molecular property prediction: A multilevel quantum interactions modeling perspective

(2019)

Luong, M.-T., Pham, H., & Manning, C. D. (2017). Effective approaches to attention-based neural machine translation. In...

MarenichA.V. et al.

Charge model 5: An extension of hirshfeld population analysis for the accurate description of molecular interactions in gaseous and condensed phases

Journal of Chemical Theory and Computation

(2012)

Cited by (3)

A meta-framework for multi-label active learning based on deep reinforcement learning
2023, Neural Networks
Multi-label Active Learning (MLAL) is an effective method to improve the performance of the classifier on multi-label problems with less annotation effort by allowing the learning system to actively select high-quality examples (example-label pairs) for labeling. Existing MLAL algorithms mainly focus on designing reasonable algorithms to evaluate the potential values (as previously mentioned quality) of the unlabeled data. These manually designed methods may show totally different results on various types of datasets due to the defect of the methods or the particularity of the datasets. In this paper, instead of manually designing an evaluation method, we propose a deep reinforcement learning (DRL) model to explore a general evaluation method on several seen datasets and eventually apply it to unseen datasets based on a meta framework. In addition, a self-attention mechanism along with a reward function is integrated into the DRL structure to address the label correlation and data imbalanced problems in MLAL. Comprehensive experiments show that our proposed DRL-based MLAL method is able to produce comparable results as compared with other methods reported in the literature.
High-Throughput Screening of Promising Redox-Active Molecules with MolGAT
2023, ACS Omega
Knowledge-Embedded Message-Passing Neural Networks: Improving Molecular Property Prediction with Human Knowledge
2021, ACS Omega

¹: Two authors have equal contribution.

View full text

Reverse graph self-attention for target-directed atomic importance estimation

Abstract

Introduction

Section snippets

Preliminaries

Machine learning-based atomic importance estimation

Experiments

Conclusion

Declaration of Competing Interest

Acknowledgments

Chemical Science

Chemical Science

A DFT study of RuO4 interactions with porous materials: metal–organic frameworks (MOFs) and zeolites

Physical Chemistry Chemical Physics

A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-d) for the 94 elements H-Pu

Journal of Chemical Physics

Approximation by superpositions of a sigmoidal function

Math Control Signal Systems

ESOL: Estimating aqueous solubility directly from molecular structure

Chemical Informatio and Computer Sciences

Conceptual density functional theory

Chemical Reviews

NBO 7.0: New vistas in localized and delocalized chemical bonding theory

Journal of Computational Chemistry

Universal function approximation by deep neural nets with bounded width and ReLU activations

Mathematics