Elsevier

Fluid Phase Equilibria

Volume 507, 1 March 2020, 112437
Fluid Phase Equilibria

A norm indexes-based QSPR model for predicting the standard vaporization enthalpy and formation enthalpy of organic compounds

https://doi.org/10.1016/j.fluid.2019.112437Get rights and content

Highlights

  • A unified QSPR model was proposed for predicting four thermodynamic properties endpoints.

  • The model for four properties endpoints was established with unified norm indexes used.

  • Norm indexes could be used to describe the ΔHv0, ΔHf0(g), ΔHf0(s) and ΔHf0(l) of organic compounds.

Abstract

As important thermodynamic properties, vaporization enthalpy and formation enthalpy were extensively utilized in the chemical industry process and chemical engineering design, environment and agriculture. Based on the concept of norm index proposed by our group previously, a unified QSPR model was built for predicting four properties endpoints for 14 families of organic compounds. Four thermodynamic properties endpoints, including standard vaporization enthalpy (ΔHv0), standard formation enthalpy in gas state (ΔHf0(g)), standard formation enthalpy in solid state (ΔHf0(s)) and standard formation enthalpy in liquid state (ΔHf0(l)), were involved in the same modelling work. This model has satisfactory fitting effect for four properties endpoints with R2 of 0.967 for ΔHv0, R2 of 0.990 for ΔHf0(g), R2 of 0.989 for ΔHf0(s) and R2 of 0.987 for ΔHf0(l), respectively. Moreover, the results of internal validation, external validation and applicability domain analysis indicated the good stability and robustness of this model. This work not only calculated vaporization enthalpy and formation enthalpy with the same formula, but also covered gas, solid and liquid phases for formation enthalpy. Satisfying results obtained in the present work suggest that this model and norm indexes have good reliability and generalization.

Introduction

The physical and thermodynamic properties of organic compounds play an important role in various chemical industry process and chemical engineering design [[1], [2], [3], [4]], environment and agriculture [[5], [6], [7], [8], [9]]. The vaporization enthalpy (ΔHv) and the formation enthalpy (ΔHf) were basic physical properties of organics. ΔHv is an essential tool for correlating and predicting many physical phenomena, such as vapor pressures [10], surface tension [11], the Hildebrand's and the Hansen's solubility parameters of hydrocarbons [12]. ΔHf could be used to calculate the equilibrium constants of reaction, and is of great significance in the study of resonance energies, bond energies, nature of chemical bond and other related features [13]. It is necessary to understand the fundamental physicochemical properties of these compounds such as ΔHv and ΔHf. Because of the exponential increase in the number of newly synthesized and discovered organic compounds, but the experimental data of most compounds are not measurable, and the data obtained through experimental studies are not only expensive but also time-consuming. Therefore, it is urgent to develop a stable and reliable calculation method to solve these problems.

A great deal of effort has been made in the development of methods for estimating enthalpy over the past decades. The empirical correlation method, the group contribution method (GCM) and the quantitative structure-property relationship (QSPR) method were usually used to calculate ΔHv and ΔHf. Estimation of ΔHv by empirical correlation method, depends on some physicochemical properties including critical properties (Tc, Pc, Vc) and normal boiling point (Tb), as shown in Riedel [14], Chen [15], Alibakhshi [16] and Belghit et al.‘s investigations [17]. In terms of GCM, Benson et al. [[18], [19], [20]] proposed a general method to estimate the thermochemical properties of chemical species on the basis of group additive contributions. Joback and Reid [21] developed the first-order contribution method, which used 41 first-order groups to calculate the ΔHv and ΔHf of organics. Constantinou and Gani [22] proposed the second-order contribution method which calculated physicochemical properties including critical properties and ΔHv and ΔHf. Interestingly, they all tried to distinguish the isomers of some organic compounds. Jia and Wang [[23], [24], [25], [26]] proposed the positional distributive group contribution method, which could effectively distinguish the organic isomers and was successfully used to predict the critical properties and enthalpy of organic compounds. On the whole, the GCM is simple and general, yet it relies on the contribution values of the groups and could not be applied to new structural classification.

Another way to predict physicochemical properties is to establish a QSPR model from the structure of molecules. QSPR [[27], [28], [29], [30], [31], [32], [33]] is an effective method to link the thermodynamic/physical/chemical properties of organics with their compositions and structures. Recently, several QSPR models for estimating ΔHv of organic compounds have been proposed. In Abooali and Sobati ‘s investigation [34], a model was built with MLR (Multiple Linear Regression) method for predicting the ΔHv at Tb of 180 pure refrigerants with R2 of 0.96 and AARD of 6.83%. Krasnykh et al. [35] determined the ΔHv0 (standard enthalpy of vaporization) of trimethylolpropane and carboxylic acids esters by the transpiration method, and good results were obtained with the relative error less than 2%. In addition, by using MLR method, Sosnowska et al. [28] proposed a QSPR model for estimating the ΔHv0 for persistent organic pollutants with good fitting and the external predictive ability (R2 = 0.888, QCV2 = 0.878). In terms of ΔHf, based on neural-network method, Hu et al. [36] presented a QSPR model to predict the ΔHf0 of 180 organic molecules with good effect. Mercader et al. [37] established a QSPR model to predict ΔHf of 51 hydrocarbons by using correlation weighting of local invariants in atomic orbital molecular graphs (AOMGs), their model could provide satisfactory results with low average deviation. Furthermore, Vatani et al. [38] predicted the ΔHf0 of 1115 compounds based on a multivariate linear genetic algorithm, and during their modelling work, five structural descriptors were calculated and selected from 1664 descriptor libraries.

In this study, the four properties endpoints - structure modelling study was performed with uniform norm descriptors for predicting the enthalpy of organic chemicals. The standard vaporization enthalpy (ΔHv0), standard formation enthalpy in gas state (ΔHf0(g)), standard formation enthalpy in solid state (ΔHf0(s)) and standard formation enthalpy in liquid state (ΔHf0(l)) were used as properties endpoints for the same modelling work. Also, the prediction ability of this model was tested by using several validation methods.

Section snippets

Dataset

573 ΔHv0, 964 ΔHf0(g), 367 ΔHf0(s) and 873 ΔHf0(l) experimental data were from CRC Handbook of Chemistry and Physics [39]. The dataset covered 14 families of organic compounds, including chain and cyclic hydrocarbons, alcohols, ketones, carboxylic acids, amines, halogenated hydrocarbons, sulfur compounds and so on. The organics involved in ΔHv0, ΔHf0(g), ΔHf0(s) and ΔHf0(l) together with corresponding experimental values were shown in Table S1~S4 in the Supplementary Material.

Atomic distribution matrix

In this work, the

Model proposed

A unified QSPR model was built for predicting the four properties endpoints as Eq. (17) by using the same norm descriptors. The parameters bk of this model for the four properties were showed in Table 2.ΔH=b0+k=122bkIkwhere, ΔH includes ΔHv0, ΔHf0(g), ΔHf0(s) and ΔHf0(l), Ik represents norm indexes.

The predicted values of the four properties and the corresponding absolute errors were summarized in Table S1~S4. Statistical results for this model prediction were shown in Table 3. The R2 of ΔHv0,

Conclusions

In this work, a set of atomic distribution matrices and norm descriptors were proposed. A unified QSPR model was built to predict the four properties endpoints for 14 families of organic compounds such as chain and cyclic hydrocarbons, alcohols, ketones, carboxylic acids, amines, halogenated hydrocarbons, sulfur compounds, etc. The four thermodynamic properties endpoints of ΔHv0, ΔHf0(g), ΔHf0(s) and ΔHf0(l) were involved in the same modelling work. High R2 and F values showed that this model

Author Contribution Section

Xue Yan: Formal analysis, Writing – original draft, Research concept and design, Collection of data, Data analysis and interpretation, Writing the article, Critical revision of the article. Tian Lan: Formal analysis, Research concept and design, Data analysis and interpretation, Critical revision of the article. Qingzhu Jia: Research concept and design, Critical revision of the article, Final approval of article. Fangyou Yan: Critical revision of the article, Final approval of article. Qiang

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

This work was supported by the National Natural Science Foundation of China [NO: 21808167, 21676203 and 21306137].

References (59)

  • D. Abooali et al.

    Novel method for prediction of normal boiling point and enthalpy of vaporization at normal boiling point of pure refrigerants: a QSPR approach

    Int. J. Refrig.

    (2014)
  • E.L. Krasnykh et al.

    Vapor pressure and enthalpy of vaporization of trimethylolpropane and carboxylic acids esters

    Fluid Phase Equilib.

    (2018)
  • A. Mercader et al.

    QSPR modeling of the enthalpy of formation from elements by means of correlation weighting of local invariants of atomic orbital molecular graphs

    Chem. Phys. Lett.

    (2000)
  • K. Roy et al.

    On a simple approach for determining applicability domain of QSAR models

    Chemometr. Intell. Lab. Syst.

    (2015)
  • K. Roy et al.

    How important is to detect systematic error in predictions and understand statistical applicability domain of QSAR models?

    Chemometr. Intell. Lab. Syst.

    (2017)
  • T.A. Albahri et al.

    SGC method for predicting the standard enthalpy of formation of pure compounds from their molecular structures

    Thermochim. Acta

    (2013)
  • Q. Jia et al.

    Norm indexes for predicting enthalpy of vaporization of organic compounds at the boiling point

    J. Mol. Liq.

    (2019)
  • J. Yin et al.

    Predicting heat capacity of gas for diverse organic compounds at different temperatures

    Fluid Phase Equilib.

    (2017)
  • Y. Wang et al.

    Distributive structure-properties relationship for flash point of multiple components mixture

    Fluid Phase Equilib.

    (2018)
  • F. Yan et al.

    Multiple toxicity endpoint–structure relationships for substituted phenols and anilines

    Sci. Total Environ.

    (2019)
  • F. Yan et al.

    Norm index-based QSTR model to predict the eco-toxicity of ionic liquids towards Leukemia rat cell line

    Chemosphere

    (2019)
  • W. He et al.

    Prediction of ionic liquids heat capacity at variable temperatures based on the norm indexes

    Fluid Phase Equilib.

    (2019)
  • G. Xin et al.

    A correlation between formation enthalpy and ionic conductivity in perovskite-structured Li3xLa0.67-xTiO3 solid lithium ion conductors

    J. Mater. Chem.

    (2017)
  • A.H. Mohammadi et al.

    New predictive methods for estimating the vaporization enthalpies of hydrocarbons and petroleum fractions

    Ind. Eng. Chem. Res.

    (2007)
  • S. Genheden

    Predicting partition coefficients with a simple all-atom/coarse-grained hybrid model

    J. Chem. Theory Comput.

    (2016)
  • V.N. Emel'Yanenko et al.

    The gaseous enthalpy of formation of the ionic liquid 1-butyl-3-methylimidazolium dicyanamide from combustion calorimetry, vapor pressure measurements, and ab initio calculations

    J. Am. Chem. Soc.

    (2007)
  • L. Yang et al.

    Quantifying solvophobic effects in nonpolar cohesive interactions

    J. Am. Chem. Soc.

    (2015)
  • L.M.N.B.F. Santos et al.

    Ionic liquids: first direct determination of their cohesive energy

    J. Am. Chem. Soc.

    (2007)
  • Z.k. Kolská et al.

    Group contribution methods for estimation of selected physico-chemical properties of organic compounds

    Thermodyn. Fundament. Appl. Sci.

    (2012)
  • Cited by (14)

    View all citing articles on Scopus
    View full text