A norm indexes-based QSPR model for predicting the standard vaporization enthalpy and formation enthalpy of organic compounds
Introduction
The physical and thermodynamic properties of organic compounds play an important role in various chemical industry process and chemical engineering design [[1], [2], [3], [4]], environment and agriculture [[5], [6], [7], [8], [9]]. The vaporization enthalpy (Hv) and the formation enthalpy (Hf) were basic physical properties of organics. Hv is an essential tool for correlating and predicting many physical phenomena, such as vapor pressures [10], surface tension [11], the Hildebrand's and the Hansen's solubility parameters of hydrocarbons [12]. Hf could be used to calculate the equilibrium constants of reaction, and is of great significance in the study of resonance energies, bond energies, nature of chemical bond and other related features [13]. It is necessary to understand the fundamental physicochemical properties of these compounds such as Hv and Hf. Because of the exponential increase in the number of newly synthesized and discovered organic compounds, but the experimental data of most compounds are not measurable, and the data obtained through experimental studies are not only expensive but also time-consuming. Therefore, it is urgent to develop a stable and reliable calculation method to solve these problems.
A great deal of effort has been made in the development of methods for estimating enthalpy over the past decades. The empirical correlation method, the group contribution method (GCM) and the quantitative structure-property relationship (QSPR) method were usually used to calculate Hv and Hf. Estimation of Hv by empirical correlation method, depends on some physicochemical properties including critical properties (Tc, Pc, Vc) and normal boiling point (Tb), as shown in Riedel [14], Chen [15], Alibakhshi [16] and Belghit et al.‘s investigations [17]. In terms of GCM, Benson et al. [[18], [19], [20]] proposed a general method to estimate the thermochemical properties of chemical species on the basis of group additive contributions. Joback and Reid [21] developed the first-order contribution method, which used 41 first-order groups to calculate the Hv and Hf of organics. Constantinou and Gani [22] proposed the second-order contribution method which calculated physicochemical properties including critical properties and Hv and Hf. Interestingly, they all tried to distinguish the isomers of some organic compounds. Jia and Wang [[23], [24], [25], [26]] proposed the positional distributive group contribution method, which could effectively distinguish the organic isomers and was successfully used to predict the critical properties and enthalpy of organic compounds. On the whole, the GCM is simple and general, yet it relies on the contribution values of the groups and could not be applied to new structural classification.
Another way to predict physicochemical properties is to establish a QSPR model from the structure of molecules. QSPR [[27], [28], [29], [30], [31], [32], [33]] is an effective method to link the thermodynamic/physical/chemical properties of organics with their compositions and structures. Recently, several QSPR models for estimating Hv of organic compounds have been proposed. In Abooali and Sobati ‘s investigation [34], a model was built with MLR (Multiple Linear Regression) method for predicting the Hv at Tb of 180 pure refrigerants with R2 of 0.96 and AARD of 6.83%. Krasnykh et al. [35] determined the (standard enthalpy of vaporization) of trimethylolpropane and carboxylic acids esters by the transpiration method, and good results were obtained with the relative error less than 2%. In addition, by using MLR method, Sosnowska et al. [28] proposed a QSPR model for estimating the for persistent organic pollutants with good fitting and the external predictive ability (R2 = 0.888, = 0.878). In terms of Hf, based on neural-network method, Hu et al. [36] presented a QSPR model to predict the of 180 organic molecules with good effect. Mercader et al. [37] established a QSPR model to predict Hf of 51 hydrocarbons by using correlation weighting of local invariants in atomic orbital molecular graphs (AOMGs), their model could provide satisfactory results with low average deviation. Furthermore, Vatani et al. [38] predicted the of 1115 compounds based on a multivariate linear genetic algorithm, and during their modelling work, five structural descriptors were calculated and selected from 1664 descriptor libraries.
In this study, the four properties endpoints - structure modelling study was performed with uniform norm descriptors for predicting the enthalpy of organic chemicals. The standard vaporization enthalpy (), standard formation enthalpy in gas state (), standard formation enthalpy in solid state () and standard formation enthalpy in liquid state () were used as properties endpoints for the same modelling work. Also, the prediction ability of this model was tested by using several validation methods.
Section snippets
Dataset
573 , 964 , 367 and 873 experimental data were from CRC Handbook of Chemistry and Physics [39]. The dataset covered 14 families of organic compounds, including chain and cyclic hydrocarbons, alcohols, ketones, carboxylic acids, amines, halogenated hydrocarbons, sulfur compounds and so on. The organics involved in , , and together with corresponding experimental values were shown in Table S1~S4 in the Supplementary Material.
Atomic distribution matrix
In this work, the
Model proposed
A unified QSPR model was built for predicting the four properties endpoints as Eq. (17) by using the same norm descriptors. The parameters bk of this model for the four properties were showed in Table 2.where, includes , , and , Ik represents norm indexes.
The predicted values of the four properties and the corresponding absolute errors were summarized in Table S1~S4. Statistical results for this model prediction were shown in Table 3. The R2 of ,
Conclusions
In this work, a set of atomic distribution matrices and norm descriptors were proposed. A unified QSPR model was built to predict the four properties endpoints for 14 families of organic compounds such as chain and cyclic hydrocarbons, alcohols, ketones, carboxylic acids, amines, halogenated hydrocarbons, sulfur compounds, etc. The four thermodynamic properties endpoints of , , and were involved in the same modelling work. High R2 and F values showed that this model
Author Contribution Section
Xue Yan: Formal analysis, Writing – original draft, Research concept and design, Collection of data, Data analysis and interpretation, Writing the article, Critical revision of the article. Tian Lan: Formal analysis, Research concept and design, Data analysis and interpretation, Critical revision of the article. Qingzhu Jia: Research concept and design, Critical revision of the article, Final approval of article. Fangyou Yan: Critical revision of the article, Final approval of article. Qiang
Declaration of competing interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgements
This work was supported by the National Natural Science Foundation of China [NO: 21808167, 21676203 and 21306137].
References (59)
- et al.
Enthalpies of vaporization of petroleum fractions from vapor pressure measurements and their correlation along with pure hydrocarbons
Fluid Phase Equilib.
(2003) - et al.
Molecular modeling of the ideal gas enthalpy of formation of hydrocarbons
Fluid Phase Equilib.
(2013) - et al.
Enthalpy of formation of natural hydrous iron phosphate: Vivianite
J. Chem. Thermodyn.
(2017) - et al.
The correlations of the enthalpy of vaporization and the surface tension of molecular liquids
Fluid Phase Equilib.
(2006) - et al.
Estimation of the enthalpy of vaporization of organic components as a function of temperature using a new group contribution method
J. Mol. Liq.
(2014) - et al.
Maximum topological distances based indices as molecular descriptors for QSPR. Part 1. Application to alkyl benzenes boiling points
J. Mol. Struct.: THEOCHEM
(2000) Enthalpy of vaporization, its temperature dependence and correlation with surface tension: a theoretical approach
Fluid Phase Equilib.
(2017)- et al.
Predicting enthalpy of vaporization for persistent organic pollutants with quantitative structure–property relationship (QSPR) incorporating the influence of temperature on volatility
Atmos. Environ.
(2014) - et al.
QSPR predictions of heat of fusion of organic compounds using Bayesian regularized artificial neural networks
Chemometr. Intell. Lab. Syst.
(2010) - et al.
Solubility of non-aromatic ionic liquids in water and correlation using a QSPR approach
Fluid Phase Equilib.
(2010)
Novel method for prediction of normal boiling point and enthalpy of vaporization at normal boiling point of pure refrigerants: a QSPR approach
Int. J. Refrig.
Vapor pressure and enthalpy of vaporization of trimethylolpropane and carboxylic acids esters
Fluid Phase Equilib.
QSPR modeling of the enthalpy of formation from elements by means of correlation weighting of local invariants of atomic orbital molecular graphs
Chem. Phys. Lett.
On a simple approach for determining applicability domain of QSAR models
Chemometr. Intell. Lab. Syst.
How important is to detect systematic error in predictions and understand statistical applicability domain of QSAR models?
Chemometr. Intell. Lab. Syst.
SGC method for predicting the standard enthalpy of formation of pure compounds from their molecular structures
Thermochim. Acta
Norm indexes for predicting enthalpy of vaporization of organic compounds at the boiling point
J. Mol. Liq.
Predicting heat capacity of gas for diverse organic compounds at different temperatures
Fluid Phase Equilib.
Distributive structure-properties relationship for flash point of multiple components mixture
Fluid Phase Equilib.
Multiple toxicity endpoint–structure relationships for substituted phenols and anilines
Sci. Total Environ.
Norm index-based QSTR model to predict the eco-toxicity of ionic liquids towards Leukemia rat cell line
Chemosphere
Prediction of ionic liquids heat capacity at variable temperatures based on the norm indexes
Fluid Phase Equilib.
A correlation between formation enthalpy and ionic conductivity in perovskite-structured Li3xLa0.67-xTiO3 solid lithium ion conductors
J. Mater. Chem.
New predictive methods for estimating the vaporization enthalpies of hydrocarbons and petroleum fractions
Ind. Eng. Chem. Res.
Predicting partition coefficients with a simple all-atom/coarse-grained hybrid model
J. Chem. Theory Comput.
The gaseous enthalpy of formation of the ionic liquid 1-butyl-3-methylimidazolium dicyanamide from combustion calorimetry, vapor pressure measurements, and ab initio calculations
J. Am. Chem. Soc.
Quantifying solvophobic effects in nonpolar cohesive interactions
J. Am. Chem. Soc.
Ionic liquids: first direct determination of their cohesive energy
J. Am. Chem. Soc.
Group contribution methods for estimation of selected physico-chemical properties of organic compounds
Thermodyn. Fundament. Appl. Sci.
Cited by (14)
QSPR models for solvation enthalpy based on quantum chemical descriptors
2023, Journal of Molecular LiquidsQSPR models for enthalpy and entropy of organic compounds based on a set of norm indices
2023, Fluid Phase EquilibriaGC-NORM-based thermodynamic framework for evaluations of organic reactions involving carbon dioxide utilization
2023, Chemical Engineering ScienceQSPR-based model extrapolation prediction of enthalpy of solvation
2023, Journal of Molecular Liquids