Abstract
The recent increase in terrorist attacks realized using liquid explosives has made it important to develop quick and reliable methods that can distinguish between nonhazardous liquids and other liquids that can be used in these explosives. Since the stability and sensitivity properties of microwave systems are high, microwave frequency band is preferred to differentiate hazardous liquids from non-hazardous liquids. In this study, a noncontact system based on electromagnetic response measurements of liquids in microwave frequency band is proposed to develop a classification approach that can be used in liquid scanners. Naive Bayes, linear discriminant analysis, qualitative data analysis, support vector machine, sequential minimal optimization, K-nearest neighbors classification algorithms are used to classify liquids and their classification performances are analyzed. The results of the set of classification experiments prove the success of the proposed measurement method. As the results prove, K-nearest neighbors is the most appropriate classification algorithm for hazardous liquid detection. Since it can be easily implemented and its detection process is fast, a classification system based on the proposed approach can be very useful in airports and shopping malls.
Similar content being viewed by others
1 INTRODUCTION
Many researchers have worked on industrial wastes and hazardous substances for the environment and human health. The effects of hazardous substances are seen in many areas such as health, safety, military and industry. Liquids, which are readily available in everyday life, threaten human and environmental safety in another way as well. They are especially preferred for terrorist attacks in places like airports, train stations, transportation points, political rallies, shopping malls, concerts and other cultural activities where there are thousands of people. Therefore, detection of hazardous liquids must be done in order to prevent these attacks.
In recent years due to the increase in the number of terrorist attacks, some researchers have examined ways to detect hazardous substances and illegal objects, and have analyzed the existing systems, related techniques, their advantages and limitations. In this way, a vision of what can be done to prevent these attack threats has been created [1]. The main focus of these researches has been the development of systems that can detect the explosive automatically without the intervention of an operator. Accordingly, in the last few years, significant progress has been made in the development of X-ray imaging systems in the detection of explosives. As well as X-ray imaging systems, the use of nuclear quadrupole resonance (NQR) for explosive detection has been heavily investigated [2, 3]. NQR is a spectroscopic technique that can detect explosives with high chemical specificity [2]. Nuclear magnetic resonance (NMR) method was used to investigate and classify the liquid contents in closed nonmetallic containers [4]. For the detection of liquid explosives, Ultra low field magnetic resonance imaging technique has been proposed, too [5].
In the literature, the use of different techniques including nuclear magnetic resonance and X-ray has been proposed to detect explosives [6, 7]. Among these techniques the most commonly used one is X-ray systems [7]. X-ray systems have also been proposed to analyze unknown solid samples that may contain explosives and analyze peroxide-based explosives [8]. As well as nuclear magnetic resonance and X-ray, liquid detection and identification can be performed using THz time-domain spectroscopy [9]. However, although these approaches are very easy to perceive certain peroxide compounds, they cannot distinguish many types of liquids used in daily life. Therefore, there is a need for a system to distinguish these liquids [6].
Microwave measurement methods are widely used for various purposes in several applications related to industry and safety. For instance, these methods have been applied to reduce the environmental impacts of industrial wastes and hazardous materials and satisfactory results have been obtained [10, 11]. They have been also used for sludge stabilization [11]. In addition, the use of coaxial probe measurement techniques which is one of the microwave measurement methods has been proposed to find solutions to biofilm defects and wall thinning problems [12]. The propagation of microwaves in liquids is quite different than their propagation in air. Moreover, both frequency-dependent velocities and attenuations of microwaves vary from liquid to liquid, depending on the molecular composition of the liquid. As it is known, the complex permeability and reflection and transmission coefficients of liquids are different. Microwave and millimeter wave frequency bands can be used to determine the complex permeability, reflection and transmission coefficients of both solids and liquids [13]. A formula model optimized with artificial bee colony (ABC) algorithm is presented to calculate the relative permeability of the materials [14]. They can also be used to determine other properties such as chemical concentration, bio-content, and moisture content [15]. These properties can be used to characterize liquids. Material characterization is not only important in safety related applications but also in food, medical, bioengineering, construction, medical and military related researches and applications [15, 16]. It has also been used to calculate the permeability of liquids, the reflection coefficient, S11, and the transmission coefficient, S21 [17, 18]. Although a vector network analyzer can provide measurement of phase and magnitude in wide microwave frequency range, it is very expensive. Therefore, some researchers prefer simulation based-studies [16].
In the last decade machine learning techniques have been used for different purposes such as predicting compressive strength of concrete [19], diagnosing cancer and Thyroid diseases [20, 21], classifying drugs according to their milk/plasma concentrations [22], automatically classifying good and defective agricultural products and raw materials such as rice, coffee and green tea [23], classifying gasoline [24], and estimating the botanical and geographical origin of honey [24]. Different from the other uses and purposes of machine learning techniques, in this study, different machine learning algorithms are used to classify liquids based on S parameter measurements. The remainder of this paper is as follows. Methodology and experimental setup used in this paper is introduced in the second section. The classification algorithms used in this paper and metrics used in the performance evaluation are introduced in this section, too. The third section presents the results of the performance evaluation study. Finally, this paper is concluded in the fourth section.
2 EXPERIMENTAL SETUP AND METHODOLOGY
Different measurement techniques can be used obtain the dielectric properties of materials. Material state (gas, liquid, or solid), frequency range and temperature (high or low) are important factors in selecting the most appropriate measurement method [26]. In coaxial probe method electromagnetic wave penetrates into the liquid with minimum reflection [27]. Although coaxial probe method can be used for liquid measurements, it is generally not practical and sometimes dangerous to dip something into some hazardous liquids or even open the lid. On the other hand, the noncontact measurement platform used in this study allows measuring without opening the lid of the liquid and immersing it in the liquid. The experimental setup used in this study for liquid classification using microwave patch antenna is shown in Fig. 1. It consists of a microwave circular patch antenna design connected to a vector network analyzer in order to measure of the reflection coefficient of electromagnetic wave. To build the experimental setup, an antenna with a resonant frequency of 1.5 GHz was designed. The design was constructed on a FR4 based dielectric substrate with 1.6 mm height, 4.4 relative permittivity and 10 × 10 cm2 ground plane beneath it. The antenna is feed by 50 Ohm SMA (SubMiniature version A) feed probe. The geometry of the antenna is illustrated in Fig. 2 and the photos of the antenna are shown in Fig. 3.
The antenna diameter is calculated using the equation (1), (2).
where εr is relative permittivity of the substrate, fr is the resonant frequency, h is the height of the substrate, and a is the radius of the patch.
The following was done to handle the overall process. The electromagnetic wave reflection coefficient of the liquids was measured by keeping a distance of approximately 5 mm between the antenna and the bottle. Then, a database from the values of each liquid was created. The data set in this database was later on used for liquid classification. The entire data set in the database was used when classifying liquids. Thus, the success of the algorithms in the classification of liquids found in the database was tested in the classification process. In order to test the success of the algorithms, 10 times cross validation and 5 times cross validation were performed. Then, the performance of the classification algorithms was analyzed using different evaluation metrics. The methodology described here is illustrated in Fig. 4.
2.1 Classification Algorithms and Performance Metrics
Machine learning is used to create a model from existing data using mathematical and statistical methods and to determine which class a new incoming data belongs to as accurately as possible using this model. In this study, naive Bayes, linear discriminant analysis (LDA), qualitative data analysis (QDA), support vector machine (SVM), sequential minimal optimization (SMO), and K-nearest neighbors (KNN) were used as classifiers. In order to evaluate the performances of each classifier, confusion matrices were created.
K-fold cross validation technique has been preferred for the performance evaluation of the proposed system and classification algorithms. K-fold cross validation technique divides the data set into training and test sets in order to avoid possible overfitting and to understand how the model performs on a set of data that it has not seen before. Because in the overfitting problem, the model gives good results on the data set worked on, but makes unsuccessful predictions on new data sets that it has never seen. K-fold cross validation technique divides the training data set into random k segments. k – 1 is used for training, 1 part is used for the test set and k is repeated this time. The values obtained in each round are summed up; and the performance of the model is evaluated. K number is usually 10 or 5, as in this study. Several metrics should be used to evaluate how well a classifier performs at the end of the classification process. In this study, Kappa, RMS, confusion matrix and accuracy are used to evaluate the performance of the classification algorithms.
Kappa. This value is used to measure the consistency between predicted and observed classifications on a group of data. The calculation of Kappa value is given in (5). P(a) indicates the accuracy of the classifier, and P(e) is the weighted average of the expected accuracy of the classifier making random estimates on the same dataset. Kappa value is between –1 and 1. –1 indicates a complete mismatch, i.e. an inverse relationship, and 1 indicates a perfect fit. The closer the value is to 1, the greater the fit, and the smaller the distance. The interpretation of Kappa value is listed in Table 1.
Root mean squared error (RMS). It is used to scale the differences between the actual and predicted values. It is determined by taking the square root of the mean square error as given in (4). P represents the estimated values and a represents the real values. As the RMS value approaches zero, the correct estimate of the classifier increases.
Confusion matrix. A confusion matrix contains information about actual and predicted groups made by a classification system. The diagonal elements of the matrix give the correct number of classified objects.
Accuracy. The most popular and simple method used to measure model performance is model accuracy. The accuracy given in (5) gives the number of samples correctly classified from all the samples.
3 PERFORMANCE EVALUATION
For performance evaluation, the experimental setup described in Section 2 was used to classify a set of 36 liquids. Table 2 lists the set of 36 liquids used by the experimental setup of this study, 12 of these liquids are hazardous ones and 24 of these liquids are nonhazardous ones used in daily life. These liquids include alcoholic beverages. In the measurements, 0.5 liter thin pet bottle which has low reflectance and frequently used in daily life is preferred. The amount of liquid to be analyzed is sufficient to be approximately 7 cm high in the pet bottle. For the consistency, at the same room temperature, the same bottle is used for all of the measurements. The results are shown in Fig. 5. In this study, as listed in Table 3, the health hazards and flammability properties of the liquids in the hazardous group are indicated with a rating of 0 to 4. Here 0 means no hazard and 4 means the highest. Health hazard applies to direct oral use or skin contact. High flammability of materials can recklessly result in starting a fire or causing an explosion which endangers human life.
When the confusion matrix of the naive Bayes algorithm is considered (see Table 4), it can be seen that hazardous liquids were correctly classified. In the table, the green areas indicate the correct number of liquids and the reds indicate the incorrect number of liquids. Particularly, in the classification when the entire training set was used in the classification, naive Bayes correctly classified all of the hazardous liquids but classified 6 of the nonhazardous liquids into hazardous groups. When cross validation process was applied, naive Bayes classified 1 hazardous liquid into nonhazardous group and classified 5 nonhazardous liquids as hazardous.
When the entire training set was used, LDA correctly classified 12 hazardous liquids, while 5 of the nonhazardous liquids were incorrectly classified as hazardous. In the case of cross validation, 1 hazardous liquid was not correctly classified and 6 nonhazardous liquids were classified incorrectly (see Table 5). The confusion matrix of QDA algorithm (see Table 6) is quite similar to LDA algorithm. However, QDA is more stable than LDA algorithm because it provides the same results in the classification used both the training set and the cross validation process. The confusion matrix of SVM algorithm (see Table 7) indicates that SVM algorithm failed to form a model. SVM classified all of the liquids as nonhazardous. Compared to SVM, SMO algorithm obtained better results. In the training set, SMO correctly classified 11 of hazardous liquids and 23 of nonhazardous liquids. When cross-validation was performed for SMO, the number of correct classifications decreased (see Table 8). Among all of the classification algorithms KNN achieved the highest accuracy. KNN algorithm correctly classified all of the hazardous and nonhazardous liquids when all the training data was used in the classification process. On the other hand, when cross validation was performed, 10-fold cross validation resulted in incorrect classification for 1 of the liquids and 5-fold cross validation resulted in incorrect classification for 3 of the liquids (See Table 9).
Table 10 lists the accuracy, Kappa and RMS values of all the classification algorithms when all the training was used and 10-fold and 5-fold cross correlations were applied. Correctly and incorrectly classified instances of all of the classification algorithms are shown in Fig. 6. As can be seen in Fig. 5, KNN algorithm provided the highest number of correct predictions and the lowest number of incorrect predictions. When Table 10 is taken into consideration, it can be seen that SVM algorithm provided the lowest accuracy rate and highest RMS value. Naive Bayes, LDA and QDA algorithms obtained similar results. SMO algorithm obtained a high accuracy rate of 94.4% in the training data set, however when cross validation was applied its accuracy decreased. KNN algorithm obtained the highest accuracy in the training set even when cross validation application was applied. In addition, KNN algorithm obtained the lowest RMS compared to the others. The Kappa value of KNN algorithm was 1 for the training set and close to 1 when cross validations were applied. This confirms the success of KNN algorithm.
4 CONCLUSIONS
In recent years, the increase in the number of terrorist attacks using liquid explosives has necessitated the development of systems that can easily and effectively distinguish between the liquids that can be used in these explosives and nonhazardous liquids. In this study, a noncontact hazardous liquid detection approach has been proposed and the performance of the classification algorithms that could be used in the proposed approach has been evaluated. The novelty of the proposed approach is that while a classification is being made using the proposed approach, the cap of the bottle does not need to be opened or removed from the bottle. After a prototype system based on the proposed approach is developed, the proposed approach can be used in airports, shopping malls and other places. Due to the easy and quite fast detection process, the proposed approach will possibly not result in queuing and loss of time at security points. In addition to proposing a novel approach to detect hazardous liquids, in this study the performance of six different classification algorithms used to identify hazardous liquids has been analyzed in terms of accuracy and time requirement. As the results prove, KNN is the most appropriate classification algorithm for hazardous liquid detection.
REFERENCES
Melnikov, Y., Avtonomov, P., Kornienko, V., and Olshansky, Y., Detection of dangerous materials and illicit objects in cargoes and baggage: current tools, existing problems and possible solutions, J. Homeland Secur. Emerg. Manage., 2011, vol. 8, no. 1. https://doi.org/10.2202/1547-7355.1889
Cardona, L., Jiménez, J., and Vanegas, N., Nuclear quadrupole resonance for explosive detection, Ingeniare Revista chilena de ingeniería, 2015, vol. 23, no. 3, pp. 458–472.
Miller, J. and Barrall, G., Explosives detection with nuclear quadrupole resonance, Am. Sci., 2005, vol. 93, no. 1, p. 50.
Kumar, S., Liquid-contents verification for explosives, other hazards, and contraband by magnetic resonance, Appl. Magn. Reson., 2004, vol. 25, nos. 3–4, pp. 585–597. https://doi.org/10.1007/BF03166550.
Espy, M., Flynn, M., Gomez, J., Hanson, C., Kraus, R., Magnelind, P., et al., Ultra-low-field MRI for the detection of liquid explosives, Supercond. Sci. Technol., 2010, vol. 23, no. 3, p. 034023. https://doi.org/10.1088/0953-2048/23/3/034023
Abidin, Z.Z., Omar, F.N., Yogarajah, P., Biak, D.R.A., and Man, Y.B.C., Dielectric characterization of liquid containing low alcoholic content for potential halal authentication in the 0.5-50 GHz range, Am. J. Appl. Sci., 2014, vol. 11, no. 7, pp. 1104–1112. https://doi.org/10.3844/ajassp.2014.1104.1112
Singh, S. and Singh, M., Explosives detection systems (EDS) for aviation security, Signal Process., 2003, vol. 83, no. 1, pp. 31–55. https://doi.org/10.1016/S0165-1684(02)00391-2.
Schulte-Ladbeck, R., Vogel, M., and Karst, U., Recent methods for the determination of peroxide-based explosives, Anal. Bioanal. Chem., 2006, vol. 386, no. 3, pp. 559–565. https://doi.org/10.1007/s00216-006-0579-y
Choi, K., Hong, T., Sim, K.I., Ha, T., Park, B.C., Chung, J.H., et al. Reflection terahertz time-domain spectroscopy of RDX and HMX explosives, J. Appl. Phys., 2014, vol. 115, no. 2, p. 023105. https://doi.org/10.1063/1.4861616
Windgasse, G. and Dauerman, L., Microwave treatment of hazardous wastes: removal of volatile and semi-volatile organic contaminants from soil, J. Microwave Power Electromagn. Energy, 1992, vol. 27, no. 1, pp. 23–32. https://doi.org/10.1080/08327823.1992.11688167
Mudhoo, A. and Sharma, S.K., Microwave irradiation technology in waste sludge and wastewater treatment research, Crit. Rev. Environ. Sci. Technol., 2011, vol. 41, no. 11, pp. 999–1066. https://doi.org/10.1080/10643380903392767
Liu, L., Application of microwave for remote NDT and distinction of biofouling and wall thinning defects inside a metal pipe, J. Nondestr. Eval., 2015, vol. 34, no. 4. https://doi.org/10.1007/s10921-015-0313-9
Lucic, B., Basic, I., Nadramija, D., Milicevic, A., Trinajstic, N., Suzuki, T., et al., Correlation of liquid viscosity with molecular structure for organic compounds using different variable selection methods, Arkivoc, 2002, vol. 2002, no. 4, pp. 45–59. https://doi.org/10.3998/ark.5550190.0003.406
Tekbas, M., Toktas, A., and Ustun, D., A formulaic model calculating the permittivity of testing materials placed on a circular patch antenna, in 2019 XXIVth Int. Semin./Workshop Direct Inverse Probl. Electromagn. Acoust. Wave Theory (DIPED), 2019. https://doi.org/10.1109/DIPED.2019.8882582.
Büyüköztürk, O., Yu, T.-Y., and Ortega, J.A., A methodology for determining complex permittivity of construction materials based on transmission-only coherent, wide-bandwidth free-space measurements, Cem. Concr. Compos., 2006, vol. 28, no. 4, pp. 349–59. https://doi.org/10.1016/j.cemconcomp.2006.02.004
Al-Mously, S.I.Y., A modified complex permittivity measurement technique at microwave frequency, Int. J. New Comput. Archit. Appl., 2012, vol. 2, pp. 389–401.
Li, Z., Haigh, A., Soutis, C., Gibson, A., and Sloan, R., A Simulation-assisted non-destructive approach for permittivity measurement using an open-ended microwave waveguide, J. Nondestr. Eval., 2018, vol. 37, no. 3, https://doi.org/10.1007/s10921-018-0493-1
Jiang, Y., Ju, Y., and Yang, L., Nondestructive in-situ permittivity measurement of liquid within a bottle using an open-ended microwave waveguide, J. Nondestr. Eval., 2015, vol. 35, no. 1. https://doi.org/10.1007/s10921-015-0322-8
Derousseau, M., Laftchiev, E., Kasprzyk, J., Rajagopalan, B., and Srubar, W., A comparison of machine learning methods for predicting the compressive strength of field-placed concrete, Constr. Build. Mater., 2019, vol. 228, p. 116661. https://doi.org/10.1016/j.conbuildmat.2019.08.042
Aydın, E.A. and Keleş, M.K., Breast cancer detection using K-nearest neighbors data mining method obtained from the bow-tie antenna dataset, Int. J. RF Microwave Comput.-Aided Eng., 2017, vol. 27, no. 6. https://doi.org/10.1002/mmce.21098
Prasad, V., Rao, T.S., and Babu, M.S.P., Thyroid disease diagnosis via hybrid architecture composing rough data sets theory and machine learning algorithms, Soft Comput., 2015, vol. 20, no. 3, pp. 1179–1189. https://doi.org/10.1007/s00500-014-1581-5
Fatemi, M.H. and Ghorbanzad’e, M., Classification of drugs according to their milk/plasma concentration ratio, Eur. J. Med. Chem., 2010, vol. 45, no. 11, pp. 5051–5055. https://doi.org/10.1016/j.ejmech.2010.08.013
Kim, S., Kwak, J., and Ko, B., Automatic classification algorithm for raw materials using mean shift clustering and stepwise region merging in color, J. Broadcast Eng., 2016, vol. 21, no. 3, pp. 425–435. https://doi.org/10.5909/JBE.2016.21.3.425
Balabin, R.M., Safieva, R.Z., and Lomakina, E.I., Gasoline classification using near infrared (NIR) spectroscopy data: Comparison of multivariate techniques, Anal. Chim. Acta, 2010, vol. 671, nos. 1–2, pp. 27–35. https://doi.org/10.1016/j.aca.2010.05.013
Maione, C., Barbosa, F., and Barbosa, R.M., Predicting the botanical and geographical origin of honey with multivariate data analysis and machine learning techniques: a review, Comput. Electron. Agric., 2019, vol. 157, pp. 436–446. https://doi.org/10.1016/j.compag.2019.01.020
Dos Santos, J.C.A., Dias, M.H.C., Aguiar, A., and Borges, I., Jr., Using the coaxial probe method for permittivity measurements of liquids at high temperatures, J. Microwaves Optoelectron. Electromagn. Appl., 2009, vol. 8, pp. 78–91.
Mitani, T., Hasegawa, N., Nakajima, R., Shinohara, N., Nozaki, Y., Chikata, T., et al., Development of a wideband microwave reactor with a coaxial cable structure, Chem. Eng. J., 2016, vol. 299, pp. 209–216. https://doi.org/10.1016/j.cej.2016.04.064
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ebru Efeoglu, Gurkan Tuna Detection of Hazardous Liquids Using Microwave Data and Well-Known Classification Algorithms. Russ J Nondestruct Test 56, 742–751 (2020). https://doi.org/10.1134/S106183092009003X
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S106183092009003X