Optimal estimation of physical properties of the products of an atmospheric distillation column using support vector regression
Introduction
Crude Distillation Unit (CDU) is the heart of a petroleum refinery where crude oil is separated into its naturally occurring fractions. Atmospheric Distillation Column (ADC) is the most complex unit in the CDU which operates at atmospheric pressure (Gary et al., 2010; Leffler, 1985). The operation parameters of an ADC affect the amounts and physical properties of its products. Dynamically changing demands and prices for end products forces the planners to frequently update the optimal operating parameters and maximize the amount of a particular product while keeping the products within their specified limits. This gives rise to the need for online monitoring of the properties.
The online monitoring of temperature, pressure and flowrate within the ADC is possible with the measurement devices connected to Distributed Control Systems (DCS). However online monitoring of hydrocarbon properties is only possible by installing online analyzers which are very complex, hard-to-maintain and expensive. Therefore, chemical composition and physical properties are monitored by taking samples periodically from the ADC and analyzing the samples in a laboratory with appropriate equipment which takes up considerable amount of time.
The lack of online measurement of hydrocarbon properties can be complemented by online estimation methods. An estimation function of temperature and pressure as input variables can be fitted to laboratory analysis data of the physical properties. The ADC in İzmit Refinery of Turkish Petroleum Refineries Corporation which is the subject of this study has employed linear regression for the online estimation of the critical properties of hydrocarbon products like the 95% boiling point temperature of heavy diesel. This prediction embodies a fixed function of heavy diesel tray temperature and flash zone temperature and pressure which is then fitted to laboratory data by a gain and a bias. There have been studies on rigorous modeling of ADCs. This method is highly complex and embodies a comprehensive physical properties library, refers to the laws of thermodynamics and is hard to fit to a real operating unit. Though many of the hydrocarbon properties can be gathered from the simulation results of a rigorous model, the rigorous model still needs periodic maintenance as it sways away from real system in time.
Recent studies have successfully employed Machine Learning in modeling chemical processes in petroleum refining. Artificial Neural Network (ANN) has been a popular choice in this area but has suffered from over-fitting. ANN like any regression method that embodies Empirical Risk Minimization, minimizes the estimation error regardless of the model complexity. This in turn leads to poor performance in generalization of unseen and unvalidated testing data.
More recent studies have utilized Support Vector Machines (SVM) in regression problems. SVMs, in contrast to ANN embodies Structural Risk Minimization that aims to generate a flatter and less complex function. By implementing a new loss function, Support Vector Regression (SVR) chooses the flattest path where the error is kept within a predefined width for insensitive region and a predefined cost factor handles outlier data. Moreover, by employing kernel functions, SVR can be trained to generate a nonlinear estimation function. Lastly, the user-defined parameters of SVR can be optimized via cross validation embedded in global optimizers like Genetic Algorithm or simpler solvers like Grid Search, to maximize generalization performance.
Section snippets
Literature survey
Since Crude Distillation Units are complex and energy intensive units, safe and optimum operation of the unit has a great significance in oil refining. This leads to a need in proper monitoring of the operation for process engineers and a proper estimation of the operation dynamics for planning engineers. Both objectives prove to be troublesome as the physical properties can be measured by taking a sample from the stream periodically and then analyzing these sample in a laboratory with
Data normalization
The rate of convergence in training the support vector model is greatly affected by the eigenvalues of the Hessian. We can improve this rate by decreasing the difference between the smallest and largest eigenvalue. Therefore, this study strongly recommends and applies normalization for input data to improve the rate of convergence. Some kernel functions can only be defined in restricted intervals which already require data normalization. For unrestricted kernels, this study scales data between
The data
The atmospheric distillation column (ADC) subject to this study is an actual operating unit in İzmit Refinery of Turkish Petroleum Refineries Corporation. The ADC together with its pump-arounds, side-strippers and condenser drum is depicted in Fig. 4.
The crude distillation unit as a total was designed for processing crude oil from Kirkuk, Iraq, but currently processes different kinds of crude oils. A pre-flash column separates LPG and Light Naphtha and sends the remaining stream to the ADC.
Optimization results
The use of , and as decision variables in QP greatly improved the rate of convergence. The CPU time of SVR training with αi and as decision variables is approximately 1.9 seconds while the CPU time of SVR training with the simplified variables was reduced by 80% to a range between 350 and 400 ms even though the problem size is the same. This improvement is achieved by halving the number of decision variables in the second-degree part of the cost function, which creates a
Discussion of optimization results
The results show that different optimization solvers have found different local optimal points with similar mean absolute errors and standard deviations for the same kernels, meaning that there are many similar local optima. Genetic method has triumphed in most of the criteria. SVR with Gaussian RBF kernel optimized by Grid Search has best performance in testing. Grid Search method has highest computational.
It should be noted that any solution linear regression also showed quite bit good
Conclusions
In this paper, Support Vector Regression (SVR) was employed for estimating 95% Boiling Point Temperature property of Heavy Diesel product of a real operating Atmospheric Distillation Column (ADC) in İzmit Refinery of Turkish Petroleum Refineries Corporation. Linear, Polynomial and Gaussian Radial Basis Function (Gaussian RBF) kernels were tested and the SVR parameters were optimized by embedding k-fold cross validation in an optimizer such as Genetic Algorithm (GA) and Grid Search (GS). The
CRediT authorship contribution statement
Ahmet Can Serfidan: Investigation, Data curation, Software, Validation, Visualization, Writing - review & editing. Firat Uzman: Conceptualization, Data curation, Formal analysis, Validation, Writing - original draft. Metin Türkay: Conceptualization, Formal analysis, Investigation, Methodology, Supervision, Writing - review & editing.
Declaration of Competing Interest
None.
References (35)
- et al.
Multi-objective optimization of an industrial crude distillation unit using the elitist non-dominated sorting genetic algorithm
Chem. Eng. Res. Des.
(2004) A crude distillation unit model suitable for online applications
Fuel Process. Technol.
(2001)- et al.
Modeling of isomerization of C8 aromatics by online least squares support vector machine
Chin. J. Chem. Eng.
(2009) Optimization of crude distillation system using aspen plus: effect of binary feed selection on grass-root design
Chem. Eng. Res. Des.
(2010)Equilibrium constants from a modified Redlich–Kwong equation of state
Chem. Eng. Sci.
(1972)- et al.
Soft sensing modeling based on support vector machine and Bayesian model selection
Comput. Chem. Eng.
(2004) - et al.
Operational optimization of a simulated atmospheric distillation column using support vector regression models and information analysis
Chem. Eng. Res. Des.
(2012) - et al.
Theoretical foundations of the potential function method in pattern recognition learning
Autom. Remote Control
(1964) - (2008)
On-line optimization of a crude distillation unit with constraints on product properties
Ind. Eng. Chem. Res.
(2002)
Robust linear programming discrimination of two linearly inseparable sets
Optimiz. Methods Softw.
A training algorithm for optimal margin classifiers
A new class of solution methods for multicomponent, multistage separation processes
Can. J. Chem. Eng.
LIBSVM: a library for support vector machines
ACM Trans. Intell. Syst. Technol.
Application of support vector regression for developing soft sensors for nonlinear processes
Can. J. Chem. Eng.
Support-vector networks
Mach. Learn.
Nonlinear programming and nonsmooth optimization by successive linear programming
Math. Program.
Cited by (16)
A study on corrosion failure of the circulation line with valve openings in atmospheric tower based on CFD
2022, Engineering Failure AnalysisCitation Excerpt :Atmospheric distillation tower is the main device for crude oil processing in petrochemical enterprises and plays a vital role in the refining system. The safety of atmospheric distillation towers has always been the focus of people's attention [1–4]. The water dissolved of corrosive components will damage the tower and other important components.
Pruning Pareto optimal solutions for multi-objective portfolio asset management
2022, European Journal of Operational ResearchMachine learning modeling and analysis of biohydrogen production from wastewater by dark fermentation process
2022, Bioresource TechnologyA framework for transportation infrastructure cost prediction: a support vector regression approach
2022, Transportation Letters