Optimal estimation of physical properties of the products of an atmospheric distillation column using support vector regression

https://doi.org/10.1016/j.compchemeng.2019.106711Get rights and content

Abstract

Atmospheric distillation column is one of the most important units in an oil refinery where crude oil is fractioned into its more valuable constituents. Almost all of the state-of-the art online equipment has a time lag to complete the physical property analysis in real time due to complexity of the analyses. Therefore, estimation of the physical properties from online plant data with a soft sensor has significant benefits. In this paper, we estimate the physical properties of the hydrocarbon products of an atmospheric distillation column by support vector regression using Linear, Polynomial and Gaussian Radial Basis Function kernels and SVR parameters are optimized by using a variety of algorithms including genetic algorithm, grid search and non-linear programming. The optimization-based data analytics approach is shown to produce superior results compared to linear regression, the mean testing error of estimation is improved by 5% with SVR 4.01 °C to 3.8 °C.

Introduction

Crude Distillation Unit (CDU) is the heart of a petroleum refinery where crude oil is separated into its naturally occurring fractions. Atmospheric Distillation Column (ADC) is the most complex unit in the CDU which operates at atmospheric pressure (Gary et al., 2010; Leffler, 1985). The operation parameters of an ADC affect the amounts and physical properties of its products. Dynamically changing demands and prices for end products forces the planners to frequently update the optimal operating parameters and maximize the amount of a particular product while keeping the products within their specified limits. This gives rise to the need for online monitoring of the properties.

The online monitoring of temperature, pressure and flowrate within the ADC is possible with the measurement devices connected to Distributed Control Systems (DCS). However online monitoring of hydrocarbon properties is only possible by installing online analyzers which are very complex, hard-to-maintain and expensive. Therefore, chemical composition and physical properties are monitored by taking samples periodically from the ADC and analyzing the samples in a laboratory with appropriate equipment which takes up considerable amount of time.

The lack of online measurement of hydrocarbon properties can be complemented by online estimation methods. An estimation function of temperature and pressure as input variables can be fitted to laboratory analysis data of the physical properties. The ADC in İzmit Refinery of Turkish Petroleum Refineries Corporation which is the subject of this study has employed linear regression for the online estimation of the critical properties of hydrocarbon products like the 95% boiling point temperature of heavy diesel. This prediction embodies a fixed function of heavy diesel tray temperature and flash zone temperature and pressure which is then fitted to laboratory data by a gain and a bias. There have been studies on rigorous modeling of ADCs. This method is highly complex and embodies a comprehensive physical properties library, refers to the laws of thermodynamics and is hard to fit to a real operating unit. Though many of the hydrocarbon properties can be gathered from the simulation results of a rigorous model, the rigorous model still needs periodic maintenance as it sways away from real system in time.

Recent studies have successfully employed Machine Learning in modeling chemical processes in petroleum refining. Artificial Neural Network (ANN) has been a popular choice in this area but has suffered from over-fitting. ANN like any regression method that embodies Empirical Risk Minimization, minimizes the estimation error regardless of the model complexity. This in turn leads to poor performance in generalization of unseen and unvalidated testing data.

More recent studies have utilized Support Vector Machines (SVM) in regression problems. SVMs, in contrast to ANN embodies Structural Risk Minimization that aims to generate a flatter and less complex function. By implementing a new loss function, Support Vector Regression (SVR) chooses the flattest path where the error is kept within a predefined width for insensitive region and a predefined cost factor handles outlier data. Moreover, by employing kernel functions, SVR can be trained to generate a nonlinear estimation function. Lastly, the user-defined parameters of SVR can be optimized via cross validation embedded in global optimizers like Genetic Algorithm or simpler solvers like Grid Search, to maximize generalization performance.

Section snippets

Literature survey

Since Crude Distillation Units are complex and energy intensive units, safe and optimum operation of the unit has a great significance in oil refining. This leads to a need in proper monitoring of the operation for process engineers and a proper estimation of the operation dynamics for planning engineers. Both objectives prove to be troublesome as the physical properties can be measured by taking a sample from the stream periodically and then analyzing these sample in a laboratory with

Data normalization

The rate of convergence in training the support vector model is greatly affected by the eigenvalues of the Hessian. We can improve this rate by decreasing the difference between the smallest and largest eigenvalue. Therefore, this study strongly recommends and applies normalization for input data to improve the rate of convergence. Some kernel functions can only be defined in restricted intervals which already require data normalization. For unrestricted kernels, this study scales data between

The data

The atmospheric distillation column (ADC) subject to this study is an actual operating unit in İzmit Refinery of Turkish Petroleum Refineries Corporation. The ADC together with its pump-arounds, side-strippers and condenser drum is depicted in Fig. 4.

The crude distillation unit as a total was designed for processing crude oil from Kirkuk, Iraq, but currently processes different kinds of crude oils. A pre-flash column separates LPG and Light Naphtha and sends the remaining stream to the ADC.

Optimization results

The use of βi=αiαi*, and βi*=αi+αi* as decision variables in QP greatly improved the rate of convergence. The CPU time of SVR training with αi and αi* as decision variables is approximately 1.9 seconds while the CPU time of SVR training with the simplified variables was reduced by 80% to a range between 350 and 400 ms even though the problem size is the same. This improvement is achieved by halving the number of decision variables in the second-degree part of the cost function, which creates a

Discussion of optimization results

The results show that different optimization solvers have found different local optimal points with similar mean absolute errors and standard deviations for the same kernels, meaning that there are many similar local optima. Genetic method has triumphed in most of the criteria. SVR with Gaussian RBF kernel optimized by Grid Search has best performance in testing. Grid Search method has highest computational.

It should be noted that any solution linear regression also showed quite bit good

Conclusions

In this paper, Support Vector Regression (SVR) was employed for estimating 95% Boiling Point Temperature property of Heavy Diesel product of a real operating Atmospheric Distillation Column (ADC) in İzmit Refinery of Turkish Petroleum Refineries Corporation. Linear, Polynomial and Gaussian Radial Basis Function (Gaussian RBF) kernels were tested and the SVR parameters were optimized by embedding k-fold cross validation in an optimizer such as Genetic Algorithm (GA) and Grid Search (GS). The

CRediT authorship contribution statement

Ahmet Can Serfidan: Investigation, Data curation, Software, Validation, Visualization, Writing - review & editing. Firat Uzman: Conceptualization, Data curation, Formal analysis, Validation, Writing - original draft. Metin Türkay: Conceptualization, Formal analysis, Investigation, Methodology, Supervision, Writing - review & editing.

Declaration of Competing Interest

None.

References (35)

  • K.P. Bennett et al.

    Robust linear programming discrimination of two linearly inseparable sets

    Optimiz. Methods Softw.

    (1992)
  • B.E. Boser et al.

    A training algorithm for optimal margin classifiers

  • J.F. Boston et al.

    A new class of solution methods for multicomponent, multistage separation processes

    Can. J. Chem. Eng.

    (1974)
  • C.-C. Chang et al.

    LIBSVM: a library for support vector machines

    ACM Trans. Intell. Syst. Technol.

    (2011)
  • S.B. Chitralekha et al.

    Application of support vector regression for developing soft sensors for nonlinear processes

    Can. J. Chem. Eng.

    (2010)
  • C. Cortes et al.

    Support-vector networks

    Mach. Learn.

    (1995)
  • R. Fletcher et al.

    Nonlinear programming and nonsmooth optimization by successive linear programming

    Math. Program.

    (1989)
  • Cited by (16)

    • A study on corrosion failure of the circulation line with valve openings in atmospheric tower based on CFD

      2022, Engineering Failure Analysis
      Citation Excerpt :

      Atmospheric distillation tower is the main device for crude oil processing in petrochemical enterprises and plays a vital role in the refining system. The safety of atmospheric distillation towers has always been the focus of people's attention [1–4]. The water dissolved of corrosive components will damage the tower and other important components.

    View all citing articles on Scopus
    View full text