Abstract

The uses of statistical distributions for modeling real phenomena of nature have received considerable attention in the literature. The recent studies have pointed out the potential of statistical distributions in modeling data in applied sciences, particularly in financial sciences. Among them, the two-parameter Lomax distribution is one of the prominent models that can be used quite effectively for modeling data in management sciences, banking, finance, and actuarial sciences, among others. In the present article, we introduce a new three-parameter extension of the Lomax distribution via using a class of claim distributions. The new model may be called the Lomax-Claim distribution. The parameters of the Lomax-Claim model are estimated using the maximum likelihood estimation method. The behaviors of the maximum likelihood estimators are examined by conducting a brief Monte Carlo study. The potentiality and applicability of the Lomax claim model are illustrated by analyzing a dataset taken from financial sciences representing the vehicle insurance loss data. For this dataset, the proposed model is compared with the Lomax, power Lomax, transmuted Lomax, and exponentiated Lomax distributions. To show the best fit of the competing distributions, we consider certain analytical tools such as the Anderson–Darling test statistic, Cramer–Von Mises test statistic, and Kolmogorov–Smirnov test statistic. Based on these analytical measures, we observed that the new model outperforms the competitive models. Furthermore, a bivariate extension of the proposed model called the Farlie–Gumble–Morgenstern bivariate Lomax-Claim distribution is also introduced, and different shapes for the density function are plotted. An application of the bivariate model to GDP and export of goods and services is provided.

1. Introduction

The Lomax or Pareto II (the shifted Pareto) distribution was proposed by Lomax in the mid of the last century to model business failure data. This model has a wide range of applications in a variety of fields, particularly, in income and wealth inequality, size of cities, and financial and actuarial sciences. Furthermore, it has been applied to model income and wealth data, the size distribution of computer files on servers, reliability, life testing, and curve analysis [1].

A random variable say X is said to follow the two-parameter Lomax distribution if its cumulative distribution function (cdf) denoted by is given bywhere . The probability density function (pdf), survival function (sf), and hazard rate function (hrf) of the Lomax random variable are given byrespectively.

The Lomax model can be obtained in a number of ways. It can be obtained as a special form of the Pearson type VI distribution. It is also considered a mixture of the exponential and gamma distributions. In the context lifetime scenario, the Lomax distribution falls under the domain of decreasing failure rate distributions. This distribution has been proved as a significant alternative to the exponential, Weibull, and gamma distributions to model heavy-tailed data sets. Due to the importance and applicability of the Lomax distribution, it has been extensively generalized and modified to obtain a more flexible extension of the Lomax distributions, for example, power Lomax distribution [2], transmuted Lomax distribution [3], exponentiated Lomax distribution [4], weighted Lomax distribution [5], exponential Lomax distribution [6], gamma Lomax distribution [7], Poisson Lomax distribution [8], an extended Lomax distribution [9], Marshall–Olkin extended Lomax [10], exponentiated Weibull Lomax [11], Kumaraswamy generalized power Lomax [12], Marshall–Olkin length biased Lomax [13], Gompertz Lomax [14], half-logistic Lomax [15], Gumbel Lomax [16], transmuted Weibull Lomax [17], and transmuted exponentiated Lomax [18].

In this article, we focus on proposing a new three-parameter modification of the Lomax distribution called the Lomax-Claim (L-Claim) distribution for modeling financial data. The L-Claim distribution is introduced by adopting the approach of a class of claim distributions of Ahmad et al. [19]. The cdf and pdf of a class of claim distributions are given, respectively, by

In this article, the L-Claim distribution along with its statistical properties will be given intensive statistical treatment. However, the flexibility and applicability of the L-Claim distribution are examined by an application to the insurance loss insurance data.

The rest of this paper is carried out as follows. In Section 2, the L-Claim distribution is introduced. Section 3 is devoted to the bivariate extension of the L-Claim distribution. The quantile function and some possible plots for the mean, variance, skewness, and kurtosis are provided in Section 4. Section 5 is devoted to the maximum likelihood estimation of the L-Claim distribution. The Monte Carlo simulation study is conducted in Section 6. A real-life application of the L-Claim model to insurance loss data Farlie–Gumble–Morgenstern bivariate Lomax-Claim (FGMBL-Claim) is applied to the GDP and export of goods and services is provided in Section 8. Some concluding remarks are given in the last section.

2. The L-Claim Distribution

In the following section, we study a three-parameter L-Claim distribution and investigate the shapes of its pdf. The cdf and pdf of the L-Claim distribution can be obtained by substituting expressions (1) and (2) in (3) and (4), respectively.

A random variable X has the L-Claim distribution if its cdf is given by

The pdf corresponding to (5) is given by

Some possible behaviors of the pdf of the L-Claim distribution are shown in Figure 1. The plots in the right panel of Figure 1 are sketched for (red-line), (green-line), and (black-line), whereas the plots in the left panel of Figure 1 are presented for (red-line), (green-line), and (black-line).

From Figure 1, it is clear that when the parameters and increase and , then the proposed L-Claim posses heavy tails. Due to the right-skewed and heavy-tailed behavior of the proposed model, it can be a good candidate model for modeling heavy-tailed data which are very important in financial and actuarial sciences.

The sf and hrf of the L-Claim distributed random variable are given byrespectively. Some possible behaviors of the hrf of the L-Claim distribution are shown in Figure 2. The plots in the right panel of Figure 2 are sketched for (red-line), (green-line), and (black-line), whereas the plots in the left panel of Figure 2 are presented for (red-line), (green-line), and (black-line).

As we stated above, the two-parameter traditional Lomax distribution belongs to the class of decreasing failure rate distributions. However, from the plots provided in Figure 2, it is clear that the L-Claim distribution has unimodal and increasing failure rate functions. Henceforth, besides the heavy-tailed behavior, this is another superiority of the proposed model over the Lomax distribution.

3. Farlie–Gumble–Morgenstern Bivariate Lomax-Claim Distribution

Copula functions are used to represent the joint cdf of the two marginal univariate distributions. If is the univariate cdf of , the joint cdf denoted by is defined by the copula function given bywhere is the dependence measures between and . The joint cdf and pdf of the Farlie–Gumble–Morgenstern (FGM) copula (Conway [20]) are given, respectively, by

The random variables say follow the FGMBL-Claim distribution if its cdf is defined bywhere . The corresponding pdf is

Different plots for the FGMBL-Claim distribution are provided in Figures 3 and 4 . Whereas plots for the cdf and sf of the FGMBL-Claim distribution are provided in Figure 5.

4. Quantile Function

The quantile function of distribution is very useful to apply for generating random numbers by Monte Carlo simulation. Suppose X follows the L-Claim distribution, then the quantile function of X can be obtained via inverting in (5). We obtain

The nonlinear expression provided in (12) can be used to obtain the random numbers from the L-Claim distribution.

Furthermore, the effects of the shape parameters on the skewness and kurtosis can be detected on quantile measures. We obtain skewness and kurtosis measures of the L-Claim distribution using (12). Bowley’s skewness (Bowley [21]) of X is given as follows:whereas Moor’s kurtosis (Moor [22]) is as follows:

These measures are less sensitive to outliers. Moreover, they do exist for distributions without moments. Some possible plots for the mean, variance, skewness, and kurtosis of the L-Claim distribution are provided in Figures 6 and 7 .

5. Parameters Estimation

Several approaches for the estimation of the unknown parameters have been studied and applied in the literature. Among them, the method of maximum likelihood estimation is the most widely applied approach. The estimators obtained via this method possess several desirable properties and can be used quite effectively for constructing confidence bounds. In this section, we adopt this method to obtain the estimators of the model parameters. Let be observed values of the random sample taken from the L-Claim distribution with parameters , and . The log-likelihood function corresponding to (9) is given by

The expression provided in (8) can be characteristically solved by using Newton’s approach or by fixed-point iteration methods. The partial derivatives of (8), on behalf of parameters, are

6. Monte Carlo Simulation Study

This section deals with assessing the performance of the maximum likelihood estimators of the L-Claim distribution by the Monte Carlo simulation study. The simulation is performed for two different sets of parameters of the L-Claim distribution. The simulation study is carried out as follows:(1)Random samples of different sizes are generated from L-Claim distribution.(2)The model parameters have been estimated via the maximum likelihood method.(3)750 repetitions are made to calculate the biases, absolute biases, and mean square errors (MSEs) of these estimators.(4)The formulas for obtaining the biases and MSEs are given byrespectively.(5)Step (4), is also repeated for the parameters .

For the simulated dataset 1, the box plot and Kernel density estimator are presented in Figure 8. Whereas for the simulated dataset 2, the box plot and Kernel density estimator are presented in Figure 9.

The simulation results of the L-Claim distribution are presented in Tables 1 and 2 . To support the results provided in Tables 1 and 2, the simulation results are also displayed graphically in Figures 10 and 11 .

7. Application of the L-Claim to Vehicle Insurance Loss Data

The main applications of the heavy-tailed distributions are the extreme value theory or insurance loss phenomena. In this section, we illustrate the applicability of the L-Claim distribution by analyzing the vehicle insurance loss data. The data are available at http://www.businessandeconomics.mq.edu.au. We compare the goodness-of-fit results of the L-Claim distribution with the two-parameter Lomax distribution and three-parameter power Lomax (P-Lomax) distribution [2], transmuted Lomax (T-Lomax) distribution [3], and exponentiated Lomax (E-Lomax) distribution [4]. We estimate the unknown parameters of the fitted distributions via the maximum likelihood method using the R-script adequacy model with the “Nelder–Mead” method; see Appendix. The goodness-of-fit statistics including the Anderson–Darling (AD), Cramer–Von Mises (CM), and Kolmogorov–Smirnov (KS) with the corresponding value are used to compare the fitted models.

Table 3 gives the MLEs of the model parameters. Whereas the analytical measures are provided in Table 4. In general, the smaller the values of these measures, the better the fit to the data. After carrying out the analysis, we observe that the L-Claim distribution gives the lowest values for the analytical measures among all the fitted models. So, the L-Claim distribution could be chosen as the best model for modeling heavy-tailed financial datasets. Furthermore, in support of the results provided in Table 3, the estimated pdf and cdf, Kaplan–Meier survival, and probability-probability (PP) and quantile-quantile (QQ) plots of the proposed model along with the box plot of the data are sketched in Figure 12. The plots provided in Figure 12 show that the L-Claim distribution provides a close fit to the insurance loss data.

8. Application of the FGMBL-Claim to Economics Data

The economy is an important sector in many developed and developing countries. Therefore, the government and other responsible institutions are always interested in GDP growth and exports of goods and services. To demonstrate the effectiveness of the proposed FGMBL-Claim distribution, we consider the GDP growth and exports of goods and services. The summary measures of the considered data based on the response variable such as exports of goods and services and GDP growth are provided in Table 5.

8.1. Testing of Normality of the Economics Data

An assessment of the normality of data is a requirement for many statistical tests because normal data are a basic assumption in parametric testing. There are two main methods of assessing normality such as numerically and graphically. In this section, we check the normality of both the variables using the Shapiro–Wilk (SW) normality test and the Anderson–Darling (AD) normality test. Additionally, we use the graphical approach based on the quantile-quantile () plot. The plot draws the correlation between a given sample and the normal distribution and provides a visual judgment about whether the distribution is bell-shaped. It plots a 45-degree reference line and assumes normality if all the points fall approximately along the reference line.

8.1.1. The Shapiro–Wilk Normality Test

The SW normality test can be performed as follows: : the data are normally distributed vs. : the data are not normally distributed. After applying the SW normality test, we observe that the SW values for and are, respectively, given by 0.9421 and 0.9336 with corresponding p values give by 0.0948 and 0.0650, respectively, since the p values are greater than the level of significance 0.05. Therefore, we conclude that the distributions of the data are significant from the normal distribution. In simple words, we can assume normality. From Figure 13, we can see that the data are normally distributed as all the points are scattered around the diagonal line.

8.1.2. The Anderson–Darling Normality Test

The Anderson–Darling (AD) test is another prominent approach to check the normality of the data. In this subsection, we perform the AD normality test to check the normality of the data. Under this test, the null hypothesis and alternative hypothesis can be defined as follows: : the data are normally distributed vs. : the data are not normally distributed. After carrying out the analyses, the values of the AD normality test statistic for and are given by 0.6533 and 0.7338 with respective values give by 0.0798 and 0.0719. We can see the values of the AD test statistic is greater than 0.05. Therefore, we fail to reject the null hypothesis and conclude that the data are normally distributed. From the plots provided in Figure 14, we can see that the data are normally distributed as all the points are scattered around the diagonal line.

8.2. Modeling of the Economics Data

The prime interest of the proposed FMGBL-Claim distribution is to be applied for data analysis purposes, making it useful in many applied fields. Here, this aspect is illustrated by considering an economics dataset based on the GDP growth and exports of goods and services. The total time on test (TTT) plot is an empirical plot and used for model identification purposes. The TTT plots of the economics dataset are presented in Figure 15.

In this subsection, we use the economics data to illustrate the proposed model. We prove that the fit power of the FGMBL-Claim model is better than the Farlie–Gumbel–Morgenstern bivariate Lomax (FGMBL) distribution [23]. The word “Better” is used in the sense that the FGMBL-Claim distribution has the smallest values of the considered fitting measures; this needs to be clarified; the details are given below.

First, we estimate the unknown parameters of the fitted models using the maximum likelihood approach using the R-script with library (BB). The estimated values of the model parameters are presented in Table 6. To the best of our knowledge, there is no strong physical interpretation of the MLEs, and they are through an optimization technique. Here, we use the Newton–Raphson iteration procedure using library (BB) to estimate the model parameters and make the FGMBL-Claim distribution flexible enough to capture the information behind the data.

The comparison is done based on some discrimination measures such as the Akaike information criterion (AIC) and the Bayesian information criterion (BIC). After performing the analysis, the discrimination measures of the fitted models are presented in Table 7. The largest values of these measures are considered the worse performance. The results provided in Table 7 reveal that the FGMBL-Claim distribution is the best candidate model for the economics data because it has the smallest values of the fitting measures.

9. Concluding Remarks

In this study, a new extension of the Lomax distribution called the Lomax-Claim model is introduced. The proposed distribution is very flexible and possess heavy tails. The maximum likelihood estimators of the parameters are obtained. Furthermore, a Monte Carlo simulation study is provided to evaluate the behavior of the estimators. The proposed L-Claim model is illustrated via analyzing a heavy-tailed vehicle insurance loss dataset, and the comparison is made with some well-known competitive models. From the real application, we showed that the L-Claim model provides a better fit to the heavy-tailed vehicle insurance loss data than the other distributions. Furthermore, a bivariate extension of the L-Claim distribution called Farlie–Gumble–Morgenstern bivariate Lomax-Claim distribution is introduced. Finally, the bivariate extension of the L-Claim distribution is illustrated by analyzing economic data related to the GDP and export of goods and services.

Appendix

R-code for the Analysis

Note. In the following R-code, a is used for , s is used for , is used for , and pm is used for the proposed model.library (AdequacyModel)dataread.csv (file.choose(), header = TRUE)data = data [,6]data = data [!is.na(data)]hist (data)########################################################################### Probability density function######################################################pdf_pmfunction (par, x){a = par[1]s = par[2] = par[3](as∗((1+x)(−a−1))∗(1−((1+x)(−a)))∗(2−(1−s)∗(1−((1x)(−a)))))/(((1−(1−s)∗(1−((1x)(−a)))))^2)}########################################################################### Probability density function######################################################cdf_pmfunction (par, x){a = par[1]s = par[2] = par[3](s∗((1−((1x)(−a)))2))/(1−(1−s)∗(1−((1x)(−a))))}set.seed (0)goodness.fit (pdf = pdf_pm, cdf = cdf_pm,starts = c(0.5, 0.5, 0.5), data = data,method = ”Nelder-Mead,” domain = c(0, Inf), mle = NULL)

Data Availability

This work is mainly a methodological development and has been applied on secondary data, but if required, data will be provided.

Conflicts of Interest

The authors declare that they have no conflicts of interest to report regarding the present study.

Acknowledgments

The study was funded by the department of statistics, Yazd University, Yazd, Iran.