Abstract

The accuracy of parameter estimation plays an important role in economic and social models and experiments. Parameter resolution is the capability of an estimation algorithm to distinguish different parameters effectively under given noise level, which can be used to select appropriate algorithm for experimental or empirical data. We use a flexible distinguishing criterion and present a framework to compute the parameter resolution by bootstrap and simulation, which can be used in different models and algorithms, even for non-Gaussian noises. The parameter resolutions are computed for power law models and corresponding algorithms. For power law signal, with the increase of SNR, parameter resolution is finer; with the decrease of parameter, the resolution is finer. The standard deviation of noise and parameter resolution satisfies the linear relation; it relates to interval estimation naturally if the estimation algorithm is asymptotically normal. For power law distribution, parameter and resolution satisfy the linear relation, and experimental slope and theoretical slope tend to be consistent when significance level approaches zero. Last, we select an algorithm with finer resolution to estimate the Pareto index for the Forbes list of global rich data in recent 10 years and analyze the changes in the gap between the rich and the poor.

1. Introduction

The processing of economic and social experimental data requires accurate parameter estimation [14], which can analyze the experimental results more precisely [57]. Parameter estimation in physical model is an important part of physical analysis [8, 9].

In order to measure the accuracy of parameter estimation, we use parameter resolution [10, 11] which was considered in [12] for EM algorithm and estimation algorithms for linear regression model [13] before. Lu and Yu carried out a preliminary study on the resolution of parameters and analyzed the algorithm [12]. Focusing on the linear model, Zhang and Yu [13] further studied the evaluation criteria of parameter resolution. In order to investigate the parameter resolution further, this paper takes the power law model as an example to explore its influencing factors and practical applications. Parameter resolution is the capability of an estimation algorithm to distinguish different parameters effectively under given noise level, which can be used to select appropriate algorithm for experimental or empirical data. In this paper, we investigate parameter resolution systematically and present a framework to compute the parameter resolution by bootstrap and simulation based on classification rather than clustering [12, 13]. Our framework can be used in different models and algorithms, even for non-Gaussian noises. We use power law models, corresponding algorithms, and real data (Forbes rich list) as prototypes to show that the parameter resolution is always computable; it relates to interval estimation in statistics naturally if the estimation is asymptotically normal, but the situation of complicated noise is different.

In fact, the parameter identification is closely related to the selection of model, algorithm, and distinguishing standard. For the same set of experimental data, different models or the same model chooses different algorithms, and the estimated parameters may not be consistent, which makes the prediction capability of the unknown data different. Hence, the selection of the model and algorithm is important in the data analysis. The model and algorithm can be evaluated from the following four aspects: accuracy, robustness, calculation speed, and simplicity of the model. This paper puts forward the concept of parameter resolution, which is used to evaluate the accuracy of model and algorithm.

In this paper, the power-law model is selected, and the parameter resolution is studied by using nonlinear least-square method and maximum likelihood method. In order to study the relationships between the parameter resolution and the influencing factors, this paper selects different discrimination methods, estimation algorithms, parameter values, and noise types. The research shows that the classification algorithm has lower error rate, and lsqcurvefit algorithm can distinguish two more similar signals; with the increase of parameter values, the accuracy of parameter resolution is lower. In other words, the similar signals are more difficult to be separated, and the absolute error of local parameters is small; it is found that when the parameter estimation is asymptotically normal, the parameter resolution can be predicted by interval estimation, and other cases can also be obtained by experimental calculation; the accuracy of parameter resolution increases with the increase of signal-to-noise ratio and decreases with the increase of noise intensity. By changing the significance level, the theoretical and experimental values are compared, and the feasibility of using interval estimation to calculate parameter resolution is verified.

2. Parameter Resolution

In general, the parameter resolution of an algorithm depends on model, noise, and sample size as well as the criterion to distinguish two close signals. The distinguishing criterion is the most important part; we will use a flexible criterion which can be adjusted by two parameters, (precision) and (robustness).

Consider the modelwhere is the signal containing information of some physical process and is a zero mean independent identically distributed (IID) noise. The strength of noise is measured by the signal-to-noise ratio (SNR). If we have a sample for some , then , where and are the mean intensities of signal and noise, respectively. Select an algorithm to obtain an estimation of ; how can we say that algorithm distinguishes two close parameters and effectively for fixed noise level and sample size? Such a problem can be resolved by bootstrap and simulation; we provide the following distinguishing criterion.

2.1. Distinguishing Criterion
(1)Generate two groups of signals for given noise level and sample size, and use algorithm to obtain estimations .(2)Divide the elements in into two groups by distinguishing method , and compute the misjudgement rate or the precision . If is less than a prescribed threshold (e.g., ), algorithm can distinguish the two signals under significance level successfully.(3)Repeat 1 and 2 for times; if the success rate is greater than a prescribed threshold (e.g., ), then algorithm can distinguish the two signals effectively under significance level and robustness .

In the second step, to divide the elements in into two groups, one can use clustering [9, 10], which is an unsupervised learning method. We shall adopt classification, a supervised learning method, because it can lead to more precise parameter resolution (see Figure 1(a)). We use the significance level to adjust the precision of each bootstrap simulation and to control the robustness of all the simulations. The introduction of and makes the distinction criteria for similar signals and distributions more flexible. If and are fixed, the distinguishing criterion is also fixed. The parameter resolution of the estimation algorithm is the minimal interval between two parameters so that can distinguish them effectively.

For model (1), given a distinguishing criterion and a noise, the parameter resolution of algorithm at is the minimal number so that if then can effectively distinguish the two signals and . In other words, if belongs to the ambiguous intervalthe two signals and cannot be effectively distinguished by . In general, the ambiguous interval may not be symmetric, but it is almost symmetric (especially when the noise is symmetric), and we assume that it is symmetric for briefness. Using , we can obtain the absolute error, , and the relative error, .

The steps of computing parameter resolution for an algorithm at are formulated in the following experimental procedures.

2.2. Experimental Procedures for Algorithm A
(1)Distinguishing Criterion. Select appropriate distinguishing criterion, and (2)Produce two noisy signals and , (3)Estimation. Generate groups of parameter estimations and by algorithm (4)Distinguishing. Determine whether two signals are effectively distinguished by the selected distinguishing criterion(5)Calculation. Adjust to reduce until two signals cannot be distinguished effectively by the distinguishing criterion, and obtain the minimum distinguished interval which is the desired parameter resolution

Next, we will choose linear and exponential models as examples to show that parameter resolution can be applied to each model, as is shown in Figure 1. Figure 1(a) is compared with the experiment in [9]; it shows that the parameter resolution of classification is better than that of clustering. The estimation algorithm of (a) and (b) is the least-square method. Figures 1(c) and 1(d) are both exponential models but under different noise type.

Set the following in the experimental procedures: (a) and . , , , and are independent Gaussian noise with zero mean, and . Estimation algorithm is nonlinear least-squares method. Distinguishing methods are Support Vector Machine (SVM) classification and peak-based clustering [14]; and . (b) The model is the same as (a). For power law noise, , with subject to uniform distribution [0, 1], xmin = 0.01, and . (c) The model is as follows: , , , and . (d) The model is the same as (c). For power law noise, , with subject to uniform distribution [0, 1], xmin = 0.01, and .

Power law distribution exists in many fields of nature and daily life [1519]. For example, the distribution of the urban population’s size, the magnitude of earthquake [20], size of interplanetary debris [21], the distribution of the moon pit diameter on the surface of the moon [22], distribution of the scale of the war [23], and distribution of the personal wealth are all subject to the power law distribution. There are various power law distribution phenomena in nature and social life, and the study of them is of great significance [14, 2430]. Accurate estimation of power law indices is important, because different indices lead to different explanation. Power law signal and power law distribution are two forms of power law models. We investigate parameter resolution of the estimation methods for indices of power law models.

3. Power Law Signal

3.1. Compare Distinguishing Methods and Algorithms

Parameter of power law signal can be estimated by two algorithms: lsqcurvefit and nlinfit. Two estimated algorithms linearize the nonlinear function via the first-order term of its Taylor series for the use of linear least squares. Initial value of lsqcurvefit algorithm does not affect the iteration result; it affects the iteration speed. Initial value of nlinfit algorithm is more sensitive. Parameter of power law distribution can be estimated by maximum likelihood method. We will compute the parameter resolutions of these algorithms by the experimental procedures and reveal the properties of parameter resolution.

At first, we compare the effect of distinguishing methods (clustering and classification) on parameter resolution. Set the following in experimental procedures: and . (uniform in logarithm), , , , and are independent Gaussian noise with zero mean and variance equal to . Algorithms are lsqcurvefit and nlinfit. Distinguishing methods are Support Vector Machine (SVM) classification and peak-based clustering [31]; and .

Figure  2 shows the parameter resolution under different noise levels by using different distinguishing methods (clustering and classification) and algorithms (lsqcurvefit and nlinfit). As is shown in Figure  2(a), by comparing the differences of parameter resolution between different distinguishing methods, it can be observed that the number of misjudgements in using classification algorithm is less than that of clustering, and the parameter resolution is more precise. Because classification is supervised learning, objects can be tagged according to certain standards and then classified according to labels. Clustering aggregates samples without their own categories into different groups. Therefore, classification is suitable for situations where categories or classification systems have been determined, and clustering is suitable for situations where there is no classification system, and the number of categories is uncertain. For the two parameters determined by the category, the accuracy of classification is more precise because is smaller. Because of the locality of parameter estimation, when parameter is small, estimation is more accurate, and the parameter resolution is better. We can also find that the parameter resolution of lsqcurvefit algorithm is finer.

From Figure  2(b), it is observed that there is a linear relationship between the absolute error and the standard deviation . Fitting by linear model via least square of the line with SVM classification and lsqcurvefit algorithm, we obtainwith . It indicates that there is a positive linear relationship between the standard deviation of noise and parameter resolution.

3.2. Influence of Noise Type

Now, we investigate the influence of noise type on parameter resolution. We fix the SVM classification and lsqcurvefit algorithm to compare the parameter resolution under IID Gaussian noise and IID power law noise. Set the following values in experimental procedures: , , . , and . (a) , Gaussian noise: , the power law noise admits the following density, , and . (b) , and the others are the same as (a). (c) ; . (d) ; .

The results are presented in Figure 3. From Figure 3(a) and the red line in Figure 3(b), the absolute error is varied at different power law indices. If the noise is changed to power law, the relationship between and the power law index is shown with blue lines in Figures 3(a) and 3(b), which indicates stronger growth with index. It implies that parameter resolution of power law model depends on specific power law index, which is different from linear regression model [10]. The changes of relative error under different and are shown in Figures 3(c) and 3(d). We can find that, with the increase of SNR, parameter resolution decreases; with the increase of , parameter resolution increases. Figure 3 indicates that the parameter resolutions are quite different when the perturbation noise is changed.

4. Power Law Distribution

4.1. Experimental Procedure

In order to study the parameter resolution of maximum likelihood estimation for power law distribution, according to experimental procedure, we select the following parameters to compute the parameter resolution of power law distribution: produce two power law distributions: and , with random numbers obeying the uniform distribution in ; estimated algorithm is maximum likelihood method, data size is 500, 1000, and 2000, respectively, and the results are shown in Figure 4(a).

4.2. Experimental Results and Analysis

It can be seen from Figure 3(a) that there is an obvious linear relationship between absolute error and power index. For sample size , the curve is fitted to be a linear equation with . This indicates that the absolute error can be approximated by the standard deviation of noise.

In the power law model, the parameter resolution based on absolute error and parameter satisfies the positive linear relationship:where is the power law index and is the sample size.

In fact, the maximum likelihood estimation for different parameters and in power law distribution is carried out [32]:

The regions where two normal distribution overlap may be misjudgement. The confidence interval at significance level of can be calculated via the pivot amount .where is quantile of normal distribution. The length of confidence interval is . The length of confidence interval and the ambiguous interval are almost identical when and . The parameters resolution based on absolute error meets the linear relationship if confidence level and the sample size are fixed, with theoretical slope . Equation (3) can be proved in the same way, because and are normally distributed [33]. This indicates that if the estimation is asymptotically normal, it can be approximated by interval estimation. Other noise conditions do not have such a rule, but parameter resolution is always computable.

Parameter resolution is related to distinguishing criterion, especially the significance level and robustness . Fix , and varies from 0.05 to 0.20; the slope of absolute error and experimental slope are obtained. Details are shown in Figure 4(b). The theoretical slope is greater than experimental slope because the parameter resolution in the worst case is obtained by the theory; that is, there may be no points in the shadow part. Figure 4(b) shows that, with the increase of , the experimental slope and theoretical slope decrease, and absolute error will decrease, which means that we can distinguish two similar signals more easily. The difference between experimental slope and theoretical slope increases gradually; it tends to be consistent when . Figure 4(b) also shows the accuracy of interval estimation.

5. Application

Forbes magazine publishes an annual list of the richest people. The list includes the net worth of each individual as well as background information about the businesses that have led to this prosperity. It includes individuals involved in all sectors of the economy, such as computer software, financial investments, retailing, and computer hardware as well as media, entertainment, communication, real estate, and many other sectors.

Although the people included in the Forbes 400 list made their fortunes in various different ways, the distribution of their wealth exhibits a striking statistical regularity. This regularity describes not only the wealth distribution at the very top of wealth range but also provides insights into any order of magnitude below this range of wealth distribution. These empirical findings can clarify the nature of the wealth accumulation process and its relationship to central issues such as market efficiency and market volatility.

Wealth distribution is typically studied by employing data about the number of individuals within each wealth range (e.g., the number of individuals with wealth between $100,000 and $150,000). Here, we employ a different approach; we use the Forbes 400 list to study the distribution of wealth at the top wealth range. The advantage of this approach is to use very specific data about the wealth of everyone on the list, as well as avoiding the aggregation problems of standard approach. The disadvantage, of course, is that this approach is limited to the very top of the wealth distribution. However, as we argue below, the results obtained with this approach may shed light on a much wider range of the wealth scale and on the capital investment process in general [34, 35].

According to top 100 Forbes global list from 2012 to 2020, this paper counts the trade’s distribution each year, and the results are shown in Table 1. We divide the trade into four categories: finance, IT, industry (retail, manufacturing, and mining), and service. In general, the overall distribution of trades remained unchanged, the number of industries has risen steadily in the past six years, the number of financial sectors has also increased, and the other two fields slightly decreased.

The empirical results show that the distribution of wealth is subject to power law distribution [3638]. Render shows that if the distribution of wealth follows the power law distribution, the value of personal wealth and its wealth ranking are satisfied [39]:where is the rank ( being the wealthiest person, etc.), is the wealth of individual with rank , is a constant, and exponent is related to the Pareto exponent, , as [39]. This implies that when wealth is plotted against rank on a log-log scale, a linear relation is observed for a Pareto distribution. Exponent quantifies the level of wealth inequality. As decreases, the wealth distribution becomes more uneven, and few individuals have a substantial amount of wealth. A bigger means a bigger gap between the rich and the poor. We analyze the wealth ranking data from the top 100 Forbes global rich list in the recent 10 years; data can be obtained from https://www.forbes.com.

Determine the original signal based on the data of 2020 of top 400 Chinese mainland rich. Nonlinear least-squares algorithm is used to fit the signal according to model (7), and we obtain the fitting values and . Set the following values in experimental procedures: (a) , , , and Gaussian noise: . Distinguishing method is SVM classification, and . Algorithms are lsqcurvefit and nlinfit, and obtain upper and lower absolute errors, respectively. (b) The Pareto index of top 100 Forbes global rich in the recent 10 years with lsqcurvefit algorithms.

As can be seen from Figure 5(a), the minimum upper and lower bounds of parameters can be distinguished by different algorithms. It indicates that different algorithms will lead to different parameter resolutions. In this condition, the lsqcurvefit algorithm’s capability of resolution is better.

In Figure 5(b), we use the top 100 Forbes global rich list in the recent 10 years. According to the experiment, lsqcurvefit algorithm has finer resolution, so we select it to analyze the changes in indices, which are more accurate. The results show the rationality and effectiveness of the parameter resolution in practical application. Noise level is unknown in real life, but when the noise level , it can provide more accurate estimations and lead to better understanding for practical problems. In the recent 10 years, the global gap between the rich and the poor has increased slightly and has been relatively stable in the last two years. The widening of the gap between the rich and the poor is universal in contemporary world and has a certain relationship with economic globalization. The widening gap between the rich and the poor is inextricably linked to knowledge and education. Antipoverty is a war-preparation problem that all countries attach great importance to. The results show the rationality and effectiveness of the parameter resolution in practical application.

6. Conclusion

In this paper, we investigate parameter resolution systematically and present a framework to compute the parameter resolution. Parameter resolution is the natural extension of the resolution concept; we take the power law model as an example to explain the rationality, feasibility, and effectiveness of the concept of parameter resolution. Our framework can be used in different models and algorithms, even for non-Gaussian noises. For power law model, the influence of each factor on the resolution system is analyzed. With the increase of the SNR, the parameter resolution is finer. Parameter resolution can be calculated in every case for any algorithm, as well as distinguishing criterion and noise, and so forth. If the estimations are asymptotically normal, parameter resolution can be approximated by interval estimation in statistics. With the decrease of significance level, the experimental slope and the theoretical slope both decrease, and the result tends to be consistent when approaches zero. It can be used to select a more precise algorithm for experimental data. We use Forbes rich list as a prototype to show how to apply the parameter resolution to real data. All computation codes and data are available upon request.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest.

Acknowledgments

This work was supported by the National Key Research and Development Program of China (no. 2020YFA0714200).