Abstract

The study of the Yangtze River Delta urban agglomeration provides a reference for regional construction. We set up a suitable cluster model and a comprehensive evaluation model to evaluate the urban economic development and provide the basis for the government to formulate and adjust the economic measures. We use the heat map of correlation coefficient to eliminate the economic indicators that are less correlated with per capita GDP and use grey correlation analysis to detect the degree of correlation of economic indicators. The optimal method is determined by comparing the results of K-means clustering and fuzzy C-means clustering. In addition, we use principal component analysis to rank the status of urban economic development. The results show that the relationship between the added value of primary industry, the added value of agriculture, forestry, animal husbandry and fishery, and per capita GDP is close to 0. Regional development level is not balanced. Chizhou, Xuancheng, Bengbu, and other cities lag behind in the economic rankings. Finally, it is suggested to reform the laggard cities and try to improve their economic strength from various economic indicators.

1. Introduction

The Yangtze River Delta is one of the key regions of economic development in China, which has abundant development potential. At present, there is still a lot of room for economic development in the Yangtze River Delta region. The economic level of the middle city lags far behind the world’s fourth bay area. The Outline of the Plan for the Integrated Development of the Yangtze River Delta Region emphasizes that “cintegration” and “high quality” are the keys to giving full play to its leading and exemplary role.

In terms of realistic measurement models, domestic and foreign scholars take regional integration, coordination, and competition of urban agglomerations and other spatial econometrics as the logical starting point to explore the collaborative development path of urban agglomerations. Among them, Wu et al. [1], by building the Malmquist model and grey model, found that the green development efficiency of the Yangtze River Delta varies greatly and some cities are in a declining level in recent years, but the gap between different provinces is narrowing year by year. Zhang et al. [2] found that the difference of scale economy and structure economy is more prominent in the Yangtze River Delta High-Tech Zone. Fan et al. [3], by using GDP per capita, divide it into central cities, more developed cities, and less developed cities. Chao et al. [4] concluded by constructing the principal component model that the Yangtze River Delta has been developing towards low carbonization since its entry into WTO. Ma et al. [5] believed that most cities in the Yangtze River Delta region should strongly strengthen sustainable development. Foreign scholars creatively use the theory of breakpoint and geo-economic relationship to explore the spatiotemporal evolution characteristics of economic structure and spatial organization of urban agglomeration.

After consulting a large number of literature, we find that most scholars only focus on one or several aspects, but lack of research on the measurement of the overall economic development level of the Yangtze River Delta. Taking this as a starting point, this paper analyzes the impact on urban economic development under the background of integration, finds out the economic advantages of each city, and makes up for the shortcomings of economic development. It not only provides a demonstration for the coordinated development of national regions but also provides Chinese solutions for the coordinated development of global regions.

2. Analysis of Current Economic Situation

As high-quality development sets new requirements and urbanization enters a new stage, in 2019, the GDP of the Yangtze River Delta region accounted for 20 percent of the national total, and the per capita GDP of the region increased year by year. Derived by the vigorous development of digital economy, the integration process of many fields within the region is accelerating, giving rise to a series of high-tech, high-sales new enterprises. Its overall valuation ranks the first among China’s urban agglomerations, accounting for 16 of the world’s top 500, with optimistic development prospects and infinite potential. Figure 1 is the GDP line chart of the Yangtze River Delta and its internal regions from 2010 to 2019. The GDP of the four provinces has been growing steadily since its establishment in 2010. Among them, the GDP growth of Jiangsu Province is the most obvious, and the GDP development of Anhui Province is the slowest, and the overall level is the lowest.

3. Index System Construction and Data

3.1. Data Source and Description

The index data are all from the statistical yearbook of the corresponding cities in 2019. After referring to relevant literatures and selecting variables [610], we introduced the added value of various industries to measure the overall industrial economic development, introduced the added value of agriculture, forestry, animal husbandry, fishery, industry, construction industry, wholesale, and retail industry as specific economic indicators and introduced the total import and export volume to measure the regional trade balance level. We have removed the effect of the size of the workforce on the level of economic development.

3.2. Data Preprocessing

In Figure 2, we process the data through three steps to ensure the feasibility of subsequent studies.

3.3. Indicator System

Data symbols and interpretation are shown in Table 1.

4. Correlation Analysis of Economic Indicators

4.1. Methods

The rank correlation coefficient is calculated. Run n experiments on random variables X and Y, get the sample , among them, . Assume that , , their rank orders were, respectively, and (, ). Then the rank correlation coefficient of random variables X and Y for this group of samples is

If there is no same rank, the following formula can be used to calculate:

As X and Y get closer and closer to the strictly monotonic function relationship, the Spearman rank correlation coefficient will get larger and larger numerically. When X and Y had a strictly monotonically increasing relationship, their Spearman rank correlation coefficient was 1. On the contrary, when X and Y have a strictly monotonically decreasing relationship, the Spearman rank correlation coefficient is −1. When the Spearman rank correlation coefficient is 0, it means that with the increase of X, Y has no tendency to increase or decrease, in other words, there is no linear correlation with Y.

4.2. Results Analysis

According to the above steps, Figure 3 is a heat map drawn with Python software. In Figure 3, the correlation coefficient between the added value of the primary industry and the added value of agriculture, forestry, and animal husbandry and fishery X2 and the per capita GDP is close to 0, which should be eliminated. Because with the development of modern economy, the agricultural economy plays a small part in the economic growth.

5. Economic Index Factor Analysis Based on Grey Correlation Analysis

The grey correlation analysis method can judge whether the connection is close by determining the similarity degree of the geometric shape between the reference data column and several comparison data column. As the analysis is conducted according to the development trend, this method has no requirement on the sample size and does not need the typical distribution law, and the calculation amount is relatively small, so this method has no requirement on sample size. We use this method to test whether the economic indicators selected above are appropriate.

5.1. Methods

Each index of the data has different dimensional orders of magnitude, and direct analysis will affect the results. Therefore, the data are firstly normalized [1113]:where is the element in the i-th row and j-th column in the normalized matrix and is the element in the i-th row and j-th column in the original data matrix. In addition, .

The pretreated per capita GDP G and the relevant economic indicators X2, X3, and X5X9 after X1 and X4 are excluded are formed into an analysis matrix:where represents the observed value of per capita GDP in the n-th city and represents the observed value of the m-th index in the n-th city.

The analysis matrix is formed into an initial value matrix :

Form the initial value matrix into a difference matrix ,

Form into a correlation coefficient matrix ,where and are the minimum and maximum values in the correlation coefficient matrix , respectively. is the resolution coefficient, which is usually 0.5.

5.2. Results Analysis

MATLAB program is used to solve the correlation degree table of each related economic index to GDP per capita, as shown in Table 2. In Table 2, the correlation degrees of indicators X2, X3, X5, X6, X7, X8, and X9 are all greater than 0.9, indicating that other economic indicators are closely related to per capita GDP, and the indicators are selected appropriately.

6. An Empirical Analysis of the Current Situation of Urban Economy in the Yangtze River Delta Region

The fuzzy clustering analysis method is widely used in exploratory research. By using fuzzy set theory to analyze data, similar samples can be grouped into one group. The result is simple and intuitive. After consulting the relevant literature, we find that many scholars at home and abroad generally use the cluster analysis method in the study of urban agglomeration development.

Among many fuzzy clustering algorithms, the C-means clustering algorithm has been applied more and more widely and successfully in many fields in recent years. In most cases, objects in the data set cannot be divided into distinct clusters, and probabilistic methods can also obtain such weights, but sometimes it is difficult to determine an appropriate statistical model, so the use of fuzzy C-means with natural and nonprobabilistic characteristics is a better choice. The K-means clustering method has a strong degree of interpretation. When the clusters are dense, spherical, or lumpy, and the difference between clusters is obvious, its clustering effect is good. We take into account that each method has its drawbacks. In order to make the research results more accurate, based on Henderson’s study on city scale [14], we study the status quo of urban economy in the Yangtze River Delta region and determined the optimal method by comparing K-means clustering and fuzzy C-means clustering.

6.1. Economic Status Analysis Based on Fuzzy C-Means Clustering Algorithm
6.1.1. Methods

Let , where represents n-dimensional real vector space. Let represent the membership degree of the k-th sample to the i-th class, , , , and represent the cluster center of the i-th class. Then a fuzzy C-means [15] clustering of X is to find the minimum of the following objective function: where is the Euclidean distance from the k-th sample to the cluster center of the i-th class.

6.1.2. Research Ideas

Determine the number of clusters c and the number of m, and initialize the membership matrix :where is the cluster center, and .

Update the initial membership matrix :

For a given , in the actual calculation, the initial value is iteratively calculated, and when , the algorithm is terminated. If , then .

6.1.3. Empirical Analysis

After removing the two indicators, the 27 cities were divided into three categories, namely, the first level, the second level, and the third level. Among them, the first level indicates that the city has the best level of economic development, the second level indicates that the city’s economic development is at a medium level, and the third level indicates that the city’s economic development is poor. The classification of 27 cities is obtained, as shown in Figure 4.

From Figure 4, it can be seen that the economic development of Shanghai is the best among 27 cities, followed by 5 cities in Jiangsu and 3 cities in Zhejiang. Anhui’s cities are all in the third level.

6.2. Analysis of Economic Status Based on K-Means Clustering Method
6.2.1. Research Ideas

Select the initial center of mass. See model establishment for specific methods. Calculate the mean value of the corresponding samples assigned to each centroid in the previous step as the new centroid. Calculate the difference value of these two centroids, and repeat the steps until the value is less than the threshold.

6.2.2. Model Building

k observations randomly selected from the sample set are taken as the initial centroid [1618]. Each observation value is randomly assigned an initial centroid, that is, the initial cluster, and then enters the update cycle step to calculate the mean value as the random initial centroid of the cluster.

After selecting the initial set of k means according to the above steps, the algorithm will alternate between the steps, including the cycle between the following two steps:(i)Assign steps: the least squares Euclidean distance is calculated, and each observed value is assigned to the cluster with the closest mean value:where is the p-th observation value, the i-th cluster in the t-th cycle, and is the i-th cluster average in the t-th cycle.(ii)Update steps: recalculate the mean of the observations assigned to each cluster:where is the i-th cluster average in the t + 1 cycle.

When the assignment is no longer changed, the algorithm has converged, and clustering has been completed.

6.2.3. Empirical Analysis

The clustering results are obtained by Python program. See Figure 5 for details.

It can be seen from Figure 5 that the economic development of Shanghai is the best among 27 cities, followed by 13 cities such as Nanjing, Wuxi, Changzhou, and Zhoushan, while the economic development level of Hefei, Bengbu, Yancheng, and Chuzhou is relatively weak, and most of the weaker cities are from Anhui Province.

6.3. Analysis of Economic Status Based on Principal Component Analysis
6.3.1. Research Ideas

Figure 6 shows the specific steps of principal component analysis. In this way, the determination of weight is objective and reasonable, and it overcomes the defect of some evaluation methods in determining weight. The higher the correlation degree between indexes, the better the effect of PCA is [1922]. This method linearly converts multiple metrics into a new set of fewer metrics. They are independent of each other and retain most of the information of the original index data. Generally, select a line that minimizes the average square distance from point to line, and select the next line similarly from the direction perpendicular to the first one. Repeating this process will produce an orthogonal basis, on which each individual dimension of data is irrelevant, and these basic vectors are called principal components.

6.3.2. Research Technique

The preprocessed data matrix X is shown below:

The mean vector and covariance matrix S of the data matrix are calculated:

The eigenvalue and variance contribution rate k of covariance matrix S are calculated:

Sorting k from big to small, and calculating the cumulative variance contribution rate, should make the first m cumulative variance contribution rate greater than 85%, then the first m principal components y can be used to explain all indexes [2327].

The unit orthogonal eigendirections of the eigenvalue are coefficients. The expression of principal component y is as follows:

Calculate by the weighted product method:where is the vector after normalization of the first m variance contribution rates.

6.3.3. Analysis of Results

Table 3 shows the results obtained after calculation by MATLAB program.

It can be seen from Table 3 and Figure 7 that the three principal components y1, y2, and y3 cover most of the variable information. Therefore, these three principal components replaced the original eight indexes, and the original eight dimensions were reduced to three dimensions, which played a role in dimension reduction [2831].

Table 4 shows the first three principal component coefficients. Therefore, the expressions of the first three principal components are obtained as follows:

Table 5 shows the specific values (partial).

In Figure 8, the darker the color, the higher the score. According to the comprehensive scores and rankings of economic development of each city in Table 6, Shanghai ranks first and scores far ahead of other cities, so it can be said that it is the leader of the Yangtze River Delta urban agglomeration. Most cities in Zhejiang and Jiangsu have good economic development level, while most cities in Anhui Province have weak economic development by comparison, and only Hefei, the provincial capital, is in the middle and upper class.

7. Conclusion

Based on the above research, we find that no matter which classification method is used, Shanghai’s economy is clearly ahead that of other cities. Therefore, the further use of the principal component analysis method, the result is that Shanghai ranked first, Ningbo, Hangzhou ranked second, third, and Chizhou, Xuancheng, Bengbu and other cities in the economic ranking behind. In order to realize the integration of the Yangtze River Delta, it is obvious to reform these backward cities and try to improve their economic strength from various economic indicators [3234].

The novel feature of this paper is that multiple evaluation clustering methods are used to comprehensively evaluate the level of urban development to make the results more objective. On the research of international regional development problems, due to multifarious factors influencing the regional development level, so we are cautious of comprehensive selection index data at the same time because of considering the rationality of the method, by using a variety of methods, a comprehensive measure of regional development to further ease international regional development imbalances, promote international cooperation. Our research provides theoretical reference for the further construction of a community with a shared future for mankind.

Rational development planning among regions is necessary but should not be relied on blindly. Combes [35] found that urban agglomeration would hinder the economic growth of industrial and service sectors in France, while Gardiner et al. [36] believed that different conclusions may be drawn when discussing the relationship between spatial agglomeration and economy without considering spatial scale. Therefore, how to properly use the geographical advantages to achieve a win-win situation is worth thinking about. Based on the above research, we hereby cautiously propose the following suggestions:

Firstly, carry forward the local advantages, acknowledge the differences, but adhere to the overall integration of development. The gap between regions of the Yangtze River Delta will be narrowed, coordinating inter-city relations through strengthening the construction of social infrastructure. Promote the flow of human resources to underdeveloped areas, pay attention to the siphon effect between cities, so as to change the scale disadvantage of Anhui and other regions, so that they can get more benefits from the core cities.

Secondly, local development drives overall development. Actively promote the construction of demonstration areas, especially happy technology industries and key industries. By taking the lead in pilot projects at the local level, we will innovate a new model of integrated development while maintaining ecological balance. Adhere to sustainable, integrated development, pay attention to output efficiency, so as to drive the economic development of backward areas.

Thirdly, break administrative boundaries. The government has improved the platform for regional cooperation and the policy service system and actively promoted resource sharing. In terms of financial policy, the government should vigorously support emerging industries and industries with promising development, help enterprises connect with market resources, and conduct upstream and downstream cooperation.

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This study was funded by the university-level research project “A Nostalgic Song of the Reclusive—A Reinterpretation of Walden Pond” (ACKYC19066), “First-Class Course” (acylkc202008), and the Teaching and Research Fund Project of the Anhui University of Finance and Economics (acjyyb2020011 and acjyyb2020014).