Abstract

Quantitative precipitation estimation (QPE) based on Doppler radar plays an important role in severe weather monitoring, industrial and agricultural production, and natural disaster prediction and prevention. However, the temporal and spatial variability of precipitation leads to large errors in radar estimates of mixed precipitation. To improve the accuracy of radar QPE, we propose an offline spatiotemporal deep fusion model that uses the reflectivity data of the Shijiazhuang Doppler radar Z9311 and the precipitation data from 17 national weather stations (NWSs) and 260 automatic weather stations (AWSs). Considering the abrupt spatial changes in precipitation, a three-dimensional radar data structure is proposed, and the spatial features of multielevation and multiscale radar data are extracted and merged using the feature fusion network (FFNet). Finally, the time dependence of the precipitation is captured using the long short-term memory (LSTM) network, and the precipitation estimation is obtained. Based on a comparison of the results of the proposed model (FFNet-LSTM) with those of the ordinary kriging (OK) interpolation, two Z-R relationship, the multilayer perceptron (MLP), the LSTM, and the FFNet, the proposed method is superior to these models, has a promising performance, and is a general-purpose rainfall algorithm.

1. Introduction

A more accurate and real-time quantitative precipitation estimation (QPE) provides a reliable data source for hydrology, meteorology, and disaster forecasting [1, 2]. However, the spatial distribution of the rainfall process is extremely complicated, and 70–80% of the uncertainty of the terrestrial hydrological process is attributed to the temporal and spatial variability of precipitation, which poses a severe challenge in QPE [3, 4]. A rain gauge is a tool commonly used in QPE, which can provide accurate measurements at point locations. However, the sparse network of rain gauges leads to a low spatial coverage [5]. The spatial resolution of radar data is the best of all types of remote sensing data [6]. While the spatial resolution of satellite images is greater than 5–10 km, the spatial resolution of radar data is about 1 km [79]. Therefore, radar data are often used in QPE research due to their better temporal and spatial resolution [1012], but precipitation estimations using radar data contain various errors. The large-scale deployment of dual-polarization weather radar has not yet been achieved in mainland China [13], and it is difficult to obtain its data. Therefore, this research is based on single-polarization radar data. Hereinafter, single-polarization radar is referred to as radar for convenience. Currently, the QPE methods based on rain gauge and radar data include two types—the rain gauge interpolation method and the radar conversion method.

In previous studies, the Kriging algorithm is the most widely used and studied rain gauge interpolation methods [14]. The Kriging interpolation method obtains the precipitation value at an unknown location by estimating the continuous attribute value between the rain gauges. The methods based on Kriging all make the basic assumption of Gaussian characteristics, and the fitting of the variogram model is complicated [15, 16]. In order to relax the Gaussian assumption and make the interpolation method more flexible, some researchers have conducted experiments from different angles. Lu and Wong [17] studied the spatial structure of data that cannot be effectively modeled using a typical variogram and proposed an adaptive inverse distance weighting (AIDW) spatial interpolation technique. Chen et al. [18] compared five interpolation methods based on daily precipitation data from 753 stations in mainland China; these were the ordinary nearest neighbor, local polynomial, radial basis function, inverse distance weighting (IDW), and ordinary kriging (OK) methods. They concluded that the OK and IDW methods rank the highest in terms of the quality of the interpolation of the daily precipitation in China. To obtain the variogram of the OK method with better adaptability in daily precipitation interpolation, Wang et al. [19] compared and analyzed the four commonly used variograms (Exponential, Spherical, Gaussian, and Linear). The experimental results show that the Exponential and Spherical functions have good adaptability to the OK method in daily precipitation interpolation. However, rain gauges have several drawbacks, such as poor spatial representation, limited spatial coverage, and uneven spatial distribution [20, 21]. Therefore, the performances of precipitation estimations based on the rain gauge interpolation method are unsatisfactory in practice.

The Z-R relationship is a traditional radar conversion QPE algorithm. The rainfall intensity is calculated using the nonlinear relationship between the radar reflectivity (Z) and the precipitation rate (R) [2224]. In early research, the simple Z-R relationship was obtained statistically in different climate zones, and then a quantitative estimation of the precipitation was realized [25, 26]. Ryde predicts various weather systems based on radar reflectivity of different sizes (5–10 cm) [27]. Based on China’s New Generation Weather Radar (CINRAD) system data, the QPE in mainland China follows the fixed relationship of [28]. Limited by the technical conditions at the time, most radar reflectivity and rain gauge measurement data contain errors, and the Z-R relationship cannot be corrected in time. Moreover, the parameters in the Z-R relationship are affected by factors such as synoptic weather situations, hydrology, and geography, thus varying in time and space [2933]. Such a situation makes the Z-R relationship an idealized model that has difficulty reproducing the actual conditions.

Many studies aimed to reduce radar QPE errors based on the Z-R relationship. Brandes and Edward [34] used the ratio between the radar estimates and the rain gauge measurement data as a correction factor (without considering the spatial variation between the ratios) to optimize the Z-R relationship. Yuan et al. [35] proposed an improved least squares method as a discriminant function to optimize the parameters in the fixed Z-R relationship. They summarized the Z-R relationship optimized by the optimization method, and both the average relative error at the station and the relative error of area rainfall are greatly reduced. Ramli et al. [36] divided the rainfall into seven categories according to the rainfall intensity and season and derived new Z-R relationships using the optimization method, which significantly improved the radar QPE accuracy. They also concluded that the Z-R relationship depends largely on the location and type of rainfall. Similar conclusions have been obtained in other studies; that is, the Z-R relationship does not apply to all regions [3739].

Radar QPE research based on data-driven methods has been conducted for many years, such as artificial neural networks (ANNs) [40, 41], nonparametric methods [42], support vector machines (SVMs) [43], and random forest (RF) [44]. Unlike the Z-R relationship, these methods attempt to directly capture the relationship between the radar reflectivity and the rain gauge measurements. Kusiak et al. [41] used a multilayer perceptron (MLP) to estimate the rainfall intensity on multiple time scales, and the estimation results were more accurate than those obtained using the four data-mining algorithms (i.e., the RF, classification and regression tree, SVM, and K-nearest neighbor). Tang et al. [5] explored the time dependence of precipitation and constructed a continuous conditional random field (CCRF) with geographic and time weighting for radar QPE, which produced a more accurate precipitation estimate than the Z-R relationship. Liu et al. [45] studied the local spatial correlation of rainfall using a radial basis function neural network (RBFNN) with local mapping characteristics to build an adaptive neural network (ANN), which considered the variability of the relationship between the radar reflectivity and the rainfall estimation. Liu et al. [46] demonstrated that reflectivity 1–4 km above the rain gauge is the best input vector for the RBFNN. The spatial variation during the rainfall process was fully considered by Xiao and Chandrasekar [47] and Trafalis et al. [48]. They conducted a preliminary study on the structure of the three-dimensional radar data; that is, they obtained constant altitude plan position indicating (CAPPI) data at different altitudes above a ground station. Three-dimensional radar data can reflect the spatial structure of the rainfall and be used to obtain more information that may affect the precipitation estimate. However, the above research uses the same size radar data at different altitudes, regardless of the influence of the horizontal advection. The data observed by the radar are related to the rainfall rate on the ground, with a certain time delay and spatial movement [22]. Therefore, three-dimensional radar data that use CAPPI of the same size at different altitudes contain more information that has nothing to do with precipitation. Chiang et al. [49] considered the influence of the horizontal advection in the atmosphere and concluded that the coverage of 3D radar data at each altitude is related to the horizontal wind speed and the vertical (terminal) velocity of the raindrops. Therefore, before designing the size of the data grid, it is necessary to determine the time it takes the raindrops to reach the ground from different altitudes. They assumed a raindrop size of 4 mm, calculated the terminal velocity using an empirical formula, and obtained the time for the raindrop to reach the ground from the CAPPI at different altitudes. The horizontal offset was determined by assuming the tangential wind speed in the area. Finally, they determined the size of the CAPPI at different altitudes and established a radar QPE and a quantitative precipitation prediction (QPF) in Taiwan based on a dynamic ANN.

Although the above methods considered the time dependence and spatial correlation of the precipitation to a certain extent and have achieved good results, the model in the previous study is only suitable for point-shaped radar data due to the structural characteristics. The input of the model is multiple data points in three-dimensional radar data, which cannot describe the real rainfall process. In addition, the estimated precipitation time scale is usually 1 hour or longer in these studies. Compared with the models used in these studies, the precipitation estimates on the 6-minute time scale are intense and susceptible to noise, especially the estimation of light rain [50, 51]. In this study, a radar QPE method based on the spatiotemporal deep fusion model is proposed to solve the abovementioned problems. It is expected to further improve the accuracy of radar QPEs because the new method considers the spatial variations and temporal integrity of the precipitation simultaneously. The rest of this manuscript is structured as follows. Section 2 presents the information about the study area and the data source. Section 3 describes the details of the model and the evaluation metrics. Section 4 covers the evaluation of the proposed model and the comparison models at a 6-minute time scale. The main conclusions of the study are summarized in Section 5.

1.1. Study Area and Data Source

The study area is over Shijiazhuang (37°27′–38°47′N, 113°30′–115°20′E) (red area in the inset in Figure 1) and spans two major geomorphological units: the Taihang Mountains and the North China Plain. The study area is low in the southeast and high in the northwest, with a large elevation difference and complex topography. Under the influence of the topographic uplift, the precipitation in the region decreases from west to east, and the rainfall is concentrated in the summer (June-September) [41]. The typical characteristics of the summer precipitation in the study area are a high precipitation intensity, short duration, and strong locality. The heavy rain in the area is difficult to predict and is often underreported [5254]. Therefore, the local short-term mixed precipitation estimation is the focus of our research.

All the data used in this study were obtained from the Shijiazhuang Meteorological Center (SMC), including January to December 2017–2019 (UTC+8) Z9311 Doppler radar reflectivity and National Weather Stations (NWSs) data and Automatic Weather Stations (AWSs) data from January to December 2018 (UTC+8). The precipitation data from the NWSs are part of a national ground weather station precipitation dataset established by the China National Meteorological Center based on long-term historical data and quality control. 97.5% of NWS data have an accuracy rate higher than 99%, which have reliable authenticity [55]. The data quality of the AWSs is affected by factors such as the observation instrument, observation technique, station location, observation time, and observation method. Various nonclimatic factors have a greater impact on the ground meteorological data obtained via long-term observations [56]. The rainfall resolution of the AWSs is 0.1 mm per minute, and the maximum error is ±3%. In this study, the Grubbs algorithm [57, 58] was used to control the quality of the AWSs precipitation data. Specifically, the abnormal values in the precipitation data within 3 hours were removed, which improved the authenticity of the AWSs precipitation data. Affected by the East Asian monsoon, the water vapor transported by the East Asian monsoon in June begins to affect the study area. The monsoon enters an active period from July to August, providing a large amount of water vapor for precipitation in the area. In late August, the subtropical high retreats south, but the water vapor carried by the monsoon can still affect the area during this retreat [59]. Therefore, the precipitation data from June to September each year were used as the research data. As shown in Table 1, the Doppler radar Z9311 is located in Xinle, Shijiazhuang (38°21′6″ N, 114°42′42″E), and the effective scanning radial distance is 230 km. The radar completes 9 volume scans with different elevation angles in 6 minutes (0.49°, 1.40°, 2.38°, 3.20°, 4.27°, 5.93°, 9.79°, 14.50°, and 19.42°), and the scan mode is the volume coverage pattern, scan strategy , version 1 (VCP21). Therefore, the radar file is updated every 6 minutes. To better obtain the spatial structure of the precipitation, the CAPPI was used as the radar data in the research. Specifically, reflectivity of less than 0 dB was eliminated, and then the original polar coordinate radar data were converted into reflectivity in Cartesian coordinates, with a grid size of 1 × 1 km. In addition, the missing data at a certain elevation were interpolated using a Cressman weighting function (Equations (1) and (2)) [60]. The radius used in the X, Y, and Z directions was 1.5 km, 1.5 km, and 1 km, respectively. Finally, the CAPPIs of different elevations were generated with each rain gauge as the center.

In equations (1) and (2), R is the radius of influence and and r is the distance between the gate and the grid point.

Correspondingly, there are 17 NWSs and 260 AWSs within the coverage of this radar (Figure 1). The data from NWSs and AWSs were calculated every 1 minute. To match the precipitation and reflectivity data, we marked the reflectivity according to time. The radar reflectivity data recording time was taken as a mark, the corresponding precipitation data were located, and the rain gauge rainfall in the previous 6 minutes was collected. This is assumed that the recording time of the radar data file is T and the time of the collected precipitation data file is T, T−1, T−2, ..., T−5. The unit of the generated precipitation data is mm/6 min. Then, based on the longitude and latitude of the weather station, we matched the reflectivity with the precipitation. The precipitation and reflectivity data were also labeled based on time.

2. Methodology

The proposed deep learning-based spatiotemporal deep fusion model framework is shown in Figure 2. The entire process can be divided into three parts: (1) data preprocessing, including geometric correction, time alignment, conversion of raw radar data into CAPPI, training, verification, and test data acquisition; (2) combining the FFNet and LSTM to establish a spatiotemporal deep neural network model; and (3) evaluation of the precipitation estimation model, which includes training, verifying, and testing the proposed model based on the data and quantifying the performance of the model through certain evaluation indicators.

2.1. Data Processing

Thorndahl et al. [38] proposed the three-dimensional radar data structure shown in Figure 3. The grid size of the 0.5 km elevation is 3 × 3 km2, the grid size of the 1.0 km elevation is 7 × 7 km2, the grid size of the 1.5 km elevation is 11 × 11 km2, and the grid size of the 2.0 km elevation is 15 × 15 km2. In addition, they concluded that the correlation between the reflectivity of the lowest elevation and the rainfall is more significant and stable than at the other three elevations, and the variability at the three higher elevations is greater than that at the lowest elevation. Therefore, the 9 data points at the lowest height and the average reflectivity of the three higher elevations were taken as the input, for a total of 12 data points. However, this method greatly destroys the spatial correlation between the three-dimensional data structure, and it is also difficult to capture the relationship between the high-level reflectivity and the ground rain gauge. According to Hao et al. [61], the rainfall in the study area from June to September in the past years did not exceed 150 mm per month and the average rainfall intensity of the heavy rain over the years was 75 mm/d. A method of sorting and averaging based on two parameters (SATP) [62] is used to establish the relationship between the rainfall intensity R and the raindrop volume diameter D, and it is concluded that the raindrop size of the study area does not exceed 4 mm, which is the same size as the data assumed in the study of Thorndahl et al. [38]. Therefore, the same three-dimensional radar data structure was also used in our research. The difference is that instead of the average reflectivity at the three higher altitudes, we used all the grid values at the higher altitudes as the input and merged the reflectivity at the different altitudes.

The best training data period of the radar precipitation model is within 1 hour [45]; that is, the precipitation within 1 hour has a strong correlation. Therefore, based on the continuity of the precipitation process, we used 10 precipitation data and 10 three-dimensional radar data within 1 hour as a precipitation sample of a station to estimate the precipitation at the last timestamp within an hour. Finally, 16,603 and 43,324 precipitation samples were generated based on the NWSs and AWSs data, respectively.

2.2. Basic Neural Network Models

Feature fusion networks (FFNets) are widely used in the field of computer vision, such as image segmentation [63], target tracking [64], and content-aware image resizing [65]. An FFNet mainly uses convolutional neural networks (CNNs) to extract target features in a layer-by-layer abstract manner. A CNN can automatically extract the important feature information from the image through the view of the convolution kernel and the combined characteristics of the neurons. Therefore, it can capture the spatial features of two-dimensional data. In addition, an FFNet fuses two feature maps using the addition algorithm, so it can extract the edge information better. In the estimation of the precipitation, the spatial distribution of the rainfall above the rain gauge is complicated, and it is difficult to accurately estimate the rainfall on the ground rain gauge through the radar reflectivity information for a certain elevation. The rainfall information for different elevations can be integrated through the FFNet to establish the relationship between the high-level radar features and the ground rain gauge. Moreover, the precipitation information for each grid point is related to the surrounding information, and the spatial dependence of the target grid and the surrounding area is automatically learned through the different perception domains in the CNN.

A recurrent neural network (RNN) is a neural network commonly used to process sequence data. Specifically, the RNN layer has feedback connections, which allow the past information to affect the current input. Through this structure, the RNN can remember past information. Long short-term memory (LSTM) is a special RNN, which can solve the problem of gradient disappearance and gradient explosion in the training process of long sequence data. For longer sequences, the LSTM performs better than the ordinary RNN. The structure is shown in Figure 4.

The LSTM is composed of three internal gates: the input gate i, the output gate o, and the forget gate f. These structures control the information of the input and output memory units, optimize the hidden state ht, and obtain Ct using the following equations:

Gate vectors:

Adapt:

Update status:

In equations (3)–(8), and b are the matrix weights and deviations, respectively; , and are the gate vectors; is the sigmoid function; xt is the input vector; and denotes element-wise multiplication. Specifically, the united state is used to store the accumulation of past information, and the forget gate ft determines how much of the cell state at the previous moment is retained to the current time, and the input gate it determines how much information of the network input is saved to the cell state at the current time, and the output gate ot determines how much information the cell state outputs. Precipitation is a continuous process, and the precipitation information for each timestamp is related to the precipitation information in the past time. Therefore, the time dependence of the precipitation information at different timestamps can be obtained using the LSTM.

2.3. Spatiotemporal Deep Fusion Model (FFNet-LSTM)

In order to capture the spatial and temporal correlations between the rain gauge and radar reflectivity simultaneously, an FFNet was used to blend the spatial features of the multielevation and multisize radar reflectivity, and the LSTM was used to capture the time dependence. Finally, a spatiotemporal deep fusion network combining the FFNet and LSTM was developed to achieve radar quantitative precipitation estimation. Figure 5 illustrates the framework of the proposed model.

The spatiotemporal deep fusion network includes two parts: the FFNet model and the LSTM model. In the precipitation process, the precipitation at each spatial location does not exist in isolation but is associated with the surrounding space. The FFNet extracts the feature vectors related to the precipitation at the grid center points from the multielevation and multisize 3D radar reflectivity data and merges them from high level to low level. The FFNet is divided into a feature extraction function and a feature fusion function, which is composed of multiple convolutional layers with different convolution kernel sizes and a matrix addition algorithm. The four network features are flattened and concatenated as the output of the model. Given the integrity constraints of the time series, the FFNet network is built on the LSTM network and the complete sequence is returned. It is difficult for a single-layer LSTM network to learn complex spatiotemporal characteristics in a shallow structure, and the use of multilayer LSTM units gives the model the ability to learn by leaps and bounds. The ground rainfall is affected by the weather conditions before and after the timeline. A bidirectional-LSTM (Bi-LSTM) can not only learn past information but can also capture future information. Therefore, based on the continuity of precipitation, a multilayer Bi-LSTM model was used to obtain the time dependence of the precipitation. The output of the fusion network at the last time stamp (R10) was used as the value of the radar QPE.

The precipitation samples from the NWSs were randomly divided into 10 parts using ten-fold cross-validation, nine of which were used for training and the remaining one was used for validation at each time (Figure 6). The precipitation samples from the AWSs were used as the test set. The training dataset was used to optimize the model parameters and prevent overfitting. To build a regression model using deep learning algorithms, the hyperparameters of the model should be adjusted during the training phase. The initial values of the parameters were randomly selected from the standard normal distribution. The mean square error (MSE) was defined as the minimum loss function to calculate the error between the estimated value and the true value (Equation (9)). The output of the FFNET-LSTM network is multi-time stamp precipitation, which contains the output of the time stamp considered and for the 9 previous time stamps; and the estimated precipitation of the time stamp considered is the output of the final model, so we propose two forms of loss function: Loss_T10 (Equation (10)) and Loss_Sum (Equation (11)).

Loss_T10 is the error between the estimated value and the true value in the last time point. Loss_Sum is the sum of the errors in 10 time stamps. In addition, the learning rate update strategy was introduced in the model training stage, and the learning rate was updated every 10 iterations. The error in the training phase was trained using the gradient descent method, and the parameters of the model were updated to minimize the error of each epoch. Various hyperparameter combinations were tested, the learnable parameters of the model were optimized, and the lowest MSE was obtained. Finally, a minibatch size of 20 and an epoch size of 150 were determined through the minimizing processes.

2.4. Evaluation Parameters

The mean value of the 10 trained models was used to evaluate the performance of each rain gauge. The root mean square error (RMSE), mean bias (MB), mean absolute error (MAE), and correlation coefficient (CC) were selected as the 6-minute time-scale precipitation estimation criteria. The specific calculation formulas are as follows:

In equations (12)–(15), is the measured value of the rain gauge (mm/6 min), is the estimated value of the model (mm/6 min), and are the mean values of and (mm/6 min), respectively, and n is the number of all observations.

The threat score (TS) was selected as the criterion for evaluating the quality of the precipitation estimation. According to the classification standard proposed by Zhang et al. [66], the 6-minute time scale of the precipitation was divided into 5 different grades (Table 2), allowing the TS for precipitation of all intensities to be determined. The TSs arewhere the k values range from 1 to 4, meaning that the rainfall estimated by the model is ≥0.1 mm, ≥0.7 mm, ≥1.5 mm, and ≥4 mm, respectively. NA, NB, and NC are the hit counts (true positive events), false alarms (false positive events), and miss times (missing events), respectively.

3. Results and Discussion

3.1. Performances of Different Models for Precipitation Estimation

To illustrate the priority of the proposed model, the Ordinary Kriging (OK) interpolation based on Spherical variogram [19], (Z–R(1)), Z-R relationship based on the optimization algorithm (ZR(2)) [35], the MLP model, the FFNet model, and the LSTM model were selected for comparison with the proposed model. The CAPPI data at an elevation of 2.0 km were used as the input of the FFNet (FFNet (Grid 2.0 km)) to verify the effectiveness of the FFNet model’s feature fusion. Two loss functions (Loss_T10 and Loss_Sum) were considered in the model with the LSTM unit.

As can be seen from Figure 7 and Table 3, the correlation of the OK interpolation based on the cross-validation method [67] is the lowest, the TSs of the four precipitation categories are the lowest, and the TSs of the rainstorm and downpour are less than 5%. The degree of precipitation’s stationarity decreases as the amount of precipitation increases, and the increase in precipitation sometimes leads to a decrease in the correlation described by the semivariogram, which affects the quality of the precipitation interpolation [18]. The RMSE, MAE, and MB of the Z-R (1) are the largest, indicating that the estimated rainfall of the Z-R (1) is higher than the rain gauge value, and the error is the largest. Therefore, the fixed Z-R relationship does not apply to all regions of China. The Z-R (2) based on the optimization algorithm is more suitable for the study area than the fixed-relationship Z-R (1), and its error is less than that of the Z-R (1). In addition, the quality of the precipitation estimate of the Z-R (2) is better than that of the Z-R (1), and in particular, the estimate for the downpour increased by 87%. The TSs of the two Z-R relationships for light/moderate rain, heavy rain, and rainstorms are all greater than 50%, which is one of the reasons that the Z-R relationship is an effective algorithm for radar QPE, which is consistent with the results of other researchers [45]. The error of the MLP model is higher than that of all the other deep learning networks, and the TSs of the 4 categories of rainfall are only higher than those of the two Z-R relationships. This may be because the MLP only fits the radar data and precipitation data nonlinearly, and it does not consider their temporal and spatial correlations [68]. The error of the FFNet model is lower than that of the MLP, indicating that the spatial information has a significant improvement effect on the radar QPE. The RMSE and MAE of the FFNet (Grid 2.0 km) are significantly higher than those of the FFNet, and its TSs for heavy rain and rainstorms are the lowest among all of the models, which is due to the low correlation between the high elevation radar reflectivity information and the surface precipitation [69] and the fact that it only uses a single layer of reflectivity data and ignores the spatial structure of the rainfall system and the effectiveness of the rainfall feature fusion. When the LSTM (LSTM (Loss_T10) and LSTM (Loss_Sum)) models are used to estimate the precipitation, only the time dependence of the precipitation is considered, and the error is lower than that of the MLP and Z-R relationships. Therefore, the temporal correlation of the precipitation can also improve the accuracy of the radar QPE. Moreover, the error of the LSTM model using the loss function Loss_Sum is less than that using the Loss_T10, which is because the sum of the loss values of multiple timestamps can reflect the overall performance of the fusion precipitation within one hour, so the network can better learn the correlation between the time series data, correct the precipitation estimation errors of the past timestamps in advance, and improve the overall accuracy of the estimate of the fusion precipitation intensity. Therefore, the MB of the LSTM (Loss_Sum) is the smallest among all the models. The proposed model using the loss function Loss_Sum (FFNet-LSTM (Loss_Sum)) fully considers the dependence of the precipitation on both time and space, and it is better than all of the comparable models. The corresponding RMSEs and MAEs are the lowest, i.e., 2.52 mm/6 min and 0.66 mm/6 min lower than the Z-R (1), respectively; and the correlation coefficient increases from 0.65 to 0.85. The TSs of the FFNet-LSTM (Loss_Sum) model for heavy rain, rainstorms, and downpours are the highest among all of the models.

3.2. Spatial Distribution of the Metrics for the Different Models

The Z-R (1) and MLP are the most representative radar QPE models in terms of meteorology and deep learning, respectively. Therefore, they are compared with the FFNet-LSTM (Loss_Sum) to assess the spatial distribution of the metrics. The spatial distributions of the metrics for the three models’ 6-minute precipitation estimations over the study area are shown in Figure 7. Based on the observations from the 260 Automatic Weather Stations (AWSs), the CC, RMSE, MAE, and MB of the precipitation estimates obtained using the Z-R (1), MLP model, and FFNet-LSTM (Loss_Sum) were calculated for each gauge, and the gauge values were interpolated into others using the inverse distance weighting (IDW) interpolation to obtain the spatial distribution of the metrics [70]. As shown in Figure 8, the correlation coefficients of the Z-R (1) and MLP model have a large area of low values in the center of the study area, while the FFNet-LSTM (Loss_Sum) reduces the area of the low CC values in the center area and further improves the overall correlation coefficient of the study area. The MAE, RMSE, and MB values of the Z-R (1) are the highest among all of the models in the study area, indicating that there is a large discrepancy between the estimated values and the reference values and that the amount of precipitation is overestimated. The spatial distributions of the RMSE, MAE, and MB values of the MLP and FFNet_LSTM (Loss_Sum) are similar, but the error value of the FFNet-LSTM (Loss_Sum) is lower than that of the MLP overall. The three models all have different degrees of unevenness in terms of the distributions of their metric values. This is due to two possible factors. First, the distribution of the AWSs is uneven, and the actual precipitation measurement error cannot be used in areas with sparse stations. Second, the simple quality control algorithms have limitations in solving the problem of the AWS data quality improvement [71].

3.3. Performance of the Models on Different Time Scales

Figure 9 shows the detection and estimation of the rainfall intensity in different evolution stages of a rainfall event that occurred in Shijiazhuang from 8:00 UTC on June 8th to 16:00 UTC on June 9th in 2018 obtained using the proposed model and the comparison models. Time-series plots for the hourly rainfall estimates obtained using the AWSs, Z-R (1), MLP, FFNet, LSTM (Loss_Sum), and FFNet-LSTM (Loss_Sum) are shown in Figure 9(a). The Z-R (1) almost overestimates the entire rainfall event. The MLP underestimates the rainfall intensity during multiple periods of the rainfall process. The FFNet improves the accuracy of the precipitation estimation, but the results are not good during the period when the rainfall intensity changes suddenly. The LSTM (Loss_Sum) underestimates the rainfall intensity during the most intense period of rainfall. The estimate of the FFNet-LSTM (Loss_Sum) is in good agreement with the AWS observations although there was a slight underestimation from 0500 to 0900 UTC on June 9. During periods of heavy precipitation, the positive deviation of the model tends to increase, but the trend of the FFNet-LSTM (Loss_Sum) is the least obvious (Figure 9(b)). The Z-R (1), MLP, FFNet, and LSTM (Loss_Sum) models have maximum positive bias values of 28.7, 13.6, 11.5, and 6.9 mm/h, respectively. However, the positive bias of the FFNet-LSTM (Loss_Sum) is significantly smaller than that of the comparison models (about 5 mm/h).

Quantitative precipitation estimates on a longer time scale are of great significance for studying water resource changes during the flood season and for coping with secondary disasters caused by precipitation [50, 51, 72]. Figure 10 demonstrates how the proposed model and the comparison models perform in terms of detecting and estimating the daily rainfall intensity in Shijiazhuang during the flood season from July 7 to September 1, 2018. Similar to the quantitative hourly precipitation estimation, the Z-R (1) overestimates the amount of rainfall during the flood season. The precipitation estimated by the MLP is lower than the measured values from the AWSs. The evaluation results of the FFNet and LSTM (Loss_Sum) are similar, and underestimation occurs during periods of high rainfall intensity. The estimated values of the FFNet-LSTM (Loss_Sum) correspond well with the AWS observations although there is a slight underestimation when there is a sudden change in the rainfall intensity (Figure 10(a)). The maximum values of the positive bias for the Z-R (1), MLP, FFNet, LSTM (Loss_Sum), and FFNet-LSTM (Loss_Sum) models are about 108.1, 121.1, 114.2, 66.6, and 34.6 mm/d, respectively; and the numbers of days when the positive deviation is less than 5 mm/d are 19, 21, 22, 24, and 33, respectively (Figure 10(b)). The estimated precipitation time series of the model on the hourly and daily time scales show that the proposed model can estimate long-term precipitation events very well, and it can accurately describe the rainfall in the flood season.

4. Conclusions

Considering the spatial correlation and time dependence of precipitation and recognizing the advantages of the FFNet in feature fusion and extraction and those of the LSTM in capturing the time dependence, an offline radar QPE method based on the FFNet and LSTM was developed. First, considering the temporal and spatial characteristics of precipitation, radar CAPPI data were selected and three-dimensional radar data were used as the input for the feature fusion and extraction. Second, the dependence of the time series radar data was captured based on the sensitivity of the LSTM to the sequence data. Finally, the FFNet and LSTM were creatively combined, and it was found that the loss sum based on the time series data (Loss_Sum) greatly improves the accuracy of the precipitation estimation. The experimental results show that the proposed FFNet-LSTM method is significantly better than the basic estimation methods.

In addition, the FFNet-LSTM provides a new perspective for the utilization of high-elevation radar data and quantitative precipitation estimation in an accurate and effective way. Based on the better performance of the FFNet-LSTM model with regard to the spatial distributions of the metrics and precipitation estimations on different time scales, we conclude that once it is trained and large-scale data are input, it can be applied to the radar QPE of a large area and a long time series. In future investigations, it will be interesting to investigate how the FFNet-LSTM model performs in far-from-sample situations associated with climate variability and changes. If successful, it may provide a new technical choice for quantitative precipitation estimation based on the Doppler radar.

Data Availability

The Doppler weather radar data, national weather stations (NWSs) data, and automatic weather stations (AWSs) data used to support the findings of this study were supplied by Shijiazhuang Meteorological Observatory under license and so cannot be made freely available.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This study was supported by the National Natural Science Foundation of China (Grant nos. 41875027 and 41871238), National Key Research and Development Program of China (Grant no. 2018YFC1506502), Jiangsu Postgraduate Practice Innovation Program (Grant no. SJCX21_0367), and Shijiazhuang Meteorological Bureau (Grant no. SJZQXJHT2019-45).