CARM30: China annual rapeseed maps at 30 m spatial resolution from 2000 to 2022 using multi-source data

Liu, Wenbin; Li, Shu; Tao, Jianbin; Liu, Xiangyu; Yin, Guoying; Xia, Yu; Wang, Ting; Zhang, Hongyan

doi:10.1038/s41597-024-03188-1

Download PDF

Data Descriptor
Open access
Published: 08 April 2024

CARM30: China annual rapeseed maps at 30 m spatial resolution from 2000 to 2022 using multi-source data

Wenbin Liu ORCID: orcid.org/0000-0003-4547-2506^1,2,
Shu Li¹,
Jianbin Tao³,
Xiangyu Liu²,
Guoying Yin²,
Yu Xia²,
Ting Wang⁴ &
…
Hongyan Zhang⁵

Scientific Data volume 11, Article number: 356 (2024) Cite this article

695 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Rapeseed is a critical cash crop globally, and understanding its distribution can assist in refined agricultural management, ensuring a sustainable vegetable oil supply, and informing government decisions. China is the leading consumer and third-largest producer of rapeseed. However, there is a lack of widely available, long-term, and large-scale remotely sensed maps on rapeseed cultivation in China. Here this study utilizes multi-source data such as satellite images, GLDAS environmental variables, land cover maps, and terrain data to create the China annual rapeseed maps at 30 m spatial resolution from 2000 to 2022 (CARM30). Our product was validated using independent samples and showed average F1 scores of 0.869 and 0.971 for winter and spring rapeseed. The CARM30 has high spatial consistency with existing 10 m and 20 m rapeseed maps. Additionally, the CARM30-derived rapeseed planted area was significantly correlated with agricultural statistics (R² = 0.65–0.86; p < 0.001). The obtained rapeseed distribution information can serve as a reference for stakeholders such as farmers, scientific communities, and decision-makers.

The 10-m crop type maps in Northeast China during 2017–2019

Article Open access 02 February 2021

High-resolution crop yield and water productivity dataset generated using random forest and remote sensing

Article Open access 21 October 2022

Mapping 20 years of irrigated croplands in China using MODIS and statistics and existing irrigation products

Article Open access 15 July 2022

Background & Summary

Rapeseed (Brassica napus L.) is a vital cash crop type, responsible for 13% of the world’s vegetable oil production^1,2,3. It has been utilized for centuries as a source of cooking oils, animal feed, and protein meals^4,5. In Europe, rapeseed production has experienced significant growth in recent decades due to the expansion of cropland and its potential for biofuels^6,7. However, in many less-developed countries and regions, labor-intensive and smallholder-operated rapeseed cultivation is closely correlated with the rural population structure. The rural economy and agricultural plantation are subject to significant transformations due to the rapid marketization and urbanization processes^8,9,10. Understanding the spatiotemporal dynamics of rapeseed can assist agricultural producers, scientific communities, and decision-makers in comprehending the potential of the vegetable-oil market, implementing refined agricultural management, and promoting the sustainable production of regional and national oilseeds.

Several publicly available remotely sensed land cover map products include specific rapeseed layers, such as the 30 m Cropland Data Layer produced by the United States Department of Agriculture and National Agricultural Statistics Service using moderate-resolution satellite imagery¹¹, the 10 m European crop type map created by the European Commission using synthetic aperture radar (SAR) data and LUCAS Copernicus in-situ observations¹², the 30 m Annual Crop Inventory product released by Canadian Agriculture and Agri-Food using both optical and SAR satellite images¹³, the 10 m annual Crop Map of England (CROME) from 2016 to 2020 produced by the Rural Payments Agency of the UK using Sentinel-1/2 imagery, and the 10 m RapeseedMap10 database across 33 countries in America and Europe published by Beijing Normal University, China using Sentinel-1/2 imagery¹⁴. These resources provide field-level information on rapeseed planting, benefiting government agencies, farmers, insurance companies, and other stakeholders and offering a promising future for the industry. However, despite these advancements, annual maps for national-scale rapeseed cultivation covering long period are currently limited due to a lack of ground survey samples, low availability of satellite imagery in cloudy areas, and regional differences in the phenology of rapeseed.

Rapeseed has a long history in China and occupies the largest planting area among oil crops, consistently providing over 13 million tons of oilseed to 1.4 billion people annually^15,16. However, various factors such as rural labor being absorbed by urbanization, changes in vegetable oil consumption, and even an international oilseed trade deficit may impact enthusiasm for smallholder rapeseed planting^17,18,19,20. To evaluate changes in rapeseed planting, Tao, et al.¹⁸ employed MODIS data to map the spatiotemporal dynamics of rapeseed cultivation in the middle reaches of the Yangtze River Valley of China in 2003 and 2015. However, the low resolution of 250 m MODIS data is not sufficient to adequately describe the fragmented landscape in southern China, let alone the intricate spatiotemporal dynamics. Liu and Zhang²¹ created rapeseed extent maps (REMs) at 10 m resolution in southern China using optical, SAR, and the Global Land Data Assimilation System (GLDAS) data, offering a detailed examination of rapeseed spatiotemporal patterns and revealing the process of field intensification. However, it is only capable of mapping winter rapeseed and ignores spring rapeseed produced in northern China. This gap was subsequently filled by the first nationwide 20 m annual rapeseed map for 2017 to 2021 generated by Zang, et al.²². Despite this, existing rapeseed maps are limited in terms of spatial accuracy, particularly in central and southern China, due to the scarcity of reliable ground samples and poor satellite observations caused by cloudy weather conditions^23,24. Developing remote sensing mapping techniques that are more reliable and less dependent on ground samples is essential for accurate rapeseed monitoring.

Recent years have seen the proposal of various methods for mapping rapeseed, including (1) spectral-based methods^25,26, (2) phenological-based methods^14,27,28, (3) machine learning-derived methos²⁹, and (4) collaborative mapping methods^21,22,24,30. Rapeseed blooms are easily identifiable due to their distinctive bright yellow hue during flowering^31,32. Many rapeseed mapping methods aim to improve the phenological or spectral separability of rapeseed from other contemporaneous crops, such as the normalized difference yellow index (NDYI)²⁷ and the canola index (CI)²⁶. However, due to the frequent cloud cover degrading the flowering image quality of rapeseed, these unsupervised classification approaches are challenging to deploy in southern China²⁴. Machine learning classifiers can reduce reliance on flowering images for mapping rapeseed by inferring the characteristics distinguishing rapeseed from other crops using non-flowering variables³³. However, the large-scale use of these classifiers can be hindered by the scarcity of well-represented training samples³⁴. To properly address this issue, collaborative mapping strategies have been proposed to automatically generate training samples from existing datasets or phenological-based methods^33,35,36. For instance, Zhang, et al.²⁴ proposed a seamless and automated rapeseed mapping (SARM) method that fuses phenology and random forest (RF) classifiers and was applied in southern China²¹. Zang, et al.²² integrated a rule-based sample generation strategy and a one-class classifier to map the 20 m national-scale rapeseed maps from 2017 to 2021 in China. Despite some challenges, this collaborative mapping method, which combines unsupervised and supervised approaches, provides a feasible theoretical framework for creating high-precision and long-term rapeseed maps in China.

In light of the aforementioned issues, we developed an automated method for creating a nation-scale, long-term rapeseed map at 30 m spatial resolution (CARM30) from 2000 to 2022, using multi-source data (Table S1). As illustrated in Fig. 1, our process comprises four components: (1) identifying rapeseed flowering time across China using field surveys, high-resolution satellite images, and GLDAS data; (2) generating and optimizing training samples for rapeseed mapping algorithm; (3) producing annual maps of rapeseed extent on Google Earth Engine (GEE); and (4) validating the accuracy of the CARM30 using independent ground samples, other existing rapeseed maps, and official agricultural statistics.

Methods

Study area

China is the third-largest producer of rapeseed globally, following Canada and India^3,37, with 24% of vegetable oil consumption coming from rapeseed³⁸. Due to the diverse climate types (Fig. 2a), rapeseed in China is split into winter and spring variants depending on planting and vernalization timing^39,40. Winter rapeseed is primarily found in southern China (Fig. 2b). It is typically planted from November until harvest before June in the following year, with a growing season of 210 to 230 days³⁷. In contrast, spring rapeseed is planted in April and harvested before September after around 140 days of growth, mainly in Xinjiang, Qinghai, Gansu, and Nei Mongol⁴¹. To accurately map rapeseed across China, the mapping task was divided into two subtasks: winter and spring rapeseed mapping (Fig. 2c). The planting boundaries for winter and spring rapeseed were determined by their growing seasons, with clearly different data collection times for the two planting areas.

Landsat data

The Landsat program has provided over 50 years of moderate-resolution Earth observation images suitable for large-scale and long-term crop types mapping⁴². The GEE platform was used to access and process the surface reflectance data from November 1999 to September 2022 from four Landsat satellite sensors, including Landsat-5 Thematic Mapper (TM), Landsat-7 Enhanced Thematic Mapper Plus (ETM+), Landsat-8 Operational Land Imager (OLI), and Landsat-9 Operational Land Imager 2 (OLI-2). Images for the winter rapeseed growing region were gathered between November 1 of the preceding year and June 1 of the current year, while images for the spring rapeseed growing region were gathered between April 1 and September 1 of every year. To increase the observation frequency within the time series, multiple Landsat sensors were combined within overlapping periods. An inter-sensor harmonization approach was applied to eliminate the spectral differences between TM/ETM+ and OLI sensors^43,44.

Several vegetation indices (VIs) were then calculated for each Landsat image. The normalized difference vegetation index (NDVI)⁴⁵ and NDYI²⁷ are commonly used due to their ability to detect time-series flowering signals^31,32. The CI effectively captures variations in spectral reflectance of rapeseed during flowering and is commonly used for automatic rapeseed mapping²⁶. The Winter Rapeseed Index (WRI) enhances the separability of winter rapeseed from winter crops²⁴. Therefore, time-series NDVI, NDYI, CI, and WRI were combined to describe the growth status of rapeseed (Eqs. (1–4)):

$$NDVI=\frac{\rho NIR-\rho Red}{\rho NIR+\rho Red},$$

(1)

$$NDYI=\frac{\rho Green-\rho Blue}{\rho Green+\rho Blue},$$

(2)

$$CI=\rho NIR\times (\rho Red+\rho Green),$$

(3)

$$WRI=\frac{\rho NIR-\rho Green}{\rho NIR+\rho Green}\times \frac{\rho Blue}{\rho Green+\rho Red},$$

(4)

where ρNIR, ρRed, ρGreen, and ρBlue denote the surface reflectance of the near-infrared, red, green, and blue bands in the Landsat images.

GLDAS data

GLDAS data from 1999–2022 was used to extract environmental variables that can characterize the phenological changes in rapeseed⁴⁶. We selected ten variables that were closely related to vegetation growth as the input of the regression model. These variables included plant canopy surface water, canopy water evaporation, potential evaporation rate, soil moisture at 0–10 cm and 10–40 cm depths, soil temperature at 0–10 cm and 10–40 cm depths, accumulated air temperature, downward shortwave radiation, and total precipitation rate. The 3-h GLDAS data was composited into daily time series. For the winter rapeseed growing region, the time series from November 1 of the preceding year to February 1 of the current year was used, while for the spring rapeseed growing region, the time series from April 1 to July 1 of each year was used. Additionally, a latitude, longitude, and elevation grid with 0.25° spatial resolution was constructed as the base geographic elements. These environmental variables have been applied in monitoring the flowering phenology of winter rapeseed in southern China²¹.

Land cover data

Cropland layers were extracted from land cover data sources including GlobeLand30 for the years 2000, 2010, and 2020^47,48, and WorldCover 10 m v100 for 2020⁴⁹. To align the spatial resolution with that of GlobeLand30, the ESA WorldCover product was downsampled to 30 m using the nearest neighbor method. A unified cropland mask layer was subsequently created by merging the cropland coverage from both GlobeLand30 and WorldCover v100 products. This layer was employed to eliminate non-cropland pixels from the final rapeseed maps.

Terrain data

The 30 m Digital Elevation Model (DEM) data were used to create a mask layer for rapeseed resultant maps⁵⁰. We used the DEM to calculate the slope and removed pixels of rapeseed that had a slope > 25° as these lands are banned in China due to the risk of soil erosion, especially in southern mountains⁵¹.

Collected rapeseed samples

The rapeseed point collection version 2 (RPC-2) dataset was compiled using field surveys and visual interpretation of satellite images (Table S2). This dataset builds upon our previous research on rapeseed mapping in southern China from 2017–2021²¹. Here we expanded the dataset by manually labeling over 900,000 samples using a hexagonal grid and stratified sampling strategy (Fig. 1c,d)⁵² to avoid spatial autocorrelation of the samples. The RPC-2 dataset was constructed by utilizing historical Google Earth very-high-resolution (GE-VHR) images from all of China to fill gaps in field survey data. Specifically, these images were divided into numbers of equal-area hexagon grids with an area of 10 km². For hexagonal grids that did not contain rapeseed fields, five points were randomly selected as nonrapeseed samples, while five to ten points for rapeseed fields were further collected for grids that contain rapeseed fields. The RPC-2 dataset covers over one million km² and spans the period of 2000–2022, being a useful resource for large-scale and long-term rapeseed mapping.

Sentinel-1 data

Time-series Sentinel-1 C-band SAR data spanning from 2014 to 2022 were used to rectify the flowering time of rapeseed as recorded in the RPC-2 dataset. Given the Sentinel-1 satellite’s capability for continuous, all-weather imaging and its high temporal resolution, we adopted the method proposed by d’Andrimont⁶ and Liu²¹ to identify peak rapeseed signals during the flowering period using the VV polarization of the Sentinel-1 data. The value of the VV polarization band reaches the peak with the peak-flowering period of rapeseed. We removed rapeseed samples with a time difference greater than five days by comparing the peak-flowering date of rapeseed obtained from the SAR data with the flowering date recorded in the RPC-2 dataset.

Agricultural statistical data

We obtained information on the annual planted area of rapeseed from agricultural statistics yearbooks at the provincial and municipal levels in China. In cases where municipal-level data was not accessible, equivalent provincial data was used. Specifically, we collected data on the planted area of rapeseed for approximately 220 administrative regions from 2000 to 2021 and compared the results of satellite mapping with statistical data for consistency.

Existing rapeseed products

To evaluate the accuracy and consistency of the CARM30 dataset, we compared it with two publicly available Chinese rapeseed products: (1) Liu’s REM product at 10 m resolution, which covered the Yangtze River Economic Belt from 2017 to 2021^21,53; and (2) Zang’s rapeseed maps at 20 m resolution, which spanned the whole China from 2017 to 2021^22,54.

Monitoring rapeseed peak flowering phenology

We used the RPC-2 dataset and the GLDAS environmental variables to study the flowering phenophase of rapeseed in China. A method proposed by Liu and Zhang²¹ for estimating rapeseed flowering phenology using the random forest regression (RFR) model and environmental variables was applied to map peak flowering dates. The RPC-2 dataset provided the training data, which were calibrated using time-series Sentinel-1 data and Whitaker smoothers^55,56. Rapeseed samples with a time difference of five days were retained by comparing the flowering dates from both Sentinel-1 data and the RPC-2 dataset. To avoid model local overfitting, the calibrated training data were aggregated to a 0.25° spatial resolution and matched with GLDAS data, whose distribution is depicted in Fig. 3. The training data were predominantly located in the Yangtze River Basin, Xinjiang, Nei Mongol, and the Tibetan Plateau. Subsequently, we developed an RFR model to estimate the peak flowering date of rapeseed in China from 2000 to 2022. The model incorporated 13 time-series environmental variables, including 10 from GLDAS and factors such as latitude, longitude, and elevation. The mtry and ntree parameters of the model were set to the square root of the total number of input variables and 100. We used 3,687 training samples for the RFR model and an additional 1,580 independent samples to validate the performance of the RFR model. The R² of the estimated peak flowering dates for winter rapeseed and spring rapeseed were 0.88 and 0.83, indicating the high accuracy and reliability of the model. (Fig. S1).

Generating and optimizing training samples

We automatically generated training samples for a machine learning classifier using a phenology-based rapeseed mapping approach to address the spatial imbalance of the manually collected samples. Four VIs were used to analyze the spectral and phenological patterns of rapeseed (Fig. 4). Due to the varying orbiting times of Landsat satellites, there were four satellite combination cases: L5+L7 (2000–2012), L7 (2013), L7+L8 (2014–2022), and L7+L8+L9 (2022). The flowering phenology of rapeseed was determined from the previous subsection and the estimated peak flowering dates of rapeseed are shown in Fig. S2. Winter rapeseed typically blooms in March, while spring rapeseed blooms in July. These VIs amplified the signal and enhanced the separability of rapeseed during flowering and can adaptively generate training samples for supervised classifiers.

Table 1 shows a statistical analysis evaluating the ability of four VIs to enhance the separability of rapeseed from other land covers under different Landsat combination scenarios. The t-value is the statistic which is calculated for comparing two phenophase pairs, while the p-value is the significance level. Higher values of |t| represent significant fluctuations of index values. For winter rapeseed, both CI and WRI performed well (p <0.01), indicating their ability to highlight rapeseed from pre-flowering to flowering stages. However, NDVI and NDYI had p > 0.1 in some scenarios, indicating that these two indices were less robust. WRI was less effective for spring rapeseed as it was designed for winter rapeseed. Overall, CI performed well in almost all Landsat combination scenarios, achieving the highest or second-highest t-values. More importantly, CI can identify rapeseed using only images during flowering, reducing its dependence on multi-temporal. Therefore, the CI-based identification method was adopted to generate initial machine-learning training samples.

Table 1 Results of statistical analysis for winter and spring rapeseed under different Landsat combination scenarios.

Full size table

We calculated CI bands for each Landsat surface reflectance image collected during the flowering period according to the peak flowering time of rapeseed determined in the preceding section. These CI bands were classified into rapeseed and non-rapeseed categories by threshold segmentation (the segmentation threshold of the CI was set to 0.07 as suggested by Ashourloo, et al.²⁶). Finally, we generated 2000 rapeseed and non-rapeseed initial samples respectively for each classification block with a stratified random sampling method.

To increase the purity of the training sample pool, the spectral angle mapper (SAM)^57,58,59,60 was used to eliminate noisy samples. The reflectance spectrum of noisy samples generated by CI differs from that of the reference rapeseed endmember, making them distinguishable using spectral similarity measures. The spectral angle α for a pixel i can be calculated using the following arc cosine Eq. (5).

$$\alpha {=\cos }^{-1}\left(\frac{\mathop{\sum }\limits_{i=1}^{nb}{u}_{i}{r}_{i}}{\sqrt{\mathop{\sum }\limits_{i=1}^{nb}{u}_{i}}\sqrt{\mathop{\sum }\limits_{i=1}^{nb}{r}_{i}}}\right),$$

(5)

where u is the spectrum of unknown pseudo pixels, r is the spectrum of reference rapeseed, nb is the number of spectral channels, and the spectral angle α is the error metric of the SAM. An unknown pixel is classified as a noisy pixel if its spectral angle α with the reference rapeseed spectrum is higher than a determined threshold. In this study, we determined the threshold of the spectral angle as the standard deviation of the reference rapeseed spectrum in each subregion.

Mapping rapeseed on GEE platform

The SARM method that combines the CI-based approach and multiple RF classifiers^61,62 was implemented on GEE to map rapeseed across China²⁴. This method assumes that for a pixel i, there are at least m cloud-free images available in a time series of n Landsat images with cloud contamination to support supervised classification. The results of the classification on cloud-free images are used to supplement the results of the cloud-contaminated images. The final classification of rapeseed is a combination of m probability maps of rapeseed, represented by Eq. (6).

$${P}_{i}=\frac{\mathop{\sum }\limits_{j=1}^{m}\,{p}_{i,j}}{m},$$

(6)

where p_i,j is the classification probability of pixel i in j temporal image, m is the number of cloud-free images, 1 ≤ m ≤ n. P_i is the probability that pixel i is finally classified into rapeseed. In the binary classification task of rapeseed and non-rapeseed, pixels with P_i > 50% are classified as rapeseed category.

Considering the variations in rapeseed cultivation patterns across China, we divided each mapping region into several 2° × 2° grids and trained local RF classifiers to minimize spatial heterogeneity. The ntree for each RF classifier was set to 100 and the mtry was set to the square root of the total number of input features. The input variables included Landsat monthly median composite images and derived VIs. Training samples for each grid were taken from adjacent 3 × 3 grids following a tile-based sampling strategy⁵². Additionally, 10-fold cross-validation was employed for RF classifiers to minimize uncertainty from potentially biased samples.

The classified rapeseed maps were optimized by removing non-cropland pixels and morphological post-processing to generate the CARM30 product. First, non-cropland pixels erroneously classified as rapeseed were eliminated using a unified cropland mask derived from GlobeLand30 and WorldCover v100, as shown in Table S3. These pixels represented 0.035% and 1.567% of the cropland and rapeseed pixels, respectively, potentially leading to an average annual mapping error of 111.022 k ha. Second, the bwareaopen function in MATLAB R2022a was used to fill holes in closed objects, preserving the integrity of the rapeseed fields. In northern China, rapeseed fields are large and regular while rapeseed fields are small and scattered in southern China. To account for these differences, the hole thresholds were set at 1 ha and 5 ha for winter rapeseed and spring rapeseed respectively.

Data Records

The produced CARM30 dataset from 2000 to 2022 are available at Mendeley Data (https://doi.org/10.17632/hxhkphgmtt.1)⁶³. The format of these datasets is GeoTiff and the coordinate system is set to WGS 1984 UTM Zone 48 N. The values 0 and 1 of CARM30 represent non-rapeseed and rapeseed, respectively.

Technical Validation

Validation metrics

Four metrics were used for accuracy assessment, including user’s accuracy (UA), producer’s accuracy (PA), overall accuracy (OA), and F1 score. These metrics were derived from the confusion matrix, which can be measured by Eqs. (7–10):

$$UA=\frac{TP+TN}{N},$$

(7)

$$PA=\frac{TP}{TP+FP},$$

(8)

$$OA=\frac{TP+TN}{N},$$

(9)

$$F1=\frac{2\times UA\times PA}{UA+PA},$$

(10)

where N is the total number of validation samples, TP is the number of pixels correctly classified as rapeseed, TN is the number of pixels correctly as non-rapeseed, FP and FN refer to the number of pixels that are incorrectly classified as rapeseed and non-rapeseed, respectively.

Second, the CARM30 product was compared to two existing available rapeseed products. Third, the spatial consistency of the mapped rapeseed areas with agricultural statistics was measured. The evaluation metrics included coefficient of determination (R²), root mean square error (RMSE), and mean absolute error (MAE) (Eqs. (11–13)):

$${R}^{2}=\frac{{\left(\mathop{\sum }\limits_{i=1}^{n}({x}_{i}-\overline{{x}_{i}})\times ({y}_{i}-\overline{{y}_{i}})\right)}^{2}}{\mathop{\sum }\limits_{i=1}^{n}{({x}_{i}-\overline{{x}_{i}})}^{2}\times \mathop{\sum }\limits_{i=1}^{n}{({y}_{i}-\overline{{y}_{i}})}^{2}},$$

(11)

$$RMSE=\sqrt{\mathop{\sum }\limits_{i=1}^{n}\frac{{({x}_{i}-{y}_{i})}^{2}}{n}},$$

(12)

$$MAE=\frac{\mathop{\sum }\limits_{i=1}^{n}\left|{x}_{i}-{y}_{i}\right|}{n},$$

(13)

where n is the number of cities collected, x_i is the rapeseed mapped area for city i from satellite imagery, y_i is the statistical rapeseed area for city i, $\overline{{x}_{i}}$ and $\overline{{y}_{i}}$ are the corresponding mean values, respectively.

Comparison with existing rapeseed maps

A qualitative assessment was conducted to compare CARM30 to 10 m REM and Zang’s 20 m rapeseed maps from 2017–2021. As depicted in Fig. 5, winter and spring rapeseed are represented by red and blue pixels. Six and three zoom-in views were selected for winter and spring rapeseed to compare the mapping details using satellite images. CARM and REM were only compared in southern China due to the absence of spring rapeseed in the REM dataset. Overall, CARM30 demonstrated high agreement with existing maps and accurately depicted the spatial distribution of rapeseed fields in both well-regularized northern China and scattered southern China. However, some discrepancies were observed (i.e., sites 7 and 8). Rapeseed fields described in CARM30 appeared more misclassified due to the coarser spatial resolution of Landsat imagery. This difference in satellite sensors has impacted the accuracy of the proposed method in southern China (as illustrated in Table 2). Nevertheless, CARM30 captured some omissions present in Zang’s map (as shown in site 4). In conclusion, our CARM30 product depicted the detailed distribution of rapeseed across China and maintained high consistency with existing rapeseed maps obtained from Sentinel data.

Table 2 Accuracy of CARM30 for winter and spring rapeseed from 2000 to 2022.

Full size table

Pixel-wise validation using ground reference data

The accuracy of the CARM30 product was qualitatively evaluated using ground reference data. This dataset comprises field survey data and sample points interpreted visually from GE-VHR imagery, with the field survey data and corresponding photos illustrated in Fig. 6. Fieldwork conducted in Hubei Province of China between 2018 and 2022 provided land cover sample data, including rapeseed. The sub-maps in Fig. 6 indicate a strong spatial correlation between CARM30 and the field survey data, accurately representing rapeseed field types. However, the delineation of rapeseed field boundaries remains imprecise due to the coarse spatial resolution of the Landsat image.

The accuracy of the CARM30 product from 2000 to 2022 was verified using RPC-2 ground samples, as presented in Table 2. Overall, CARM30 demonstrated high precision with F1 scores exceeding 0.8 for most years. The accuracy of winter rapeseed was significantly lower than that of spring rapeseed, with average F1 scores of 0.869 and 0.971. In 2001 and 2002, the precision of winter rapeseed was relatively poor with F1 scores below 0.7. In contrast, the accuracy of spring rapeseed remained consistently high throughout all years with F1 scores above 0.9.

We further evaluated the performance of Zang’s rapeseed maps using the same validation data, as shown in Table 3. In the winter rapeseed growing area, Zang’s product showed comparable performance to CARM30, with an average F1 score of 0.877 (CARM: 0.88). However, in the spring rapeseed growing area, Zang’s product underperformed compared to CARM30, with an average F1 score of 0.899 (CARM30: 0.964). Both CARM30 and Zang’s product exhibited lower validation accuracy for winter rapeseed compared to spring rapeseed.

Table 3 Accuracy of the Zang’s product for winter and spring rapeseed from 2017 to 2022.

Full size table

Several factors contribute to the relatively poor performance of winter rapeseed compared to spring rapeseed. First, the topographical complexity in southern China results in extreme fragmentation of cropland, with most fields measuring less than 0.04 ha^64,65. This has led to confusion between rapeseed and other land cover types⁶⁶. Second, smallholder agriculture in southern China exhibits more complex cultivation patterns than intensive farm operations in northern China⁶⁷. The arbitrary intercropping patterns result in high spatial heterogeneity of cropland, affecting the accuracy of classification algorithms. Lastly, the unavailability of high-quality Landsat images introduces uncertainty to the produced rapeseed maps. Cloudy and rainy conditions in southern China result in fewer high-quality satellite images being available than in northern China⁶⁸, posing challenges for rapeseed mapping.

Comparison with agricultural statistics

The CARM30 product was compared with statistics for the years 2000 to 2020. As depicted in Fig. 7, the mapped rapeseed area from CARM30 showed a strong correlation with agricultural statistics with an R² ranging from 0.65 to 0.86 (p <0.001). According to RMSE and MAE, the two data sources differed by 28.79 to 63.63 k ha and 15.33 to 22.43 k ha, respectively. In earlier years, CARM30 showed a high association with statistical data. The remotely sensed rapeseed planting area, however, has been smaller than the statistical area since 2012, resulting in a lower R² and higher RMSE and MAE.

The REM dataset and Zang’s rapeseed maps were further compared with agricultural statistics (Fig. 8). Even though REM had a high R², it nevertheless underestimated the officially reported rapeseed area (Fig. 8a). RMSE and MAE for Zang’s rapeseed map were 98.41 k ha and 30.23 k ha, respectively, with an R² of 0.2 (Fig. 8b). The substantial discrepancies between statistics and the remotely sensed rapeseed area in recent years may be due to changes in the market and in policy. Rapeseed is a labor-intensive crop with high costs and low returns¹⁰. Agricultural producers in China are choosing more cost-effective crops as a result of the country’s urbanization and industrialization, which have reduced the manpower resources available for agricultural cultivation^69,70. Additionally, inaccurate figures may result from farmers overstating the rapeseed planted area due to economic factors like agricultural subsidies^71,72.

Spatial patterns of rapeseed cultivation in China

Figure 9 depicts the spatial distribution and field patterns of 30 m rapeseed across various regions in China. In northern China, spring rapeseed fields are large and regularly shaped due to large-scale intensive management. In contrast, winter rapeseed fields are fragmented and irregular, primarily found in central China (i.e., Sichuan, Guizhou, Hubei, Hunan, and Jiangxi). Field patterns vary with the topography. In mountainous southwest China, rapeseed is concentrated in narrow valleys. While in the central region, rapeseed is fragmented by the river network. In Jiangsu and Anhui, small-scale household cultivation has resulted in a dense and spotty distribution of rapeseed fields.

The spatial distribution of rapeseed at different latitudes, longitudes, and city-level scales was further analyzed and visualized using the CARM30 product (Fig. 10). Rapeseed in China is primarily distributed between 100°E–122°E and 21°N–37°N. Spring rapeseed is mainly found in Yili and Hulunbeier, while winter rapeseed is primarily distributed in cities near 30°N (i.e., Chengdu, Chongqing, Jingmen, and Changde). The purple curve represents the average planting area of rapeseed from 2000 to 2022, while the gray strip indicates the fluctuation in the planting area of rapeseed. The significant range fluctuation suggests that the planting area of rapeseed has undergone major changes since the 21st century.

The temporal trend of rapeseed cultivation from 2000 to 2022 was characterized using Sen’s slope analysis⁷³, as depicted in Fig. 11. China’s rapeseed planting area has exhibited a clear decreasing trend since the 21st century (decrease: 199 cities; increase: 94 cities). The most decreased areas in rapeseed production were primarily Sichuan, Chongqing, Hubei, Hunan, Anhui, Jiangxi, and Jiangsu. In contrast, the areas with increased rapeseed production were mainly Nei Mongol and Yunnan. Overall, the number of municipalities that detected a decrease and an increase in rapeseed production was 95 and 23, respectively (at a significance level of 5%).

Usage Notes

Advantages of our rapeseed maps

This study represents the inaugural large-scale mapping of rapeseed across China from 2000 to 2022. An initial collection of approximately 910,000 samples, both rapeseed and non-rapeseed, was conducted at a national scale. These samples were utilized to estimate the peak flowering dates of rapeseed and to assess the accuracy of the obtained CARM30 product. A Landsat image-based strategy was subsequently implemented for sample generation and purification, thereby providing training samples for the SARM algorithm. The CI-based rapeseed mapping approach was employed to automatically generate training samples, while the SAM algorithm was utilized to filter out noisy samples. The procured data is publicly accessible, facilitating researchers in investigating the spatial-temporal dynamics and phenological shifts of rapeseed in China.

This study presents several notable advantages. First, the RPC-2 dataset and an RFR model were employed to estimate the flowering period of rapeseed in China, addressing the issue of phenological variability. This approach enabled the estimation of a long time series of flowering phenology information for rapeseed nationwide, compensating for the previously unknown phenology of spring rapeseed²¹. The resulting rapeseed flowering maps have potential applications in yield estimation studies. Second, the scarcity of training samples on a national scale was addressed by employing an unsupervised CI-based mapping method and SAM to automatically generate and purify training samples. This approach, previously applied to rapeseed mapping in southern China²¹, was adapted to accommodate the temporal and spatial resolution of Landsat images. Multiple spectral and phenological indices were analyzed and compared, leading to the selection of the CI-based method for sample generation. The SAM method was employed to purify the samples, mitigating the impact of noise. The proposed process offers valuable technical insights for other crop mapping tasks. To our knowledge, CARM30 is the longest-period rapeseed dataset currently accessible. Results from both qualitative and quantitative assessments demonstrated that CARM30 is accurate and competitive with other 10 m and 20 m resolution rapeseed maps.

Specifically, CARM30 provides a wider temporal window than both REM and Zang’s rapeseed maps, allowing the tracking of rapeseed planting trends in China since the 21st century. Furthermore, compared to Zang’s rapeseed maps, CARM30 displays a higher association with statistical data and provides a wider spatial coverage than the REM dataset.

Uncertainties

Despite the advantages achieved, there are still some uncertainties in CARM30. The fragmentation of cultivated land affects the accuracy of rapeseed mapping. In southern China, smallholder-operated croplands are often less than 0.07 ha in size, smaller than the size of a single Landsat pixel, making it challenging to differentiate between rapeseed and other crops⁶⁷. One potential solution to this issue is to utilize Sentinel-2 data to create 10 m Landsat image products^74,75. The availability of Landsat observations can also have an impact on CARM30 accuracy in some regions with heavy cloud cover.

To understand the uncertainty caused by the availability of Landsat images, we evaluated the variation in the accuracy of CARM30 with different monthly image combinations (Fig. 12). We utilized seven and five monthly compositions of images for the winter and spring rapeseed mapping tasks, resulting in 127 and 31 image combination scenarios, respectively. The lowest classification accuracy for rapeseed was obtained when only one monthly composited image from the sowing stage was used, while the highest accuracy was achieved when all monthly images were used. Compared to spring rapeseed, the classification accuracy of winter rapeseed improved gradually with an increasing number of images, indicating the complexity of the winter rapeseed mapping task.

We visualized the impact of different image combinations on rapeseed mapping in China. Figure 13 illustrates the number of cloud-free Landsat monthly composited images (Fig. 13a) and the corresponding F1 scores for CARM30 during the rapeseed growing season (Fig. 13b). Spatially, the highest errors in CARM30 were found in southern and southwestern China (i.e., Guangxi, Guangdong, Hunan, Jiangxi, and Tibet). These regions often experience cloudy weather, leading to a lack of high-quality Landsat images and introducing uncertainty to CARM30. Temporarily, CARM30’s potential uncertainty peaked in 2012, the year that only Landsat-7 images were employed. However, the uncertainty in CARM30 due to data availability has decreased in recent years as a result of the expansion of Landsat satellites.

The uncertainty regarding the quality of training samples in the CARM30 dataset was further examined through a simulation experiment, with the test region depicted in Fig. S3. Figure 14 illustrates the contribution of the training sample cleaning strategy in enhancing accuracy under varying noise intensities. Observations indicate that the classification accuracy of the proposed method inversely correlates with the noise level in the CI-derived training samples. It suggests that using uncleaned training samples may result in high classification errors. The SAM method improves the purity of the initially generated samples and yields a higher classification accuracy, thereby demonstrating the effectiveness of our mapping strategy.

Code availability

The core codes and associated files we used for mapping rapeseed flowering phenology and spatial distribution are available at https://github.com/liuwenbinwhu/China-annual-rapeseed-maps30. Moreover, MATLAB R2022a, ArcGIS Pro, and Origin 2017 were used for data pre-processing, spatial analysis, and figure production.

References

Li, L. F. & Olsen, K. M. in Current Topics in Developmental Biology Vol. 119 (ed V., Orgogozo) 63–109 (Academic Press, 2016).
Amar, S., Becker, H. C. & Möllers, C. Genetic Variation and Genotype × Environment Interactions of Phytosterol Content in Three Doubled Haploid Populations of Winter Rapeseed. Crop Sci. 48, 1000–1006, https://doi.org/10.2135/cropsci2007.10.0578 (2008).
Article Google Scholar
FAO. Oilseeds: World Markets and Trade. (FAOSTAT, 2022).
Shahidi, F. in Canola and Rapeseed: Production, Chemistry, Nutrition and Processing Technology (ed Fereidoon Shahidi) 3–13 (Springer US, 1990).
Bell, J. M. From Rapeseed to Canola: A Brief History of Research for Superior Meal and Edible Oil1. Poult. Sci. 61, 613–622, https://doi.org/10.3382/ps.0610613 (1982).
Article Google Scholar
d’Andrimont, R. et al. Detecting flowering phenology in oil seed rape parcels with Sentinel-1 and -2 time series. Remote Sens. Environ. 239, 111660, https://doi.org/10.1016/j.rse.2020.111660 (2020).
Article PubMed PubMed Central Google Scholar
Alcock, T. D., Salt, D. E., Wilson, P. & Ramsden, S. J. More sustainable vegetable oil: Balancing productivity with carbon storage opportunities. Sci. Total Environ. 829, 154539, https://doi.org/10.1016/j.scitotenv.2022.154539 (2022).
Article ADS CAS PubMed Google Scholar
Chen, S. et al. Two-Stepwise Hierarchical Adaptive Threshold Method for Automatic Rapeseed Mapping over Jiangsu Using Harmonized Landsat/Sentinel-2. Remote Sens. 14 (2022).
Tian, Z. et al. The potential contribution of growing rapeseed in winter fallow fields across Yangtze River Basin to energy and food security in China. Resour. Conserv. Recycl. 164, 105159, https://doi.org/10.1016/j.resconrec.2020.105159 (2021).
Article Google Scholar
Hu, Q. et al. Rapeseed research and production in China. The Crop Journal 5, 127–135, https://doi.org/10.1016/j.cj.2016.06.005 (2017).
Article Google Scholar
Boryan, C., Yang, Z., Mueller, R. & Craig, M. Monitoring US agriculture: the US Department of Agriculture, National Agricultural Statistics Service, Cropland Data Layer Program. Geocarto Int. 26, 341–358, https://doi.org/10.1080/10106049.2011.562309 (2011).
Article ADS Google Scholar
d’Andrimont, R. et al. From parcel to continental scale – A first European crop type map based on Sentinel-1 and LUCAS Copernicus in-situ observations. Remote Sens. Environ. 266, 112708, https://doi.org/10.1016/j.rse.2021.112708 (2021).
Article Google Scholar
McNairn, H., Champagne, C., Shang, J., Holmstrom, D. & Reichert, G. Integration of optical and Synthetic Aperture Radar (SAR) imagery for delivering operational annual crop inventories. ISPRS J. Photogramm. Remote Sens. 64, 434–449, https://doi.org/10.1016/j.isprsjprs.2008.07.006 (2009).
Article ADS Google Scholar
Han, J. et al. The RapeseedMap10 database: annual maps of rapeseed at a spatial resolution of 10 m based on multi-source data. Earth Syst. Sci. Data 13, 2857–2874, https://doi.org/10.5194/essd-13-2857-2021 (2021).
Article ADS Google Scholar
Zhang, Y. et al. Flavor of rapeseed oil: An overview of odorants, analytical techniques, and impact of treatment. Compr. Rev. Food Sci. Food Saf. 20, 3983–4018, https://doi.org/10.1111/1541-4337.12780 (2021).
Article CAS PubMed Google Scholar
NBS. China Statistical Yearbook, National Bureau of Statistics of China. (2021).
Qiang, W., Liu, A., Cheng, S., Kastner, T. & Xie, G. Agricultural trade and virtual land use: The case of China’s crop trade. Land Use Policy 33, 141–150, https://doi.org/10.1016/j.landusepol.2012.12.017 (2013).
Article Google Scholar
Tao, J., Wu, W., Liu, W. & Xu, M. Exploring the Spatio-Temporal Dynamics of Winter Rape on the Middle Reaches of Yangtze River Valley Using Time-Series MODIS Data. Sustainability 12, 466, https://doi.org/10.3390/su12020466 (2020).
Article Google Scholar
Yan, D. et al. Arable land and water footprints for food consumption in China: From the perspective of urban and rural dietary change. Sci. Total Environ. 838, 155749, https://doi.org/10.1016/j.scitotenv.2022.155749 (2022).
Article ADS CAS PubMed Google Scholar
Wang, S. et al. Urbanization can benefit agricultural production with large-scale farming in China. Nat. Food 2, 183–191, https://doi.org/10.1038/s43016-021-00228-6 (2021).
Article PubMed Google Scholar
Liu, W. & Zhang, H. Mapping annual 10 m rapeseed extent using multisource data in the Yangtze River Economic Belt of China (2017–2021) on Google Earth Engine. Int. J. Appl. Earth Obs. Geoinf. 117, 103198, https://doi.org/10.1016/j.jag.2023.103198 (2023).
Article Google Scholar
Zang, Y. et al. Mapping rapeseed in China during 2017-2021 using Sentinel data: an automated approach integrating rule-based sample generation and a one-class classifier (RSG-OC). GISci. Remote Sens. 60, 2163576, https://doi.org/10.1080/15481603.2022.2163576 (2023).
Article Google Scholar
Zhang, C., Zhang, H. & Zhang, L. Spatial domain bridge transfer: An automated paddy rice mapping method with no training data required and decreased image inputs for the large cloudy area. Comput. Electron. Agric. 181, 105978, https://doi.org/10.1016/j.compag.2020.105978 (2021).
Article Google Scholar
Zhang, H., Liu, W. & Zhang, L. Seamless and automated rapeseed mapping for large cloudy regions using time-series optical satellite imagery. ISPRS J. Photogramm. Remote Sens. 184, 45–62, https://doi.org/10.1016/j.isprsjprs.2021.12.001 (2022).
Article ADS Google Scholar
Wang, D. et al. A regional mapping method for oilseed rape based on HSV transformation and spectral features. ISPRS Int. J. Geo-Inf. 7, 224, https://doi.org/10.3390/ijgi7060224 (2018).
Article Google Scholar
Ashourloo, D. et al. Automatic canola mapping using time series of sentinel 2 images. ISPRS J. Photogramm. Remote Sens. 156, 63–76, https://doi.org/10.1016/j.isprsjprs.2019.08.007 (2019).
Article ADS Google Scholar
Sulik, J. J. & Long, D. S. Spectral considerations for modeling yield of canola. Remote Sens. Environ. 184, 161–174, https://doi.org/10.1016/j.rse.2016.06.016 (2016).
Article ADS Google Scholar
Han, J., Zhang, Z., Cao, J. & Luo, Y. Mapping rapeseed planting areas using an automatic phenology- and pixel-based algorithm (APPA) in Google Earth Engine. The Crop Journal 10, 1483–1495, https://doi.org/10.1016/j.cj.2022.04.013 (2022).
Article Google Scholar
Meng, S. et al. Optimal Temporal Window Selection for Winter Wheat and Rapeseed Mapping with Sentinel-2 Images: A Case Study of Zhongxiang in China. Remote Sens. 12, 226, https://doi.org/10.3390/rs12020226 (2020).
Article ADS Google Scholar
Tao, J., Liu, W., Tan, W., Kong, X. & Xu, M. Fusing multi-source data to map spatio-temporal dynamics of winter rape on the Jianghan Plain and Dongting Lake Plain, China. J. Integr. Agric. 18, 2393–2407, https://doi.org/10.1016/S2095-3119(19)62577-3 (2019).
Article Google Scholar
Sulik, J. J. & Long, D. S. Spectral indices for yellow canola flowers. Int. J. Remote Sens. 36, 2751–2765, https://doi.org/10.1080/01431161.2015.1047994 (2015).
Article Google Scholar
Shen, M., Chen, J., Zhu, X. & Tang, Y. Yellow flowers can decrease NDVI and EVI values: evidence from a field experiment in an alpine meadow. Can. J. Remote Sens. 35, 99–106, https://doi.org/10.5589/m09-003 (2009).
Article ADS Google Scholar
Wang, S., Azzari, G. & Lobell, D. B. Crop type mapping without field-level labels: Random forest transfer and unsupervised clustering techniques. Remote Sens. Environ. 222, 303–317, https://doi.org/10.1016/j.rse.2018.12.026 (2019).
Article ADS Google Scholar
Skakun, S. et al. Early season large-area winter crop mapping using MODIS NDVI data, growing degree days information and a Gaussian mixture model. Remote Sens. Environ. 195, 244–258, https://doi.org/10.1016/j.rse.2017.04.026 (2017).
Article ADS Google Scholar
Belgiu, M., Bijker, W., Csillik, O. & Stein, A. Phenology-based sample generation for supervised crop type classification. Int. J. Appl. Earth Obs. Geoinf. 95, 102264, https://doi.org/10.1016/j.jag.2020.102264 (2021).
Article Google Scholar
Hao, P., Di, L., Zhang, C. & Guo, L. Transfer Learning for Crop classification with Cropland Data Layer data (CDL) as training samples. Sci. Total Environ. 733, 138869, https://doi.org/10.1016/j.scitotenv.2020.138869 (2020).
Article ADS CAS PubMed Google Scholar
Bonjean, A. P., Dequidt, C., Sang, T. & Groupe, L. Rapeseed in China. OCL 23, https://doi.org/10.1051/ocl/2016045 (2016).
Jamet, J.-P. & Chaumet, J.-M. Soybean in China: adaptating to the liberalization. OCL 23, https://doi.org/10.1051/ocl/2016044 (2016).
Qian, W. et al. Introgression of genomic components from Chinese Brassica rapa contributes to widening the genetic diversity in rapeseed (B. napus L.), with emphasis on the evolution of Chinese rapeseed. Theor. Appl. Genet. 113, 49–54, https://doi.org/10.1007/s00122-006-0269-3 (2006).
Article CAS PubMed Google Scholar
Qian, W. et al. Heterotic patterns in rapeseed (Brassica napus L.): I. Crosses between spring and Chinese semi-winter lines. Theor. Appl. Genet. 115, 27–34, https://doi.org/10.1007/s00122-007-0537-x (2007).
Article CAS PubMed Google Scholar
Tuvdendorj, B. et al. Performance and the Optimal Integration of Sentinel-1/2 Time-Series Features for Crop Classification in Northern Mongolia. Remote Sens. 14 (2022).
Wulder, M. A. et al. Current status of Landsat program, science, and applications. Remote Sens. Environ. 225, 127–147, https://doi.org/10.1016/j.rse.2019.02.015 (2019).
Article ADS Google Scholar
Roy, D. P. et al. Characterization of Landsat-7 to Landsat-8 reflective wavelength and normalized difference vegetation index continuity. Remote Sens. Environ. 185, 57–70, https://doi.org/10.1016/j.rse.2015.12.024 (2016).
Article ADS Google Scholar
Vogeler, J. C., Braaten, J. D., Slesak, R. A. & Falkowski, M. J. Extracting the full value of the Landsat archive: Inter-sensor harmonization for the mapping of Minnesota forest canopy cover (1973–2015). Remote Sens. Environ. 209, 363–374, https://doi.org/10.1016/j.rse.2018.02.046 (2018).
Article ADS Google Scholar
Rouse, J. Jr, Haas, R. H., Deering, D., Schell, J. & Harlan, J. C. Monitoring the vernal advancement and retrogradation (green wave effect) of natural vegetation. (1974).
Rodell, M. et al. The Global Land Data Assimilation System. Bull. Am. Meteorol. Soc. 85, 381–394, https://doi.org/10.1175/BAMS-85-3-381 (2004).
Article ADS Google Scholar
Jun, C., Ban, Y. & Li, S. Open access to Earth land-cover map. Nature 514, 434–434, https://doi.org/10.1038/514434c (2014).
Article ADS CAS PubMed Google Scholar
Chen, J. & Chen, J. GlobeLand30: Operational global land cover mapping and big-data analysis. Science China. Earth Sciences 61, 1533–1534, https://doi.org/10.1007/s11430-018-9255-3 (2018).
Article ADS Google Scholar
Zanaga, D. et al. (2021).
Farr, T. G. et al. The Shuttle Radar Topography Mission. Rev. Geophys. 45, https://doi.org/10.1029/2005RG000183 (2007).
Bennett, M. T. China’s sloping land conversion program: institutional innovation or business as usual? Ecol. Econ. 65, 699–711, https://doi.org/10.1016/j.ecolecon.2007.09.017 (2008).
Article Google Scholar
Zhang, M. et al. Automatic high-resolution land cover production in Madagascar using sentinel-2 time series, tile-based image classification and google earth engine. Remote Sens. 12, 3663, https://doi.org/10.3390/rs12213663 (2020).
Article ADS MathSciNet Google Scholar
Liu, W. & Zhang, H. Data for: Mapping annual 10-m rapeseed extent using multi-source data in the Yangtze River Economic Belt of China (2017–2021) on Google Earth Engine. Mendeley Data V2, https://doi.org/10.17632/6p6b86bwv5.2 (2022).
Zang, Y. et al. (2022).
Atzberger, C. & Eilers, P. H. A time series for monitoring vegetation activity and phenology at 10-daily time steps covering large parts of South America. Int. J. Digital Earth 4, 365–386, https://doi.org/10.1080/17538947.2010.505664 (2011).
Article ADS Google Scholar
Eilers, P. H. A perfect smoother. Anal. Chem. 75, 3631–3636, https://doi.org/10.1021/ac034173t (2003).
Article CAS PubMed Google Scholar
Kruse, F. A. et al. The spectral image processing system (SIPS)—interactive visualization and analysis of imaging spectrometer data. Remote Sens. Environ. 44, 145–163, https://doi.org/10.1016/0034-4257(93)90013-N (1993).
Article ADS Google Scholar
Dennison, P. E., Halligan, K. Q. & Roberts, D. A. A comparison of error metrics and constraints for multiple endmember spectral mixture analysis and spectral angle mapper. Remote Sens. Environ. 93, 359–367, https://doi.org/10.1016/j.rse.2004.07.013 (2004).
Article ADS Google Scholar
Tu, B., Zhou, C., He, D., Huang, S. & Plaza, A. Hyperspectral Classification With Noisy Label Detection via Superpixel-to-Pixel Weighting Distance. IEEE Trans. Geosci. Remote Sens. 58, 4116–4131, https://doi.org/10.1109/TGRS.2019.2961141 (2020).
Article ADS Google Scholar
Mafanya, M., Tsele, P., Zengeya, T. & Ramoelo, A. An assessment of image classifiers for generating machine-learning training samples for mapping the invasive Campuloclinium macrocephalum (Less.) DC (pompom weed) using DESIS hyperspectral imagery. ISPRS J. Photogramm. Remote Sens. 185, 188–200, https://doi.org/10.1016/j.isprsjprs.2022.01.015 (2022).
Article ADS Google Scholar
Breiman, L. Random Forests. Machine Learning 45, 5–32, https://doi.org/10.1023/A:1010933404324 (2001).
Article Google Scholar
Belgiu, M. & Drăguţ, L. Random forest in remote sensing: A review of applications and future directions. ISPRS J. Photogramm. Remote Sens. 114, 24–31, https://doi.org/10.1016/j.isprsjprs.2016.01.011 (2016).
Article ADS Google Scholar
Liu, W. & Zhang, H. China annual rapeseed maps at 30 m spatial resolution from 2000 to 2022. Mendeley Data https://doi.org/10.17632/hxhkphgmtt.1 (2023).
Tan, M., Robinson, G. M., Li, X. & Xin, L. Spatial and temporal variability of farm size in China in context of rapid urbanization. Chinese Geographical Science 23, 607–619, https://doi.org/10.1007/s11769-013-0610-0 (2013).
Article Google Scholar
Yu, Q., Hu, Q., van Vliet, J., Verburg, P. H. & Wu, W. GlobeLand30 shows little cropland area loss but greater fragmentation in China. Int. J. Appl. Earth Obs. Geoinf. 66, 37–45, https://doi.org/10.1016/j.jag.2017.11.002 (2018).
Article ADS Google Scholar
Li, S. et al. An estimation of the extent of cropland abandonment in mountainous regions of China. Land Degradation & Development 29, 1327–1342, https://doi.org/10.1002/ldr.2924 (2018).
Article ADS Google Scholar
Liu, L. et al. Mapping cropping intensity in China using time series Landsat and Sentinel-2 images and Google Earth Engine. Remote Sens. Environ. 239, 111624, https://doi.org/10.1016/j.rse.2019.111624 (2020).
Article Google Scholar
Huang, L., Luo, Y. & Zhang, D.-L. The Relationship Between Anomalous Presummer Extreme Rainfall Over South China and Synoptic Disturbances. J. Geophys. Res.: Atmos. 123, 3395–3413, https://doi.org/10.1002/2017JD028106 (2018).
Article ADS Google Scholar
Fei, R., Lin, Z. & Chunga, J. How land transfer affects agricultural land use efficiency: Evidence from China’s agricultural sector. Land Use Policy 103, 105300, https://doi.org/10.1016/j.landusepol.2021.105300 (2021).
Article Google Scholar
Liao, L., Long, H., Gao, X. & Ma, E. Effects of land use transitions and rural aging on agricultural production in China’s farming area: A perspective from changing labor employing quantity in the planting industry. Land Use Policy 88, 104152, https://doi.org/10.1016/j.landusepol.2019.104152 (2019).
Article Google Scholar
Sharma, S. K. WTO and policy space for agriculture and food security: issues for China and India. Agricultural Economics Research Review 31, 207–219, https://doi.org/10.5958/0974-0279.2018.00038.1 (2018).
Article Google Scholar
Sun, S. China’s Ban on Canadian Canola: Reasons, Impacts, and Policy Perspectives. https://doi.org/10.7939/r3-bzhn-d142 (2020).
Article Google Scholar
Gocic, M. & Trajkovic, S. Analysis of changes in meteorological variables using Mann-Kendall and Sen’s slope estimator statistical tests in Serbia. Global Planet. Change 100, 172–182, https://doi.org/10.1016/j.gloplacha.2012.10.014 (2013).
Article ADS Google Scholar
Wu, J. et al. Fusing Landsat 8 and Sentinel-2 data for 10-m dense time-series imagery using a degradation-term constrained deep network. Int. J. Appl. Earth Obs. Geoinf. 108, 102738, https://doi.org/10.1016/j.jag.2022.102738 (2022).
Article Google Scholar
Shao, Z., Cai, J., Fu, P., Hu, L. & Liu, T. Deep learning-based fusion of Landsat-8 and Sentinel-2 images for a harmonized surface reflectance product. Remote Sens. Environ. 235, 111425, https://doi.org/10.1016/j.rse.2019.111425 (2019).
Article Google Scholar
Beck, H. E. et al. Present and future Köppen-Geiger climate classification maps at 1-km resolution. Sci. Data 5, 1–12, https://doi.org/10.1038/sdata.2018.214 (2018).
Article ADS Google Scholar

Download references

Acknowledgements

This work was supported in part by the National Key Research and Development Program of China under Grant 2022YFB3903605, in part by the National Natural Science Foundation of China under Grant 42071322, and in part by the Major Science and Technology Project of the Ministry of Water Resources of China under Grant SKS-2022082.

Author information

Authors and Affiliations

Changjiang Institute of Survey Technical Research, MWR, Wuhan, Hubei, 430011, China
Wenbin Liu & Shu Li
The State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, Hubei, 430072, China
Wenbin Liu, Xiangyu Liu, Guoying Yin & Yu Xia
The Key Laboratory for Geographical Process Analysis & Simulation of Hubei Province/School of Urban and Environmental Sciences, Central China Normal University, Wuhan, Hubei, 430079, China
Jianbin Tao
Hubei Research Institute of Spatial Planning, Wuhan, Hubei, 430064, China
Ting Wang
School of Computer Sciences, China University of Geosciences, Wuhan, Hubei, 430074, China
Hongyan Zhang

Authors

Wenbin Liu
View author publications
You can also search for this author in PubMed Google Scholar
Shu Li
View author publications
You can also search for this author in PubMed Google Scholar
Jianbin Tao
View author publications
You can also search for this author in PubMed Google Scholar
Xiangyu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Guoying Yin
View author publications
You can also search for this author in PubMed Google Scholar
Yu Xia
View author publications
You can also search for this author in PubMed Google Scholar
Ting Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hongyan Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

W.L., S.L. and H.Z. designed the research. X.L., G.Y., T.W. and Y.X. collected the datasets. W.L., X.L., G.Y., Y.X., T.W. and J.T. performed the experiments and analysis. W.L. and H.Z. wrote the original manuscript. W.L., J.T. and H.Z. revised the manuscript.

Corresponding author

Correspondence to Hongyan Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

SUPPLEMENTARY INFORMATION

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, W., Li, S., Tao, J. et al. CARM30: China annual rapeseed maps at 30 m spatial resolution from 2000 to 2022 using multi-source data. Sci Data 11, 356 (2024). https://doi.org/10.1038/s41597-024-03188-1

Download citation

Received: 12 October 2023
Accepted: 25 March 2024
Published: 08 April 2024
DOI: https://doi.org/10.1038/s41597-024-03188-1

Subjects

Abstract

Similar content being viewed by others

The 10-m crop type maps in Northeast China during 2017–2019

High-resolution crop yield and water productivity dataset generated using random forest and remote sensing

Mapping 20 years of irrigated croplands in China using MODIS and statistics and existing irrigation products

Background & Summary

Methods

Study area

Landsat data

GLDAS data

Land cover data

Terrain data

Collected rapeseed samples

Sentinel-1 data

Agricultural statistical data

Existing rapeseed products

Monitoring rapeseed peak flowering phenology

Generating and optimizing training samples

Mapping rapeseed on GEE platform

Data Records

Technical Validation

Validation metrics

Comparison with existing rapeseed maps

Pixel-wise validation using ground reference data

Comparison with agricultural statistics

Spatial patterns of rapeseed cultivation in China

Usage Notes

Advantages of our rapeseed maps

Uncertainties

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

SUPPLEMENTARY INFORMATION

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links