ML-LUM: A system for land use mapping by machine learning algorithms

https://doi.org/10.1016/j.cola.2019.100908Get rights and content

Abstract

The land use mapping refers to mapping and assessing changes and patterns of land use. The use of agricultural land maps becomes increasingly important. The governments, private sectors, research agencies, and community groups rely on land use mapping data for natural resource assessment, monitoring, and planning. Finding an effective mapping approach is thereby crucial for natural resource condition monitoring and investment, agricultural productivity, sustainability and planning, biodiversity conservation, natural disaster management, and bio-security.

In this paper, four machine learning algorithms, i.e., the classic k-Nearest Neighbour (kNN), Support Vector Machines (SVMs), Convolutional Neural Network (CNN), and newly developed Capsule Network (CapsNet), are applied to classify satellite images for land use. For comprehensively comparing the performance of different algorithms for land use mapping, the experiments have been conducted on real-world datasets. Based on the experiment results, several improvements on the algorithms are proposed in order to fulfil the requirement of a large-scale land mapping. In addition, we design and implement these algorithms for land use mapping in a Machine Learning Land Use Mapping (ML-LUM) system. The system is able to train the models, predict classifications of satellite images, map the land use, display the land use statistic data, and predict production yields. With a friendly graphic user interface for farmers, the system is implemented by using the cloud computing technique for processing large land use data. Furthermore, we present a case study. For the case study, a banana plantation area from a given satellite image is correctly marked and the area size is then calculated, together with predicting banana production.

Introduction

Land use mapping becomes increasingly important. For example, the remote sensing centre in Queensland relies on satellite remote sensing data to map land use and land use changes, under the Australian Land Use Mapping Program (ALUMP) [1]. The program aims to build nationally consistent land use datasets so as to meet a wide range of user needs and to make the best use of existing data and available resources [1]. In particular, ALUMP provides nationally consistent spatial information in order to assist in planning, natural resource condition monitoring and investment, agricultural productivity and sustainability, biodiversity conservation, as well as natural disaster management and bio-security. As such, the Australian government, private sectors, research agencies and community groups can use the ALUMP datasets for natural resource assessment, monitoring, and planning [1].

The majority current methodologies of mapping land use still require skilled teams to manually interpret satellite imagery data and manually classify land use features [2]. It is obvious that this process is time and resource intensive. Finding an effective mapping approach will significantly reduce such costs.

With the rapid advances in technologies, automatic land use mapping becomes possible. Advances in machine learning (ML) enable to train a machine learning model on ALUMP's datasets for automatically mapping land use features [2]. In other words, ML algorithms make automatic land use mapping possible, by training models on all of the available data. As such, machine learning algorithms can be used to identify the land use from satellite images, to reduce the intensive manual effort required and to produce results in a shorter period of time [3]. Furthermore, the use of machine learning for land use mapping can make predictions. This can guide better decisions and smart actions for land use in real time. In addition, it can help seasonal crop mapping and monitoring [3] and control, preventing Banana Bunchy Top Virus (BBTV) outbreak [4].

This study will investigate the use of machine learning to identify land use from satellite imagery so as to reduce intensive manual efforts required and obtain results in shorter timeframes. The question is how to apply appropriate machine learning algorithms to automatically identify and classify land use.

In order to answer this question, we start by identifying the requirements of land use mapping as follows:

  • 1.

    Accuracy: a high accuracy is expected for the land use mapping; otherwise manual corrections by experienced staff are still needed.

  • 2.

    Processing speed: the speed of data processing is essential. This is particularly true when the outcome results are in an urgent need, such as disaster management.

These requirements for land use mapping pose challenges to automatic mapping systems. Further, the computing capacity is essential for the huge size of the data. For example, Queensland Australia has approximately 173 million hectares in area. A baseline dataset of land use mapping in Queensland contains approximately 160,000 features [3]. An example is shown in Fig. 1. It is a challenge for a program and algorithm to process such a large dataset.

In this paper, we have made the following contributions:

  • we conduct extensive experiments on comparing several important ML algorithms for land use mapping.

In particular, the deep learning algorithms of CNN and the newly developed CapsNet are used. To the best of our knowledge, this is the first work on applying deep learning algorithms, particular the newly developed CapsNet algorithm to land use mapping. Some improvements on the algorithms for land mapping use have been proposed, particularly for the settings of parameters in the algorithms.

  • We make recommendations on how to choose algorithms for land use mapping under different cases are made. The recommendation is based on comparing and analysing the experimental results on both public datasets and satellite imagery datasets.

  • We design and develop a system that implemented these machine learning algorithms in land use mapping (ML-LUM). By giving a set of satellite images, the ML-LUM system uses different machine learning algorithms to produce classification outputs. The system is implemented in a cloud computing environment that uses supercomputer resources for processing huge data quickly.

The organization of this paper is as follows. Section 2 reviews related work. Section 3 presents the compared approaches, followed by Section 4 that presents the cloud implementation of the system. Section 5 describes the experimental methodologies. Section 6 provides an illustrative case study. The paper concludes in Section 7.

Section snippets

Algorithms in land mapping systems

A number of methods for classifying satellite images are available. These methods can be classified into four categories [5]: template matching-based methods, knowledge-based methods, OBIA-based methods, and machine learning-based methods.

The template matching-based methods, knowledge-based methods, and OBIA-based methods are built on traditional techniques such as segmentation, edge detection, feature extraction, and classification. These methods have been used in remote sensing image analysis

Implemented algorithms in the ML-LUM system

In our ML-LUM system, the following algorithms are implemented: kNN, SVM, CNN, and CapsNet. The former two are traditional ones while the latter two are recently advanced ones. We briefly review the four algorithms in the following.

k-Nearest Neighbour (kNN): As one of the simplest classification algorithms available for supervised learning [12,13], kNN searchs for the closest match of the test data in feature space [12].

Input: a training sample S = (x1,y1),…,(xm,ym)

Output: for every point xX,

Overview of ML-LUM

For comparing the algorithms, the ML-LUM system is designed and developed to implement the ML algorithms in land use mapping. The modular design is applied to make the system extendable.

Experiments

Four ML algorithms have been implemented to process various satellite images and to produce classification results by using Python 3. The selection of Python 3 as the programming language is based on its great range of new libraries for machine learning applications. Python 3 supports TensorFlow which was developed by Google Brain Team. It is also able to build applications with multi-thread and GPU accelerate support.

The main libraries include TensorFlow-GPU, Keras, OpenCV and Matplotlib. The

A case study: estimation of banana plantation areas using ML-LUM

In order to further validate the experiment conclusions, we compare these algorithms by applying them to a real-world example. Given high definition satellite images as inputs, the trained model will identify which area is a banana plantation. After this, the ML-LUM system calculates the size of the identified areas, and predicts the productions of the banana plantation.

The ML-LUM system makes use of the trained model to classify land use. It also marks the banana plantation (Banana Farms) in

Conclusion

Land use mapping becomes increasingly important. In this paper, we have compared several ML algorithms ranging from classic to state-of-the-art against the three datasets for the mapping task. In order to achieve the best performance, we have improved the algorithms for this particular task. Through the experiments, we conclude that CNN and CapsNet are good candidates for land use mapping in terms of their prediction accuracy. In particular, CapsNet uses much more computing resource and time

Acknowledgment

This work was supported in part by the National Natural Science Foundation of China (Nos. 61877020). We acknowledge CSU's Spatial Data Analysis Network (SPAN) Team and Mr Simon McDonald for providing valuable support including data, tools and software.

Cloud implementation of this research was supported by use of the NeCTAR Research Cloud, a collaborative Australian research platform supported by the National Collaborative Research Infrastructure Strategy.

References (25)

  • T.E. Parece

    Remote sensing in an arcmap environment

    Classif. Landsat Image (Supervised)

    (2014)
  • C. Makowski et al.

    Geoform and landform classification of continental shelves using geospatially integrated ikonos satellite imagery

    J. Coast. Res.

    (2016)
  • View full text