Exploration of OpenStreetMap missing built-up areas using twitter hierarchical clustering and deep learning in Mozambique

doi:10.1016/j.isprsjprs.2020.05.007

ISPRS Journal of Photogrammetry and Remote Sensing

Volume 166, August 2020, Pages 41-51

https://doi.org/10.1016/j.isprsjprs.2020.05.007 Get rights and content

Abstract

Accurate and detailed geographical information digitizing human activity patterns plays an essential role in response to natural disasters. Volunteered geographical information, in particular OpenStreetMap (OSM), shows great potential in providing the knowledge of human settlements to support humanitarian aid, while the availability and quality of OSM remains a major concern. The majority of existing works in assessing OSM data quality focus on either extrinsic or intrinsic analysis, which is insufficient to fulfill the humanitarian mapping scenario to a certain degree. This paper aims to explore OSM missing built-up areas from an integrative perspective of social sensing and remote sensing. First, applying hierarchical DBSCAN clustering algorithm, the clusters of geo-tagged tweets are generated as proxies of human active regions. Then a deep learning based model fine-tuned on existing OSM data is proposed to further map the missing built-up areas. Hit by Cyclone Idai and Kenneth in 2019, the Republic of Mozambique is selected as the study area to evaluate the proposed method at a national scale. As a result, 13 OSM missing built-up areas are identified and mapped with an over 90% overall accuracy, being competitive compared to state-of-the-art products, which confirms the effectiveness of the proposed method.

Introduction

Over the last decades, Volunteered Geographic Information (VGI) has been collected much more detailed, dynamic, and manifold than ever before from heterogeneous data sources, such as location-based services, global positioning systems (GPS), high-resolution earth observation data, and crowdsourced geographic information (Goodchild, 2007). OpenStreetMap (OSM) has been considered as the most active and widely used VGI platform. However, its reliability and accessibility remain variables due to the high diversity of volunteers’ mapping behavior (Barron et al., 2014). Data quality is regarded as first topic that suggests itself to anyone knowing VGI for the very first time (Goodchild and Glennon, 2010). Therefore, exploring the data quality and accessibility of OSM data requires further research towards developing sophisticated methods by integrating multiple social and geographical perspectives. Better quality-oriented awareness is of central essentiality to improve data quality and boost data application of OSM in general.

Among the existing works on investigating the quality of OSM data, there are mainly two streams. One common approach is to compare OSM data with authoritative reference data sets (Fan et al., 2014, Zielstra et al., 2013, Neis et al., 2012, Mooney and Corcoran, 2012), which are collected by federal agencies or commercial map providers. However, the acquisition of such reference data sets highly depends on social-economic factors (e.g., time, costs, and human labor restrictions), thus further limits the application of such extrinsic analysis approach. Herein, the intrinsic data analysis has been explored by looking into the historical data, where the intrinsic indicators show great potential to provide alternate indicators regarding the OSM data quality (Barron et al., 2014, Zhang et al., 2018, Jackson et al., 2013, Ostermann and Spinsanti, 2011). Given a data-sparse scenario where most of settlements and streets features are simply missing in OSM data, the established approaches become no longer adequate due to a lack of either reference or historical data. Therefore, robust and efficient quality indicators are necessary, which should be easily generated from widely available open geospatial data.

With the ever fast growth in the need of disaster response in worldwide, we have witnessed the increasing demands for accurate geographical information on the spatial distribution of human settlements. Examples include the 2008 Wenchuan earthquake, the 2010 Haiti earthquake, and the 2019 Cyclone Idai and Kenneth in Mozambique, which all caused tremendous damages, injuries, and loss of human lives. VGI, especially OpenStreetMap, has opened a new window in supporting such disaster response by establishing humanitarian mapping projects, with the motto of “mapping the most vulnerable places in the world” (Scholz et al., 2018). When considering the quality issue of OSM data, we should keep in mind that the quality may have diverse contexts, depending on the application to which the information is to be put (Goodchild and Glennon, 2010). While, the first priority in disaster response mapping scenario is to map as much as possible built-up areas with potential human settlements, which could be potentially vulnerable due to the disaster. In other words, the importance of positional accuracy is often outweighed by the completeness as a dimension of quality. Towards indicating the overall completeness of OSM data with a specialization in human settlements, we dedicate this work to the exploration of OSM missing built-up areas by integrating remote sensing and social sensing (Liu et al., 2015b) perspectives.

In this paper, we explore how social media and earth observation data can be used as reliable alternate sources to estimate OSM data quality in terms of missing built-up areas. Subsequently, this paper proposes a novel method by discovering the complementary values from hierarchical clustering of geo-tagged tweets and deep learning based built-up areas mapping for large-scale OSM data quality indication. By implementing the proposed method in Mozambique, Africa, we successfully explored a range of the OSM missing built-up areas, which deserves future detailed mapping by volunteers. The research questions answered in this paper are twofold.

•
(RQ1): How can we discover the relationship between human active regions and geo-tagged tweets clusters at diverse scales of density, shape, and random cluster number?
•
(RQ2): How can we further estimate and map OSM missing built-up areas within the discovered regions even with little prior knowledge?

The remainder of this paper is organized as follows: Section 2 introduces the relevant work of assessing OSM data quality, summarizing state-of-the-art studies. The overall methodology is presented in Section 3, followed by Section 4 that evaluates the performance of our OSM missing built-up areas mapping method in Mozambique. Section 5 discusses the results and provides suggestions for future work. Finally, Section 6 wraps up this paper with conclusions.

Section snippets

Assessing OpenStreetMap data quality

With the rapid growth in OSM community and application, the data quality becomes a crucial research topic than ever before, especially for those authoritative consumers (e.g., humanitarian organizations and local governments) (Fonte et al., 2015). Quality measurements of spatial data usually follow the principles of International Organization for Standardization (ISO) under ISO 19113 and ISO 19157, which consist of multiple elements such as accuracy, completeness, logical consistency, etc.

Exploring OpenStreetMap missing built-up areas

With respect to large-scale OSM missing built-up areas exploration, it would be too optimistic to rely on single module of either hierarchical tweets clustering or fine-tuned building detection neural networks, since both modules focus on different geographical scales and have their limitations. On the one hand, through spatial clustering of geo-tagged tweets, we could define the human active regions (HAR) as those areas where Twitter users have clustered and posted significant amount of

Study areas and data description

The Republic of Mozambique is selected as our study area (Fig. 4) in this paper, which was severely devastated by Cyclone Idai and Kenneth in 2019. This is the first time in recorded history that two strong tropical cyclones hit Mozambique in the same season. Nearly 2.2 million people in Mozambique have been put in an emergent situation of humanitarian assistance, such as, health care, nutrition, protection, and water and sanitation. Correspondingly, Humanitarian OpenStreetMap Team (HOT)

Discussions

Considering the HDBSCAN clustering results of geo-tagged tweets in Section 4.2, it is believed that our desired clusters should consist of highly individual search radiuses, shapes, and densities, which actually reveals the heterogeneous socioeconomic development level around Mozambique. This fact further distinguishes our work from existing social media data mining works, such as Steiger et al., 2016, Liu et al., 2019, which mainly focus on individual human activity patterns in an urban city

Conclusions

In this paper, we presented a novel method for exploring OSM missing built-up areas from a joint perspective of social sensing and remote sensing. The proposed method consists of two core modules: identifying human active regions with geo-tagged tweets clustering, and mapping built-up areas by deep learning from existing OSM buildings and satellite imagery. To conclude the results answering RQ1, we demonstrated the capability of HDBSCAN in deriving multi-density tweets clusters with random

Declaration of Competing Interest

No potential conflict of interest was reported by the author.

Acknowledgements

The authors would like to take this opportunity to thank the editors and reviewers for their valuable comments and suggestions. This work has been partly supported by the Klaus Tschira Stiftung (KTS) Heidelberg.

References (55)

Y. Hu et al.
Extracting and understanding urban areas of interest using geotagged photos
Comput. Environ. Urban Syst.
(2015)
W. Huang et al.
Understanding human activity patterns based on space-time-semantics
ISPRS J. Photogramm. Remote Sens.
(2016)
A. Luque et al.
The impact of class imbalance in classification performance metrics based on the binary confusion matrix
Pattern Recogn.
(2019)
J.E. Vargas-Muñoz et al.
Correcting rural building annotations in openstreetmap using convolutional neural networks
ISPRS J. Photogramm. Remote Sens.
(2019)
M. Wurm et al.
Semantic segmentation of slums in satellite images using transfer learning on fully convolutional neural networks
ISPRS J. Photogramm. Remote Sens.
(2019)
C. Barron et al.
A comprehensive framework for intrinsic openstreetmap quality analysis
Trans. GIS
(2014)
R.J.G.B. Campello et al.
Density-based clustering based on hierarchical density estimates
R.J.G.B. Campello et al.
Hierarchical density estimates for data clustering, visualization, and outlier detection
ACM Trans. Knowl. Discov. Data
(2015)
J. De Albuquerque et al.
The tasks of the crowd: A typology of tasks in geographic information crowdsourcing and a case study in humanitarian mapping
Remote Sens.
(2016)
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L., 2009. ImageNet: A Large-Scale Hierarchical Image...

Ertöz, L., Steinbach, M., Kumar, V., 2003. Finding clusters of different sizes, shapes, and densities in noisy, high...

T. Esch et al.

Urban footprint processor—fully automated processing chain generating settlement masks from global data of the tandem-x mission

IEEE Geosci. Remote Sens. Lett.

(2013)

Ester, M., Kriegel, H.P., Sander, J., Xu, X., et al., 1996. A density-based algorithm for discovering clusters in large...

M. Everingham et al.

The pascal visual object classes (voc) challenge

Int. J. Comput. Vision

(2010)

H. Fan et al.

Quality assessment for building footprints data on openstreetmap

Int. J. Geograph. Informat. Sci.

(2014)

C.C. Fonte et al.

Usability of vgi for validation of land cover maps

Int. J. Geograph. Informat. Sci.

(2015)

J.F. Girres et al.

Quality assessment of the french openstreetmap dataset

Trans. GIS

(2010)

M.F. Goodchild

Citizens as sensors: The world of volunteered geography

GeoJournal

(2007)

M.F. Goodchild et al.

Crowdsourcing geographic information for disaster response: a research frontier

Int. J. Digital Earth

(2010)

M. Haklay

How good is volunteered geographical information? a comparative study of openstreetmap and ordnance survey datasets

Environ. Plann. B: Plann. Des.

(2010)

He, K., Gkioxari, G., Dollar, P., Girshick, R., 2017. Mask r-cnn. In: The IEEE International Conference on Computer...

R. Hecht et al.

Measuring completeness of building footprints in openstreetmap over space and time

ISPRS Int. J. Geo-Informat.

(2013)

B. Herfort et al.

Mapping human settlements with higher accuracy and less volunteer efforts by combining crowdsourcing and deep learning

Remote Sensing

(2019)

Huang, J., Rathod, V., Sun, C., Zhu, M., Korattikara, A., Fathi, A., Fischer, I., Wojna, Z., Song, Y., Guadarrama, S.,...

S.P. Jackson et al.

Assessing completeness and spatial error of features in volunteered geographic information

ISPRS Int. J. Geo-Informat.

(2013)

A.K. Jain et al.

Algorithms for Clustering Data

(1988)

P. Kaiser et al.

Learning aerial image segmentation from online maps

IEEE Trans. Geosci. Remote Sens.

(2017)

Cited by (29)

InstantCITY: Synthesising morphologically accurate geospatial data for urban form analysis, transfer, and quality control
2023, ISPRS Journal of Photogrammetry and Remote Sensing
Citation Excerpt :
As the quality of urban data is becoming increasingly important (Basiri et al., 2019; Songchon et al., 2021; Grinberger et al., 2021), various methods to assess the completeness of features have been developed, with many of them focused on buildings (Senaratne et al., 2016). There are intrinsic methods, i.e. predicting the completeness of features based on the history of contributors or the arrangement of existing features (Zhou, 2017; Jacobs and Mitchell, 2020; Majic et al., 2021; Sundaram et al., 2021), and those that are extrinsic, requiring checking against another, usually authoritative, dataset representing the same features or proxies (Brovelli et al., 2016; Balducci, 2019; Li et al., 2020b). In the experiments, we investigate whether our method can also be used as a key component in spatial data quality assessment.
Generative Adversarial Network (GAN) is widely used in many generative problems, including in spatial information sciences and urban systems. The data generated by GANs can achieve high quality to augment downstream training or to complete missing entries in a dataset. GANs can also be used to learn the relationship between two datasets and translate one into another, e.g. road network data into building footprint data. However, such approach has not been developed in the geospatial and urban data science context, its usability remains unknown, and the methods are not fully developed. We develop a new Geographical Data Translation algorithm based on GAN to generate high-resolution vector building data solely from street networks, which may be used to predict the urban morphology in absence of building data, also enabling studies in unmapped or undermapped urban geographies, among other advantages. Experiments on 16 cities around the world demonstrate that the generated datasets are largely successful in resembling ground truth morphologies. Thus, the approach may be used in lieu of traditional data for tasks that are often hampered by lack of data, e.g. urban form studies, simulation of urban morphologies in new contexts, and spatial data quality assessment. Our work proposes a novel rapid approach to generate building footprints in replacement of procedural methods and it introduces a new intrinsic method for large-scale spatial data quality control, which we test on OpenStreetMap by predicting missing buildings and suggesting the completeness of data without the usually required authoritative counterparts. The code, sample model, and dataset are available openly.
Global Building Morphology Indicators
2022, Computers, Environment and Urban Systems
Citation Excerpt :
Data on building heights that are fully complete are in some cases available from authoritative (government) datasets in form of building footprints enriched with attribute information on heights or as point clouds obtained from airborne lidar, but these are limited to few geographic areas. Despite commendable advancements in large-scale mapping of buildings using satellite remote sensing techniques, there are still no global open datasets on heights of individual buildings, and many instances are generated at a coarse spatial resolution (e.g. average building height at the scale of a block), limited in coverage, and/or their positional accuracy may not be fully adequate for studying the urban form at high resolution (Chen, Zhang, Wong, & Ignatius, 2020; Esch et al., 2022; Frantz et al., 2021; Geis et al., 2019; Li et al., 2020; Li, Herfort, Huang, Zia, & Zipf, 2020; Tian, Tsendbazar, van Leeuwen, Fensholt, & Herold, 2022; Zhu et al., 2022). This limitation solely pertains to our input dataset (OSM) and geographies with completeness issues.
Characterising and analysing urban morphology is a continuous task in urban data science, environmental analyses, and many other domains. As the availability and quality of data on them have been increasing, buildings have gained more attention. However, tools and data facilitating large-scale studies, together with an interdisciplinary consensus on metrics, remain scarce and often inadequate. We present Global Building Morphology Indicators (GBMI) — a three-pronged contribution addressing such shortcomings: (i) a comprehensive list of hundreds of building form multi-scale measures derived through a systematic literature review; (ii) a methodology and tool for the computation of these metrics in a database suited for big data and comparative studies, and release the code freely and open-source; and (iii) we carry out the computations using high performance computing, generating a public repository with data quantifying the form of selected urban areas around the world, and demonstrate their value with novel analyses comparing morphological parameters across cities. GBMI introduces a formalised, structured, modular, and extensible method to compute, manage, and disseminate urban indicators at a large scale and high resolution, while the precomputed dataset facilitates comparative studies. The theory and implementation traverse multiple scales: at the building level, both individual and contextual ones based on encircling buildings by multiple buffers, and aggregations at several hierarchical administrative levels and at multiple grids. Our open dataset, comprising billions of records on a growing scope of urban areas worldwide, is the most comprehensive instance of morphological data parametrising the individual building stock, supporting studies in urban analytics and a range of disciplines.
Leveraging OpenStreetMap and Multimodal Remote Sensing Data with Joint Deep Learning for Wastewater Treatment Plants Detection
2022, International Journal of Applied Earth Observation and Geoinformation
Humans rely on clean water for their health, well-being, and various socio-economic activities. During the past few years, the COVID-19 pandemic has been a constant reminder of about the importance of hygiene and sanitation for public health. The most common approach to securing clean water supplies for this purpose is via wastewater treatment. To date, an effective method of detecting wastewater treatment plants (WWTP) accurately and automatically via remote sensing is unavailable. In this paper, we provide a solution to this task by proposing a novel joint deep learning (JDL) method that consists of a fine-tuned object detection network and a multi-task residual attention network (RAN). By leveraging OpenStreetMap (OSM) and multimodal remote sensing (RS) data, our JDL method is able to simultaneously tackle two different tasks: land use land cover (LULC) and WWTP classification. Moreover, JDL exploits the complementary effects between these tasks for a performance gain. We train JDL using 4,187 WWTP features and 4,200 LULC samples and validate the performance of the proposed method over a selected area around Stuttgart with 723 WWTP features and 1,200 LULC samples to generate an LULC classification map and a WWTP detection map. Extensive experiments conducted with different comparative methods demonstrate the effectiveness and efficiency of our JDL method in automatic WWTP detection in comparison with single-modality/single-task or traditional survey methods. Moreover, lessons learned pave the way for future works to simultaneously and effectively address multiple large-scale mapping tasks (e.g., both mapping LULC and detecting WWTP) from multimodal RS data via deep learning.
High-resolution large-scale onshore wind energy assessments: A review of potential definitions, methodologies and future research needs
2022, Renewable Energy
Citation Excerpt :
For example, much more recently, Broveli and Zamboni [70] evaluated OSM building completeness in Lombardy Italy and found the dataset to be 57% complete. Li et al. [71] identified 13 missing built-up areas in Mozambique's OSM data with a new approach combining social and remote sensing, which achieved an overall accuracy of more than 90% showing room for improving OSM's completeness. Another promising dataset in this context is the World Settlement Footprint, which has global coverage at 10 m resolution and to our knowledge has not yet been employed for global onshore wind potential analyses [72].
The rapid uptake of renewable energy technologies in recent decades has increased the demand of energy researchers, policymakers and energy planners for reliable data on the spatial distribution of their costs and potentials. For onshore wind energy this has resulted in an active research field devoted to analysing these resources for regions, countries or globally. A particular thread of this research attempts to go beyond purely technical or spatial restrictions and determine the realistic, feasible or actual potential for wind energy. Motivated by these developments, this paper reviews methods and assumptions for analysing geographical, technical, economic and, finally, feasible onshore wind potentials. We address each of these potentials in turn, including aspects related to land eligibility criteria, energy meteorology, and technical developments of wind turbine characteristics such as power density, specific rotor power and spacing aspects. Economic aspects of potential assessments are central to future deployment and are discussed on a turbine and system level covering levelized costs depending on locations, and the system integration costs which are often overlooked in such analyses. Non-technical approaches include scenicness assessments of the landscape, constraints due to regulation or public opposition, expert and stakeholder workshops, willingness to pay/accept elicitations and socioeconomic cost-benefit studies. For each of these different potential estimations, the state of the art is critically discussed, with an attempt to derive best practice recommendations and highlight avenues for future research.
Automatic mapping of national surface water with OpenStreetMap and Sentinel-2 MSI data using deep learning
2021, International Journal of Applied Earth Observation and Geoinformation
Citation Excerpt :
OSM can offer higher precision and more semantic information of water features than most RS products, although the volunteered geographical information (VGI) nature of OSM leads to inevitable concerns regarding data quality, position accuracy, spatial consistency, and data completeness (Goodchild and Li, 2012; Barron et al., 2014; Fan et al., 2014). Previous studies (Xu et al., 2019; Li et al., 2020) have successfully combined ML methods with OSM data to predict multi-lane roads in China and to identify missing built-up areas in Mozambique. Furthermore, Scholz et al. (2018), Chen et al. (2018), Schmitt (2020) highlighted the potential of harvesting OSM data for more effective and efficient training of ML-based LULC classification methods.
Large-scale mapping activities can benefit from the vastly increasing availability of earth observation (EO) data, especially when combined with volunteered geographical information (VGI) using machine learning (ML). High-resolution maps of inland surface water bodies are important for water supply and natural disaster mitigation as well as for monitoring, managing, and preserving landscapes and ecosystems. In this paper, we propose an automatic surface water mapping workflow by training a deep residual neural network (ResNet) based on OpenStreetMap (OSM) data and Sentinel-2 multispectral data, where the Simple Non-Iterative Clustering (SNIC) superpixel algorithm was employed for generating object-based training samples. As a case study, we produced an open surface water layer for Germany using a national ResNet model at a 10 m spatial resolution, which was then harmonized with OSM data for final surface water products. Moreover, we evaluated the mapping accuracy of our open water products via conducting expert validation campaigns, and comparing to existing water products, namely the WasserBLIcK and Global Surface Water Layer (GSWL). Using 4,600 validation samples in Germany, the proposed model (ResNet+SNIC) achieved an overall accuracy of 86.32% and competitive detection rates over the WasserBLIcK (87.47%) and GSWL (98.61%). This study provides comprehensive insights into how to best explore the synergy of VGI and ML of EO data in a large-scale surface water mapping task.
Detecting inconsistent information in crowd-sourced street networks based on parallel carriageways identification and the rule of symmetry
2021, ISPRS Journal of Photogrammetry and Remote Sensing
Citation Excerpt :
Moreover, the flexible way of semantic annotation encourages novel creation and use of the data (Ramm and Topf, 2010). Therefore, we claim that it is more sensible to keep the crowd-sourcing policy flexible while developing advanced methods to help identify problematic data as an assistant (e.g. Li et al., 2020). Such methods could be deployed in the crowd-sourcing platforms as data editor plug-ins during editing, or when the data are to be used in professional applications as a quality assurance mechanism.
Crowd-sourced geographic information has great potential in scientific and public domains, and is recently under consideration by geospatial professionals as an alternative to traditional spatial data collection. The success, however, implies a need to build long-term reliance on the crowd-sourcing projects, and poses growing concern over the quality of the constantly evolving data. In general, we aim to develop an approach that uses geographic rules to identify inconsistent information in street networks without relying on external sources. This paper focuses on a more challenging sub-process that aims to identifying inconsistent information using the rule of symmetry. That is, information (e.g. name, class, speed limit, etc.) in parallel carriageways (e.g. divided highways) always constrains each other. The process starts with a clustering of related streets into well-defined or ambiguous situations using a DBSCAN-inspired technique; then two pairing strategies are designed for both situations. To address the challenging problem of pairing carriageways in ambiguous situations, three pairing algorithms (stroke-based, tree-based, and mixed) are devised based on the idea of using expanded ‘receptive field’ to disentangle the ambiguities; each has a focus on efficiency, effectiveness, or their tradeoff. Evaluating the algorithms against 7 selected datasets shows that all three algorithms reached satisfactory performance (F1-score > 92%) for ambiguous situations, and much higher accuracy for the whole datasets. Then, we applied our approach to over 40 datasets worldwide and detected inconsistencies (i.e. dissimilar values in paired carriageways) in crowd-sourced and authoritative street networks. We evaluate the identified inconsistencies, analyze the possibilities of our approach in suggesting corrections to problematic data, and discuss its effectiveness, issues, and future directions. We thereby demonstrate that the proposed approach is effective for quality assurance, and can be used to assure the quality of crowd-sourced and authoritative mapping projects during their evolution without relying on ground-truth.

View all citing articles on Scopus

View full text

Exploration of OpenStreetMap missing built-up areas using twitter hierarchical clustering and deep learning in Mozambique

Abstract

Introduction

Section snippets

Assessing OpenStreetMap data quality

Exploring OpenStreetMap missing built-up areas

Study areas and data description

Discussions

Conclusions

Declaration of Competing Interest

Acknowledgements

Comput. Environ. Urban Syst.

ISPRS J. Photogramm. Remote Sens.

Pattern Recogn.

ISPRS J. Photogramm. Remote Sens.

ISPRS J. Photogramm. Remote Sens.

A comprehensive framework for intrinsic openstreetmap quality analysis

Trans. GIS

Density-based clustering based on hierarchical density estimates

Hierarchical density estimates for data clustering, visualization, and outlier detection

ACM Trans. Knowl. Discov. Data

The tasks of the crowd: A typology of tasks in geographic information crowdsourcing and a case study in humanitarian mapping

Remote Sens.

Urban footprint processor—fully automated processing chain generating settlement masks from global data of the tandem-x mission

IEEE Geosci. Remote Sens. Lett.

The pascal visual object classes (voc) challenge

Int. J. Comput. Vision

Quality assessment for building footprints data on openstreetmap

Int. J. Geograph. Informat. Sci.

Usability of vgi for validation of land cover maps

Int. J. Geograph. Informat. Sci.

Quality assessment of the french openstreetmap dataset

Trans. GIS

Citizens as sensors: The world of volunteered geography

GeoJournal

Crowdsourcing geographic information for disaster response: a research frontier

Int. J. Digital Earth

How good is volunteered geographical information? a comparative study of openstreetmap and ordnance survey datasets

Environ. Plann. B: Plann. Des.

Measuring completeness of building footprints in openstreetmap over space and time

ISPRS Int. J. Geo-Informat.

Mapping human settlements with higher accuracy and less volunteer efforts by combining crowdsourcing and deep learning

Remote Sensing

Assessing completeness and spatial error of features in volunteered geographic information

ISPRS Int. J. Geo-Informat.

Algorithms for Clustering Data

Learning aerial image segmentation from online maps

IEEE Trans. Geosci. Remote Sens.