Assessing Global OpenStreetMap building completeness to generate large-scale 3D city models
2020-07-05, 10:00–10:20, Academic Track | Track 2 - Sunday, July 5

This presentation describes the ongoing work at the Urban Analytics Lab at the National University of Singapore, developing novel methods to assess building completeness at a multi-country scale, as part of a broader project of generating 3D city models on a large-scale using OpenStreetMap.

Quality assessment of OpenStreetMap (OSM) data has been an important topic since the inception of the project. Much research has been done on this topic by many research groups around the world, and it can mostly be seen as permutations of three aspects: (1) spatial data quality element(s) in focus (e.g. positional accuracy, completeness), (2) theme (e.g. amenities, buildings, roads), (3) geographical area (e.g. particular city or country); e.g. positional accuracy of cultural features in Italy.
Completeness is one of the principal quality aspects of geospatial data, and our research focuses on developing a method to assess the completeness of buildings in OSM on a large scale (spanning several countries). While there are many robust OpenStreetMap completeness techniques and studies developed, they mainly focus on limited areas, mostly developed countries with ground truth data at hand for comparisons.
Doing the same for less developed regions is rare as the lack of authoritative data inherently hampers it, and the methods hardly ever scale: such an analysis done simultaneously for more than one administrative region is seldom carried out as there are other research challenges such as disparate urban morphology, different data sources and standards to bridge in order to facilitate ground truth, and varying understanding of what a building is. Furthermore, the development of a method that would scale across dozens of countries is limited by computational resources.
We are currently developing a method that uses several indicators derived from remote sensing, which are available on the global level, that may hint at the building completeness and would scale across the world. A regression model to predict the approximate volume of buildings in a given area is trained in areas in which there is an indication of high completeness of buildings in OSM.
OSM building completeness is estimated by comparing the number of mapped buildings against their expected (predicted) amount in reality. The method has the potential to scale at a worldwide level, and completeness is estimated for a grid of resolution of approximately 1x1 square kilometres, and simultaneously for administrative regions to enable cross-country comparisons.
The work is being implemented in Google Earth Engine, mostly relying on imagery and indicators such as normalised difference built-up index (NDBI) and normalised difference vegetation index (NDVI).
Preliminary results suggest a substantial disparity in OSM building completeness around the world, with areas that are entirely complete to those with inadequate completeness.
This presentation aims to report the progress of the ongoing work and encountered challenges in the project such as selecting indicators that are consistently available on the global level, scaling the method to the global level, accounting for different mapping practices around the world, different notions of a building, and varying morphologies of cities and urban areas. We also investigate the relation between OSM building completeness and socio-economic parameters such as GDP to understand their relationships to mapping intensity and quality. These may offer the potential to be used as additional predictors.
This work is part of a broader project conducted at the Urban Analytics Lab at the National University of Singapore on investigating the potential of generating 3D city models by extruding building footprints in OpenStreetMap to a building height that is predicted using artificial intelligence. This ongoing portion of the project is the crucial first step in the project, as it will enable us to understand what is the completeness of building footprints in OpenStreetMap around the world and manage expectations about the potential coverage of 3D city models.