Review
Utilizing geo-referenced imagery for systematic social observation of neighborhood disorder

https://doi.org/10.1016/j.compenvurbsys.2021.101691Get rights and content

Highlights

  • This study brings conceptual clarity in four ‘generations’ of research conducting SSO based on geo-referenced imagery.

  • Cutting-edge technologies provide large, and largely unexplored, opportunities for studying neighborhood disorder.

  • We propose steps forward to leverage ‘big primary data’ to enhance insights for policy, practice and research.

Abstract

Research methods in social science take advantage from broader trends such as digitalization and increasing computational power, however, this is an evolving explorative search. The main purpose of this article is to describe the methodological innovations in the collection and processing of geo-referenced imagery for the observation of neighborhood disorder. In this narrative review, attention is paid to advances in both the data sources and the data processing methods used. Neighborhood disorder is traditionally measured by means of survey methods and (systematic) (social) observations, but these methods have specific shortcomings, such as respectively the subjective measurement that does not deliver a valid measure of actual prevalence of disorderly phenomena and the intensive use of resources in terms of time and money. This has repercussions for (the interpretation of) the results based on these data. Today, scholars have innovative data sources and cutting-edge data processing methods at their disposal that can meet (some of) these shortcomings, but which have not yet been fully explored. In this article, the evolutions in the use of geo-referenced imagery for the observation of neighborhood disorder from the last 25 years are described with a focus on the empirical opportunities, and the methodological challenges and prospects. We conclude by outlining the road ahead: promising avenues for future research to exploit the full potential of ‘big primary data’.

Introduction

In this review article, an overview of the developments in the use of geo-referenced imagery for the observation of neighborhood disorder is provided. The main purpose of this study is to outline the methodological advances and to focus on the empirical opportunities, and methodological challenges and prospects. We specifically focus on the developments in the use of systematic social observation (hereafter: SSO). SSO is considered as a combination of qualitative and quantitative methods, i.e. “in-depth observational work championed by ethnographers and the large-scale, systematic, and generalizable approaches of quantitative social science” (Brunton-Smith, 2018, p. 294). Reiss (1971) described SSO as a method where:

… observation and recording are done according to explicit procedures which permit replication and that rules are followed which permit the use of the logic of scientific inference. The means of observation and recording, whether a person or some form of technology, must be independent of that which is observed and the effects of observing and measuring must be measurable. (p. 4).

The occurring developments regarding SSO are considered on two axes: the first one based on the type of data used (primary versus secondary), and the second axis based on the nature of the data processing methods (manual versus automated). Those two axes can be represented as a quadrant, resulting in four possible combinations which can also be situated in time (see Fig. 1). Primary data are data collected by researchers for specific research goals and are used in that context. Secondary data are data that are not collected for specific research goals but are used in the context of research or primary data used secondary by other researchers (Hox & Boeije, 2005). In other words: with secondary data, existing data are used retroactively to answer a specific research question. Manual coding means that the researchers are responsible for the processing of the data, with human effort in the coding process itself. In the case of SSO, this often involves the systematic coding of observed facts in which the researcher is interested (e.g. number of graffiti-tags or -pieces, number of units of illegal waste, number of cigarette ends). Automated coding means that this coding is executed computationally, without human effort in the coding process itself. More specifically, techniques that belong to the field of computer vision are used for this automatic coding.

It is not the intention to provide a systematic overview of the use of geo-referenced imagery. For this, we refer to systematic reviews of previous research in different fields (e.g., Aghaabbasi, Moeinaddini, Shah, & Asadi-Shekari, 2018; Rzotkiewicz, Pearson, Dougherty, Shortridge, & Wilson, 2018; Vandeviver, 2014). This narrative review is centered around a number of seminal studies, that give an impetus to or are illustrative for the observation of neighborhood disorder. The aim is to give a concise description of how SSO using geo-referenced imagery is being applied, from a quantitative point of view. After all, social observation also has qualitative applications, such as spatial video geo-narratives1 (e.g., Bell, Phoenix, Lovell, & Wheeler, 2015; Curtis, Curtis, Porter, Jefferis, & Shook, 2016), but these will not be discussed further in this article. SSOs are leveraged across many disciplines, for example in public health (e.g., Bader et al., 2015), sociology (e.g., Hwang & Sampson, 2014), and geography (e.g., Conley, Stein, & Davis, 2014). In this article, we focus specifically on the application of SSO in the field of criminology, by applying it to the observation of social and physical neighborhood disorder, as initiated by Sampson and Raudenbush in the mid-90s.

In what follows, we will first focus on disorder and how this is traditionally measured (Section 2). Next, we will chronologically describe the considered developments, as visualized in Fig. 1. In the first phase, researchers used imagery that they collected themselves and then manually coded (Section 3). In the second phase, they started using secondary data that was being coded manually, this under the influence of an increase in the publicly available secondary imagery data (Section 4). In the third phase, researchers started applying automated coding (based on advances in computer sciences, in particular machine learning and more specifically computer vision) to the publicly available secondary imagery data (Section 5). In the last phase, researchers started the application of this automated coding on imagery data that they collected themselves (Section 6). In the concluding section of this article, we reflect on the relatively fast developments of this research domain and the practical implications of this (Section 7). Finally, we will also consider the promising avenues for future research (Section 8).

Section snippets

Disorder as a study object and its traditional measurement methods

Disorder, also known as incivilities, is a frequently studied phenomenon in criminology (see, among others, Kubrin, 2008; Skogan, 2012). Furthermore, disorder has already been studied for a long time in urban contexts and in particular in research into the relevant processes of social (dis)organization (O'Brien, Farrell, & Welsh, 2019). Disorder is associated with various negative effects, both on an ecological level as well as on an individual level. On an ecological level, we speak of

Primary data, manual coding

An important first step in the application of SSO was the collection of primary data that were coded manually. Since SSO was introduced by Reiss in sociology and criminology in the early 1970s, this research method has been applied on a small scale compared to the more established research methods, such as surveys and interviews. Reiss (1975 in Earls, Raudenbush, Reiss, & Sampson, 2005) states that SSO has several specific advantages, such as higher reliability than self-reports, more precise

Secondary data, manual coding

With the arrival of online mapping software, an important step was taken concerning the amount of (publicly) available imagery. This type of secondary data was first available in a bird's-eye view perspective. Later on, various applications of panoramic imagery on micro-level became available. Sampson (2013) states that “[o]ne of the world's powerful companies in effect implemented PHDCN SSO procedures for observing public places” (p. 9). Google Street View is by far the best known application

Secondary data, automated coding

The use of secondary data results in a significant improvement in efficiency with regard to the data collection, but not with regard to the data processing and analysis. The coding of imagery on a large scale, as described in the previous two paragraphs, still requires a large amount of resources in terms of time and labor costs (Hwang, 2017). The emergence of research and applications in the field of artificial intelligence has led to major advances in the automated coding of imagery data.

Primary data, automated coding

The last phase that we describe relates to developments that are visibly unfolding nowadays. New technologies and artificial intelligence offer perspectives in many areas, and thus also in the area of scientific research. As explained in the previous sections, the use of secondary data has important shortcomings. Efforts have already been made to increase the quality of secondary imagery using sparse representation-based image resolution improvement methods (see Li et al., 2018), but the

Discussion

Synthesizing prior research on the use of imagery in the context of SSO of neighborhood disorder learns that there are large opportunities for both the use of new technologies for the purpose of data collection and the use of artificial intelligent methods for the purpose of data processing in this research domain. These evolutions in the methodological toolkit perfectly align with the original purpose of systematic observation, as initiated in sociology and criminology by Albert J. Reiss Jr.

Avenues for future research

The combination of these ‘big primary data’ and the artificial intelligent data processing and analysis methods is for the time being largely unexplored territory with still plenty of room for exploratory research. The steps that could be taken in the coming years thanks to contemporary technological developments will not only enrich scientific research but will also benefit society when applications are developed and used to measure, detect and prevent disorder and crime. With the promises and

Funding

The authors received no financial support for the research, authorship, and/or publication of this article.

Declaration of Competing Interest

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

References (88)

  • A. Rzotkiewicz et al.

    Systematic review of the use of Google Street View in health research: Major themes, strengths, weaknesses and possibilities for future research

    Health & Place

    (2018)
  • N. Sastry et al.

    The design of a multilevel survey of children, families, and communities: The Los Angeles family and neighborhood survey

    Social Science Research

    (2006)
  • D. Wallace et al.

    Testing the temporal nature of social disorder through abandoned buildings and interstitial spaces

    Social Science Research

    (2015)
  • C.C. Aggarwal

    Neural networks and deep learning

    (2018)
  • A. Amaya et al.

    Total error in a big data world: Adapting the TSE framework to big data

    Journal of Survey Statistics and Methodology

    (2020)
  • A.B. Arrieta et al.

    Explainable artificial intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI

    Information Fusion

    (2020)
  • S.L. Bell et al.

    Using GPS and geo-narratives: A methodological approach for understanding and situating everyday green space encounters

    Area

    (2015)
  • W. Bernasco et al.

    Promise into practice: Application of computer vision in empirical research on social distancing

    (2021)
  • S. Bloch

    An on-the-ground challenge to uses of spatial big data in assessing neighborhood character

    Geographical Review

    (2020)
  • D. Britz

    Understanding convolutional neural networks for NLP

  • I. Brunton-Smith

    Systematic social observation

  • D.T. Campbell et al.

    Convergent and discriminant validation by the multitrait-multimethod matrix

    Psychological Bulletin

    (1959)
  • C. Clews et al.

    Alcohol in urban streetscapes: A comparison of the use of Google street view and on-street observation

    BMC Public Health

    (2016)
  • J. Conley et al.

    A spatial analysis of the neighborhood scale of residential perceptions of physical disorder

    Applied Spatial Analysis and Policy

    (2014)
  • A. Curtis et al.

    Context and spatial nuance inside a neighborhood’s drug hotspot: Implications for the crime-health nexus

    Annals of the American Association of Geographers

    (2016)
  • K. Dakin et al.

    Built environment attributes and crime: An automated machine learning approach

    Crime Science

    (2020)
  • S. Das

    CNN architectures: LeNet, AlexNet, VGG, GoogLeNet, ResNet and more

  • J. Deng et al.

    Imagenet: A large-scale hierarchical image database [Paper presentation]

  • F.J. Earls et al.

    Project on Human Development in Chicago Neighborhoods (PHDCN): Systematic Social Observation, 1995

  • K. Eykholt et al.

    Robust physical-world attacks on deep learning visual classification [Paper presentation]

  • J.M. Gau et al.

    Broken windows or window dressing? Citizens’ (in)ability to tell the difference between disorder and crime

    Criminology & Public Policy

    (2008)
  • T. Gebru et al.

    Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States

    Proceedings of the National Academy of Sciences

    (2017)
  • I.J. Goodfellow et al.

    Explaining and harnessing adversarial examples

    arXiv

    (2014)
  • Google. (n.d.). Google-contributed Street View imagery policy....
  • R.M. Groves et al.

    Survey Methodology

    (2004)
  • R.M. Groves et al.

    Total survey error: Past, present, and future

    Public Opinion Quarterly

    (2010)
  • D. Gunning

    Explainable artificial intelligence (xai)

  • K. Hao

    Making face recognition less biased doesn't make it less scary

  • W. Hardyns et al.

    A multilevel analysis of collective efficacy, neighborhood disorder, and individual social capital on avoidance behavior

    Crime & Delinquency

    (2019)
  • J.R. Hipp et al.

    Measuring the built environment with Google Street View and machine learning: Consequences for crime on street segments

    Journal of Quantitative Criminology.

    (2021)
  • E.M. Hoeben et al.

    Measuring disorder: Observer bias in systematic social observations at streets and neighborhoods

    Journal of Quantitative Criminology

    (2018)
  • Y.P. Hsieh et al.

    Total twitter error: Decomposing public opinion measurement on Twitter from a total survey error perspective

  • J. Hwang

    Invited commentary: Observing neighborhood physical disorder in an age of technological innovation

    American Journal of Epidemiology

    (2017)
  • J. Hwang et al.

    Divergent pathways of gentrification: Racial inequality and the social order of renewal in Chicago neighborhoods

    American Sociological Review

    (2014)
  • Cited by (4)

    View full text