Artificial fairness? Trust in algorithmic police decision-making

Hobson, Zoë; Yesberg, Julia A.; Bradford, Ben; Jackson, Jonathan

doi:10.1007/s11292-021-09484-9

Artificial fairness? Trust in algorithmic police decision-making

Research Article
Open access
Published: 12 September 2021

Volume 19, pages 165–189, (2023)
Cite this article

Download PDF

You have full access to this open access article

Journal of Experimental Criminology Aims and scope Submit manuscript

Artificial fairness? Trust in algorithmic police decision-making

Download PDF

Zoë Hobson¹,
Julia A. Yesberg¹,
Ben Bradford ORCID: orcid.org/0000-0001-5480-5638¹ &
…
Jonathan Jackson^2,3

6894 Accesses
8 Citations
25 Altmetric
2 Mentions
Explore all metrics

Abstract

Objectives

Test whether (1) people view a policing decision made by an algorithm as more or less trustworthy than when an officer makes the same decision; (2) people who are presented with a specific instance of algorithmic policing have greater or lesser support for the general use of algorithmic policing in general; and (3) people use trust as a heuristic through which to make sense of an unfamiliar technology like algorithmic policing.

Methods

An online experiment tested whether different decision-making methods, outcomes and scenario types affect judgements about the appropriateness and fairness of decision-making and the general acceptability of police use of this particular technology.

Results

People see a decision as less fair and less appropriate when an algorithm decides, compared to when an officer decides. Yet, perceptions of fairness and appropriateness were strong predictors of support for police use of algorithms, and being exposed to a successful use of an algorithm was linked, via trust in the decision made, to greater support for police use of algorithms.

Conclusions

Making decisions solely based on algorithms might damage trust, and the more police rely solely on algorithmic decision-making, the less trusting people may be in decisions. However, mere exposure to the successful use of algorithms seems to enhance the general acceptability of this technology.

Predictive policing and algorithmic fairness

Article Open access 05 June 2023

Tzu-Wei Hung & Chun-Ping Yen

Algorithms, human decision-making and predictive policing

Article 26 April 2021

Peter J. Phillips & Gabriela Pohl

Perceptions of Justice By Algorithms

Article Open access 05 April 2022

Gizem Yalcin, Erlis Themeli, … Stefano Puntoni

Introduction

The use of artificial intelligence and algorithmic decision-making now permeates many parts of society and the economy, with an increasing number of government agencies as well as private sector entities considering—and indeed using—this type of technology. In the summer of 2020, just a few months before the experiment described in this paper took place, the use of algorithms was suddenly thrown into the public eye in the UK with the announcement that, as a result of the COVID-19 pandemic, A-Level exam results in England had been calculated in this way. This resulted in serious implications for the students sitting the exams, with the “downgrading” of almost 40% of results (Coughlan, 2020). The resulting furore led to a withdrawal of the algorithmically determined grades and the use of teacher-predicted marks instead.

In the light of such developments—including, pertinent to the current paper, concerns about “algorithmic justice” (Huq, 2019)—a good deal of public and scholarly attention has recently focused on the use of algorithms to aid or replace human decision-making (Dhasarathy et al., 2020). This work has raised a number of questions concerning the fairness and consistency of algorithmic decisions. Like other public sector organizations, the police have begun incorporating new technologies into their working environment, with a range of programmes, trials and other implementations either in place now or currently under development. The use of algorithms and artificial intelligence by police—along with data analytics and the broader shift to “data-driven policing” (Kearns & Muir, 2019)—has been driven both by the reduction of resources for the public sector and the availability of digital technology (Ferguson, 2017; David and Ola, 2020). Due to their potential for high accuracy, low cost, effectiveness and efficiency, artificial intelligence and algorithmic tools can be appealing to public services (Shrestha & Yang, 2019). Use cases range from Live Facial Recognition technology to identify wanted people (Fussey & Murray, 2019), predictive policing (David and Ola, 2020), risk assessment (Oswald et al., 2018) and the Most Serious Violence Tool (Home Office, 2019). More widely, there is an expectation that police should have the same access and ability to utilise technology as the general public (Mackey, 2020), and private sector actors and police use of new technologies in many ways merely mirrors wider societal change.

It seems likely that more and more responsibility will be handed to these new technologies and that there will be greater reliance on policing decisions made primarily by machines (Ridgeway, 2019). However, while the potential benefits are notable, there is also scepticism and concern about the fairness of such systems, and their potential to inadvertently discriminate against, for example, certain minority groups. Previous research has highlighted concerns about potential negative effects arising from the use of predictive policing tools (Ferguson, 2017; Couchman, 2019; David and Ola, 2020), most notably in relation to the inability of the technology to take all relevant information into account, and the potential for “baking in” disproportionality and discrimination (Babuta & Oswald, 2020; Brayne, 2020).

In this paper, we explore how the use of algorithmic tools to make operational policing decisions affects lay reactions to police activity. Public reactions to the introduction of algorithmic decision-making in policing are comparatively understudied, but at the threshold it seems there are likely to be two inter-related sets of concerns in play. First, will people trust the decisions made by algorithms? Will they view policing decisions made by machines as more or less trustworthy than those made by human actors? As we describe below, the literature on public views of algorithmic decision-making suggests people can hold a complex and quite subtle set of opinions, and the answers to these and related questions remain unclear. Second, will people trust the police to use this technology appropriately? A growing body of research has explored public trust as a critical factor that shapes public attitudes and acceptance towards police uptake of new technology (see among many others Ariel et al., 2018; St Louis et al., 2019; Meijer & Wessels, 2019; Ridgeway, 2019; Bradford et al., 2020). Such trust is of course not “free-floating” or entirely prior to the development under consideration, but is developed via the direct, vicarious and mediated experiences people have of policing (Jackson et al., 2013). The current study uses an experimental approach to test public acceptability and support for (or opposition to) the use of algorithmic tools in the context of operational policing decisions, specifically, and support for (or opposition to) police use of this technology in a general sense.

Who or what is making the decision, and does it have trustworthy motives?

We know (or think we know) how we make decisions and we have (or think we have) some insight into the potentially complex amalgamation of information that goes into the decision-making process. However, most people are much less aware of how machines can make these same calculations and decisions (Grzymek & Puntschuh, 2019; Lee, 2018). Evidence suggests, however, that algorithmic decision-making processes are perceived as having less agency and emotional capabilities than humans, therefore rendering algorithmic decision-makers more rational and less intentional or emotional (Lee, 2018). Statistical models are seen as more accurate than humans at predicting various outcomes across several disciplines (Kleinberg et al., 2018). Within healthcare, for instance, algorithmic tools are predicted to perform with expert-level accuracy and deliver cost-effective healthcare at scale—often outperforming human healthcare providers (Longoni et al., 2019). It is easy to imagine that this superior accuracy would be preferable to many, with people willing to follow the advice of the data-driven technology over human intuition.

Yet, research has suggested that more weight is often placed on advice given by a human expert compared to an algorithm (Dietvorst et al., 2015; Önkal et al., 2009); people are, for example, more likely to follow the recommendation of a physician than of a computer (Promberger & Baron, 2006). Indeed, Longoni and colleagues (2019) demonstrated, across a variety of medical decisions, a robust reluctance to use algorithms and artificial intelligence compared to human care providers. Averseness to healthcare delivered by artificial intelligence—to the prospect of being cared for by a decision-making machine—may evoke a concern that one’s unique characteristics, circumstances and symptoms will be neglected by a “cold”, impersonal machine that lacks motives that could be deemed trustworthy or untrustworthy.

The type of decision to be made (and/or what the decision is) also seems to influence public perceptions. Lee (2018) found that when the decision-maker (either algorithmic or human) was making a managerial decision about a mechanical task (for example scheduling employees’ shift patterns), algorithm and human-made decisions were perceived as equally fair and trustworthy. However, differences appeared when the decision-maker was considering a human task (for example whether to arrest someone, although that was not a focus of Lee’s study); here, algorithms were perceived as less fair and trustworthy and evoked more negative emotions than human decisions. People seem to think that algorithms lack intuition and subjective judgement capabilities (Lee, 2018). Applied to the current context, this ability to understand and incorporate unique characteristics, while showing sensitivity, could be key if the police were to adopt such technology and for it to be accepted by the public. Each situation the police are presented with will have its own specific qualities that need to be taken into consideration, and people may be reluctant to accept a machine can do this adequately.

The phrase “algorithmic aversion” (Dietvorst et al., 2015) has been used to describe why people might be wary of or opposed to the use of algorithmic decision-making in a context such as policing. Burton et al. (2018) set out five sets of reasons behind such aversion, including false expectations that affect responses to algorithmic decision-making (for example the idea that error is systematic, “baked in” and therefore irreparable); concerns about a lack of decision control and an emphasis on the need for human decision-making in contexts marked by uncertainty, where “alternatives, consequences, and probabilities are unknown and optimization is unfeasible” (ibid: 226)—i.e. where people feel there is not enough formalised information to make algorithmic decision-making a plausible option.

Relatedly, the importance of trust in generating acceptance of AI and related technologies has also been emphasised. Drawing on Mayer et al.’s (1995: 712) widely cited definition of trust, “the willingness of a party to be vulnerable to the actions of another party based on the expectation that the other will perform a particular action important to the trustor, irrespective of the ability to monitor or control that other party”, Glikson and Woolley, 2020: 629) argue that trust among users will predict the extent of reliance on a new technology and that this can take positive and negative forms: “Low trust in highly capable technology would lead to disuse and high costs in terms of lost time and work efficiency … whereas high trust in incapable technology would lead to over-trust and misuse, which in turn may cause … undesirable outcomes”. Based on a systematic review, they outline the characteristics of trustworthy AI, including tangibility or presence, reliability, transparency, immediacy (e.g. ability to respond to human presence and speech) and, under some conditions, anthropomorphism. AI technology that does not display these characteristics is less likely to be trusted and its use is thus less likely to be supported. Of particular note, given the vignettes used in the experiment described below, is that Glikson and Woolley (ibid) suggest that in the context of “embedded AI” where there is no visual representation or “identity”, reliability and transparency are likely to be particularly important factors.

Public perceptions of police use of new technology

Traditional UK policing relies heavily on the Peelian ideology of policing by consent, in which public views of police legitimacy and trustworthiness are based on transparency about, and integrity in, the use of police powers, accountability and justice. On this account, while police should, and indeed must, embrace new technology, they also need to understand the ethical issues arising from doing so. There is also a strong need to test the acceptability of new technologies since implementing new tools that transgress boundaries of appropriateness (Huq et al., 2017; Trinkner et al., 2018) risks significant damage to public trust.

There are important issues of privacy, fairness and accountability involved when policing relies on algorithmic technology to inform decisions that can have an impact on the liberty of the individual or on the generation of outcomes they favour (Mackey, 2020). In the context of police decision-making, procedural justice has been found to be a key factor in generating public trust and police legitimacy (Mazerolle et al., 2013). Procedural justice relates to the fairness, consistency and accuracy of the decision-making process, and the quality of interpersonal interaction across dimensions of dignity, respect and voice, and it has been found to be central to generating support for decisions, sometimes irrespective of their favourability to the people involved (Brockner & Wiesenfeld, 2005; Brown et al., 2019), satisfaction with the decision-maker and outcome achieved (Tyler, 2006) and other outcomes including institutional trust, legitimacy and compliance (Tyler & Jackson, 2014). The literature on policing—rather unlike that on work organisations (e.g. Colquitt, 2001)—also regularly finds that procedural justice is more important than distributive justice (typically defined as the fair allocation of policing outcomes across aggregate social groups) in shaping people’s responses to authority (Hinds & Murphy, 2007; Reisig et al., 2007; Sunshine & Tyler, 2003; Tankebe, 2009; Tyler & Wakslak, 2004). One reason for this may be that, while in employment situations people can often see the outcomes provided to others (i.e. co-workers), this is less often the case in policing, where the wider outcomes achieved by police are often hidden from those involved in any one interaction. Indeed, this may lead people to infer distributive justice from procedural justice in policing contexts (Solomon & Chenane, 2021, c.f. van den Bos et al., 1997a, b) .

According to procedural justice theory, the effect or outcome of officer decision-making is therefore only one-factor driving public trust and support for police actions. Perceptions of the nature of the decision-making process and the quality of interpersonal treatment can be equally if not more important, and indeed, a “good” process can make up for a “bad” outcome. People tend to support police decisions they believe have been made in the right way, even if the outcome of those decisions is not favourable to them (Tyler, 2006). One important reason for this is that procedural justice generates a sense of motive-based trust in the decision-maker—that they are at least trying to do the right thing for the right reasons and have the interests of the trustor in mind—and this mitigates the effect of any failure to actually achieve desired ends (Tyler & Huo, 2002).

It might be imagined that procedural justice should be a prominent feature in algorithmic decision-making, and of people’s perceptions of it, as this technology follows the same set of rules and procedures every time. Furthermore, algorithms can sometimes be perceived as higher in quality and objectivity (Sundar & Nass, 2001) as they are not influenced by emotional factors or overt biases. All this should enhance trust. However, perceived trust is often lower for algorithmic rather than human decision-making because people do not believe that algorithms have the ability to learn from their mistakes (Dietvorst et al., 2015) or the capacity to successfully execute a task (Lee, 2018). Just the fact that decisions are made by algorithms rather than by humans may influence perceptions of the decisions that are made, regardless of the qualities of the decision outcomes (Lee, 2018).

Concerns about issues of algorithmic fairness are not of course unfounded. Algorithms are developed through programming a set of parameters, which are necessarily founded in and shaped by the values and interests of their designers—values and interests that, inescapably, become built-in to the process (Brey & Søraker, 2009). Evidence has shown that algorithmic decisions not only counteract and expose biases, but also afford new mechanisms for introducing biases with unintended and detrimental effects (Mittelstadt et al., 2016). Algorithmic decisions have been shown to amplify biases and unfairness embedded in data in relation to sensitive features such as gender, culture and race (Shrestha & Yang, 2019). Used in policing contexts, algorithmic decision-making has the potential to “compound the crisis of unfair treatment of marginalised communities … [predictive policing] provides a front of allegedly ‘impartial’ statistical evidence, putting a neutral technology veneer on pre-existing discriminatory policing practices” (Couchman, 2019: 15). Algorithms that are trained on historical data can mean that past discrimination and stereotypes prevalent in the organization and society are reflected in their predictions (Barocas & Selbst, 2016; Grimshaw, 2020; Sweeney, 2013). The application of algorithmic decision-making, derived from historical data with already embedded biases, may undermine people’s sense that the police act impartially and in a neutral fashion.

While it may seem, then, that algorithmic decision-making tools will make police decisions fairer—because they remove the potential for human bias—this is by no means a given, and there is much to suggest that bias can be “built-in” to the process. There is evidence that these sorts of concerns have filtered in “public consciousness”; Araujo et al. (2018), for example, found that their Dutch respondents expressed concern that algorithmic decision-making may lead to misuse or cause worry, and over half of the respondents thought that technology might lead directly to unacceptable outcomes. More importantly for the current study, perhaps, it also seems that people tend at the very least to be cautious about machine-driven decision-making processes and, in many cases, seem to prefer the involvement of a human being. While a system using algorithmic technology might be fully compliant with formal regulations, it may fail to have the “social license” (Brown et al., 2019) required to be accepted by the community and stakeholders within which it operates. Without this acceptability, the decisions made and outcomes achieved may not be tolerated, particularly, we might suggest, when desired ends are not achieved. Because decisions made by algorithms are opaque, lack the involvement of an identifiable human actor and are harder to trust, there is less chance for process-based factors to mitigate the effect of outcome failure.

These issues are made yet more salient by the fact that trust is vital for the acceptance of new technology, in policing and elsewhere. People who trust the police are also more willing to accept their vulnerability in relation to police—to place their trust in police—because they expect officers to be willing and able to behave fairly and effectively (Hamm et al., 2017). Trust (and/or associated constructs such as confidence and legitimacy) appears to be an important factor shaping public support for police powers, including the police potential for, and use of, force (Bradford et al., 2017; Kyprianides et al., 2021) as well as new technologies such as Body Worn Video (Lawrence et al., 2021) and the use of “big data” (Lee & Park, 2021). Trust can act as a heuristic, providing a mental shortcut towards a decision or judgement in situations where people know little about the power or technology in question (which is often the case in policing and when AI is involved). Yet, such trust is not “free-floating”, nor does it exist entirely prior to people’s encounters—of whatever kind—with police and exposure to the judgements they make. Rather, trust is developed during experiences of policing (Oliveira et al., 2020), not least because these provide people with some information about police activity, its fairness and its effectiveness. As research on algorithmic aversion implies, being exposed to apparently effective, justified and/or appropriate AI decision-making should build trust, making people more likely to accept the use of this technology (Glikson and Woolley, 2020).

The current study

Although police organisations are increasingly turning towards automated systems, there is as yet very little evidence on how the public will respond. We do not know whether people trust algorithmic decision-making in policing, whether trust forms a basis for judging the use of algorithms (un)acceptable or indeed whether being exposed to the use of this technology in policing makes people more or less likely to accept it. Do people trust algorithmic police decision-making and do they generally approve of algorithmic policing? It is this gap that the current paper seeks to address.

We conducted an online experiment using text-based vignettes to explore perceptions of police use of algorithmic decision-making. We manipulated three factors in the vignettes. First, we manipulated whether an operational decision was made by a human (i.e. a police officer) or an algorithm. The research outlined above would suggest that while some may view algorithmic decision-making as accurate, it is more likely that people will on average be less accepting of decisions made purely by an algorithm, which they may perceive as unable to take full account of the complex characteristics of particular situations (Longoni et al., 2019) marked by inherent uncertainty (Burton et al., 2018), lacking in intuition (Lee, 2018) and/or opaque (Glikson and Woolley, 2020). Thus, we test the following hypothesis:

H1: Participants will view police decision-making as more trustworthy, and less biased, when the decisions are made by a human (i.e. a police officer) compared to an algorithm.

Second, we manipulated the outcome of the operational decision (i.e. a reduction in crime or no change to crime levels). Previous research suggests the public are particularly concerned about the use of algorithms resulting in “unacceptable outcomes” (Araujo et al., 2018), and introducing biases and issues of accountability (Mittelstadt et al., 2016) and, in a general sense, about their reliability (Dietvorst et al., 2015; Glikson and Woolley, 2020). Thus, we test the following hypothesis:

H2: Participants will view police decision-making as more trustworthy, and less biased, when the outcome of the decision is successful (i.e. a reduction in crime). This will be particularly true for decisions made by an algorithm; outcome success should be relatively less important in cases where the decision is made by an officer.

Third, we manipulated the type of scenario described in the vignette, which was either (1) an individual police officer in a localised situation (i.e. a stop and search decision) or (2) an area-based decision where a senior officer has to decide whether to allocate resources to a crime hotspot. While the second scenario represents most closely the way algorithmic technology is currently used within police forces in England and Wales, it would seem important to test the acceptability of algorithmic decision-making across a range of use scenarios. However, as previous research does not clearly indicate whether public perceptions will vary depending on whether the decision impacts individuals (e.g. being stopped and searched by police) or neighbourhoods (e.g. allocation of police resources), we have no a priori hypothesis for this condition.

Fourth, we consider whether being exposed to an instance of algorithmic decision-making that was judged as fair and trustworthy was also linked to greater acceptance of police use of this new technology:

H3: Participants who view police decision-making as more trustworthy, and less biased, will be more likely to support the police use of algorithmic technology.

At the threshold, and in the context of the experiment described below, support for police use of algorithmic technology could be triggered by trust in police decision-making developed in one (or some combination) of three ways. First, human decision-making could be seen as more trustworthy than AI decision-making, meaning people in the “human” conditions may be more generally supportive of police use of algorithmic technology. Second, successful decision-making could be seen as more trustworthy than unsuccessful decision-making, meaning people in the “success” conditions, may be more generally supportive of police use of algorithmic technology. Third, it may be that it is being exposed to successful algorithmic decision-making, specifically, that generates wider acceptance of the technology. We explore these possibilities below.

Method

Participants

A total of 642 residents in the UK were recruited via the online crowdsourcing platform Prolific Academic on 16 November 2020.^{Footnote 1} Participants were aged between 18 and 84 years old, with the majority (55%) aged between 18 and 34 years old. Females accounted for two-thirds (68%) of the participants. Some 511 participants (80%) reported their ethnic group to be White-British, White-Irish or any other White background: 9% (57) were Asian or Asian British, 5% (29) were Black or Black British, 3% (20) were mixed and 2% (15) were other. There were no significant differences in demographics across the experimental conditions. In line with Prolific recruitment protocols, participants were paid £6.02/h (£0.88) for taking part in the study.

Procedure

The online platform Qualtrics was used to build and host the experiment.^{Footnote 2} We conducted two pilot studies with a total of 440 participants, which confirmed that participants understood the scenarios and were able to appropriately answer questions based on what they had read. The experiment then used a 2 (scenario: individual vs area) × 2 (decision-making: human vs algorithm) × 2 (outcome: successful vs unsuccessful) between-subjects design.

First, participants were randomly allocated to read one of two scenarios that described either:

Individual—an incident in which a single police sergeant observes suspicious males and has to make a decision about whether or not to conduct a stop and search (adapted from Ferguson, 2017).
Area based—a scenario in which a crime hotspot has been identified and a police inspector has to make a decision about whether or not to direct officers to the crime hotspot for increased proactive policing (e.g. more stop and searches), thus leaving fewer resources elsewhere.

Participants were also randomly allocated to one of two decision-making conditions in which the sergeant or inspector made the operational decision, either:

Human—using their own knowledge, observations and expertise.
Algorithm—using a new piece of algorithmic technology incorporating a range of data/information.

In all conditions, the sergeant/inspector makes the operational decision to act (i.e. to conduct the stop and search/allocate resources to the crime hotspot). After reading the vignette, participants were randomly allocated to read one of two outcomes:

Successful—in the individual scenario, the sergeant recovers items that could be used to commit a crime after conducting the stop and search. In the area-based scenario, the allocation of resources to the crime hotspot reduced crime in that area by 16%.^{Footnote 3}
Unsuccessful—in the individual scenario, no items were recovered from the stop and search. In the area-based scenario, the allocation of resources to the crime hotspot had no impact on crime levels.

Following the vignette, participants responded to a range of questions about the decision made by the police officer described in the vignette (i.e. whether they trusted the officer had made an effective decision, was competent and unbiased in their decision-making). Participants then responded to questions about the use of decision-making technology by the police and their knowledge of algorithms.

Constructs and measures

Confirmatory factor analysis in the package MPlus 7.11 was used to derive and validate three latent variables that comprise our dependent variables (factor scores were obtained and saved for analysis). A robust maximum likelihood approach (MLR) was used (see Appendix Table 2 for a list of the items used, factor loadings and model fit statistics). The first factor—trustworthy decision-making—consisted of six items measured on a 5-point agree/disagree scale and capturing whether participants thought the officer in the vignette dealt with the situation effectively and made the appropriate decision (e.g. “I would feel confident in the decision Sergeant/Inspector McFadden made” and “The Sergeant/Inspector took the most appropriate action to the situation”).

The second factor—fair decision-making—consisted of three items and measured whether participants thought the officer made a fair and unbiased decision (e.g. “Sergeant/Inspector McFadden’s decision making was impartial”).

The third factor—use of technology—consisted of five items and measured whether participants had confidence in the police use of technology (e.g. “I feel confident that technology/algorithms are accurate in the decisions they make” and “Police use of algorithms will make it easier for the police to catch criminals”). Immediately before answering these questions, participants in the officer decision-making conditions were provided a short summary of police use of algorithms, similar to that provided in the vignettes read by participants in the algorithmic decision-making conditions.

Results

To test H1 and H2, we conducted a series of 2 (scenario: individual vs area) × 2 (decision: human vs algorithm) × 2 (outcome: successful vs unsuccessful) between subject ANOVAs with the two latent variables (trustworthy decision-making and fair decision-making) as the dependent variables. Table 1 presents the descriptive statistics for each condition.^{Footnote 4}

Table 1 Descriptive statistics for dependent variables by condition

Full size table

Trustworthy decision-making

First, consistent with H1, there was a significant main effect of decision-making, F (1, 634) = 25.64, p < 0.001. Participants were more trusting of decisions made by a human (M = 0.155, SD = 0.780) compared to decisions made using an algorithm (M = − 0.156, SD = 0.879). Second, consistent with H2, there was a significant main effect of outcome, F(1, 634) = 56.39, p < 0.001 with participants granting more trust when the outcome of the decision was successful (M = 0.232, SD = 0.741) compared to unsuccessful (M = − 0.232, SD = 0.879). However, we found no significant interaction between decision-making and outcome (F (1, 634) = 0.116, p = 0.733). Across both the human and algorithm decision-making conditions, participants were more trusting of decisions with a successful outcome than an unsuccessful one.

Third, we found a significant main effect of scenario, F (1, 634) = 13.07, p < 0.001. Participants were more likely to trust the decision made in the individual scenario (M = 0.111, SD = 0.835) compared to the area-based scenario (M = − 0.112, SD = 0.841). There was a significant interaction between scenario and decision-making on trust, F (1, 634) = 21.95, p < 0.001 (see Fig. 1). In the individual scenario, the decision-making method made no difference to participants’ levels of trust: participants were equally trusting of decisions made using human experience (M = 0.122, SD = 0.853) and those made using an algorithm (M = 0.100, SD = 0.819). However, in the area-based scenario, who or what made the decision mattered. Here, participants exhibited far greater trust when the decision was made using human knowledge and experience (M = 0.188, SD = 0.700) than by an algorithm (M = − 0.414, SD = 0.865). There were no significant interactions between the scenario and outcome (F (1, 634) = 0.034, p = 0.853) nor was there a significant three-way interaction (F (1, 634) = 0.105, p = 0.746).

Fair decision-making

One of the biggest concerns of incorporating algorithms into operational decision-making is the potential for biases to become embedded, and that these are difficult to identify. Inconsistent with H1, we found no main effect of decision-making on the perceived fairness of the decision, F(1, 634) = 1.37, p = 0.242. Participants in the algorithm condition (M = − 0.035, SD = 0.801) were just as likely to think the officer’s decision was unbiased compared to the human condition (M = 0.034, SD = 0.732). Yet, consistent with H2, and the findings for trustworthiness above, there was a significant main effect of outcome, F(1, 634) = 12.78, p < 0.001: decisions with a successful outcome (M = 1.07, SD = 0.754) were considered significantly less biased than decisions with an unsuccessful outcome (M = − 1.07, SD = 0.766). Again, there was no significant interaction between decision-making and outcome, (F (1, 634) = 0.275, p = 0.600). Across both decision-making conditions, participants were more likely to perceive decisions with a successful outcome as fair.

Unlike the findings for trustworthy decision-making above, there was no significant main effect of scenario, F(1, 634) = 0.002, p = 0.966. However, there was a significant interaction between scenario and decision-making on perceptions of fair decision-making (see Fig. 2), F(1, 634) = 8.76, p = 0.003. In the individual scenario, participants were more likely to think the decision made by the algorithm was fair (M = 0.053, SD = 0.839) compared to the decision made by a human (M = − 0.055, SD = 0.785). In contrast, in the area-based scenario, participants were more likely to think the decision made by a human was unbiased (M = 0.125, SD = 0.663), compared to the algorithm (M = − 0.123, SD = 0.754). Again, there was no significant interaction between the scenario and outcome (F (1, 634) = 0.389, p = 0.533) and no significant three-way interaction (F (1, 634) = 0.011, p = 0.915).

Support for police use of algorithms

The above results indicate that participants were more likely to perceive the police to have made a trustworthy, competent and unbiased decision when the decision was made by a police officer (human), and when the outcome of the decision was successful. But did the apparent trustworthiness of the decision affect support for police use of AI technology in a wider sense? As a first step towards addressing H3, we conducted the same 2 (scenario: individual vs area) × 2 (decision: human vs algorithm) × 2 (outcome: successful vs unsuccessful) between-subject ANOVA, this time with support for police use of algorithmic technology as the dependent variable (see Table 1).

We found a significant main effect of decision-making on support for police use of algorithms, F(1,634) = 5.72, p = 0.017. Participants exposed to the algorithm vignette (M = 0.093, SD = 0.987) showed more subsequent support for police use of algorithms than participants exposed to the human vignette (M = − 0.092, SD = 0.994). There was no significant main effect of outcome (successful vs unsuccessful); however, there was a significant interaction between decision-making and outcome at the p < 0.10 level, F(1, 634) = 3.77, p = 0.053—see Fig. 3. In the algorithm condition, participants were significantly more supportive of police use of technology after being exposed to a vignette with a successful outcome (M = 0.220, SD = 0.983) compared to an unsuccessful outcome (M = − 0.034, SD = 0.976). In contrast, in the human condition, the outcome made no difference to participants’ subsequent support for police use of algorithms (successful M = − 0.115, SD = 1.00; unsuccessful M = − 0.070, SD = 0.991).

There was a main effect of scenario, F(1,634) = 5.74, p = 0.017. Participants exposed to the area scenario (M = 0.093, SD = 0.978) showed more support for police use of algorithms than participants exposed to the individual scenario (M = − 0.092, SD = 1.00). There was a significant interaction between scenario and decision-making, F(1, 634) = 6.33, p = 0.012—see Fig. 4. In the algorithm condition, scenario made no difference to participants’ support for police use of technology (individual M = 0.099, SD = 0.989; area M = 0.088, SD = 0.987). In the human condition, participants were more supportive of police use of algorithms when exposed to the area-based scenario (M = 0.099, SD = 0.971) compared to the individual scenario (M = − 0.281, SD = 0.983). There was no significant interaction between the scenario and outcome (F (1, 634) = 1.43, p = 0.232) and no significant three-way interaction (F (1, 634) = 1.14, p = 0.286).

These findings indicate that exposure to a scenario in which the police used algorithmic technology to make a successful decision led participants to be more supportive of the general use of algorithms within policing. According to H3, however, this should be because exposure to a successful use case increases trustworthiness, which in turn generates support. We used structural equation modelling (SEM) in MPlus 7.11 to test the associations between perceptions of police decision-making (as trustworthy and fair) and support for the police use of algorithmic technology, and whether perceptions of trust mediated the link between scenario and support. Here, we focus only on participants in the algorithm condition since, as the analysis above suggests, they were the ones who had been exposed to a scenario from which they could make some judgement about the apparent trustworthiness of police in this area. Support for police use of algorithmic technology was regressed on trustworthiness and fairness of decision-making; decision-making was regressed on outcome condition (successful vs unsuccessful). Figure 5 presents the standardized path coefficients.

As shown in Fig. 5, both trustworthiness of decision-making (B = 0.377, p < 0.001) and fairness of decision-making (B = 0.277, p = 0.002) were significant predictors of support for police use of algorithms. In other words, respondents who felt that the police had used a particular algorithm to make a competent, effective and fair decision were more likely to support police use of new algorithmic technology. Turning to the outcome condition, as above, we find that participants exposed to a scenario where the outcome was successful were significantly more likely to grant trust (B = 0.549, p = < 0.001) and to believe the officer had made an unbiased decision (B = 0.255, p = 0.044). Conditioning on these associations, there was no direct effect of outcome (successful vs. unsuccessful) on participants’ support for police use of technology (B = − 0.005, p = 0.966). In other words, all of the association between outcome and support was mediated by perceptions of fair treatment and, in particular, trustworthiness.

Discussion

Algorithms now pervade our lives. They determine the news we see, the products we buy and shape many areas of our economy and society in which new technologies and data-driven tools are being adopted in order to function more effectively and efficiently. Policing is not exempt from this process, with algorithmic decision-making and AI used more and more across multiple operational contexts. As police organisations increasingly turn towards automated systems, ethical questions arise about the police use of these new technologies—including bias and discrimination (cf. Barocas & Selbst, 2016) and the lack of transparency and accountability (cf. Citron & Pasquale, 2014)—as well as concerns about the need to maintain the public’s trust and confidence (Mackey, 2020). Yet, public reactions to the introduction of police use of algorithmic decision-making are not yet fully understood. This study sought to address this gap.

To return to the hypotheses that motivated our analysis, we found partial support for H1. Overall, people were more trusting of decisions made by a police officer compared to an algorithm. However, this effect was only present in the area-based scenario (i.e. the allocation of resources to a crime hotspot). In the individual scenario (i.e. the stop and search encounter), the decision-making method made no difference to the perceived trustworthiness of the decision. A similar pattern of results was found when looking at the perceived fairness of the decision. In the area-based scenario, participants were more likely to think the decision was fair when it was made by a police officer compared to an algorithm. In the individual scenario, there was little difference across the two decision-making conditions (although participants were slightly more likely to think the use of algorithms was fair in the individual scenario).

We also found support for H2. Across all conditions, when the outcome of the decision was successful, participants demonstrated higher levels of trust in police decision-making and perceived the decision as less biased, compared to when the decision led to an unsuccessful outcome. However, contrary to our expectations, outcome effectiveness was apparently no more (or less) important to participants in the algorithmic decision-making conditions.

Lastly, we found support for H3. Specifically, participants who were exposed to successful algorithmic decision-making expressed more support for police use of this technology, and this seemed to be because they perceived the police as being more trustworthy and fair in their decision-making (at least in comparison to those in the unsuccessful algorithmic conditions). If they are to offer their support, it is essential the public believe that any new technology introduced by the police would be effective and used appropriately. Previous research has shown that when people have trust in the police, they are more accepting of changes in the tools police use (Bradford et al., 2020). At the core of the concept of trust is a willingness to accept vulnerability in relation to the trust object (PytlikZillig & Kimbrough, 2015). What we see here may be a reflection of the fact that trust in the police is partly rooted in direct and vicarious experiences of policing (Bradford et al., 2009; Oliveira et al., 2020), which can have important implications for people’s acceptance of wider powers and policies. If people experience a particular instance of policing as trustworthy—a judgement shaped, here, by the outcome it achieves—they are more likely to support the use of powers about which, it is important to note, they are likely to know very little. Trust does, indeed, seem to be used as a heuristic.

Taken together, though, our findings suggest that the public still “prefer” decisions to be made by police officers rather than algorithms. This seems to be especially true for decisions that impact a community or neighbourhood compared to decisions that impact an individual during a one-on-one encounter, such as a stop and search. The finding that people prefer human decisions fits with theoretical perspectives suggesting that more weight is often placed on the same advice given by a human expert compared to an algorithm (Dietvorst et al., 2015). Both may be opaque, but at least with a human, one can infer trustworthy motives. When one assumes that the other is taking one’s own interests into account, one gives the decision-maker the “benefit of the doubt”.

Yet, although many of the results here reflect the “reluctant” viewpoints often associated with the acceptance and trustworthiness of algorithmic technology within the medical profession, there are also some differences. In particular, medical patients voice concerns that decision-making technology may neglect their unique characteristics, circumstances and symptoms (Longoni et al., 2019). By contrast, in our individual scenario—a face-to-face encounter—the use of algorithms was just as acceptable as the decision made by a human. Arguably, this type of situation is more similar to people’s experiences of healthcare: that decision-making technology will neglect individuals’ unique characteristics and circumstances. However, the widespread and well-documented bias and disproportionality evident in the UK criminal justice system, including in stop and search encounters (Ashby, 2020; Police Foundation, 2020), may mean the British public is particularly attuned to issues of human bias, including both overt racism and unconscious bias. Because some or indeed many people are aware that there is a current issue with disproportionality in stop and search, they may be more open to the idea of decisions being “taken over” by machines.

Indeed, public perceptions about the fairness of the police decision-making appear more nuanced than first thought, with the type of scenario being particularly important. Despite following the same set of rules and procedures every time, algorithmic technology has the potential to amplify biases and unfairness embedded in data (Mittelstadt et al., 2016; Shrestha & Yang, 2019). This could explain why participants felt that when making a decision about a community or neighbourhood (the area-based scenario), the use of an algorithm would lead to more biased decision-making. It may however be implausible to suggest the average person is sufficiently aware of algorithmic decision-making processes to draw these kinds of conclusions. Another possibility is that decisions that affect whole areas are viewed as more serious than those affecting only individual people, and this in effect raises the bar, leading people to prefer that a human actor makes the choice.

We have demonstrated here that there remains a scepticism among the public about the use of algorithmic technology, which is likely to be fuelled through the potential ethical concerns and effects of this new capability. Given this, it might seem rather paradoxical that we also found that respondents exposed to an apparently successful use case of algorithmic decision making were more likely to support police use of this power. This seems likely to reflect (a) the complexity and fuzziness of people’s opinions on the issues at hand, but also (b) how those opinions are shaped by experiences of policing (that may be trust building or undermining). Coming to the question “cold”, people preferred a human decision-maker. But having been presented with an example of apparently successful AI decision-making, those in the relevant experimental condition were more likely to support wider police use of this technology than either those exposed to an unsuccessful use case or those not exposed to an example of AI decision-making at all. Crucially, this support was forthcoming to the extent that they judged the police decision-making involved to be trustworthy.

Limitations and future work

Some limitations of the current research must be acknowledged. First, the hypothetical nature of the scenarios described is insufficient to fully capture the nuances of how police make decisions. We used scenarios that intentionally described a situation where a police officer used solely an algorithm or solely their own knowledge and experience to make a decision in order to understand how these extreme scenarios might affect people’s views. In reality, in the UK, it is likely that the kind of scenarios presented here—a stop and search encounter and allocation of resources across a borough—would be made using a combination of decision methods. But, it is true that some police agencies in the USA have already adopted algorithmic technology (e.g. Predpol^{Footnote 5}) to predict when and where crime will occur. The decisions made by this technology are akin to the area-based scenario here, as Predpol identifies crime hotspots and directs resources to them. Our scenarios are also similar to those used by Ferguson (2017) when discussing issues surrounding the rise of “big data policing”.

Second, there are the typical concerns about the reliability, generalizability and validity as a result of using a non-probability convenience sample recruited from a crowdsourcing platform. While the sampling methodology used is common in the study of public attitudes towards the police (e.g. Gerber & Jackson, 2017), the results are not representative of the general population. Additionally, by virtue of the nature of the research, experimental conditions and fictional vignette scenarios cannot fully replicate real instances of police decision-making, as influential factors relating to the complexity of the decision were not fully described here. Future investigation should explore these topics from more robust methodological perspectives that use stronger manipulations or which are based on real-world interventions. For example, participants could be more exposed to police decision-making or activity via the use of CGI or virtual reality technology (Vasser & Aru, 2020) or deliberative polling methods could be used to create greater space for discussion of the inputs, risks, rewards and consequences of particular policy developments in this area.

Finally, we recommend researchers delve further into the idea that the context of the decision-making process by algorithmic technology is important. Why do the public perceive algorithmic decision-making to be less trustworthy when the decision is for a whole neighbourhood or community? Examining more nuanced applications of algorithmic technology could better elucidate the particular situations where these tools could be incorporated into operational police decisions, while gaining the support and acceptance of a currently rather skeptical public.

Conclusion

The growth of AI, data-driven policing and algorithmic technology all provide potential ethical challenges to policing, and the proliferation of disinformation and massive growth in the use of social media are providing new opportunities to question how policing is done and at speed (Mackey, 2020). Policing methods that incorporate such technology need to be transparent and used sensitively—certainly initially only in very specific situations if the public are to be supportive of such measures. There is a clear need to maintain the public’s trust in using data for decision-making, and our results suggest that the police still have some way to go to bring the public fully on board and gain their acceptance.

A key issue here may be that of accountability. For the police to be seen as trustworthy, there must be a clear and transparent chain of command and a decision-making process that can be audited. But, identifying the human subjectivity embedded in algorithmic decision-making processes is difficult, with underlying values remaining obscured until a problematic case arises (Mittelstadt et al., 2016). Police leaders may struggle to explain what is going on “inside the box” (Mackey, 2020), not least because when harms are caused by algorithmic decisions, it can be difficult to locate the reasons due to the complex decision-making structures, hundreds of rules and probabilistic reasoning involved. In contrast to human decision-making, where an individual can usually articulate their decision-making process when required, the rationale of an algorithm is often incomprehensible to humans, making the fairness and accountability of decisions difficult to challenge (Mittelstadt et al., 2016; Vestby & Vestby, 2021). These technologies, and the opaque manner in which they are deployed, raise concerns that they may have unintended consequences and operate outside the scope of traditional oversight and public accountability mechanisms (Binns et al., 2018; Brown et al., 2019). Looking forward, it is important that the police think carefully about the situations in which they adopt algorithmic technology and follow a clear and transparent methodology that can be open to scrutiny when required. Equally, more work still needs to be done to understand public reluctance to fully accepting this transition to technology-driven decision-making.

Notes

Prolific Academic is similar to other crowdsourcing platforms such as Mechanical Turk but has a larger, more diverse pool of UK participants.
All materials used in this study are provided in a supplementary appendix (available here https://osf.io/dnhq8/).
Braga and Weisburd (2020) recently conducted meta-analyses of 53 hotspot studies and found hot spots policing generated on average a 16% (d = .24) reduction in crime. Therefore, this is the percentage reduction used in this study.
The datasets generated during and/or analysed during the current study are available in the OSF repository, available here https://osf.io/dnhq8/
https://www.predpol.com/

References

Araujo, T., et al. (2018). Automated decision-making fairness in an AI-driven world: Public perceptions, hopes and concerns. University of Amsterdam.
Ariel, B., et al. (2018). Paradoxical effects of self-awareness of being observed: Testing the effect of police body-worn cameras on assaults and aggression against officers. Journal of Experimental Criminology, 14, 19–47.
Article Google Scholar
Ashby, M. (2020). Stop and search in London: July to September 2020. Institute for Global City Policing.
Babuta, A. & Oswald, M. (2020). Data analytics and algorithms in policing in England and Wales: towards a new policy framework. RUSI Occasional Paper, February 2020. https://researchportal.northumbria.ac.uk/ws/portalfiles/portal/27680384/rusi_pub_165_2020_01_algorithmic_policing_babuta_final_web_copy.pdf. Accessed 16 Aug 2021.
Barocas, S., & Selbst, A. (2016). Big data’s disparate impact. California Law Review, 104, 671–732.
Binns, R. et al. (2018). It’s reducing a human being to a percentage: perceptions of Justice in Algorithmic Decisions (Paper presented at the 2018 CHI Conference on Human Factors in Computing Systems, Montréal). https://doi.org/10.1145/3173574.3173951
Bradford, B., Jackson, J., & Stanko, E. (2009). Contact and confidence: Revisiting the impact of public encounters with the police. Policing and Society, 19, 20–46.
Article Google Scholar
Bradford, B., Milani, J., & Jackson, J. (2017). Identity, legitimacy and “making sense” of police use of force. Policing: An International Journal of Police Strategies & Management, 40, 614–627.
Article Google Scholar
Bradford, B., Yesberg, J., Jackson, J., & Dawson, P. (2020). Live facial recognition: Trust and legitimacy as predictors of public support for police use of new technology. The British Journal of Criminology, 60, 1502–1522.
Google Scholar
Braga, A., & Weisburd, D. (2020). Does hot spots policing have meaningful impacts on crime? Findings from an alternative approach to estimating effect sizes from place-based program evaluations. Journal of Quantitative Criminology. https://doi.org/10.1007/s10940-020-09481-7
Article Google Scholar
Brayne, S. (2020). Predict and surveil: Data, discretion, and the future of policing. Oxford University Press.
Brey, P. & Søraker, J. (2009). Philosophy of computing and information technology. In A. Meijers (Ed.), Philosophy of technology and engineering sciences: Handbook of the Philosophy of Science (pp. 1341–1407). Elsevier.
Brockner, J., & Wiesenfeld, B. (2005). How, when, and why does outcome favorability interact with procedural fairness? In J. Greenberg & J. Colquitt (Eds.), Handbook of organizational justice (pp. 525–553). Lawrence Erlbaum Associates Publishers.
Brown, A., Chouldechova, A., Putnam-Hornstein, E., Tobin, A. & Vaithianathan, R. (2019). Toward algorithmic accountability in public services: a qualitative study of affected community perspectives on algorithmic decision-making in child welfare services Paper presented at the 2019 CHI Conference on Human Factors in Computing Systems, Glasgow. https://doi.org/10.1145/3290605.3300271
Burton, J. W., Stein, M.-K., & Jensen, T. B. (2018). A systematic review of algorithm aversion in augmented decision making. Journal of Behavioral Decision Making, 33, 220–239.
Article Google Scholar
Citron, D., & Pasquale, F. (2014). The scored society. Washington Law Review, 89, 1413–1424.
Google Scholar
Colquitt, J. A. (2001). On the dimensionality of organizational justice: A construct validation of a measure. Journal of Applied Psychology, 86, 386–400.
Article Google Scholar
Couchman, H. (2019). Policing by machine: predictive policing and the threat to our rights. Liberty. https://www.libertyhumanrights.org.uk/wp-content/uploads/2020/02/LIB-11-Predictive-Policing-Report-WEB.pdf. Accessed 16 Aug 2021
Coughlan, S. (2020). Why dd the A-level algorithm say no? BBC News.https://www.bbc.co.uk/news/education-53787203. Accessed 16 Aug 2021
David & Ola (2020). The practice of predictive policing and self-service business intelligence in three UK police services. N8 Policing Research Partnership Catalyst Project. https://n8prp.org.uk/wp-content/uploads/2020/07/David-and-Ola-Report.pdf. Accessed 16 Aug 2021
Dhasarathy. A., Jain, S. & Khan, N. (2020). When governments turn to AI: Algorithms, trade-offs, and trust. McKinsey & Company. https://www.mckinsey.com/industries/public-and-social-sector/our-insights/when-governments-turn-to-ai-algorithms-trade-offs-and-trust. Accessed Accessed 16 Aug 2021
Dietvorst, B., Simmons, J., & Massey, C. (2015). Algorithm aversion: People erroneously avoid algorithms after seeing them err. Journal of Experimental Psychology: General, 144, 114–126.
Article Google Scholar
Ferguson, A. G. (2017). The rise of big data policing: Surveillance, race, and the future of law enforcement. New York University Press.
Fussey, P. & Murray, D. (2019). Independent report on the London Metropolitan Police Service’s trial of Live Facial Recognition technology. https://www.essex.ac.uk/news/2019/07/03/met-police-live-facial-recognition-trial-concerns. Accessed 16 Aug 2021
Gerber, M., & Jackson, J. (2017). Justifying violence: Legitimacy, ideology and public support for police use of force. Psychology, Crime and Law, 23, 79–95.
Article Google Scholar
Glikson, E., & Woolley, A. W. (2020). Human trust in artificial intelligence: Review of empirical research. ANNALS, 14, 627–660.
Article Google Scholar
Grimshaw, R. (2020). Institutional racism in the police: how entrenched has it become? Centre for Crime and Justice Studies. https://www.crimeandjustice.org.uk/resources/institutional-racism-police-how-entrenched-has-it-become. Accessed 16 Aug 2021
Grzymek, V. & Puntschuh, M. (2019). What Europe knows and thinks about algorithms: results of a representative survey. Bertelsmann Stiftung eupinions February 2019. http://aei.pitt.edu/102582/1/WhatEuropeKnowsAndThinkAboutAlgorithm.pdf. Accessed 16 Aug 2021
Hamm, J. A., Trinkner, R., & Carr, J. D. (2017). Fair process, trust, and cooperation: Moving toward an integrated framework of police legitimacy. Criminal Justice and Behavior, 44, 1183–1212.
Article Google Scholar
Hinds, L., & Murphy, K. (2007). Public satisfaction with police: Using procedural justice theory to improve police legitimacy. The Australian and New Zealand Journal of Criminology, 40, 27–42.
Article Google Scholar
Home Office (2019). Home Office funds innovative policing technology to prevent crime. Press release. https://www.gov.uk/government/news/home-office-funds-innovative-policing-technology-to-prevent-crime. Accessed 16 Aug 2021
Huq, A. (2019). Racial equity in algorithmic criminal justice. Duke Law Journal, 68, 1043–1134.
Google Scholar
Huq, A., Jackson, J., & Trinkner, R. (2017). Legitimating practices: Revisiting the predicates of police legitimacy. British Journal of Criminology, 57, 1101–1122.
Google Scholar
Jackson, J., Bradford, B., Stanko, B., & Hohl, K. (2013). Just authority?: Trust in the police in England and Wales. Routledge.
Kearns, I. & Muir, R. (2019). Data driven policing and public value. The Police Foundation. http://www.police-foundation.org.uk/2017/wpcontent/uploads/2010/10/data_driven_policing_final.pdf. Accessed 16 Aug 2021
Kleinberg, J., Ludwig, J., Mullainathan, S., & Rambachan, A. (2018). Advances in big data research in economics. AEA Papers and Proceedings, 108, 22–27.
Article Google Scholar
Kyprianides, A., Yesberg, J. A., Milani, J., Bradford, B., Quinton, P., & Clark-Darby, O. (2021). Perceptions of police use of force: the importance of trust. Policing: An International Journal, 44, 175–190.
Article Google Scholar
Lawrence, T. I., Mcfield, A. & Freeman, K. (2021). Understanding the role of race and procedural justice on the support for police body-worn cameras and reporting crime. Criminal Justice Review. https://doi.org/10.1177/07340168211022794
Lee, M. (2018). Understanding perception of algorithmic decisions: fairness, trust, and emotion in response to algorithmic management. Big Data & Society 5:1–16. https://doi.org/10.1177/2053951718756684
Lee, Y. & Park, J. (2021). Using big data to prevent crime: legitimacy matters. Asian Journal of Criminology. https://doi.org/10.1007/s11417-021-09353-4
Longoni, C., Bonezzi, A., & Morewedge, C. (2019). Resistance to medical artificial intelligence. Journal of Consumer Psychology, 46, 629–650.
Google Scholar
Mackey, C. (2020). Where next for policing and technology? Policing Insights. https://policinginsight.com/features/opinion/where-next-forpolicing-and-technology/. Accessed 16 Aug 2021
Mayer, R. C., Davis, J. H., & Schoorman, F. D. (1995). An integrative model of organizational trust. Academy of Management Review, 20, 709–734.
Article Google Scholar
Mazerolle, L., Antrobus, E., Bennett, S., & Tyler, T. (2013). Shaping citizen perceptions of police legitimacy: A randomized field trial of procedural justice. Criminology, 51, 33–63.
Article Google Scholar
Meijer, A., & Wessels, M. (2019). Predictive policing: Review of benefits and drawbacks. International Journal of Public Administration, 42, 1031–1039.
Article Google Scholar
Mittelstadt, B., Allo, P. Taddeo, M., Wachter, S. & Floridi, L. (2016). The ethics of algorithms: mapping the debate. Big Data & Society, 3, 1– 21. https://doi.org/10.1177/2053951716679679
Oliveira, T. R., Jackson, J., Murphy, K., & Bradford, B. (2020). Are trustworthiness and legitimacy “Hard to win, easy to lose”? A longitudinal test of the asymmetry thesis of police-citizen contact. Journal of Quantitative Criminology. https://doi.org/10.1007/s10940-020-09478-2
Article Google Scholar
Önkal, D., Goodwin, P., Thomson, M., Gönül, S., & Pollock, A. (2009). The relative influence of advice from human experts and statistical methods on forecast adjustments. Journal of Behavioral Decision Making, 22, 390–409.
Article Google Scholar
Oswald, M., Grace, J., Urwin, S., & Barnes, G. C. (2018). Algorithmic risk assessment policing models: Lessons from the Durham HART model and ‘experimental’ proportionality. Information & Communications Technology Law, 27, 223–250.
Article Google Scholar
Police Foundation. (2020). Public Safety And Security In The 21st Century. The Police Foundation.
Promberger, M., & Baron, J. (2006). Do patients trust computers? Journal of Behaviour Decision Making, 19, 455–468.
Article Google Scholar
PytlikZillig, L. M., & Kimbrough, C. D. (2015). Consensus on conceptualizations and definitions of trust: are we there yet? In E. Shockley, T. M. S. Neal, L. PytlikZillig, & B. Bornstein (Eds.), Interdisciplinary perspectives on trust: towards theoretical and methodological integration (pp. 17–47). Springer.
Reisig, M. D., Bratton, J., & Gertz, M. G. (2007). The construct validity and refinement of process-based policing measures. Criminal Justice and Behavior, 34, 1005–1028.
Article Google Scholar
Ridgeway, G. (2019). Policing in the era of big data. Annual Review of Criminology, 1, 401–419.
Article Google Scholar
Shrestha, Y. R., & Yang, Y. (2019). Fairness in algorithmic decision-making: Applications in multi-winner voting, machine learning, and recommender systems. Algorithms , 12, 199.
Article Google Scholar
Solomon, S. J., & Chenane, J. L. (2021). Testing the fair process heuristic in a traffic stop context: Evidence from a factorial study with video vignettes. The British Journal of Criminology. https://doi.org/10.1093/bjc/azaa096
Article Google Scholar
St Louis, E., Saulnier, A., & Walby, K. (2019). Police use of body worn cameras: Challenges to visibility, procedural justice and legitimacy. Surveillance and Society, 17, 305–321.
Article Google Scholar
Sundar, S., & Nass, C. (2001). Conceptualizing sources in online news. Journal of Communication, 51, 52–72.
Article Google Scholar
Sunshine, J., & Tyler, T. R. (2003). The role of procedural justice and legitimacy in shaping public support for policing. Law & Society Review, 37, 513–548.
Article Google Scholar
Sweeney, L. (2013). Discrimination in Online Ad Delivery Google ads, black names and white names, racial discrimination, and click advertising. ACM Queue. https://dl.acm.org/doi/pdf/10.1145/2460276.2460278. Accessed 16 Aug 2021
Tankebe, J. (2009). Public cooperation with the police in Ghana: Does procedural fairness matter? Criminology, 47, 1265–1293.
Article Google Scholar
Trinkner, R., Jackson, J., & Tyler, T. R. (2018). Bounded authority: Expanding “appropriate” police behavior beyond procedural justice. Law and Human Behavior, 42, 280–293.
Article Google Scholar
Tyler, T. R. (2006). Why people obey the law (2nd ed.). Princeton University Press.
Tyler, T. R., & Huo, Y. J. (2002). Trust in the law: encouraging public cooperation with the police and courts. Russell Sage Foundation.
Tyler, T. R., & Jackson, J. (2014). Popular legitimacy and the exercise of legal authority: Motivating compliance, cooperation, and engagement. Psychology, Public Policy and Law, 20, 78–95.
Article Google Scholar
Tyler, T. R., & Wakslak, C. J. (2004). Profiling and police legitimacy: Procedural justice, attributions of motive, and acceptance of police authority. Criminology, 42, 253–282.
Article Google Scholar
van den Bos, K., Lind, E. A., Vermunt, R., & Wilke, H. A. (1997a). How do I judge my outcome when I do not know the outcome of others? The psychology of the fair process effect. Journal of Personality and Social Psychology, 97, 1034–1046.
Google Scholar
van den Bos, K., Vermunt, R., & Wilke, H. A. (1997b). Procedural and distributive justice: What is fair depends more on what comes first than on what comes next. Journal of Personality and Social Psychology, 72, 95–104.
Article Google Scholar
Vasser, M., & Aru, J. (2020). Guidelines for immersive virtual reality in psychological research. Current Opinion in Psychology, 36, 71–76.
Article Google Scholar
Vestby, A., & Vestby, J. (2021). Machine learning and the police: asking the right questions. Policing: A Journal of Policy and Practice, 15, 44–58.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Global City Policing, Department of Security and Crime Science, University College London, 35 Tavistock Square, WC1H 9EZ, London, UK
Zoë Hobson, Julia A. Yesberg & Ben Bradford
Department of Methodology, London School of Economics and Political Science, London, UK
Jonathan Jackson
Sydney Law School, Sydney, Australia
Jonathan Jackson

Authors

Zoë Hobson
View author publications
You can also search for this author in PubMed Google Scholar
Julia A. Yesberg
View author publications
You can also search for this author in PubMed Google Scholar
Ben Bradford
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Jackson
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ben Bradford.

Ethics declarations

Ethics approval

The questionnaire and methodology for this study were approved by the Human Research Ethics committee of University College London (Ethics approval number: 17987/003).

Consent to participate

Informed consent was obtained from all individual participants included in the study.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 18 KB)

Appendix

Table 2 Factor loadings and model fit for confirmatory factor analysis

Full size table

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hobson, Z., Yesberg, J.A., Bradford, B. et al. Artificial fairness? Trust in algorithmic police decision-making. J Exp Criminol 19, 165–189 (2023). https://doi.org/10.1007/s11292-021-09484-9

Download citation

Accepted: 29 July 2021
Published: 12 September 2021
Issue Date: March 2023
DOI: https://doi.org/10.1007/s11292-021-09484-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Artificial fairness? Trust in algorithmic police decision-making

Abstract

Objectives

Methods

Results

Conclusions

Similar content being viewed by others

Predictive policing and algorithmic fairness

Algorithms, human decision-making and predictive policing

Perceptions of Justice By Algorithms

Introduction

Who or what is making the decision, and does it have trustworthy motives?

Public perceptions of police use of new technology

The current study

Method

Participants

Procedure

Constructs and measures

Results

Trustworthy decision-making

Fair decision-making

Support for police use of algorithms

Discussion

Limitations and future work

Conclusion

Notes

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethics approval

Consent to participate

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary file1 (DOCX 18 KB)

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation