1 Introduction

Tracing and isolation of people who were in contact with an infectious person (contact tracing) can be used to control the spread of communicable diseases [1, 2]. In the traditional understanding of contact tracing (CT), public health employees interview known carriers (index cases) of the disease and then track down people who had the type of close contact with the index case necessary to transmit the disease. Contacts are then diagnosed and isolated. This implementation is only suited for infections that spread relatively slowly, and where cases can be easily diagnosed [3]. SARS-CoV-2, with its unspecific symptoms, high number of asymptomatic carriers, and incubation times as short as a day, does not fit this mold.Footnote 1 Recent studies indicate that an outbreak of SARS-CoV-2 could be controlled using fast and efficient digital contact tracing (DCT) [4, 5]. DCT systems using cellphone applications based on Bluetooth proximity measurements are currently being developed and/or deployed in many countries [615]. Predicting the effect of DCT on an outbreak is challenging, especially since the values of many epidemiological parameters that describe the outbreak dynamics have not been determined accurately yet, with clinical studies yielding conflicting results. Almost no practical experience for DCT is available. Partly automated CT systems were in use during the Ebola outbreak 2014–2016 [16] and it turned out that the technical difficulties that come with that approach must not be underestimated.

Simulation studies focusing on classical CT for COVID-19 consistently indicate that around 70% of the contacts need to be traced, and that the tracing delay has to be as short as 1 day [5, 17]. For these reasons, [17] doubt that COVID-19 can be controlled by traditional CT in practice. Ferretti et al. [4] however point out that DCT could significantly reduce tracing delays, so that the outbreak could be controlled for tracing probabilities much smaller than 70%. Meanwhile, several other simulation studies indicate that a high tracing probability and a combination of fast CT and testing is required to control SARS-CoV-2 [16]. However, it also became clear that not only the reduction of the reproduction number, or the final size of the epidemic, need attention, but also the number of persons that go to quarantine [18]: A naive application of DCT leads to a situation that resembles a lock-down as a large fraction of the population is quarantined. An appropriate choice of the tracing and testing protocol is central.

We developed individual-based models with the Monte Carlo (MC) simulation technique, flanked and cross-checked by deterministic models, to evaluate the dynamics of a COVID-19 outbreak under different intervention protocols, focusing on DCT and DCT combined with random testing (administering a SARS-CoV-2 test to some fraction of the population at random, that is regardless of each person’s symptoms or contact history) and social distancing. We determine not just the immediate effective reproductive number \({R_{e}}\), but also the daily \({R_{e}}\), the number of healthy people in quarantine, and the number of people infected, for up to a year of continuous interventions. The sensitivity of all outcomes to the reported ranges of values of the epidemiological parameters is studied in detail.

The goal of this paper is to study quantitatively under which conditions and to which degree DCT combined with fast testing and social distancing can replace rigorous shelter-in-place policies for keeping the effective reproduction rate \({R_{e}}\leq 1\).

All abbreviations used can be found in Table 2.

2 Model inputs

Table 1 presents an overview of all model input parameters and the values considered for them. They will be discussed in detail in the following subsections.

Table 1 Key model input parameters and settings. The models are evaluated for all possible combinations of all parameter values shown in black. Parameter values in grey are used only with select other parameter combinations
Table 2 List of abbreviations

Many model inputs that are properly described by a distribution rather than just a mean value are modelled using a shifted Gamma distribution, defined as

$$\begin{aligned} G(x;\mu,\gamma,\beta ) = \frac{ (\frac{x-\mu }{\beta } )^{\gamma -1} \exp (-\frac{x-\mu }{\beta })}{\beta \Gamma (\gamma )}, \end{aligned}$$
(1)

where \(x\geq \mu \). The gamma distribution can describe the shapes of and interpolate between several other probability distributions and is widely used to model skewed distributions for which the underlying true probability distribution is not known [19, 20]. As this distribution is flexible, we also use a discretized version \((n\in \mathbb {N} _{0})\)

$$\begin{aligned} \hat{G}(n;\gamma,\beta ) = G(n;0,\gamma,\beta ) \Biggl(\sum _{i=0}^{\infty }G(i;0,\gamma,\beta ) \Biggr)^{-1}. \end{aligned}$$
(2)

2.1 Social contact structure

Our deterministic models describe contacts between individuals in a population by the usual mass action law. The stochastic models allow for a more detailed investigation. Particularly, two different strategies are considered: For the first, in the following referred to as homogeneous population, we choose the contacts for each person randomly out of the entire population. The probability for a person to have n unique contacts close enough to transmit a respiratory virus on a given day is taken from the empirical distributions reported in [21]. These are well described by

$$\begin{aligned} P_{\mathrm{social}}(n) = \hat{G}(n;\gamma =2,\beta ={n_{c}}/2), \end{aligned}$$
(3)

where \({n_{c}}\) is the mean number of contacts per day. We also consider a more realistic contact pattern, in the following referred to as social graph population. The population is described by a social graph, were each individual is represented by a node and contacts are represented by edges. Each individual is given a fixed set of contact persons for the entire simulation. We employ a modified version of the Lancichinetti–Fortunato–Radicchi benchmark graph (LFR) [22] as shown in [23] (therein referred to as LFR-BA). In this model, the population is divided into communities with sizes distributed according to a power-law distribution. Node edges are constructed according to the linear preferential attachment model [24] under the constraint that an average fraction of \((1-\mu )\) of the edges of each node connect nodes within the same community. A graph constructed in this manner results in a power-law probability distribution with index \(a=2\) for the node degree n [24]:

$$\begin{aligned} P_{\mathrm{social;SG}}(n; a, n_{\mathrm{min}}) \sim a\cdot n_{\mathrm{min}}^{a} n^{-(a+1)}, \end{aligned}$$
(4)

where \(n_{\mathrm{min}}\) is the minimum node degree. Here, the mean number of contacts per day is given by \({n_{c}}= \frac{a\cdot n_{\mathrm{min}}}{a-1}\). We assume that each edge is active once per day, so that the node degree corresponds to the number of contacts per day.

2.2 Epidemiological parameters

Figure 1 schematically shows the probability distributions for symptom onset, transmission, and true positive test results. The parameters are explained in the following paragraphs.

Figure 1
figure 1

Course of the disease with probability distribution for the incubation period \({T_{\mathrm {Inc}}}\) (top) [25] and the fully correlated probability density function for the contagiousness \({T_{\mathrm {con}}}\) (bottom) [26]. The dotted vertical lines corresponds to the median of \({T_{\mathrm {Inc}}}\). The probability for a true–positive point-of-care (POC) test is displayed on the bottom left (grey line). The diamonds correspond to exposure (contact, black), end of latency/begin of contagious period (magenta) and symptom onset (teal)

Incubation period P(\({T_{\mathrm {Inc}}}\) ). The distribution of incubation periods is taken as

$$\begin{aligned} P({T_{\mathrm {Inc}}}) = G({T_{\mathrm {Inc}}}, \mu, \gamma, \beta ) \end{aligned}$$
(5)

(Fig. 1 upper curve) with shape parameters chosen to match the curve reported in [25], which has median and mean incubation periods of 7 days and 7.44 days, respectively, with a range from 0–23 days. The authors included a relatively large group of patients (\(n= 587\)) with a wide age (0–90 years) and symptom (asymptomatic to severe) range. Other studies, albeit with smaller number of patients and with a bias towards more severe symptoms, have reported lower medians [27, 28]. Therefore, in addition to the parameters matching [25] (printed in black in Table 1) we also model a curve with shorter median and mean incubation periods of 3.6 days and 4 days (printed in gray in Table 1).

Latent period \({T_{\mathrm {lat}}}\). The latent period for SARS-CoV-2 is shorter than the incubation period, meaning pre-symptomatic transmission can occur [1, 29]. The latent period is difficult to determine empirically, as it requires exact information about the time of exposure and contagiousness. As, to our knowledge, no reliable, large-scale studies have been published on the latent period of SARS-CoV-2 so far, we use the measured contagiousness relative to the incubation time as an auxiliary means to infer latency,

$$\begin{aligned} {T_{\mathrm {lat}}}= \max \{{T_{\mathrm {Inc}}}-2.5, 0\}, \end{aligned}$$

where the value of 2.5 days comes from the transmission probability curve discussed in the next paragraph.

The transmission probability curve \(P_{\mathrm{trans}}({\tau })\). The transmission probability is the probability that a contagious person infects someone they have contact with. This probability is often given as an average “infectivity per day”, \({\beta _{i}}\), even though it changes significantly as a function of the time since infection τ. The infectivity was measured as a function of the time since onset of symptoms by He et al. [26]. They find that carriers become contagious approximately 2.5 days before the onset of symptoms, and that approximately 44% of transmissions occur during this pre-symptomatic phase.

We take as the contagious period \({T_{\mathrm {con}}}\) the time from the end of the latent period until 99% of the cumulative transmission probability is reached; a person is considered to be recovered afterwards. Therewith, the transmission probability as a function of time since infection is given by a scaled and truncated Gamma distribution (Fig. 1 lower curve). After infection, an individual has a (random) latent period \({T_{\mathrm {lat}}}\), during which the transmission probability is zero. Afterward, for \({\tau }={T_{\mathrm {lat}}}+t\) (and \(t\geq 0\)), we have

$$\begin{aligned} P_{\mathrm{trans}}({T_{\mathrm {lat}}}+t) = {T_{\mathrm {con}}}{\beta _{i}}G(t;\mu, \gamma, \beta ) \chi (t< {T_{\mathrm {con}}}), \end{aligned}$$
(6)

where χ is 1 if the condition in the argument is met, and 0 otherwise.

The shape parameters shown in black in Table 1 are our defaults, taken to match the curve in [26]. Pre-symptomatic infectivity is strongly debated. Therefore, we model a second shape where only 18% of the transmission occurs during the pre-symptomatic phase (shape parameter values printed in gray).

The course of the disease for asymptomatic carriers is the same as that for symptomatic carriers as shown in Fig. 1, and the incubation time is the time when symptoms would have started.

Fraction of asymptomatic cases \(1-\alpha \). The proportion of asymptomatic carriers (\(1-{\alpha }\)) described in the literature ranges from ∼4% [30] to ∼40%[31] of all cases. Initial reports for (\(1-{\alpha }\)) derived from testing of specific cohorts (cruise ship, returning travellers) ranged from 17–31% [29, 32, 33], however larger studies suggest even higher numbers. Analysis of the mass screening of the full population of the municipality of Vo’, Italy [31] report that 41.1% of the confirmed SARS-CoV-2 infections were asymptomatic (as defined by the absence of fever and/or cough). Ferretti et al. [4] analyzed 40 selected transmission pairs and also derived a value of 40% for the proportion of asymptomatic infected individuals. We model several different values for α to cover the reported ranges.

Reduced asymptomatic transmission probability \({\eta _{as}}\). While initial studies assumed that asymptotic cases were less contagious [4, 34], newer reports indicate that the viral load of asymptomatic cases is similar to symptomatic cases, which suggests similar contagiousness [3540]. We nevertheless introduce the parameter \({\eta _{as}}\in [0,1]\), which scales the transmission probability for pre- and asymptomatic cases, and vary that parameter to explore its effects. We note that when we model with \({\eta _{as}}<1\), we apply the scaling to both pre- and asymptomatic phases; since there most likely is no difference in viral load, if asymptomatic transmission is suppressed, this is likely due to circumstantial factors, like lack of coughing, which apply to the pre-symptomatic phase as well.

The basic reproductive number \({R_{0}}\). If the contagiousness is independent of the symptom status, i.e. \({\eta _{as}}= 1\), the reproductive number is given by the number of contacts during the contagious period, \({n_{c}}{T_{\mathrm {con}}}\), times the average probability to transmit the infection in one contact, \({\beta _{i}}\). However, we need to distinguish between symptomatic, pre-symptomatic, and asymptomatic cases in order to allow for \({\eta _{as}}<1\). We find

$$\begin{aligned} {R_{0}}= {n_{c}}\biggl[ &{\eta _{as}}\int _{{T_{\mathrm {lat}}}}^{{T_{\mathrm {Inc}}}} P_{\mathrm{trans}}(t) \,dt + \bigl((1- {\alpha }) {\eta _{as}}+ {\alpha }\bigr) \int _{ {T_{\mathrm {Inc}}}}^{{T_{\mathrm {lat}}}+{T_{\mathrm {con}}}} P_{\mathrm{trans}}(t) \,dt \biggr]. \end{aligned}$$
(7)

Note that—though \(P_{\mathrm{trans}}\) is a random function—the random part of the function is a pure translational offset (the latency period), s.t. the integral is deterministic.

\({R_{0}}\) is expected to be different in different populations because \({n_{c}}\) is different by factors of up to 4 just within the populations of different European countries. Values of \({R_{0}}\) ranging from 1.4 to 6.5 have been reported [45], though it is not always clear whether or not the reported \({R_{0}}\) is for the case where symptomatic individuals are quarantined. Furthermore, \({R_{0}}\) is usually not corrected for the contact rate in the population where it is studied. We consider \({R_{0}}\) the reproductive number without any form of interventions. We take \({R_{0}}\) of 3 (approximately the median reported in [45]) as our default, but also run the models for \({R_{0}}\) of 2 and 4.

Following [46], we introduce the random reproduction number for an individual \(R_{0,i}\). \({R_{0}}\) is the expectation value over the \(R_{0,i}\). For simplicity (and since two of our models are not based on continuous time but on discrete time/days), we state the time-discrete formula for \(P(R_{0,i}=n)\). Assume that an individual did infect n persons. These n persons can be arbitrarily distributed over the contagious period. In a slight abuse of notation, let \(T_{c}\) denote the number of days that a person is contagious for. Furthermore, let \(C_{n} = \{\vec{n}\in \mathbb {N} _{0}^{T_{c}}: \sum_{i=1}^{T_{c}} \vec{n}_{i} = n\}\) be all possible ways to distribute the n infectees to \(T_{c}\) days. Then, for a homogeneous population,

$$\begin{aligned} P(R_{0,i} = n) = \frac{1}{ \vert C_{n} \vert } \sum_{\vec{n}\in C_{n}} \Biggl[ \sum_{i=1}^{{T_{\mathrm {con}}}} \operatorname{Binomial} \bigl(\vec{n}_{i}, P_{\mathrm{trans}}( {T_{\mathrm {lat}}}+i), m\bigr) \circledast P_{\mathrm{social}}(m) \Biggr], \end{aligned}$$
(8)

where the convolution ⊛ is over the parameter m. This distribution is shown for some parameter combinations in Fig. 2.

Figure 2
figure 2

The distribution of the number of people a carrier infects (Eq. (8)) for 3 combinations of \({n_{c}}\) (the mean number of contacts per day) and \({\beta _{i}}\) (the average transmission probability per day) that result in \({R_{0}}=3\) (see Tab. 1) (left) and for a fixed \({n_{c}}\) but different social structures (right). Default values for the infection probability curve are used, \({\eta _{as}}=1\), and no interventions are applied. Compared to a Poisson distribution with mean of 3, the distribution is over-dispersed

2.3 Intervention protocols

The interventions considered here are (1) DCT, (2) quarantining, (3) testing, and (4) social distancing. Reported symptomatic cases are quarantined starting right at the beginning of the epidemic. The remaining interventions are turned on once a fraction \({f_{i}}\) of the population has become exposed.

(1) DCT. We assume that a fraction \({ p_{\mathrm {app}}}\) of the population uses the DCT system and that we can trace all contacts between users of this system with time delay \({T_{\mathrm {delay}}}\). In case that both infector and infectee have a DCT device, the probability for successful tracing is \({\eta _{\mathrm {DCT}}}\), while tracing always fails if either infector or infectee do not use the system. \({\eta _{\mathrm {DCT}}}\) accounts for situations where cell phones run out of battery, are not with the owner at all times, Bluetooth is turned off, or where an alert is ignored. The DCT system (DCTS) identifies contacts within the past \({\Delta T _{\mathrm {trace}}}\) days.

In the literature, the overall tracing probability across the population is often taken as \(p_{\mathrm{trace}} = p_{\mathrm{app}}^{2} {\eta _{\mathrm {DCT}}}\) (e.g. in [4]). We note that this formula is not correct but becomes approximately right if \({ p_{\mathrm {app}}}^{2} {\eta _{\mathrm {DCT}}}\) is small (see the Appendix).

To become an index case for tracing, a person must be reported. We assume that from the group of symptomatic carriers, a fraction \({f_{m}}\) sees a doctor to get tested with a reliable laboratory test and is then reported. A fraction (\(1-{\alpha }\)) of cases will go unreported because they do not exhibit symptoms, unless they get tested due to being traced. A fraction \({\alpha }(1-{f_{m}})\) of symptomatic cases will go unreported due to lack of access to medical tests. In the case where \({\eta _{as}}= 1\), α and \({f_{m}}\) are degenerate.

First order tracing refers to a protocol where contacts of an index case are traced. DCT also allows immediate tracing of contacts-of-contacts. We refer to this as second order tracing. If the \({\Delta T _{\mathrm {trace}}}\) is big enough, DCT will identify the infector. Second order tracing then can trace not just the people infected by an index case, but also the people who were infected by the same infector as the index case.

A DCT system will identify all contacts, regardless of their infection status. Since many models assume perfect accuracy in identifying only contacts that became infected (e.g. [4, 5, 47]), we run all parameters both with (closer to reality) and without (to be comparable to other models) tracing of the uninfected contacts.

All traced people immediately go into quarantine. This is necessary to suppress the pre- and asymptomatic transmission rates.

(2) Quarantining. Quarantining refers to any intervention that reduces the transmission probability significantly; this includes self-quarantine at home as well as being hospitalized. We assume that all reported symptomatic patients are immediately quarantined, regardless of any other interventions. This already reduces the reproductive number to

$$\begin{aligned} R_{e,Q} ={}& {n_{c}}\biggl[ {\eta _{as}}\int _{{T_{\mathrm {lat}}}}^{{T_{\mathrm {Inc}}}} P_{\mathrm{trans}}(t) \,dt \\ &{}+ \bigl((1-{\alpha }) {\eta _{as}}+ {\alpha }(1-{f_{m}})\bigr) \int _{{T_{\mathrm {Inc}}}}^{ {T_{\mathrm {lat}}}+{T_{\mathrm {con}}}} P_{\mathrm{trans}}(t) \,dt \biggr]. \end{aligned}$$
(9)

Figure 3 shows \(R_{e,Q}\) for combinations of α, \({f_{m}}\), and \({\eta _{as}}\). In Fig. 3 (top), \({\eta _{as}}= 1\), so only the product of α and \({f_{m}}\) is relevant, and results are shown for \({R_{0}}= 3\) and for \({R_{0}}= 2\). In Fig. 3 (bottom), \({\eta _{as}}<1\), so both \({R_{0}}\) and \(R_{e,Q}\) depend on α and on \({f_{m}}\). For each combination of α and \({f_{m}}\), the transmission probability \({\beta _{i}}\) was adjusted to obtain \({R_{0}}= 3\).

Figure 3
figure 3

The effective reproductive number reached just from quarantining reported symptomatic carriers, \(R_{\mathrm{e;Q}}\), is shown for four different values of \({\eta _{as}}\) (asymptomatic infectivity scaling) as calculated from Eq. (9). Top panel: In the case of \({\eta _{as}}= 1\), \(R_{\mathrm{e;Q}}\) depends only on the product of α (symptomatic fraction) and \({f_{m}}\) (fraction reported and tested) and is shown for two values of \({R_{0}}\). Lower three panels: For each combination of \({\eta _{as}}\) and α, the infection probability was adjusted to obtain \({R_{0}}=3\)

In response to being reported or being traced, people are quarantined by default for 14 days. Symptomatic cases may leave quarantine 8 days after the symptoms start. Uninfected contacts can leave quarantine early following a testing protocol.

(3) Testing. We consider two types of tests. A reliable laboratory test for symptomatic carriers seeing a medical professional, and a fast point of care (POC) test that can be performed at home or at mobile testing stations. In either case, carriers only test positive while they have a high enough viral load. We assume the tests have \(p^{\mathrm{true\ positive}} = 0.0\) while the carrier is in the latent period, that is up to approximately 2.5 days before symptom onset. The viral load rises quickly after the end of the latent period. We further assume that the laboratory test then has a true positive rate of 100% until the carrier has recovered. The POC test on the other hand has \(p^{\mathrm{max}}_{\mathrm{true\ positive}} = 0.9\) until approximately 5 days after symptom onset.Footnote 2 After this time, the true positive rate falls at the same rate as the transmission-probability curve; the true positive rate as a function of days since symptom onset is shown in Fig. 1 (bottom gray curve).

All people who are traced must be tested for two reasons: (a) A positive test result is the only way for asymptomatic individuals to become index cases for tracing, and index cases are needed for tracing to be effective, and (b) so uninfected traced people can be released from quarantine. Keeping all traced people in quarantine for the full quarantining duration means that a large fraction of the uninfected population may end up quarantined on any given day of the outbreak. We use the following release protocol: All traced people go into quarantine and get tested with a POC test. Regardless of the test result, everyone stays in quarantine, because the person may still be in the latent period. Those who tested negative on the first day are re-tested \({\delta T_{\mathrm {re\text{-}test}}}\) days later. If both tests were negative, the person may leave quarantine, but is tested again after another \({\delta T_{\mathrm {re\text{-}test}}}\) days in case they were still in the latent period when the second test was done.

In addition to testing in response to being traced, we simulate the option of randomly testing a fraction \({f_{\mathrm {RT}}}\) of the population each day. This is done with a testing protocol assumed to have a negligible number of false positives.

(4) Social Distancing. Social distancing includes both a reduction of the total number of contacts per day \({n_{c}}\) to \({n_{c}}\cdot {f_{\mathrm {SD}}}\), and limiting the maximum number of contacts per day. In the absence of second-order effects, and if the upper limit does not change the distribution mean significantly, this reduces the reproductive number to

$$\begin{aligned} R_{e,\mathrm{SD}} = {f_{\mathrm {SD}}}R, \end{aligned}$$
(10)

where R is the reproductive number without social distancing.

3 Models

Epidemiological modelling is a well established scientific discipline and different approaches, including contact tracing, are described in the rich literature [5053]. Epidemiological models that account for CT date back to the 1980s [54]. The main challenge to modelling a CT system is the individual-based character of CT, and the handling of the resulting stochastic dependencies between individuals. Individual-based simulation models [55] readily describe this process. For the scope of this paper, we developed two deterministic and two individual-based models. While, for reasons of brevity, most of the results that will be presented here come from the individual-based models, the redundant modelling approach served to cross-validate and understand the results.

3.1 Deterministic models

The early phase of an outbreak can be quantitatively described with compartmental models based on ordinary differential equations (ODE) [56] or with age-since-infection models [57].

The deterministic models used here bridge the different scales utilizing the mathematical analysis of the underlying, microscopic stochastic branching process with contact tracing. The effect of contact tracing on the removal rates is determined. These effective removal rates are then used in the deterministic models. Our first deterministic, compartmental model explicitly predicts the status (exposed/infectious) for a newly infected person, when he/she will eventually be traced. Eventually traced and never-traced individuals go to different compartments. In that, the (exponentially distributed) waiting times can be readily adapted. Particularly, the model is close to standard SEIR-models (see Fig. 4), and is feasible to analytical analysis (Appendix A.1). In contrast, the second model, based on age since infection, does not explicitly formulate an exposed and an infectious period. The basic assumption is that the state of an individual is a function of his/her age of infection, that is the time that has passed since he or she became infected. The structure is less pronounced, but it is possible to use transition rates that are more realistic (supplemental materials A.2). At the present time, the analytical treatment of the interdependence of contact tracing and correlations between infected individuals at the plateau phase of an epidemic is not well understood. Therefore, both models focus on the onset of the epidemics, where the reduction of the number of susceptible people by quarantine or recovery does not play a central role. The main outcomes of these two models are the doubling time \(T_{2}\) and effective reproductive number \(R_{\mathrm{eff}}\) for interventions starting on the first day of the epidemic, though both models are able to predict in a heuristic way the total course of the epidemic.

Figure 4
figure 4

Simplified structure of the compartmental model. The model is based on a SEIR-type model from Ref. [58] and distinguishes between untraced (gray) and traced (blue) individuals. A detailed description of the model and its variables can be found in Appendix A.1

3.2 Individual-based models

We developed two independent individual-based models (IBMs), which use the Monte Carlo (MC) technique to simulate social interactions, the progression of the viral disease, and interventions, at the level of individual people. The code for the models is available from [59] and [60]. The MC simulations proceed through the outbreak in steps of one day. Each day of the outbreak, every infected person not in quarantine has contact with a number of other people randomly drawn either from \(P_{\mathrm{social}}(n)\) (for a homogeneous population structure) or from the person’s social graph. The probability to infect each contact is given by \(P_{\mathrm{trans}}({\tau })\). When a contact becomes infected, the incubation time is drawn from \(P({T_{\mathrm {Inc}}})\). The intervention protocols are implemented as described in Sect. 2.

Figure 5 shows a chain of infections from one of the simulation runs. Each box represents a person, and arrows between boxes represent infections and tracing.

Figure 5
figure 5

Spread and containment of SARS-CoV-2 using tracing and quarantine interventions in a Monte Carlo simulation including 1st and 2nd order tracing. The downward arrow indicates the time axis. Shown is a small sub-tree of the full infection tree. This sub-tree starts with simulated person P936, who was exposed on day 147, and includes the infection chains for the first three generations of infectees. The input settings for the simulation were: \({R_{0}}= 2.0\), \({\Delta T _{\mathrm {trace}}}= 7\) days, \({ p_{\mathrm {app}}}= 0.6\), \({\eta _{\mathrm {DCT}}}= 0.9\), \({\alpha }= 0.6\). Each box represents a person. The leftmost column in each box gives the person ID and whether or not they use the DCT app. The middle column indicates the days when the person was exposed, became infectious, and recovered. The rightmost column indicates if and when the person was traced, reported, and quarantined

In this example, P936 is exposed to the virus on day 147 of the simulated epidemic and has a latent period of 5 days, but never develops symptoms (light blue background) or tests positive and is therefore never reported (R–). He or she infects three others—P576 on day 152, P747 on day 154, and P277 on day 155. All three infectees develop symptoms (purple background). P576 sees a doctor on day 155, tests positive, and is reported. This triggers tracing of his infector, P936, and of the person he or she infected, P392. Tracing to P392 fails because this person does not use the app. Since P392 also does not develop symptoms, he or she is never reported or quarantined and infects three others. The backward trace from P576 to P936 puts P936 in quarantine on day 155 and thus prevents him or her from infecting more people after this time. P936 does not test positive (dashed outline of the box indicates the person was traced but never reported), so never becomes an index case him- or herself. However, since second order tracing is active in this simulation, the ‘siblings’ of P576 are identified. P747 is put in quarantine before he or she can infect anyone else, and tests positive the same day. This makes him or her an index case, so that the common infector, P936, is traced again. P277 does not use the app, therefore the trace fails. However, P277 happens to not meet many people on the first two days of infectiousness, then develops symptoms, sees a doctor, and is quarantined.

In this example, the chain of infections was stopped at P747 through second order tracing, the chain was stopped at P277 due to luck, but the chain could not be interrupted at P576 because the person he or she infected did not use the tracing app.

For a given set of input parameters, that is for a specific scenario, each run of the MC simulation represents one possible course of the epidemic. To find the most likely outcome for a scenario, the simulation is run 50 to 1000 times and the outcomes are averaged. As an example, Figs. 6 shows the course of the epidemic for 50 MC runs. The stochastic nature of the processes involved creates a spread in outcomes. Especially near the beginning of the epidemic where only few people are infected, statistical fluctuations cause large differences in the outbreak dynamics.

Figure 6
figure 6

Stochastic variation in outbreak dynamics. The results are from 50 runs of the MC simulation; each run has the same input parameters (\({ p_{\mathrm {app}}}= 0.9\), \({\alpha }\cdot {f_{m}}= 0.95\), \({\eta _{as}}= 1\), trace uninfected = true). Top: The fraction of infectious (green), quarantined (blue) and cumulative exposed (yellow) people for each day of the simulated outbreak. Curves for only 20 out of the 50 runs are shown to improve legibility. Outcomes from one selected run are drawn as bold lines. Middle: \({R_{e}}\) each day is shown for the MC run drawn in bold in the top plot. The red vertical line indicates the time when 0.4% of the population have been infected, which is when interventions (other than quarantining of reported symptomatic cases, which is enabled from the beginning) are turned on. To measure their effectiveness, \({R_{e}}\) is averaged over 18 days (red area), starting 10 days after interventions commence. Bottom: Outcomes from the 50 MC runs, such as the maximum fraction of the population quarantined, are histogramed to show the statistical variation more clearly

The \({R_{e}}\) shown for each day is given as the average number of people infected by everyone who recovered on that day. After the interventions are turned on, \({R_{e}}\) begins to decrease and in the absence of non-linear effects reaches a plateau after approximately 10 to 14 days. In runs where more than a few percent of the population has been exposed at that time, \({R_{e}}\) declines naturally due to an increasing chance that contact persons are already infected or recovered, and therefore cannot be infected again. When reporting the \({R_{e}}\) for a simulation run, \({R_{e}}(t)\) is averaged in the time span of 10 days to 28 days after interventions start, or from 10 days to the day more than 50% of the population has been exposed, whichever period is shorter. This time window is a compromise between being far enough away from the start of interventions for the effect of the interventions to fully manifest, and not getting to close to the region where \({R_{e}}\) changes naturally. The \({R_{e}}\) reported for a scenario is the average \({R_{e}}\) over all the simulation runs for that scenario.

We consider the following outcomes:

  • The fraction of the population exposed after one year of continuous interventions. The one year is counted from the day interventions start.

  • The fraction of the population sick on the day when most people are sick.

  • The average fraction of the population in quarantine each day over one year of continuous interventions.

  • The fraction of the population in quarantine on the day when most people are in quarantine.

  • The effective reproductive number after interventions.

  • The fraction of simulations that did not generate an outbreak, where an outbreak is defined as at least 0.4% of the population becoming exposed in runs where interventions do not start on day 0, and is defined as at least 50 people becoming exposed in runs where interventions start on day 0. These numbers are chosen since they represent a robust threshold separating simulation runs where exponential rise in the number of people infected (i.e. an outbreak) takes place from those where it does not.

4 Results

We highlight three outcomes for select scenarios and as function of the app coverage. Unless stated otherwise, the values printed in black in Table 1 are used for those parameters not explicitly varied in the figures or stated in the figure captions. The full set of outcomes for all scenarios is shown in the Appendix. Note that the size of the simulated population and the number of MC runs was chosen such that the uncertainties on the outcomes are very small. Hence the error bars on most points are smaller than the marker size.

4.1 The effect of instantaneous contact tracing on an ongoing epidemic

Figure 7 and Fig. 8 show three outcomes each for the four simulated symptom/reporting fractions and for \({R_{0}}= 3\) (Fig. 7) and \({R_{0}}= 2\) (Fig. 8). Results are shown for the realistic case where tracing identifies contacts regardless of their infection status, and for the case where only infected contacts are traced. The latter is included so that results can be compared to other models, and because the difference in the number of quarantined people between the two cases indicates how many healthy people are quarantined when uninfected contacts are also traced. \({R_{0}}=2\) is likely too optimistic, the results, however, are also valid in the situation where R was lowered to \(R= 2\) by other interventions, such as mask wearing, before tracing and quarantining starts.

Figure 7
figure 7

The effect of \({\alpha }\cdot {f_{m}}\) (tested symptomatic fraction) for different app coverages on the reproductive number after interventions (DCTS and quarantine) (top), the average of daily quarantined people as a percentage of the total population (middle) and the percentage of the population exposed after 1 year of continuous interventions (bottom). The points for 60% app coverage and 60% symptomatic fraction, outlined in red, will be studied further. This is for \({R_{0}}=3\) and \({\eta _{\mathrm {DCT}}}=1\)

Figure 8
figure 8

The same as Fig. 7 but for starting conditions where \(R= 2\). \(R= 2\) could be achieved by interventions other than DCT

The \({R_{e}}\) shown in the top panels of Fig. 7 and Fig. 8 should be compared to Fig. 3. For example for \({R_{0}}=3\) and \({\alpha }\cdot {f_{m}}=0.6\), just quarantining reported symptomatic cases yields R\(_{e;q}=2.2\), so DCT only lowers \({R_{e}}\) by another 0.5 (if tracing is independent of infection status), or 0.3 (if tracing finds only infected contacts).

Assuming that α is about 60% in European populations and that not everyone who has symptoms sees a doctor or is tested, the region between \({\alpha }\cdot {f_{m}}=0.4\) to 0.6 is likely relevant for Europe. If the reports of higher α in Asian countries are due to true differences in symptom fraction rather than to differences in study methods, the higher \({\alpha }\cdot {f_{m}}\) values simulated should be more relevant to Asian countries.

We will use \({R_{0}}=3\), \({ p_{\mathrm {app}}}=0.6\), \({\alpha }\cdot {f_{m}}=0.6\), and \({\eta _{\mathrm {DCT}}}=1\) (points outlined in red in Fig. 7) as defaults.

In Fig. 9, the lightest-coloured points correspond to these defaults. The other colors indicate what happens when the tracing efficiency is reduced. For the lower app coverages, the results are barely sensitive to \({\eta _{\mathrm {DCT}}}\) because DCT is not very effective to begin with.

Figure 9
figure 9

The effect of the tracing probability \({\eta _{\mathrm {DCT}}}\) for different app coverages (\({R_{0}}= 3\), \({\alpha }\cdot {f_{m}}= 0.6\))

In the realistic case where traced uninfected contacts are quarantined until two test results are negative (see Sect. 2.3), as many as 15% of the population are in quarantine on each day of the simulated outbreak, most of them healthy. Without a POC test to release healthy contacts, this number rises to 25%. At the peak of the outbreak, approximately 30% of the population is quarantined and half of those quarantined are actually sick.

People are not available to be infected while in quarantine, so the mean number of contacts per day, and with it the effective reproductive number, goes down and fewer people become exposed. The number of people quarantined rises with higher app coverage (because more people are traced in that case) and with a higher number of people exposed (because there are more index cases). A higher app coverage eventually leads to fewer exposed people though. Hence for a given reported symptomatic fraction, the number of people quarantined rises until an app coverage of approximately 75% (for lower reported symptomatic fractions) or 90% (for higher reported symptomatic fractions) and then falls sharply.

Contact tracing cannot reduce R below 1 in any of the simulations presented here except for \({\alpha }\cdot {f_{m}}\geq 0.8\) and \({ p_{\mathrm {app}}}\geq 0.9\) (if \({R_{0}}=3\)) or \({ p_{\mathrm {app}}}\geq 0.7\) (if R when tracing and quarantining is started is (2), and perfect tracing probability.

We note that in some cases, the fraction of the population exposed after 1 year is higher than the herd immunity level. The herd immunity level is defined as the fraction of the population that must be immune for the increase in new infections to not be able to grow exponentially, that is for \({R_{e}}\) to become 1. In an ongoing epidemic, many people are infectious when this point is reached, and the number of exposed people continues to rise until enough people are immune for \({R_{e}}=0\), therefore the curve overshoots herd immunity level.

4.2 Contact tracing in combination with random testing and social distancing

To control the epidemic, R must be reduced by additional measures. We simulated the effect of random testing (RT) and social distancing (SD). Figure 10 shows the outcomes for our standard scenario with the addition of RT of 5% and 20% of the population per day, and social distancing bringing \({n_{c}}\) to 0.8 and 0.6 of its original value. The reduction in contact rate is always connected to an upper limit in the number of contacts as shown in Table 1.

Figure 10
figure 10

The effect of social distancing (SD) and random testing (RT) for different app coverages in combination with CT (\({R_{0}}= 3\), \({\eta _{\mathrm {DCT}}}= 1\), \({\alpha }\cdot {f_{m}}=0.6\))

Random testing even at 20% of the population per day in combination with contact tracing can only achieve \({R_{e}}\leq 1\) for \({ p_{\mathrm {app}}}\geq 0.75\). It does however bring \({R_{e}}\) close enough to 1 to significantly reduce the fraction of the population that becomes exposed, even for lower app coverages.

Social distancing reliably reduces the reproductive number. Social distancing to just 80% of the contact rate does as well as randomly testing 20% of the population each day. Reducing the contact rate to 60% pushes \({R_{e}}\) below 1 for 60% app coverage.

4.3 The effect of reduced contagiousness of asymptomatic carriers

As we saw in Fig. 3, in a situation where reported symptomatic cases are quarantined, down-scaling the contagiousness of asymptomatic carriers reduces R significantly. Figure 11 shows the outcomes when DCT is then applied.

Figure 11
figure 11

Outcomes when the contagiousness of a- and pre-symptomatic carriers, \({\eta _{as}}\), is smaller than 1. Settings are \({R_{0}}= 3\), \({ p_{\mathrm {app}}}= 0.6\), \({\eta _{\mathrm {DCT}}}= 1.0\), trace uninfected contacts = false. The values printed for \({R_{e}}\) correspond to: (first number) the mean minus the standard deviation, and (second number) the mean plus the standard deviation, of the distribution of \({R_{e}}\) from 100 simulations (compare Fig. 6 bottom right panel), while the color of the field shows the mean. Where values are exactly 0.0, none of the 100 simulations had an outbreak (compare Sect. 4.5)

In the case where \({\eta _{as}}=0.1\) and all symptomatic carriers are reported when symptoms start, the simulations generate no outbreaks (fewer than 400 people become infected in total). When 75% of symptomatic cases are reported, \({R_{e}}\) has large fluctuations between simulation runs, and outcomes are very sensitive to the fraction of symptomatic cases.

4.4 The effect of timing, delays, and second order tracing

Figure 12 shows the outcomes as a function of tracing delay, that is the time in days between when an index case is identified and when his or her contacts are traced and quarantined. Outcomes are again grouped by whether or not uninfected contacts are identified by tracing. Results are also shown for both 1st order tracing and 2nd order tracing and for the two incubation time curves (the default one with mean incubation time (IT) of 7 days and the alternative one with the shorter IT = 4 days). The yellow marker uses the default incubation time curve with the alternate transmission probability curve (IC) where there is less pre-symptomatic transmission. In addition to the simulation results, the calculated \({R_{e}}\) from the age-of-infection model is shown for the settings with the default incubation time and transmission probability curves, and first order tracing. Approximations had to be made in the calculation, hence the absolute value is not expected to match the simulation results perfectly.

Figure 12
figure 12

The effect of incubation time (IT) and tracing order (TO) for different tracing delays. IT = 7 refers to the curve with mean incubation period of 7 days, and IT = 4 to the one with mean incubation period of 3.6 days. The yellow point shows results for the alternate transmission probability curve (IC) as shown in Tab. 1 (grey values). Predictions from the “age-of-infection” (AOI) model, where only infected contacts are traced, are shown as the dark green line for parameters IT = 7, TO = 1 - the result shown is not exact (see Suppl. Materials). This is for \({ p_{\mathrm {app}}}=0.75\), \({\eta _{\mathrm {DCT}}}=1\), and \({\alpha }\cdot {f_{m}}=0.6\)

Results are shown for an app coverage of 75%. Some of the dynamics are quite sensitive to the app coverage (see supplemental materials), and at 75% trends are clearer than at 60% app coverage.

The difference between first and second order tracing is small in all three outcomes (this changes in some situations for higher app coverages). For the default incubation time curve with mean of 7 days, tracing delays of up to 6 days have only a small effect, increasing the number of people exposed after 1 year from approximately 72% to 82%. When the mean incubation time is only 4 days, \({R_{e}}\) is more sensitive to tracing delays, with a 7 day delay increasing the number of people exposed from approximately 60% to 81%. The infection probability curve with less pre-symptomatic transmission probability improves the outcomes only very slightly, though the effect becomes bigger with larger tracing delay.

Second order tracing can find the infector and through him or her, the ‘siblings’ of the index case. The chance that the infection took place within \({\Delta T _{\mathrm {trace}}}\) is higher with a longer \({\Delta T _{\mathrm {trace}}}\). However even for \({\Delta T _{\mathrm {trace}}}= 14\) days, the outcomes are not significantly different. The look-back time must be balanced against the number of healthy people quarantined. People typically become index cases before they have recovered, and thus would have had a chance to infect others in the approximately 7 days prior. Looking back longer than that means one has a bigger chance of finding the infector, but it also means tracing many uninfected contacts.

When considering the realistic case where uninfected contacts are traced, second order tracing with a 7 day look-back time sends about 1.3 times as many people into quarantine on average over 1 year as first order tracing.

4.5 Outbreak probability

The results discussed so far consider situations where an outbreak is ongoing and interventions are started at some point into the outbreak. But not all simulation runs result in an outbreak. The stochastic nature of the outbreak means that there are large statistical variations at the beginning of the chain. For example, if patient zero happens to not infect anyone, no outbreak happens.

The chance for an outbreak to occur increases with R. The more people a case typically infects, the less likely it is that cases at the beginning of the infection chain do not infect anyone. Therefore, keeping interventions in place even in populations without an ongoing outbreak can be useful to decrease the probability that an outbreak will occur when a case is introduced into the population, for example through travel.

Figure 13 shows the outbreak probability as a function of the reproductive number when an infected person enters a fully susceptible population.

Figure 13
figure 13

The probability for an outbreak to start as a function of the reproductive number at the time when patient 0 enters a fully susceptible population. An outbreak here is defined as more than 50 people becoming infected. The error bars shown are statistical

4.6 Sensitivity of results to the social contact structure

The results presented so far assumed a homogeneous population of size \(1\times 10^{5}\) and a distribution of the number of contacts with infection potential per day from Eq. (3). We also ran some sets of parameters for different population sizes and for different contact structures. The results are shown in Fig. 14.

Figure 14
figure 14

The sensitivity of the outcomes to the size and the structure of the simulated population is shown. The three population structures (described in Sect. 2.1) correspond to the homogeneous population with a gamma distribution (Eq. (3)) describing the number of contacts per day, a homogeneous distribution using the power law (Eq. (4)) for the number of contact per day, and the social graph population. Settings are \({R_{0}}=3\), \({\alpha }\cdot {f_{m}}=0.6\), \({\eta _{\mathrm {DCT}}}=1\), trace uninfected = true. For the 10,000 people population, outcomes have large statistical fluctuations and error bars show the error on the mean. For the other points, the error bars are smaller than the marker size

The introduction of a social graph introduces non-linear effects that change \({R_{e}}\) on timescales much longer than what is captured by our standard analysis. In some cases, this means that fewer people are exposed after one year, even though \({R_{e}}\) is higher (see Fig. 21 in Appendix A.8).

5 Discussion

Contact tracing relies on index cases from which to trace. When there is a large fraction of mildly symptomatic and asymptomatic carriers who never go to the doctor or get tested, many carriers do not become index cases, so DCT does not have a large impact. Outcomes improve strongly the higher the fraction of reported symptomatic carriers. This is partially because DCT is more efficient, and partially because R is additionally reduced just from quarantining the index cases. Therefore it is crucial that every person with even the mildest symptoms has easy access to a COVID-19-test.

The extend to which pre- and asymptomatic carriers drive the outbreak depends on their contagiousness. If for some reason they are less contagious than symptomatic carriers, missing them as index cases does not worsen outcomes much. In the case where \({\eta _{as}}\) is 0.1, as proposed for example in [4], quarantining of index cases, without CT, reduces R from \({R_{0}}=3\) to \({R_{e}}<1\) even when just 40% of cases are symptomatic.

Randomly testing a fraction of the population regularly to find unreported carriers helps to make up for the large fraction of asymptomatic carriers. We find that a very large fraction of the population must be tested daily to significantly improve outcomes. For our default parameters, even when testing 20% of the population daily, at least 90% of the population would have to use the DCTS for \({R_{e}}\) to become smaller than one. Since typical PCR test capacities are much lower than these numbers, POC tests are likely the only realistic option for mass testing.

Reducing the contact rate (social distancing) by as little as 20% is as effective as testing 20% of the population every day while requiring fewer people to be quarantined.

Tracing delays of a few days do not significantly worsen the outcomes. Two studies, Ferretti et al. [4] and [5], indicate that a DCTS could control a SARS-CoV-2 outbreak (that is achieve \({R_{e}}<1\)) because it allows for contact tracing without delays. We find that the asymptomatic infectiousness scaling of \({\eta _{as}}= 0.1\) used by [4] is the main driver of their \({R_{e}}\) and given these starting conditions, DCT only has to lower R by a small amount to achieve outbreak control and is therefore then effective. Kretzschmar et al. [5] are more careful about the reduction in R achievable with DCT, but do confirm the improved outcomes with short tracing delays. However, [5] use a very short latency period. With the longer median latency periods consistent with recent large-scale studies, this effect is small. Therefore, the advantage of a DCT in the case of COVID-19 lies mostly in the possibility to scale tracing to a large number of cases without needing a large increase in the number of manual contact tracers.

Most models consider that contacts that were actually infected are traced with some probability. In reality, it is impossible to tell immediately whether or not a traced contact has been infected. Even if a test performed immediately on tracing is negative, it could just mean that the person is still in the latent period. Therefore, all traced contacts should be quarantined and tested multiple times. In principle, one could devise other schemes, such as testing each traced person every morning (e.g. with a POC tests that can be done at home and gives results within minutes) for a few days without requiring quarantine unless the test comes up positive. Right now, such frequent testing is not realistic in most countries.

We find that including the effect that quarantining of uninfected contacts has on the outbreak dynamics can lead to significantly different, typically more positive, outcomes compared to models where this effect is ignored. The improvement in outcomes is due to the large number of people quarantined even though they are healthy. Our simulations probably underestimate this number, because we use contact rates for the types of contacts that have a high chance of transmitting a respiratory virus. A DCT system will typically pick up many persons who were in spacial proximity to the index case, but not in a manner that was likely to transmit the virus, so the number of contacts traced per index case could be bigger in reality. For any serious large-scale use of a DCT system during an ongoing epidemic, dealing with these uninfected contacts in quarantine is going to be a major challenge, especially as the compliance of the population with quarantining procedures may decrease once someone has been traced and quarantined multiple times.

The statistical nature of virus transmission and contact rates leads to large variations in outbreak dynamics at the start of the outbreak. Sometimes, an infectious person entering a susceptible population does not start an outbreak. This becomes less likely the higher R is. This also means that under identical conditions, one population could have hundreds of cases within a week of the arrival of patient 0, while in another population the case number does not start rising for several weeks, just by luck.

Beside control of the outbreak, that is achieving \({R_{e}}\) below 1, an important outcome is how many people will have been exposed by the time a vaccine might be available. Due to the heavy social and economical burden imposed by virus control, some countries are aiming at \({R_{e}}\) around 1, rather than total control. Our simulations assume that the same interventions are applied throughout the epidemic, so the outcomes over 1 year are guidelines rather than realistic predictions for any real country. They do give a qualitative idea of what achieving a given \({R_{e}}\) means in terms of the number of people exposed (and with that, the number of fatalities).

For reasons of computing power, most simulations were run for a homogeneous population. In reality, society is organized into social units. Introducing such social units into the simulation means that an infectious person tends to meet the same people every day, exposing them again and again. This leads to slight changes in outcome, while the qualitative results remain the same.

5.1 Limitations

We assumed that all people, once they have recovered from the infection, are immune to a secondary infection. Whether and for how long a recovered person is immune remains to be answered. Studies show that neutralizing antibodies are produced during infection and to a higher degree in symptomatic carriers, but decline significantly 2–3 months after recovery [61, 62]. The minimal antibody titer to confer protection is, however, still unclear. Furthermore, memory T cells to SARS-CoV-2 have been found in patients including asymptomatic and mildly symptomatic ones, which likely contribute to protective immunity as well [63].

In our models, everyone adheres to quarantine protocols. That is, every time someone is alerted by the DCT system to having been in proximity of a contagious individual, this person must follow the quarantining and testing procedure. This is crucial to suppress pre- and asymptomatic transmission, but may be difficult to achieve in reality. We also assume that the fraction of symptomatic individuals who see a doctor/get tested do so the day they become symptomatic.

The transmission probability in our models changes with the time since infection, but not between individuals. Current research, however, suggests that COVID-19 is over-dispersed, meaning some individuals spread the virus to many others, in so-called “superspreading events”, while most do not transmit the virus at all or only to very few people [64]. Part of this over-dispersion is due to the random nature of the contact number—some people just meet more others, and is therefore included in our models (see Sect. 2.2 and Fig. 2).

We assume that no manual tracing is performed at all. Typically, the types of close contact persons to whom spreading the disease is most likely, that is friends and family, can be manually traced without much effort, hence the fraction of infected contacts traced could be larger in reality.

6 Summary

Many countries enforced a policy of ‘shelter–in–place’ and/or extreme social distancing, effectively putting most of the population into quarantine. This significantly slowed down the infection rate [65, 66], but came with large economical and social costs to society. World-wide, a lot of effort has been put into the development of CT systems, in the hope that large-scale CT could replace other public health measures at much smaller cost to society.

We modelled the effect of instantaneous DCT in combination with a testing and quarantining protocol, as well as random testing and social distancing, on an ongoing COVID-19 epidemic. Results were validated by running the scenarios with two independently developed individual-based models, which were further cross-checked by two types of deterministic models. We modelled many different parameter values for the still not well-known properties of SARS-CoV-2, COVID-19 and for the interventions, leading to well over 10,000 simulated scenarios. The goal was to find the regions in this parameter space where CT without additional interventions could lower the effective reproductive number enough to halt exponential growth.

Wherever modelling approximations had to be made, we chose defaults that lead to better outcomes, hence these results are likely on the optimistic side. Our results are stable under different simulated social structures and epidemiological parameters, with significantly different outcomes seen only when varying the fraction of asymptomatic individuals or down-scaling the contagiousness of pre- and asymptomatic cases.

We find that for large regions of the parameter space, including the currently most likely parameter values, an outbreak of COVID-19 cannot be fully controlled by DCT even if a large fraction of the population uses the system. Furthermore, if interventions are started once an outbreak is already ongoing, DCT causes a large fraction of the healthy population to be traced and quarantined.

DCT can be combined with other measures, such as face-mouth coverings, social distancing, and/or random testing, to achieve outbreak control.

The availability of fast testing, and coordination of test results with the DCT system, are crucial to allow symptomatic cases to become index cases for tracing, and to release traced healthy contacts from quarantine. Since SARS-CoV-2 symptoms are unspecific, everyone with even a slight cough of fever must be able to get a test (a) quickly, because the infection probability peaks just before symptom onset and then falls quickly and people who are not sure they are infected likely will not effectively quarantine themselves, and (b) easily, so that a large fraction of symptomatic cases do seek out testing. The gains of a DCT system in outbreak control quickly vanish if many symptomatic cases do not seek out testing, or if positively tested individuals do not become index cases.