Introduction

Everyday decisions are often marred by uncertainty1. Gathering new evidence through exploration is a useful and rational strategy to reduce the impact of uncertainty on subsequent decisions2. One of such exploratory strategies is checking, a procedure through which an agent repeatedly samples from the same source of information3,4. In a changing environment, checking can serve the useful purpose to update one’s, possibly outdated, internal model about the world5; for instance, checking one’s smartphone for possible novel notifications. In spite of its apparent simplicity, what drives checking in humans, and how it is regulated by observer’s uncertainty about the world, remains mostly unknown.

Deciphering the psychological drivers of checking is also key to understand how this, otherwise rational, behavior goes awry in certain pathological conditions such as obsessive–compulsive disorder (OCD). About 80% of OCD patients6,7 indeed exhibit excessive checking; not only for anxiety-related, but also neutral, stimuli8,9,10,11. In line with the hypothesized uncertainty-reducing role of checking, patients typically report that checking reduces their uncertainty about checked options12.

One hypothesis we explored here is thus that checking behavior depends on an uncertainty-evaluation, or metacognitive, process13,14,15,16,17. Traditional models have decomposed metacognitive processes into two categories: (i) explicit metacognition, understood as, at least partially, conscious evaluations and regulations of our own decisions, grounded in reflective processes involving introspection, and which enables humans to share metacognitive representations with other people18,19,20,21; and (ii) implicit metacognition understood as pre-reflective, automatic evaluations and regulations of our own decisions, which does not necessarily reach awareness18,22, and which are present in non-verbal young infant23 and non-human animals24. In explicit metacognition tasks, subjects are often asked to rate their confidence into some past decision20. By contrast, in implicit metacognition tasks, subjects do not explicitly rate their confidence but, instead, use it to guide decision-making, such as when choosing between several options from which to get rewarded25. In line with this taxonomy, here, we contrasted the contributions of both explicit and implicit metacognitive monitoring sensitivities to uncertainty-guided checking behavior, and explored the possibility that checking might only be related to explicit metacognition.

The propensity to check exhibits a wide spectrum from occasional to excessive forms even in the non-clinical population26. Previous studies provide converging evidence that self-beliefs (e.g. cognitive distrust) correlate with a general tendency to check, as assessed by questionnaires, in healthy subjects27,28. The mechanisms underlying this association are however still unknown. We further hypothesized that self-beliefs such as the self-reported overall propensity to compulsion and cognitive distrust, which varies across individuals, also in the non-clinical population, would impede uncertainty-guided checking behavior.

To address these questions, we capitalized on previous attempts to reproduce checking behavior in controlled experimental settings3,4,8,9,10,29. Notably, we leveraged cutting-edge experimental psychology paradigms allowing us to manipulate perceptual uncertainty through stimulus difficulty similarly across individuals, independently of overall performance. We then quantified the number of checks for different levels of perceptual uncertainty, and assessed to what extent they related to performance. We also measured the sensitivity of participants’ implicit and explicit metacognition using state-of-the-art measures30,31. Finally, we made the bridge with widely used clinical notions by relying on validated questionnaires to assess inter-individual differences in declarative self-beliefs.

We made the following hypotheses: (H1) we predicted that checking reduces participants' uncertainty about checked options; (H2) accordingly, we expected that checking allows individuals to increase task performance: performance should be better after checking than after immediate decisions; (H3) importantly, we predicted that checking behavior depends on an explicit uncertainty-evaluation (explicit metacognitive accuracy) rather than an implicit uncertainty-evaluation (implicit metacognitive accuracy) process; (H4) finally, we expected that, above and beyond inter-individual differences in metacognitive sensitivities, participants’ self-beliefs, such as self-reported compulsivity and low self-confidence, would impede checking behavior even in healthy subjects. In a nutshell, our findings show that uncertainty-guided checking behavior relates to both participants’ (explicit, but not implicit) metacognitive sensitivity as well as to self-beliefs.

Results

We report results for a total of 28 human healthy participants who were asked to rate the orientation (left or right) of random dot motion (RDM) stimuli. Difficulty levels (i.e. motion coherence) were adjusted at the single participant level thanks to an initial calibration phase (see Methods), to prevent inter-individual differences in visual perception to affect the results.

Checking behavior

Participants performed a checking task in which they were allowed to replay (an infinite number of times) the RDM stimulus before committing to the decision orientation (see Methods and Fig. 1a,b). We first assessed fluctuations of checking behavior, how checking is regulated by stimulus difficulty and whether it improves subsequent performance.

Figure 1
figure 1

Causes and consequences of checking behavior. (a) RDM stimuli. Stimuli used throughout the study were random dot motion (RDM) stimuli whose coherence level was manipulated. Participants were asked to judge the direction of motion (either left or right). (b) Checking task. Representative trial of the checking task: before committing to the decision, participants were allowed to replay the RDM stimulus an infinite number of times. Feedback was delivered at the end of the trial. (c) Increased checking after more difficult stimuli. Number of checks according to motion coherence (i.e. difficulty, in arbitrary units, a.u.), which is expressed in three terciles for the sake of averaging over participants (who have different perceptual sensitivity). (d) Decreased performance after checking. Accuracy of orientation judgments as a function of motion coherence and number of checks. Same binning method as in (c). Error bars correspond to standard error of the mean.

Firstly, in order to evaluate the impact of stimulus difficulty on checking (H1), we used a Poisson mixed effect model explaining trial-by-trial fluctuations in the number of checks (hereafter referred to as 1; see Methods and Table S1). We found that participants increased their checking frequency for more difficult stimuli (b = 0.90, P < 2 × 10–16; see Fig. 1c), in line with the hypothesis that checking aims at decreasing participants’ uncertainty about future decisions (H1).

Secondly, in order to evaluate whether checking behavior improved subsequent performance (H2), we used a multivariate logistic mixed-effect model explaining trial-by-trial fluctuations in RDM task performance (hereafter referred to as 2; see Methods and Table S2). Contrary to what we expected, we found that checking decreased performance on RDM stimuli of matched difficulty levels (b = –0.26, P = 4.56 × 10–8; see Fig. 1d). Furthermore, we found that the detrimental effect of checking on performance decreased with stimulus difficulty (number of checks by stimulus difficulty significant and positive interaction; b = 0.20, P = 6.27 × 10–5): checking more strongly reduced performance for high motion coherence RDM stimuli (i.e. easy trials). Although checking was promoted by perceptual uncertainty (H1), it thus did not appear to reduce the impact of perceptual uncertainty on subsequent decisions (H2); we later discuss possible reasons for this finding (see Discussion).

Contribution of metacognition to checking

Showing that our experimental settings induced uncertainty-guided fluctuations in checking behavior, we now turn to one of our study goals: assessing the contribution of inter-individual differences in metacognitive sensitivities in the tendency of participants to check (H3). In particular, we assessed a possible differential effect between implicit and explicit metacognitive sensitivities in the promotion of checking and its regulation by perceptual uncertainty.

In a well-established explicit metacognition task (see for example32), participants were asked to explicitly rate their confidence (on a scale ranging from 1, low confidence, to 6, high confidence) about their orientation judgment (see Fig. 2a). We assessed participants’ explicit metacognitive sensitivity using M-ratio, a well-established measure of (explicit) metacognitive sensitivity which quantify the amount of perceptual information (in signal-to-noise units) that is available to metacognitive processes33. Overall, participants had an M-ratio significantly higher than zero (two-tailed one sample Wilcoxon signed rank test; V-statistic = 406; P value = 4.003 × 10–6; see Fig. 2b), thereby showing that confidence judgments were predictive of actual performance.

In an implicit metacognition task, using an original design by Barthelmé and Mamassian34 and validated in further studies25,30,34, participants had to judge the direction of two successive RDMs, of different difficulty levels (i.e. motion coherence), before being finally asked to indicate the one for which they judged to be more likely correct (see Fig. 2c). We assessed participants’ implicit metacognitive sensitivity by computing the difference in sigmoid slopes (relating stimulus difficulty to performance) between the discarded and the chosen stimulus (divided by the average slope across both conditions), hereafter referred to as the confidence modulation index (CMI)25,30. Participants’ CMI was significantly higher than zero (two-tailed one sample Wilcoxon signed rank test; V-statistic = 351; P value = 3.814 × 10–4; see Fig. 2d), thereby showing that participants were able to leverage their metacognitive knowledge to improve decision-making.

We then assessed the independence between implicit vs. explicit metacognitive sensitivities by correlating them across participants. We found no significant correlation (r = 0.04, P = 0.853, BFnull = 2.83; see Fig. 2e & Table S3), thereby suggesting they reflect distinct (meta)cognitive processes and, possibly, relate differently to checking behavior. Accordingly, we also found a dissociation in the contributions of explicit vs. implicit metacognitive sensitivities to uncertainty-guided checking behavior (H3). On the one hand, the Poisson mixed effect model explaining trial-by-trial fluctuations in the number of checks (1) revealed a significant positive interaction between M-ratio and stimulus difficulty (1; b = 0.08, P = 0.028; see Fig. 3c), showing that checking was driven by perceptual uncertainty more strongly in participants with higher explicit metacognitive sensitivity. Conversely, the interaction between CMI (i.e. implicit metacognitive monitoring ability) and stimulus difficulty was negative and non-significant (1; b = –0.05, P = 0.210; see Fig. 3c). We then confirmed that the differential modulation of implicit vs. explicit metacognitive sensitivities on uncertainty-guided checking behavior was significant using a likelihood ratio test (LRT) between the original model (1) versus a restricted model which assumes the equality between the regression coefficients of the interactions of interest (i.e. interactions between CMI and stimulus difficulty, and M-ratio and stimulus difficulty; LRT: chi-square = 5.88, 1 df, P = 0.015). This analysis thus confirmed that checking is more strongly driven by perceptual uncertainty in participants with elevated explicit, but not implicit, metacognitive monitoring ability (H3).

Figure 2
figure 2

Independence of implicit and explicit metacognitive sensitivities. (a) Explicit metacognition task. Representative trial of the explicit metacognition task: after reporting their orientation judgment, participants were asked to report their confidence in this orientation judgment on a scale from 1 (guess) to 6 (certainty). Feedback was delivered at the end of each trial. (b) Sensitivity of explicit metacognition. The sensitivity of explicit metacognition was measured using the M-ratio, which quantifies the amount of decision information available to the confidence judgment, here depicted as the difference between type 1 (decision) and type 2 (confidence) ROC curves35. (c) Implicit metacognitive task. Representative trial of the implicit metacognition task: after reporting their orientation judgment for two successive RDM stimuli, participants were asked to report the stimulus for which they judged more likely to be correct. Feedback on the chosen stimulus was delivered at the end of each trial. (d) Sensitivity of implicit metacognition. The sensitivity of implicit metacognition was measured as the difference in slopes between psychometric functions corresponding to the chosen minus discarded stimuli, divided by the average slope across both conditions; a measure we refer to as CMI25,30. (e) Independence between explicit and implicit metacognition. Metacognitive sensitivities were estimated for each participant separately in the explicit and implicit metacognition tasks. Measures are not correlated across participants suggesting they relate to independent metacognitive processes (see Table S3). Stars indicate significance: *P < 0.05, **P < 0.01, and ***P < 0.001.

Figure 3
figure 3

Modulation of checking uncertainty-guided behavior. (a/b) Distribution of participants on the Padua inventory/cognitive mistrust subscales. Participants spread over a large spectrum of self-beliefs, which replicated previous findings28,36,37 (see Methods). (c) Modulation of uncertainty-guided checking. Arrow size are proportional to z-scores from the Poisson model of the number of checks (1, see Methods). Arrow color denotes the direction of the effect (green: positive, red: negative). Stars indicate significance: *P < 0.05, **P < 0.01, and ***P < 0.001.

Contribution of self-beliefs to checking

We hypothesized that, above and beyond inter-individual differences in metacognitive sensitivities, participants’ self-beliefs, such as self-reported compulsivity and self-confidence, would modulate checking behavior even in our non-clinical population (H4). To investigate this, we asked participants to answer questionnaires at the end of our experiment.

First, we measured the declarative lack of confidence in various cognitive domains (e.g. “my memory can mislead me at times”) using the cognitive mistrust subscale of the Metacognition Questionnaire28 (see Fig. 3b). This subscale was of particular interest here as a long series of studies (see27 for a review) consistently showed that persons suffering from compulsive checking typically exhibit higher levels of cognitive mistrust. Here, we tested, in particular, the hypothesis that cognitive mistrust may decrease the sensitivity of checking behavior to perceptual uncertainty. Accordingly, we found that cognitive mistrust significantly reduced the effect of stimulus difficulty on checking behavior (1; cognitive mistrust by stimulus difficulty significant negative interaction; b = –0.09, P = 0.039; see Fig. 3c).

Second, we measured self-reported obsessive–compulsive symptoms (OCS) using the Padua inventory scale36 which quantifies self-reported obsessions and rituals across four dimensions: mental control, checking, contamination obsessions and anxiety about motor control (see Fig. 3a). The Padua Inventory scale was slightly correlated with the cognitive mistrust subscale across participants (r = 0.42, P = 0.026; see Table S3), as previously observed28. We found, however, that, contrary to the cognitive mistrust subscale, the self-reported obsessive–compulsive symptoms increased the effect of stimulus difficulty on checking behavior (1; Padua inventory by stimulus difficulty significant positive interaction; b = 0.10, P = 0.019; see Fig. 3c).

Discussion

The aim of the present study was to investigate the psychological drivers of checking behavior in humans, in particular its regulation by environmental uncertainty. Our results provide evidence that human checking behavior is the outcome of an uncertainty-evaluation, or metacognitive, process (H1). Indeed, participants increased their checking frequency for more difficult stimuli, and thus more uncertain settings38. This result is in line with past findings in non-human primates4,39 which demonstrate that checking adaptively aims at decreasing one’s uncertainty about future decisions.

Contrary to what was expected, checking had a detrimental effect on performance: accuracy was worse after checking than after immediate decisions (H2). A first possible explanation of this unexpected result relates to the psychophysical properties of the RDM stimuli used in this study: although the difficulty of the RDM stimulus was kept constant across all replays (if any), each replayed RDM differed in the exact spatial locations of the dots. A possibility is that participants expected to see the exact same RDMs across all replays. The incongruence of replayed RDMs with subject’s expectations may have impaired subsequent decision-making processes. To investigate this hypothesis, future studies could manipulate the extent to which stimulus replays vary from the original stimulus and compare the impact on performance. Worth noting, above a given degree of stimulus dissimilarity, it is likely that the cognitive process at stake will relate more to ‘information sampling’ than genuine ‘checking behavior’11. Further theoretical work is highly needed to devise the conceptual delineations of those processes. A second, perhaps more interesting, explanation for the detrimental effect of checking on subsequent performance is that repeated checking of the RDM stimuli may hinder (perceptual) decision-making processes. This possibility echoes the results of a recent meta-analysis which revealed that repeated checking reduced memory accuracy of visual stimuli across experiments, in addition to the well-known reduction in memory confidence40. The mechanisms underlying this reduction in memory performance are not yet known. A possibility is that, in our study, the diminished perceptual decision-making performance relies on similar mechanisms. Further studies are necessary to replicate our unexpected result, and to investigate its possible underpinnings. Importantly, our results show that although detrimental on average, checking nevertheless remains adaptive in the sense that this detrimental effect seems to decrease with stimulus difficulty, thereby suggesting it is still more adapted to check after more difficult stimuli.

Our study also provides evidence that implicit and explicit metacognitive sensitivities are dissociable (H3) and that they relate differently to uncertainty-guided checking behavior. We found that, in the non-clinical population, uncertainty-guided checking behavior is promoted by a conscious, explicit high-level mental representation18,19,20, rather than by an unconscious, implicit low-level representation of perceptual uncertainty18,22. The role of this dichotomy in shaping checking behavior has received surprisingly little attention despite its important implications for disorders of checking behavior, including OCD. It has been suggested that excessive checking behavior in compulsive checkers corresponds to the pathological automatization of an otherwise deliberate behavior40,41,42,43. One could thus hypothesize that such an automatization relates to an insufficient contribution of explicit metacognitive processes to uncertainty-driven checking in compulsive checkers. This insufficient contribution could result from an alteration of the ability to explicitly introspect about one’s decisions (an alteration of the so-called metacognitive monitoring ability) as suggested by recent studies44,45. Another possibility is that the ability to explicitly introspect about one’s decisions is preserved, but the resulting information is not appropriately used for the regulation of checking behavior. To our knowledge such a metacognitive control impairment has never been investigated in OCD. In either case, compulsive checkers would lack an explicit, conscious termination cue, and would continue checking regardless of environmental uncertainty.

Over and above the contribution of core metacognitive processes operating at the scale of single decisions, our results suggest that uncertainty-guided checking behavior is also influenced by self-beliefs (H4), including the self-reported propensity to compulsions and cognitive mistrust. Previous studies provide converging evidence that self-reported overall confidence in memory is lowered by repetitive checking in healthy subjects40 and that OCD patients have lowered confidence in memory, as well as in other cognitive domains27. Nonetheless, the mechanism underlying the typical association between excessive checking and cognitive mistrust is still unknown. Our results bring a step forward in understanding by showing that cognitive mistrust impedes the regulation of checking behavior by perceptual uncertainty. It follows that the alteration of uncertainty-guided checking behavior may mediate the association between cognitive mistrust and obsessive–compulsive symptoms (OCS), a hypothesis which remains to be tested.

Conversely, our results show that beyond cognitive mistrust, self-reported OCS are associated with checking behavior better tuned to environmental uncertainty, operationalised here as stimulus difficulty. This goes against our hypothesis that the propensity to compulsion, conceived as a continuum from non-clinical OCS to OCD26, would impede uncertainty-guided checking behavior (H4). A possible explanation for this result is that the relation of compulsivity to uncertainty-driven checking behavior may depend on whether compulsivity falls within the clinical or non-clinical domain. The association between compulsivity and the impairment of uncertainty-guided checking behavior would thus appear clearly only for pathological compulsivity. Future studies in patients with OCD are necessary in order to test these hypotheses. To that regard, our well-controlled experimental setting might be relevant in assessing checking behavior in OCD patients.

Our findings should be relativized given the exploratory nature of our study and its limitations. In particular, we found two unexpected findings which we call for replications: a detrimental effect of checking on performance, and an association between increased self-reported OCS and improved uncertainty-guided checking behavior. A classic but major limitation of our study concerns our sample size. Although we collected a large number of trials to maximize within-subject reliability and sensitivity of our results, we believe it is important to confirm our results in larger sample sizes. We also note that given recent evidence reporting an impact of feedback onto self-reported measures46, it is possible that feedback provided in the course of the experiment may have influenced subsequent self-reports, including cognitive distrust and OCS. However, because performance was individually tailored (to ~ 70% accuracy), it is likely that the impact of task feedback onto subjects’ self-beliefs, if any, was uniform across subjects, and thereby could not by itself explain our results. In addition, in previous research, implicit metacognition is assessed in different ways and the ideal behavioural task for its assessment is still debated. Some tasks require subjects to place bets on their decisions or give subjects the possibility to opt out47. However, whether these implicit (non-verbal) uncertainty-monitoring tasks are metacognitive is a matter of debate21,48. In the present study, the implicit metacognition task required subjects to choose among two stimuli the one they judged the most likely to yield a correct response25,30,34. As Mamassian and colleagues showed, when doing so, participants' have to compare two levels of uncertainty related to two successive decisions34. This ‘comparison step’ makes the so-called implicit metacognition slightly more complex, possibly involving meta-representations, than the report of a single perceptual uncertainty quantity, which could be explained by entirely first-order processes48. Also, it is possible that in individuals’ everyday environment, in particular at home, the metacognitive processes involved in repetitive checking may differ from those elicited in the lab. Accordingly, it has been shown that personalized cues trigger increased responses in OCD, compared to non-specific ones49. Our hypotheses should thus also be tested in a more ecological environment, such as at participants’ homes. The development of information and communication technologies, and their recruitment for cognitive tasks, could make it possible to export research outside the laboratory and improve external validity of cognitive models.

In summary, our findings provide evidence for joint contributions of both explicit, but not implicit, metacognitive processes, and self-beliefs, including cognitive mistrust, to checking behavior guided by perceptual uncertainty. Our findings also open unexplored avenues to address and better understand how the otherwise adaptive checking behavior could go awry in OCD.

Methods

Participants

Sample size was determined a priori based on past studies of metacognitive sensitivity25,30, and constrained by participant availability. We initially aimed for a target sample size of 40 participants, and our initial sample (i.e. before outlier rejection) consisted of 37 healthy volunteers. We tested our hypothesis using multilevel regression models (see below;50, for which it remains difficult to perform classic power calculations, mostly because of the multiple sources of variance that must be taken into account51). Because the study did not concern biomedical research and no medical intervention was performed, it was, according to French law, waived from getting an approval by an external ethical committee. Yet, an internal review board, the ‘Comité de pilotage du Laboratoire d'Economie Expérimentale de Paris (LEEP)', assessed and approved the nature of the study and procedures, which preserved the full anonymity of the participants with respect to experimental data collection. The study was carried out in accordance with the Declaration of Helsinki52. All participants provided informed consent and received 15 euros for participating in the study plus a bonus reward (max = 5 euros) proportional to their performance during the behavioral experiment.

Behavioral experiment

The behavioral experiment, which lasted for 1h30min, consisted in 4 behavioral tasks, all derived from the well-established random-dot motion (RDM) task53,54 (see Fig. 1a). The RDM stimuli consisted of 236 white dots presented for 500 ms. Difficulty was manipulated by varying the degree of motion coherence (i.e. the proportion of dots moving in the same direction). On each trial, the direction of the dots moving coherently was randomly set to left or right. Participants reported the perceived motion direction on a keyboard; no time limit was imposed. Feedback was delivered at the end of each trial. First, participants went through a calibration phase (see below), followed by three tasks which were all explained in advance, and after a training phase of 15 trials was completed. The implicit metacognitive task and the checking task (see below) were intermixed in blocks of 50 trials for a total duration of 50 min, resulting in a mean number of trials of respectively 246.4 (median = 251, sd = 21, min = 204, max = 300) and 218.5 (median = 201, sd = 22.8, min = 201, max = 251). The explicit metacognition task (see below) was presented in one single block at the end of the experiment for a total duration of 15 min, resulting in a mean number of trials of 113.3 (median = 117.5; sd = 17.1; min = 79; max = 142). Data from all tasks are available at https://doi.org/10.17605/OSF.IO/2TAC4.

Calibration phase

Difficulty of RDM stimuli (i.e. motion coherence) was adjusted so as to evoke the same performance level across participants. This allowed to estimate inter-individual differences in metacognitive sensitivities independently of differences in low-level visual perception, which otherwise typically contaminate metacognitive measures20,31. For that aim, prior to the experiment, we estimated, for each participant, psychometric curves (inverse logit function) relating motion coherence levels to orientation judgment accuracy in a set of 120 RDM stimuli. During calibration, coherence levels were chosen online so as to maximally improve the model evidence of the psychometric function thanks to an adaptive variational Bayesian design optimization algorithm55. We discarded 9 participants from the analyses, for which either (i) the slope of the psychometric curve derived from the calibration task was more than 2 SE below group's mean slope, or (ii) the stimulus coherence level at the inflexion point of the psychometric curve was more than 2 SE above group average. Our final sample consisted of 28 participants aged 19 to 29 years old (mean = 23.6, sd = 3.0).

Checking task

Participants were asked to judge the orientation of a RDM stimulus they could replay an infinite number of times. Difficulty of the first RDM stimulus was drawn at random, and kept constant for the replayed RDM stimuli (if any). Dots’ spatial locations varied between successive presentations of the RDM stimulus.

Implicit metacognitive monitoring task

Participants were asked to judge the orientation of two RDM stimuli presented in succession, then asked to choose the stimulus for which they felt the most likely to be correct (see Fig. 2c)25,30,34. Difficulty of the two RDM stimuli were varied independently. Again, using participant-specific psychometric curves estimated during the calibration task, we chose coherence levels such that participants' performance ranged from 50 to 100% correct.

Explicit metacognitive monitoring task

Participants were asked to judge the orientation of a RDM stimulus as well as a to report a post-decisional confidence judgment20,31 using a discrete scale ranging from 1 (guess) to 6 (certainty; see Fig. 2a). Prior to the experiment, participants were instructed to use the entire length of the confidence scale. They used arrows keys to move a square indicating the selected confidence level; until they reached their desired confidence level, at which point they pressed the space bar to confirm their rating. The cursor starting point was set to a random position between 1 and 6. Using participant-specific psychometric curves estimated during the calibration task, we chose coherence levels such that participants' performance ranged from 50 to 100% correct.

Measures of metacognitive sensitivity

Implicit metacognition

We used the methodology previously implemented by de Gardelle and colleagues25,30. We fitted two psychometric curves (inverse probit function) describing the proportion of correct responses as a function of RDM difficulty separately for chosen versus discarded stimuli. We then quantified the so-called confidence modulation index (CMI) as the difference in sensitivity (i.e. slope) between the two psychometric curves, divided by the average sensitivity across of the two psychometric curves25,30,56.

Explicit metacognition

We used the M-ratio31,33,35, a well-established measure of metacognitive efficiency which maps the sensitivity of participants' confidence judgments about their performance to their actual objective performance, independently of task difficulty.

Questionnaires

Participants answered the ‘general cognitive mistrust’ and ‘obsessive–compulsive symptoms’ questionnaires.

Cognitive mistrust

The level of cognitive mistrust was assessed with the French version of the MetaCognition Questionnaires28. Participants’ mean cognitive mistrust subscale score (mean = 20.5, sd = 6.3) was similar to MCQ first validation study scores in a non-clinical population (mean = 17.9, sd = 6.0 for male, mean = 17.8, sd = 5.4 for female)37, as well as the MCQ French validation study scores (mean = 19.0, sd = 6.3 for male, mean = 18.4, sd = 5.8 for female) in a non-clinical population28.

Obsessive Compulsive Symptoms (OCS)

OCS were assessed using the Padua Inventory36, a 60-items Likert scale which measures obsessions and rituals (motor or mental). Participants’ mean Padua inventory score (mean = 57.6, sd = 28.8) was similar to Sanavio et al.’s study with non-clinical participants’ (mean = 53.6, sd = 27.7 for male, mean = 62.5, sd = 29.2 for female)36.

Statistical models

Models were implemented using the lme457 package in R version 3.4.158. Regression coefficients (b) and their statistics (z-scores and P-values), calculated using the Wald method57, as well as bootstrapped 95% confidence intervals, are reported (see Table S1 & S2). All predictors were standardized. For each regressor in the models, we checked for multicollinearity using the variance inflation factor (VIF; Shieh and Fouladi59). A VIF smaller than 3 indicates that collinearity is not an issue60.

Model #1 (1). The model relates trial-by-trial fluctuations of the number of checks to trial-by-trial changes in RDM difficulty, as well as trait-level predictors including implicit and explicit metacognitive sensitivities (i.e. CMI and M-ratio, respectively), questionnaires (i.e. Padua Inventory scale and MCQ’s cognitive mistrust subscale) as well as interactions between these predictors and trial-level stimulus difficulty. Practically, the model is a Poisson generalized linear mixed model (Nobs = 6119; Nsubj = 28) with participants as random intercept, and an observation level random effect used to account for the overdispersion of the dependent variable, as recommended by Harrison61. We computed VIF for all regressors and found an average VIF of 1.2 (sd = 0.1, median = 1.1, max = 1.4), demonstrating collinearity was not an issue in our model. This model allowed us to test hypothesis H1, H3, and H4.

Model #2 (2). The model relates trial-by-trial fluctuations in the accuracy of orientation judgments to trial-by-trial fluctuations in number of checks, RDM difficulty as well as trait-level predictors including implicit and explicit metacognitive sensitivities (i.e. CMI and M-ratio, respectively), Padua Inventory and MCQ’s cognitive mistrust subscale, as well as interactions between these predictors and trial-level number of checks. Practically, the model is a logistic linear mixed model (Nobs = 6119; Nsubj = 28) with participants as random intercept. Again, we found an average VIF of 1.13 (sd = 0.7, median = 1.1, max = 2.8), demonstrating collinearity was not an issue in our model. This model allowed us to test hypothesis H2.

Open data

All data from this study have been made publicly available via the Open Science Framework and can be accessed at https://doi.org/10.17605/OSF.IO/2TAC4.