Introduction

Perseverative cognition, such as excessive worry and rumination, has been recognized as a central pathogenic process in psychopathology (Harvey et al., 2004) and somatic illness (Brosschot et al., 2006). According to the perseverative cognition hypothesis (Brosschot et al., 2006), perseverative cognition prolongs the psychophysiological stress response, by prolonging the mental representation of stressful events beyond their actual presence. This prolonged stress response has been suggested to lead to somatic as well as psychological stress related pathology, including health complaints (Brosschot & Van Der Doef, 2006), cardiovascular disease (Kubzansky et al., 1997) and mood and anxiety disorders. A recent meta-analysis showed that perseverative cognition was related to a range of physiological concomitants including increased heart rate, blood pressure, cortisol, and lower heart rate variability (Ottaviani et al., 2016).

Despite the mounting evidence supporting this hypothesis, more studies are needed that precisely investigate how perseverative cognition contributes to psychopathology and somatic illness. For this purpose there are several questionnaires available that measure worry and the related construct of rumination, all of which show promising psychometric properties. These questionnaires include the Penn State Worry Questionnaire (PSWQ), assessing the pathological aspects of worrying, such as its uncontrollability and the Worry Domains Questionnaire (WDQ) which measures worry content (see Verkuil et al., 2007; Davey, 1993). To measure rumination, the Ruminative Response Scale (RRS) is often used (Treynor et al., 2003; Bagby et al., 2004). However, there might be much more evidence available in existing datasets. The Trait Anxiety subscale of the State Trait Anxiety Inventory (STAI-T) is one of the most frequently used instruments in studies of the effects of stress and anxiety on health outcomes (see for example Fisher and Durham (1999) for a short overview of studies using the STAI-T in evaluating effectiveness of GAD treatments).

The STAI-T contains five items that also seem to measure worry, especially its pathological aspects, i.e., worry perseveration (‘I worry too much over something that really doesn’t matter’; ‘I have disturbing thoughts’; ‘Some unimportant thought runs through my head and that bothers me’; ‘I take disappointments so keenly that I can’t put them out of my mind’ and ‘I get in a state of tension or turmoil if I think over my recent concerns and interests’). If these items can be used to reliably measure worry, the aforementioned datasets are theoretically open to explore relationships of worry with several health outcomes. It is not yet clear whether these five items form a scale that reliably predicts worry, although in a previous study it has been suggested that these items measure aspects of anxiety whereas the other remaining items of the STAI-T seem to tap into general negative affect or even depressive affect (Bieling et al., 1998). Most studies that have used the STAI-T have typically looked at anxiety as one construct, instead of investigating the independent effects of perseverative cognition (worry) and affect (anxiety) on health outcomes. Anxious apprehension (worry) and anxious arousal are considered separate constructs (Sharp et al., 2015), and worry, especially its perseveration is believed to be the central pathogenic element of anxiety and several other psychopathologies (Harvey et al., 2004; Davey & Levy, 1998; Borkovec et al., 1998). Moreover, in the somatic pathology domain, worry seems to have adverse effects on cardiac functioning independent of negative affect (Brosschot et al., 2007; Pieper et al., 2007; Ottaviani et al., 2016).

Previously, we provided evidence that of the PSWQ, the WDQ and the STAI-T, the PSWQ is the best predictor of worry in daily life. In the first study, using the same data plus additional subjects from subsequent waves of the study, we more specifically investigate the psychometric quality of the five worry items of the STAI-T, termed Brief Worry Scale (BWS), and whether the BWS can be used to measure trait worry (as measured by the PSWQ), as well as worry in daily life (as measured using ecological momentary assessments). Furthermore, the PSWQ was found to explain only 24% of the variance of daily worry. Therefore, a second goal was to examine whether such a brief, STAI-based questionnaire might in fact explain additional variance in daily worry behavior.

In the second study, we extend this further by examining if the five worry items of the STAI-T, taken together, are related to other measures assessing the same construct, and if it is unrelated to measures evaluating constructs that should have no relationship. Specifically, we expected that the BWS would be highly related to measures of pathological worry (PSWQ and the Generalized Anxiety Disorder Questionnaire for DSM-IV (GADQ-IV; Newman et al., 2002), but not correlated, or much weaker, to measures merely tapping into anxious-arousal and depressive affect. As the GADQ-IV can be used as a screening device to determine whether GAD diagnosis is likely present (Newman et al., 2002), we tested if higher scores on the BWS were associated with an increased likelihood of a GAD diagnosis. Additionally, the second study was completed across three time points, where the BWS measure was used across each time point, allowing us to examine the test-retest reliability of the BWS. Because the GADQ-IV was also collected across these three time points, we were also able to test the ability of the BWS to predict changes in GAD symptoms. Thus, the second study aimed to establish the convergent and divergent validity of the BWS, as well as the test-retest reliability.

In a third study, we tested if the BWS is associated with the uncontrollability of worry when this worrying is induced in an experimental setting. If the BWS indeed taps into pathological aspects of worry, it should be predictive of its uncontrollability, a cardinal feature of worry in GAD patients (Craske et al., 1989).

Given sufficient psychometric qualities established by the three studies, a brief worry scale would be available for many researchers, and a huge number of existing datasets might potentially be reanalyzed to examine the psychopathological and pathophysiological effects of worry.

Study 1

Participants

As part of a first year course in psychology, 1030 students were asked to complete the trait questionnaires and keep a log of their worries (see below). However, after the six-day registration period, 24.8% of the sample failed to return their worry logs. Complete data were available for 683 participants. We previously reported on the first 432 of the subjects in a different study. The mean age of the sample was 21.73 (SD = 5.83) and 76.7% of the sample was female. Participants received course credit for their participation.

Questionnaires

State-Trait Anxiety Inventory-Trait Form (STAI-T; Dutch version: Defares, Van der Ploeg, & Spielberger, 1980). To measure trait anxiety we administered the trait version of the State-Trait Anxiety Inventory. It consists of 20 self-report items that are rated on a Likert scale ranging from 1 (almost never) to 4 (almost always). Earlier use has shown good internal consistency and validity (van der Ploeg et al., 1980). For the purpose of this study we divided the STAI-T into two subscales: the Brief Worry Scale (BWS; by summing up the scores on items 9, 11, 17, 18 and 20; see Table 2) and the remaining negative affectivity items (NA).

Penn State Worry Questionnaire (PSWQ; Meyer et al., 1990; Dutch Translation: Van Rijsoort, Emmelkamp, & Vervaeke, 1999). This questionnaire consists of 16 self-report items. Items are directed at the excessiveness, duration and uncontrollability of worry as experienced in clients diagnosed with GAD, for example: “Once I start worrying, I can’t stop”. Each item is rated on a 1–5 scale, with anchors at 1 (Not at all typical), 3 (Somewhat typical) and 5 (Very typical). The total sum score on the PSWQ has demonstrated high reliability as well as high temporal stability and substantial validity in the assessment of trait worry (Meyer et al., 1990; van Rijsoort et al., 1999).

Worry Log. The worry log is a one-page A4 form validated in previous studies by Brosschot and Van Der Doef (2006), Verkuil et al. (2007) and Versluis et al. (2016). On this form an adapted version of Borkovec et al.’s (1983) working definition of worry is given. In the present study, all participants were instructed to register their worries during six days by tallying each worry episode. More specifically, they were instructed to register a worry episode whenever they noticed that they were worrying, or immediately after they had been worrying. At the end of each day they were asked to estimate the total number of worry episodes (daily worry frequency) and the total duration of these episodes (in minutes), based on their tallies. Each morning, they were requested to estimate the frequency and total duration of any nightly worry episodes (worry frequency and duration in the night-time). To obtain measures of worry perseveration, we decided to regress worry duration (total and during the nighttime) on worry frequency (total and during the nighttime) and to use the residuals of these models as indices of total worry perseveration and worry perseveration during the nighttime. The residuals from these analyses reflected the variance in worry duration that is not simply accounted for by frequently occurring worry episodes - frequent but short worry episodes might reflect successful problem solving - but more purely by their perseveration. Subsequently, these worry perseveration indices were regressed on the trait questionnaires.

Statistical Analyses

Data were screened for normality and outliers. The distributions of the worry duration and worry frequency variables were positively skewed and were transformed using square root transformations.

Psychometric aspects of the BWS were analyzed using reliability analysis (to test internal consistency) and a confirmatory factor analysis (CFA; to test for the proposed unidimensional factor structure of the BWS). To correct for deviations of multivariate normality of the item-data (which are scores on Likert scales), we used diagonally weighted least squares (WLSMV) to estimate the parameters of the CFA model, which is specifically designed for ordinal data. The comparative fit index (CFI), Tucker–Lewis index (TLI), the root mean square error of approximation (RMSEA), and the standardized root mean squared residual index (SRMR) were used to assess model fit. The cutoffs for good model fit were CFI and TFI values of 0.95 or greater (Hu & Bentler, 1998), RMSEA and SRMR values of 0.08 or lower (Browne & Cudeck, 1992). The predictability of worry duration, worry frequency and worry perseveration by trait questionnaires was analyzed with Pearson correlations and linear regression analyses. To test if the association between the BWS and the PSWQ was stronger than the association between the BWS and the NA items of the STAI-T, Steiger’s z-tests were used (Steiger, 1980). CFA analyses were conducted using R studio, version 1.3.1093 (RStudio Team, 2020), using the lavaan package (Rosseel, 2012). The remaining analyses were done using SPSS.

Results Study 1

Descriptive Statistics

Means and standard deviations of the main variables are presented in Table 1. Missing values resulted in the exclusion of some participants from some analyses; listwise exclusion of participants with missing values led to a sample size of 683 – results remained similar when using this dataset.

Table 1 Descriptive Statistics of the three samples

The mean scores on the PSWQ (M = 44.11, SD = 12.76) and the STAI-T (M = 37.31, SD = 8.95) are within the normal range for healthy subjects and are comparable with mean scores found in other studies (Brosschot & Van Der Doef, 2006; Tallis et al., 1994). On average, participants worried 144 min, which means 24 min on average per day, which is comparable to previous studies using momentary assessments of worry (Verkuil et al., 2007; Brosschot & Van Der Doef, 2006). The participants that did not return their worry logs did not significantly differ on the PSWQ and the STAI-T from those who did return their logs.

Psychometric Aspects of the BWS

The mean score on the BWS was 9.15 (SD = 3.00). The mean score on the remaining NA items of the STAI-T was 28.16 (SD = 6.59). To examine the internal consistency of the two subscales of the STAI-T, two reliability analyses were conducted. Results showed that Cronbach’s α of the BWS was .83, suggesting a satisfactory internal reliability. Removal of any of the five items led to decreases in reliability (alpha; see Table 2). In addition, Cronbach’s α of the NA items was .88.

Table 2 Reliability analysis and factor loadings of the Brief Worry Scale

To examine the structural validity of the BWS a CFA (using WLSMV to estimate the parameters) was conducted. A good fit was observed for the model where all items loaded on one latent factor (for factor loadings see Table 2; CFI = 0.999, TLI = 0.998, RMSEA = 0.016 [90% CI: 0.00–0.057], SRMR = 0.010).

Association of the BWS with Trait Worry and Daily Worry

Table 3 shows the correlations between the trait questionnaires. The BWS was significantly associated with the PSWQ (r(681) = .795, p < .001) and, to a lesser extent, also with the negative affectivity items of the STAI-T (r(681) = .698, p < .001). The negative affectivity items were also associated with the PSWQ (r(681) = .682, p < .001). When controlling for negative affectivity, the partial correlation between the BWS and the PSWQ was still substantial and significant (r(680) = .608, p < .001). To test whether the BWS was more strongly associated with the PSWQ than with the NA items, Steiger’s z tests were conducted (Steiger, 1980). The results indicated that the association between the BWS and the PSWQ was significantly stronger than the association between the BWS and the NA items (z = 5.40, p < .001). The difference between the association between NA and the BWS and the association between the NA items and the PSWQ was not statistically significant (z = .94, p = .34).

Table 3 Pearson correlations of trait questionnaires and worry in daily life

Next, we examined the predictive validity of the BWS by examining the associations with worry in daily life. Correlations between the BWS, PSWQ and the NA-items with worry frequency and worry duration in daily life are presented in Table 4. Results show that worry duration was associated with the PSWQ, the BWS and the NA-items respectively, whereas worry frequency was associated with the PSWQ, the NA-items, and the BWS respectively.

Table 4 Hierarchical regression analysis of daily worry

Linear forced entry regression analyses were conducted to examine whether the BWS predicted total worry duration in daily life, independent of negative affectivity and the PSWQ. We therefore entered NA and the BWS in the first step, followed by the PSWQ in the second step. Results are presented in Table 3. Total worry duration was predicted by the PSWQ (β = .24, p < .001), the BWS (β = .16, p = .008), and by NA (β = .13, p = .006). With respect to total worry frequency, the predictive strength of the BWS seemed to be accounted for by the PSWQ, as the BWS only predicted worry frequency in the first step (β = .17, p < .001), but not when the PSWQ was entered into the model. In the final model total worry frequency was predicted by the PSWQ (β = .20, p = .001) and NA (β = .19, p < .001), and not by the BWS (β = .05, p = .42).

To examine if the BWS predicted the worry perseveration indices, linear regression analyses were conducted. Results showed that total worry perseveration was best predicted by the BWS (β = .17, p = .009), the PSWQ (β = .13, p = .036), but not by the NA-items (β = .01, ns; see also Table 5). Furthermore, worry perseveration in the night was best predicted by the BWS (β = .14, p = .036), but not by the PSWQ (β = .10, ns) and the NA-items (β = −.02, ns).

Table 5 Hierarchical regression analysis of worry perseveration

Study 2

In a second study we aimed to extend these finding by further examining the convergent and divergent validity of the BWS. Specifically, we expected that the BWS would be highly related to measures of pathological worry (PSWQ and the Generalized Anxiety Disorder Questionnaire for DSM-IV (GADQ-IV; Newman et al., 2002), but not correlated, or much weaker, to measures merely tapping into anxious-arousal and depressive affect.

Participants

Participants were 143 undergraduate students at a large Mid-western university, ages 18–40 years old (M = 19.16, SD = 2.23). Data were collected across three sessions, at two 3-week (M = 20.15 days, SD = 3.71) intervals and one 6-week (M = 40.21 days, SD = 3.13) interval. Complete data were available for 120 participants. Participants were taking part in a larger study on the risk factors associated with depression. To increase the likelihood that the sample would include participants who were at elevated risk for depressive symptoms, potential participants were screened using the Persistence and Low Distraction subscale of the Effortful Control Scale (ECS-PLD; Lonigan, 1998) and the trait version of the Positive and Negative Affect Schedule (T-PANAS; Watson et al., 1988) and recruited based on their scores (see Vasey et al., 2014). Specifically, all individuals in the prescreening pool who scored above the median or in the lower quartile on negative affect (i.e., those high or low on negative affect) and in the upper or lower quartiles on ECS-PLD (i.e., those high or low in effortful control) and positive affect (i.e., those high or low in positive affect) were invited to participate along with a random subset of 10 % of the remainder of the sample. Thus, this sample varied widely in risk for depressive symptoms. Participants came to the laboratory near the beginning, middle, and end of the quarter and received course credit for their participation.

Questionnaires

State-Trait Anxiety Inventory-Trait Form (STAI-T; Spielberger et al., 1983)

To measure anxiety, a subscale of the STAI-T was utilized–the Brief Worry Scale (BWS; items 9, 11, 17, 18 and 20; see Table 2). The remaining items made up the negative affectivity subscale (STAI-NA). All items were rated by the participants on a Likert scale ranging from 1 (almost never) to 4 (almost always).

The Generalized Anxiety Disorder Questionnaire for DSM-IV (GADQ-IV; Newman et al., 2002)

The GADQ-IV is comprised of 9 items, with five yes/no questions assessing the occurrence of excessive, uncontrollable worry (e.g., “Do you find it difficult to control your worry (or stop worrying) once it starts?”), the number of endorsed worry themes and physical symptoms (e.g., muscle tension, irritability), and two questions assessing the amount of interference and distress caused by worry and its symptoms, scored on a 0 (none) to 8 (very severe) scale. The GADQ-IV has demonstrated good sensitivity (83%) and specificity (89%) for structure interview-based diagnosis of GAD, adequate test-retest reliability (κ = .67) as well as both convergent and discriminant validity (Newman et al., 2002). There are several ways to score the GADQ-IV. As described by Newman et al. (2002), items 7–9 are not scored for participants not endorsing the 6-month duration criterion (i.e., item 6). A cut-off score of 5.7 on this scale has been suggested to achieve optimal sensitivity (83%) and specificity (89%) when screening for the presence and absence of diagnosable GAD in student samples (Newman et al., 2002).

The Depression, Anxiety, and Stress Scale (DASS; Lovibond & Lovibond, 1995)

The DASS is a 42-item self-report questionnaire designed to measure the three related negative emotional states of depression, anxiety and stress. Each of the three DASS scales contains 14 items, rated on a 0–3 Likert scale based on how much or how often they applied during the past week (never - almost always). The Depression scale (DASS-D) assesses dysphoria, hopelessness, devaluation of life, self-deprecation, lack of interest/involvement, anhedonia, and inertia. The Anxiety scale (DASS-A) measures primarily symptoms of autonomic hyperarousal (i.e., anxious arousal). The Stress scale (DASS-S) captures difficulty relaxing, nervous arousal, irritability, and impatience. Scores for the Depression, Anxiety and Stress subscales are calculated by summing the scores for the relevant items. This scale has been shown to possess adequate psychometric properties (Lovibond & Lovibond, 1995). In contrast to the DASS-A and DASS-D, DASS-S scores are significantly elevated in GAD patients (Brown, Chorpita, Korotitsch, & Barlow, 1997).

Penn State Worry Questionnaire (PSWQ; Meyer et al., 1990)

This questionnaire consists of 16 self-report items, assessing the tendency to worry excessively and uncontrollably. Each item is rated on a 1–5 scale, with anchors at 1 (not at all typical), 3 (somewhat typical) and 5 (very typical). The PSWQ has demonstrated high reliability as well as high temporal stability and substantial validity in the assessment of trait worry (Meyer et al., 1990).

Statistical Analyses

To examine the internal consistency of the two subscales of the STAI-T, reliability analyses were conducted. To assess the test-retest reliability of the BWS across the three time points, Pearson correlations were computed. To test the convergent and divergent validity of the BWS, partial correlations were computed. To assess convergent validity, partial correlations were conducted to investigate the relationship between the BWS and the GADQ-IV, PSWQ, and DASS-S, controlling for the STAI-NA. To investigate divergent validity partial correlations between the BWS and the DASS-A and DASS-D, when controlling for the STAI-NA were computed. (Logistic) regression analyses were performed to test if the BWS was associated with changes in GAD symptoms across time and with diagnostic status. All analyses were conducted using SPSS.

Results Study 2

Descriptive Statistics

At T1, the mean scores on the PSWQ (M = 47.78, SD = 16.84) and the STAI-T (M = 42.48, SD = 12.07) were within the normal range for healthy subjects. However, consistent with a sampling strategy emphasizing risk for depressive symptoms, independent t-tests showed that these scores were significantly higher than in study 1 (PSWQ: t(801) = 2.75, p = .006; STAI-T: t(801) = 5.50, p < .001).

Reliability of the BWS

To examine the internal consistency of the two subscales of the STAI-T, reliability analyses were conducted for each time point. Results indicated a Cronbach’s α for the BWS of .84 at the first time point, .88 at the second time point, and .87 at the third time point. These findings suggest a satisfactory internal reliability.

To examine the test-retest reliability of the BWS assessed at the three data collection sessions across two 3-week intervals and one 6-week interval, correlations were computed. Results indicated strong test-retest reliability, with significant correlations between the BWS at session 1 and session 2 (r(118) = .830, p < .001); session 2 and session 3 (r(118) = .815, p < .001); and session 1 and session 3 (r(118) = .784, p < .001).

Convergent and Divergent Validity

To test the convergent and divergent validity of the BWS, zero-order and partial correlations were conducted. These correlations are presented in Table 6. In examining convergent validity, the relationship between the BWS T1 and the GADQ-IV (T1 and T3), PSWQ T1, and DASS-S T1 were assessed, controlling for STAI-NA T1. The results suggest that the BWS is uniquely and significantly positively correlated with the GADQ-IV, PSWQ, and DASS-S, indicating convergent validity. In assessing divergent validity, the association between the BWS and the DASS-D and DASS-A were examined, controlling for STAI-NA. As expected, the correlations between the BWS, DASS-D and DASS-A are weaker, with the BWS and DASS-D being uncorrelated, when controlling for trait NA. In contrast, the partial correlation between DASS-D and STAI-NA was r(117) = .602, p < .001, when controlling for the BWS.

Table 6 Zero-order and partial correlations of questionnaires and the BWS, controlling for STAI-NA

BWS and GAD Symptoms

To examine if the BWS was associated with changes in GAD symptoms, a regression analysis was conducted. This showed that the BWS at T1 significantly predicted GADQ-IV at T3 (β = .263, p = .006) above and beyond GADQ-IV at T1 (β = .598, p < .001). In contrast, the NA score did not (β = −.021, p = .807).

Having demonstrated a strong association between the BWS and the severity of GAD symptoms, we additionally tested if the BWS was associated with the likelihood of a GAD diagnosis. This was done to illustrate the potential clinical significance of the findings (Rutledge & Loh, 2004). Using the cut-off score of 5.7 on the GADQ-IV showed that in 25% (n = 30) of the participants a diagnosis of GAD was likely present. GADQ-IV based diagnostic status was used as dependent variable in a logistic regression, with the BWS as predictor. The results showed that every 1-point increase on the BWS was associated with a 76% increase in risk for receiving a GAD diagnosis (OR = 1.762, 95% CI 1.436–2.160, p < .001). Compared to the odds of participants scoring below the median of 11 on the BWS, the odds of a GAD diagnosis was 27 times higher in participants scoring above the median (OR = 27.818, 95% CI 7.688–100.657, p < .001). When looking at the likelihood of a GAD diagnosis at T3, a 1-point increase in the BWS at T1 increases odds of a diagnosis at T3 by 33% (OR = 1.331, 95% CI 1.111–1.594, p = .002) above and beyond what was predicted by diagnosis at T1 (OR = 7.028, 95% CI 1.993–24.776, p = .002).

Study 3

If the BWS indeed taps into pathological aspects of worry, it should be predictive of its uncontrollability, a cardinal feature of worry in GAD patients (Craske et al., 1989). In a third study, we therefore tested if the BWS is associated with the uncontrollability of worry when this worrying is induced in an experimental setting.

Participants

The sample consisted of 18 male and 35 female students, aged 17–50 (M = 24.4) that we have previously reported on (Verkuil et al., 2009). The main aim of the study was to compare cardiac responses to induced worry versus problem solving. Here, to further test the validity of the BWS, we tested the associations between the BWS, STAI-NA, PSWQ and the uncontrollability of worry experienced during the induced worry bout. Participants were recruited by advertisement. After the experimental conditions, the participants completed questionnaires and were paid 6 euros or received course credits.

Procedure

After giving informed consent, all participants took part in three experimental tasks: a worry induction, a cognitive problem solving task and a relaxation induction (we report only on the worry induction task). The experimental conditions were presented in counterbalanced order. Each condition lasted 10 min. During the whole experiment cardiac activity was recorded. Participants also completed the PSWQ and the STAI-T (as in the previous studies).

Worry Induction

Following the work of Borkovec and others (Lyonfields et al., 1995; Thayer et al., 1996), participants were asked to write down three personal worry topics, before receiving further instructions. To minimize participant’s social evaluative concerns about writing down a personal worry topic, they were notified that they could take home or destroy the paper on which they wrote their worry topic. Thereafter, participants were asked ‘to worry as you usually do’ for ten minutes.

(Un)Controllability of Induced Worry

After the worry induction, participants were asked to rate the extent to which they felt their worrisome thoughts were controllable, on a scale from 0 (not at all) to 10 (very much).

Statistical Analyses

Associations between the BWS, STAI-NA, PSWQ and the uncontrollability of worry were analyzed with Pearson correlations and linear regression analyses. For all analyses, SPSS was used.

Results Study 3

Descriptive Statistics

The mean scores on the PSWQ (M = 47.30, SD = 13.77) and the STAI-T (M = 40.65, SD = 10.85) are within the normal range for healthy subjects. The mean uncontrollability of the worrisome thoughts was 5.07 (SD = 1.91, range: 2–10).

Reliability of the BWS

The mean score on BWS was 9.73 (SD = 3.30). The mean score on the remaining NA items of the STAI-T was 30.90 (SD = 8.06). The reliability analyses indicated a Cronbach’s α for the BWS of .85, suggesting a satisfactory internal reliability.

BWS and Controllability of Worrying

Pearson correlations showed that controllability of induced worry was negatively associated with the BWS (r(51) = −.662, p < .001), the PSWQ (r(51) = −.645, p < .001) and the STAI-NA (r(50) = −.574, p < .001) respectively.

Linear forced entry regression analyses were conducted to examine whether the BWS predicted controllability of worry during the worry episode, independent of negative affectivity and the PSWQ. We therefore entered NA and the BWS in the first step, followed by the PSWQ in the second step. In step 1, worry intensity was predicted by the BWS (β = −.546, p < .01)), but not the STAI-NA (β = −.150, p > .37). In the second step, both the BWS (β = −.389, p < .05) and the PSWQ (β = −.337, p < .05) predicted controllability of worry, but the STAI-NA did not.

General Discussion

The results of these three studies make clear that five worry-related items of the STAI-T can be used to reliably measure trait worry and state worry in daily life, independent of negative affect. This implies that a huge number of existing datasets might be reanalyzed to examine how the different elements of anxiety (worry and negative affect) affect psychological and somatic illness. We also observed that the BWS was the best predictor of the worry perseveration index, that is, worry duration in daily life corrected for the total number of worry episodes, and it did so even better than the most often used measure of pathological worry, the PSWQ. Moreover, the BWS was the only one of the trait measures that predicted nightly worry. Worry perseveration – the prolongation of worry episodes regardless of its frequency – and nocturnal worry are perhaps the most pathological characteristics of worry, with respect to psychological as well as somatic health (Borkovec et al., 1983; Brosschot et al., 2006). The BWS therefore seems particularly suited to measure the pathological aspects of worry. However, future studies are warranted to replicate this finding, as our measure of worry perseveration is not yet a standardized measure.

In this first study, we also found that the BWS predicted the frequency of worry episodes, although it did so to a lesser extent than the NA items. Moreover, after taking the PSWQ into account, the association between the BWS and worry frequency was not significant. It is perhaps not surprising that the PSWQ, consisting of 16 items, is sensitive to a broader array of features of worry – including frequency - than the shorter BWS. However, as argued above, frequency of worry seems less important as a pathogenic factor. This is also consistent with a study by Thielsch et al. (2015) who found, in a sample of patients suffering from GAD, that momentarily assessed uncontrollability ratings of worry (i.e., pathogenic negative metacognitions about worry) did not predict subsequent worry frequency but did predict worry duration. Frequent but short worry episodes might reflect successful problem solving and seems less detrimental to health, while duration of worry reflects difficulty in stopping worry and involves prolonged stress responses (Brosschot & van der Doef, 2006; Brosschot et al., 2007).

In addition, in the second study we observed a pattern of associations that was consistent with convergent validity. The BWS positively correlated with the GADQ-IV and the PSWQ. These two questionnaires are established measures of pathological worry. The BWS also predicted GADQ-IV scores after 6-weeks above and beyond GADQ-IV at baseline (which the NA items did not). The GADQ-IV is a reliable questionnaire measuring symptoms of GAD, including items on its excessiveness, uncontrollability, and the amount of impairment due to the symptoms. Yet, not every participant scoring above the clinical cut-off of 5.7 will endorse all symptoms of GAD, and a GADQ-IV-based analog diagnosis is probably over-inclusive. That is, even in a sample stratified for levels of depressive symptoms, a 25% base rate of GAD in this sample is considerably higher than would be expected based on base rates in the general population (Somers et al., 2006). Still, the finding that with every 1-point increase in BWS scores there was an associated 76% increased risk for a current GAD diagnosis, and a 33% increased risk for a diagnosis 6 weeks later, over and above what is predicted by current diagnostic status, expresses the potential clinical significance of the BWS.

Out of the three DASS subscales, the BWS most strongly related to the DASS-S subscale, which has previously been shown to be the DASS subscale most closely related to measures of worry and GAD (Brown et al., 1997). The BWS was less strongly related to the DASS-A subscale. The DASS-A subscale primarily measures symptoms of anxious arousal (trembling, difficulty breathing; although one item is focused on worry about having a panic attack). The association between worry and anxious arousal should be modest on average (see Vasey et al., 2017).Footnote 1 In addition, when controlling for negative affect, the association between the BWS and the DASS Depression subscale was not significant anymore. This is indicative of divergent validity, as we would expect a measure of depression to be less related to an index of worry when negative affectivity is partialled out. Taken together these findings demonstrate a pattern of correlations that provide support for the BWS, and its convergent and divergent validity.

With regard to the third study, several studies have demonstrated that people suffering from GAD experience their worries as more uncontrollable than healthy controls (Craske et al., 1989; Hoyer et al., 2001). In the third study, we observed that the BWS predicted the (un)controllability of worrying, experienced during an experimental worry bout, whereas negative affect did not. Even though this was a relatively small non-clinical sample, these data also suggest that the BWS items tap into this pathological aspect of worry. What’s more, the BWS was an equally strong predictor of uncontrollability as the standard measure of pathological worry, the PSWQ, which provides further evidence for the validity of the BWS.

Limitations

There are several methodological limitations. First, all studies were conducted in non-clinical and largely female samples, and one could raise the question to what extent the BWS is useful in measuring pathological worry, as it is observed in clinical populations, especially in GAD patients. Although we did not screen for GAD diagnoses using clinical interviews, several studies suggest that student samples are suitable to investigate worry on the full severity range (Borkovec & Roemer, 1995; Roemer et al., 1995; Ruscio, 2002) and indeed all samples seemed to cover the full worry range. Moreover, Ruscio (2002) showed that although high levels of worry are a main characteristic of GAD, a large group of people who show high levels of worry do not receive a full GAD diagnosis. Still, further validation studies might be conducted that use clinical populations and try to elucidate whether the BWS indeed is a good predictor of pathological worry. Studies with clinical populations could also examine whether the combination of the BWS with the PSWQ leads to a more sensitive measurement of GAD worry and whether this also leads to a better screening instrument for the detection of GAD. Furthermore, although momentary assessments of worry are less prone to retrospective bias than trait questionnaires, the end of day measurements of worry duration that we used in the first study could still lead to overestimations in time spent worrying. This hypothesis could be addressed in future momentary assessment studies using time based random sampling, for example with handheld computers. Finally, the first study was formally restricted to worry, while we assumed that it related to more general perseverative cognition that is central in many psychopathologies. However, we did not include related measures of perseverative cognitions, such as rumination, that is, experiencing repetitive negative thoughts about one’s negative mood (Nolen-Hoeksema, 2000), or measures of intrusive / obsessional thoughts. On the other hand, rumination and intrusive thoughts are closely associated with worry, and may share the same underlying processes (Watkins et al., 2005; Langlois et al., 2000; Segerstrom et al., 2003).

Future Directions

Given that these findings provide preliminary evidence for the reliability and validity of the BWS, existing datasets might potentially be reanalyzed to examine the effects of worry on mental and somatic health. Since the BWS seems to be particularly associated with worry perseveration, the BWS might be a valuable instrument for studying the shared cognitive, neural and affective processes that constitute general perseverative thinking processes.

Recently, during the review process of this paper, a study by Zsido and colleagues was published, in which they set out to develop a short version of the STAI-T. They used a data-driven item-response theory approach to search for 5 items from the 13 non-reversed scored items of the STAI-T that could measure trait anxiety. Four of the final items that were selected (9, 17, 18 and 20) are also present in the BWS. The distinction in the proposed scales is made by a fifth item, which in the BWS is “I have disturbing thoughts” and in the brief STAI-T “I feel that difficulties are piling up so that I cannot overcome them”. These items pertain to different aspects of anxiety, with the former being more clearly directed towards the cognitive aspect of anxiety: worry. This suggests that the BWS might be a more consistent measure of worry, when compared to the brief STAI-T. This idea was backed up by an additional CFA on the brief STAI-T in the first sample, in which the fifth brief STAI-T item – loaded less on the latent factor (.62) than the fifth BWS item (.74). For researchers aiming to specifically measure worry, the BWS might therefore be preferred.

Conclusions

In short, we provided evidence that a short, easy to administer subscale of the STAI-T can be used to measure worry in daily life, especially worry uncontrollability, duration and worry perseveration, which are believed to be central processes in mental as well as somatic health. A questionnaire that reliably predicts perseverative thinking in daily life seems to be of great potential importance. We hope that the findings of the present studies will encourage researchers to inspect existing datasets that include the STAI-T to test the effects of worry perseveration on mental and somatic health outcomes.