Selective attention is a key mechanism for cognitive control. By focusing attention on relevant stimulus features and ignoring irrelevant features, the cognitive system has some control over what information is processed while pursuing a task goal. However, limitations on control are evident when task performance is influenced by irrelevant information. In various cognitive control tasks that involve selective attention, such as the flanker task (Eriksen & Eriksen, 1974) and the Stroop task (Stroop, 1935), performance is worse when relevant and irrelevant stimulus features are associated with different responses (incongruent) than with the same response (congruent). These congruency effects indicate that irrelevant features are often attended and processed to some extent, reflecting suboptimal cognitive control. By studying congruency effects and how they are modulated by different experimental variables, researchers have sought a better understanding of control via selective attention. In the present study, I investigate the generality of the modulation of congruency effects by alertness, which is important for understanding the relationship between alertness and cognitive control—two major components of the human attention system (Posner & Boies, 1971; Posner & Petersen, 1990).

This research is motivated by an unusual finding that originated with variants of the Attention Network Test (ANT; Fan, McCandliss, Sommer, Raz, & Posner, 2002). At its core, the ANT involves an arrow flanker task: Subjects make a spatially compatible response to classify the direction of a target arrow presented among distractor arrows. The target and distractor arrows point in different directions (incongruent) or in the same direction (congruent). The frequent finding of congruency effects indicates that subjects are unable to focus attention solely on the target, resulting in distractor processing that influences task performance. Different types of cues can be presented to manipulate alertness or orienting of attention in the ANT (Fan et al., 2002). Of particular relevance in the present context, alertness is manipulated by briefly presenting alerting cues (e.g., visual signals at the potential positions of the target arrow) shortly before the arrow stimuli.Footnote 1 Response times (RTs) are usually shorter on these alert trials than on no-alert trials, suggesting that the cues temporarily boosted alertness (Posner, 2008). Curiously, this overarching benefit of heightened alertness comes with a cost: Congruency effects are often larger on alert trials than on no-alert trials in variants of the ANT (e.g., Callejas, Lupiáñez, Funes, & Tudela, 2005; Fan et al., 2002; J. W. MacLeod et al., 2010; McConnell & Shore, 2011; Redick & Engle, 2006; Schneider, 2018a, 2018b, 2019b; Weinbach & Henik, 2012). This alerting–congruency interaction, which is the focus of the present study, suggests that increased alertness is associated with impaired selective attention. More generally, it indicates that different components of attention do not seem to operate independently in some situations, making it important to ascertain how they function together as parts of the attention system.

The alerting–congruency interaction implies that alertness is connected to cognitive control, but the nature of the connection is unclear. Several hypotheses have been proposed for the interaction (for an overview, see Seibold, 2018), some of which I discuss later, but none can account for all the available data (see Schneider, 2019b). A major challenge is to explain why the interaction occurs in some cognitive control tasks, but not in others. As noted earlier, the alerting–congruency interaction is routinely found in the arrow flanker task at the core of the ANT.Footnote 2 Three other cognitive control tasks in which alertness has been manipulated are the Simon, global/local, and color Stroop tasks.

The Simon task (Simon & Small, 1969; for a review, see Lu & Proctor, 1995) involves making a spatial response to classify a feature (e.g., color) of a stimulus that appears at a lateralized position (e.g., a blue square presented left of fixation). Stimulus position is irrelevant, but performance is worse when it mismatches the side of the response (incongruent) than when it matches (congruent). This congruency effect indicates that irrelevant position information is processed and influences task performance. I am aware of four experiments in which alertness was manipulated in the Simon task, and alerting–congruency interactions were found in all of them (Böckler, Alpay, & Stürmer, 2011; Fischer et al., 2010; R. M. Klein & Ivanoff, 2011; Soutschek, Müller, & Schubert, 2013). These results indicate that the interaction generalizes beyond the arrow flanker task.

The global/local task (Navon, 1977) involves making a manual response to classify either the global form of a stimulus or its local elements. For example, if the stimulus is a large arrow composed of multiple small arrows, then the global task is to classify the direction of the large arrow, and the local task is to classify the direction of the small arrows. For each task, the other dimension is irrelevant, but performance is worse when it mismatches the relevant dimension (incongruent) than when it matches (congruent). This congruency effect indicates that irrelevant dimension information (which is spatial information for arrow stimuli) is processed and influences task performance. Weinbach and Henik (2014) manipulated alertness in an experiment with an arrow version of the global/local task, and typical alerting–congruency interactions were found when the irrelevant dimension was salient (see also Weinbach & Henik, 2011), whereas reversed interactions were found when the relevant dimension was salient. These results indicate that the interaction generalizes in a conditional way to the global/local task.

The color Stroop task (Stroop, 1935; for a review, see C. M. MacLeod, 1991) involves making a vocal or manual response to classify the ink color of a color word (e.g., the word GREEN printed in red ink). Word meaning is irrelevant, but performance is worse when it mismatches the ink color (incongruent) than when it matches (congruent). This congruency effect indicates that irrelevant semantic information is processed and influences task performance. Schneider (2019a) manipulated alertness in eight experiments using variants of the color Stroop task, and the typical alerting–congruency interaction was not found in any of them, replicating and extending the findings of two previous experiments (Soutschek et al., 2013; Weinbach & Henik, 2012). These results indicate a potential constraint on the generality of the interaction.

Why does the alerting–congruency interaction occur in the arrow flanker task, the Simon task, and conditionally in the global/local task, but not in the color Stroop task? Recently, I suggested that there is “a spatial attention constraint on the relationship between alertness and cognitive control, part of which might involve having a task goal associated with spatial information processing” (Schneider, 2019a, p. 119). A distinction can be made between spatial information processing associated with stimuli and responses. Depending on the stimuli, perceptual classification involves spatial information processing (e.g., discriminating between line orientations) or it does not (e.g., discriminating between colors). Depending on the responses, selection of a motor program involves spatial information processing (e.g., choosing between left and right key-press responses) or it does not (e.g., choosing between vocal responses). I suggest that the alerting–congruency interaction might arise when spatial information processing is needed to accurately classify the relevant stimulus feature, prior to mapping that classification to a response.

From this perspective, it is potentially fruitful to consider the involvement of spatial attention and spatial information processing in the aforementioned cognitive control tasks. Spatial attention is needed in the arrow flanker task because the target is spatially separated from the distractors. Spatial information processing occurs because the task is to classify the direction of the target arrow. Spatial attention is needed in the Simon task because attention has to shift to the lateralized position of the stimulus in order to classify the relevant stimulus feature. Whether spatial information processing occurs for classification of the relevant feature in the Simon task depends on the stimuli.Footnote 3 It is unlikely that spatial information is needed for classification of colors (e.g., red vs. green lights; Craft & Simon, 1970). However, some spatial information processing occurs when classifying shapes. For example, in two of the experiments in which alertness was manipulated in the Simon task (Böckler et al., 2011; Fischer et al., 2010, Experiment 2), subjects classified the shape of the stimulus, which was a square or a diamond. Shape perception involves processing multiple spatial properties of stimuli (Palmer, 1999), and the only difference between a square and a diamond is a spatial transformation (rotation). Thus, even though shape classification (square vs. diamond) does not yield position information, it does involve spatial information processing. Spatial attention is needed in the global/local task because attention has to focus on either the global form of the stimulus or its local elements. Spatial information processing occurs in the arrow versions of the global/local task studied by Weinbach and Henik (2011, 2014) because the task is to classify the direction of either the large, global arrow or the small, local arrows.

In contrast, spatial attention and spatial information processing play little or no role in the color Stroop task. Subjects have to attend to the stimulus display to perform the task, but spatial attention does not have to be shifted (if the stimulus is always presented centrally) and it cannot be used to distinguish between relevant and irrelevant features (if they are spatially integrated; e.g., the word GREEN printed in red ink). Schneider (2019a) did not obtain typical alerting–congruency interactions in the color Stroop task even when the relevant and irrelevant features were spatially separated (e.g., a red rectangle flanked above and below by the word GREEN). Spatial information is not needed for classifying the relevant feature in the color Stroop task. Moreover, Schneider (2019a) did not obtain typical alerting–congruency interactions in color Stroop-like tasks in which the irrelevant feature conveyed spatial information (e.g., a red rectangle flanked above and below by the word LEFT), ruling out the notion that irrelevant spatial information processing is sufficient for producing the alerting–congruency interaction.

The preceding analysis seems consistent with the idea that spatial information processing of the relevant stimulus feature is an important determinant of whether the alerting–congruency interaction is obtained in a specific cognitive control task. Spatial information processing of the relevant feature definitely occurs in the arrow flanker task and in arrow versions of the global/local task, whereas it occurs to varying degrees in the Simon task. Alerting–congruency interactions have been obtained with all of those tasks. Spatial information processing of the relevant feature arguably does not occur in the color Stroop task, and alerting–congruency interactions have not been obtained with that task. However, if the Stroop task were modified such that spatial information was needed to achieve the task goal, then alerting–congruency interactions should emerge. This possibility can be explored by manipulating alertness in the spatial Stroop task.

There are many variants of the spatial Stroop task, which differs from the Simon task by having both the relevant and irrelevant stimulus features convey spatial information (Lu & Proctor, 1995). For example, Logan and Zbrodoff (1979) studied a spatial Stroop task in which subjects classified the meaning of a word (ABOVE or BELOW) that was positioned above or below fixation. Stimulus position was irrelevant, but performance was worse when it mismatched the word’s meaning (incongruent) than when it matched (congruent). Similar results have been obtained with classification of nonlinguistic stimuli, such as arrows (e.g., Clark & Brownell, 1975). The congruency effect in the spatial Stroop task is similar to that of the Simon task, in that irrelevant position information is the source of the effect. However, the spatial Stroop task involves the explicit goal of classifying spatial information conveyed by the relevant stimulus feature, whereas the Simon task does not. The congruency effect in the spatial Stroop task is analogous to that of the color Stroop task, except that the irrelevant semantic information influencing task performance is spatial rather than color information.

I am aware of only one experiment in which alertness was manipulated in the spatial Stroop task. Fischer et al. (2010, Experiment 1) studied a spatial Stroop task in which 26 subjects classified the direction of a left-pointing or right-pointing arrow that was positioned to the left or right of a central fixation sign.Footnote 4 The arrow’s direction and position either matched (congruent) or mismatched (incongruent). Trials were divided into prime–probe pairs, with an alerting cue (a tone) presented before the arrow on half of the prime trials. A main finding was that congruency effects on prime trials were larger with alerting cues than without them. This result represents initial evidence that the alerting–congruency interaction generalizes to the spatial Stroop task. However, it is unknown whether Fischer et al.’s finding can be replicated with a more typical experiment design that involves visual alerting cues and no prime–probe pairing of trials.

For the present study, I conducted a preregistered, multiexperiment investigation of alerting effects in variants of the spatial Stroop task.Footnote 5 In each experiment, subjects made a spatially compatible key-press response to classify the spatial meaning of a stimulus presented at an irrelevant position. Stimulus meaning and position either matched (congruent) or mismatched (incongruent) on each trial. Stimulus type (arrows or words) and spatial dimension (horizontal or vertical) were varied between experiments. Figure 1 shows examples of incongruent stimulus displays for all experiments, as well as the responses for each spatial dimension. I manipulated stimulus type and spatial dimension to assess the replicability and generality of the results across different task conditions, which is important in the context of concerns about the replicability of findings in psychology (e.g., R. A. Klein et al., 2018; Open Science Collaboration, 2015).

Fig. 1
figure 1

Top and middle rows: Example stimulus displays (drawn to scale) as a function of stimulus type (arrows or words) and spatial dimension (horizontal or vertical). All examples depict incongruent stimuli. Bottom row: Designated response keys (shaded in gray) on schematic displays of the response pad as a function of spatial dimension

In Experiment 1 (arrow stimuli; horizontal), subjects pressed a left or right key to classify the direction of a left-pointing or right-pointing arrow, positioned left or right of fixation (e.g., top-left display of Fig. 1), similar to the task in Fischer et al. (2010). In Experiment 2 (arrow stimuli; vertical), subjects pressed an up or down key to classify the direction of an up-pointing or down-pointing arrow, positioned above or below fixation (e.g., top-right display of Fig. 1). In Experiment 3 (word stimuli; horizontal), subjects pressed a left or right key to classify the meaning of the word LEFT or RIGHT, positioned left or right of fixation (e.g., middle-left display of Fig. 1). In Experiment 4 (word stimuli; vertical), subjects pressed an up or down key to classify the meaning of the word UP or DOWN, positioned above or below fixation (e.g., middle-right display of Fig. 1).

Besides congruency, each experiment also included an alerting manipulation. On no-alert trials, the stimulus was presented after a variable fixation interval, which is a standard feature of the ANT (Fan et al., 2002) and several previous studies on the alerting–congruency interaction (e.g., Fischer et al., 2010; Schneider, 2018a, 2018b, 2019a, 2019b; Seibold, 2018; Soutschek et al., 2013). On alert trials (see Fig. 2), visual alerting cues (squares) were briefly presented at the two potential stimulus positions, 500 ms before stimulus onset. These alerting cues are similar to the double cues used to manipulate alertness in the ANT (Fan et al., 2002), and the 500-ms alerting interval is a standard feature of the ANT and my previous studies on the alerting–congruency interaction (Schneider, 2018a, 2018b, 2019a, 2019b).

Fig. 2
figure 2

Example sequences of displays (not drawn to scale) on alert trials for each experiment. The sequences were identical on no-alert trials, except that only the central fixation cross appeared in the second display (i.e., the square alerting cues were absent)

The alerting manipulation enabled me to assess the presence or absence of an alerting–congruency interaction in each experiment. Based on my earlier analysis, if spatial information processing of the relevant stimulus feature is an important determinant of whether the interaction is obtained in a cognitive control task, then the interaction should be present in all variants of the spatial Stroop task investigated here. In each case, spatial information processing is needed to achieve the task goal, regardless of the spatial dimension (horizontal or vertical) and whether the relevant spatial information is conveyed by linguistic or nonlinguistic stimuli (words or arrows, respectively). The reliable occurrence of the alerting–congruency interaction in the spatial Stroop task would help to clarify its generality across tasks and inform theorizing about the relationship between alertness and cognitive control.

Method

Subjects

A total of 240 undergraduate students from Purdue University participated for course credit. There were 60 subjects per experiment (Experiment 1: 36 female, 53 right-handed, mean age = 19.3 years; Experiment 2: 30 female, 51 right-handed, mean age = 19.2 years; Experiment 3: 31 female, 54 right-handed, mean age = 19.6 years; Experiment 4: 33 female, 52 right-handed, mean age = 19.3 years). The preregistered sample size for each experiment was based on a power analysis (calculated using G*Power 3.1; Faul, Erdfelder, Lang, & Buchner, 2007) that indicated 59 subjects would provide 90% power to detect an alerting–congruency interaction with an effect size of d = 0.43, which is the overall effect size reported by Schneider (2019a) from a meta-analysis of nine experiments involving arrow flanker tasks. The sample size was rounded up to 60. Data were excluded from one additional subject (in Experiment 1) based on a preregistered exclusion criterion (mean RT exceeded four standard deviations of the group mean). All subjects reported having normal or corrected-to-normal vision.

Apparatus

The experiments were conducted using E-Prime 3 (Psychology Software Tools, Inc.) on computers that displayed stimuli on monitors at a viewing distance of approximately 50 cm. Responses were registered from Cedrus RB-540 response pads (Cedrus Corp.), which have five keys in a plus-sign spatial configuration (see bottom displays in Fig. 1; the circular central key was not used in any experiment).

Task, stimuli, and responses

Each experiment used a variant of the spatial Stroop task, which involved making a spatially compatible key-press response to classify the spatial meaning of a stimulus while ignoring its position. Congruency was defined by the match (congruent) or mismatch (incongruent) between the spatial meaning of the stimulus and its position. The set of spatial Stroop tasks used in the experiments reflected a factorial manipulation of stimulus type (arrows or words) and spatial dimension (horizontal or vertical). Figure 1 shows example stimulus displays as a function of those variables, as well as the designated response keys for each spatial dimension (left and right keys for the horizontal dimension are shaded gray in the bottom-left display; up and down keys for the vertical dimension are shaded gray in the bottom-right display). Stimuli were displayed in white on a black background. Word stimuli were presented in 18-point Calibri font.

Experiment 1

The stimuli were arrows and the spatial dimension was horizontal (left–right). Subjects pressed a left or right key (see Fig. 1, bottom-left display) to classify the direction of a left-pointing or right-pointing arrow (1.15° × 0.46°), respectively, positioned 1.72° (center-to-center distance) to the left or right of a central fixation cross (0.23° × 0.23°). A congruent stimulus was an arrow that pointed in a direction that matched its position relative to fixation (a left-pointing arrow positioned left of fixation, or a right-pointing arrow positioned right of fixation). An incongruent stimulus was an arrow that pointed in a direction that mismatched its position relative to fixation (a left-pointing arrow positioned right of fixation, or a right-pointing arrow positioned left of fixation; the latter is shown in Fig. 1, top-left display).

Experiment 2

The stimuli were arrows and the spatial dimension was vertical (up–down). Subjects pressed an up or down keyFootnote 6 (see Fig. 1, bottom-right display) to classify the direction of an up-pointing or down-pointing arrow (0.46° × 1.15°), respectively, positioned 1.72° above or below a central fixation cross. A congruent stimulus was an arrow that pointed in a direction that matched its position relative to fixation (a down-pointing arrow positioned below fixation, or an up-pointing arrow positioned above fixation). An incongruent stimulus was an arrow that pointed in a direction that mismatched its position relative to fixation (a down-pointing arrow positioned above fixation, or an up-pointing arrow positioned below fixation; the latter is shown in Fig. 1, top-right display).

Experiment 3

The stimuli were words and the spatial dimension was horizontal (left–right). Subjects pressed a left or right key (see Fig. 1, bottom-left display) to classify the meaning of the word LEFT (1.15° × 0.40°) or RIGHT (1.66° × 0.40°), respectively, positioned 1.72° to the left or right of a central fixation cross. A congruent stimulus was a word with a meaning that matched its position relative to fixation (LEFT positioned left of fixation, or RIGHT positioned right of fixation). An incongruent stimulus was a word with a meaning that mismatched its position relative to fixation (LEFT positioned right of fixation, or RIGHT positioned left of fixation; the latter is shown in Fig. 1, middle-left display).

Experiment 4

The stimuli were words and the spatial dimension was vertical (up–down). Subjects pressed an up or down key (see Fig. 1, bottom-right display) to classify the meaning of the word UP (0.67° × 0.40°) or DOWN (1.72° × 0.40°), respectively, positioned 1.72° above or below a central fixation cross. A congruent stimulus was a word with a meaning that matched its position relative to fixation (DOWN positioned below fixation, or UP positioned above fixation). An incongruent stimulus was a word with a meaning that mismatched its position relative to fixation (DOWN positioned above fixation, or UP positioned below fixation; the latter is shown in Fig. 1, middle-right display).

Procedure

Subjects were tested individually in private rooms after giving written informed consent for a study protocol approved by the Purdue University Institutional Review Board. Instructions were presented on-screen and read aloud by the experimenter. Subjects were instructed to classify the direction of the arrow (Experiments 1 and 2) or the meaning of the word (Experiments 3 and 4) on each trial, ignoring stimulus position. They were instructed to respond quickly and accurately by using their left and right index fingers to press the designated response keys, which were labeled on a photo of the response pad that appeared on-screen below the instruction text (the photo was not present after the instructions).Footnote 7 During the instructions, subjects completed eight example no-alert trials (two trials for each possible stimulus display) with accuracy feedback. Afterward, they completed 12 blocks of 32 trials per block without accuracy feedback.

Alert and no-alert trials occurred equally often in each block and involved similar sequences of events. Figure 2 shows example sequences of displays on alert trials for each experiment. The first display consisted of a central fixation cross presented for a randomly selected interval of 400 ms; 800 ms; 1,200 ms; or 1,600 ms. The second display consisted of the central fixation cross and two alerting cues (white-filled squares, each 0.44° × 0.44°) presented at the two potential stimulus positions for 100 ms. In Experiments 1 and 3, which involved the horizontal dimension, the alerting cues appeared to the left and right of the central fixation cross. In Experiments 2 and 4, which involved the vertical dimension, the alerting cues appeared above and below the central fixation cross. The third display consisted of the central fixation cross by itself for 400 ms. The fourth display consisted of the central fixation cross and the stimulus at a selected position, which remained on-screen until a response key was pressed. Afterward, the screen was cleared, and the first display for the next trial started immediately.

The sequences of displays on no-alert trials were identical to those shown in Fig. 2, except that the second display consisted of the central fixation cross by itself for 100 ms (i.e., the square alerting cues were absent). From the subject’s perspective, there appeared to be a single, uninterrupted fixation display until stimulus onset on no-alert trials.

The fixation and alerting intervals were identical to those used in Schneider (2019b). The fixation intervals for the first display were randomized independently of the other variables and occurred equally often in random order within every block. All possible combinations of alerting and stimulus conditions occurred twice in random order within each consecutive subset of 16 trials.

Results

Data trimming and analyses followed the preregistered protocol. The first block of trials was excluded as practice for each experiment. Trials with RTs exceeding three standard deviations of the mean in each condition for a given subject were excluded (<1.7% of trials in each experiment). Error trials were excluded from the RT analyses. Mean RTs and mean error rates appear as a function of alerting (no alert or alert) and congruency (incongruent or congruent) for each experiment in Table 1. The results of repeated-measures analyses of variance (ANOVAs) with those variables as factors are summarized in Table 2. All effects were significant (α = .05) in the RT data. Fewer effects were significant in the error data because mean error rates were generally low (<2.7% in each experiment). With two exceptions (noted later), the error data patterns were consistent with the RT data patterns; therefore, I focus on the latter. Mean RTs are presented visually in Fig. 3.

Table 1 Mean response times and mean error rates
Table 2 Summary of individual experiment analyses of variance
Fig. 3
figure 3

Mean response times as a function of alerting (no alert or alert) and congruency (incongruent or congruent) for each experiment. Numbers above bars indicate congruency effects. Error bars represent 95% confidence intervals (Masson & Loftus, 2003)

The ANOVA results are supplemented by Bayes-factor analyses of the interactions between alerting and congruency (scaled JZS Bayes factors with r = 1; Rouder, Speckman, Sun, Morey, & Iverson, 2009). Each Bayes factor (BF) quantifies the strength of evidence for the presence or absence of an alerting–congruency interaction. Following the notation and categorization used by Schneider (2019a, 2019b), BFalt indicates the data are more likely under the alternative hypothesis that an interaction is present, whereas BFnull indicates the data are more likely under the null hypothesis that an interaction is absent.Footnote 8 The BF subscript is determined by the data, but one can convert from BFalt to BFnull, or vice versa, by inverting the BF value (e.g., BFnull = 1/BFalt). The BF value indicates how many times more likely the data are under one hypothesis than the other, which can be categorized as weak (1–3), moderate (3–10), strong (10–30), very strong (30–100), or decisive (>100) evidence. By expressing BF values in terms of the hypothesis (null or alternative) under which the data are more likely, one avoids fractional values that might be difficult to interpret (Siegler & Lortie-Forgues, 2017).

Individual experiment analyses

Experiment 1 (arrow stimuli; horizontal)

Subjects were faster on alert trials (mean RT of 466 ms) than on no-alert trials (498 ms), and they were slower for incongruent stimuli (508 ms) than for congruent stimuli (456 ms), resulting in significant main effects of alerting and congruency. As shown in Fig. 3 (top-left panel), congruency effects were larger on alert trials (56 ms) than on no-alert trials (48 ms), resulting in a significant interaction (BFnull = 1.1).

Experiment 2 (arrow stimuli; vertical)

Subjects were faster on alert trials (489 ms) than on no-alert trials (519 ms), and they were slower for incongruent stimuli (538 ms) than for congruent stimuli (471 ms), resulting in significant main effects of alerting and congruency. As shown in Fig. 3 (top-right panel), congruency effects were larger on alert trials (71 ms) than on no-alert trials (63 ms), resulting in a significant interaction (BFalt = 1.2).Footnote 9

Experiment 3 (word stimuli; horizontal)

Subjects were faster on alert trials (486 ms) than on no-alert trials (506 ms), and they were slower for incongruent stimuli (513 ms) than for congruent stimuli (479 ms), resulting in significant main effects of alerting and congruency. As shown in Fig. 3 (bottom-left panel), congruency effects were larger on alert trials (44 ms) than on no-alert trials (25 ms), resulting in a significant interaction (BFalt = 1,864.2).

Experiment 4 (word stimuli; vertical)

Subjects were faster on alert trials (481 ms) than on no-alert trials (507 ms), and they were slower for incongruent stimuli (521 ms) than for congruent stimuli (467 ms), resulting in significant main effects of alerting and congruency. As shown in Fig. 3 (bottom-right panel), congruency effects were larger on alert trials (61 ms) than on no-alert trials (46 ms), resulting in a significant interaction (BFalt = 3,605.7).

Combined experiment analyses

Drawing on the factorial structure of the set of experiments, preregistered secondary analyses for all experiments combined were conducted by performing 2 (stimulus type: arrows or words) × 2 (spatial dimension: horizontal or vertical) × 2 (alerting: no alert or alert) × 2 (congruency: incongruent or congruent) mixed ANOVAs, with the first two factors between subjects. The ANOVA results are summarized in Table 3. To avoid redundancy with the individual experiment analyses, I focus on effects involving stimulus type and spatial dimension.

Table 3 Summary of combined experiment analyses of variance

Alerting effects were larger for arrow stimuli (31 ms) than for word stimuli (22 ms), resulting in a significant interaction between stimulus type and alerting. As shown in Fig. 3 (top vs. bottom panels), congruency effects were larger for arrow stimuli (59 ms) than for word stimuli (44 ms), resulting in a significant interaction between stimulus type and congruency. Also shown in Fig. 3 (left vs. right panels), congruency effects were larger for the vertical dimension (61 ms) than for the horizontal dimension (43 ms), resulting in a significant interaction between spatial dimension and congruency. Alerting–congruency interactions (differences in congruency effects between alert and no-alert trials) were larger for word stimuli (17 ms) than for arrow stimuli (8 ms), resulting in a significant three-way interaction between stimulus type, alerting, and congruency.Footnote 10

Discussion

Cognitive control over information processing can be implemented by selective attention, but it is often suboptimal, as indicated by congruency effects arising from processing of irrelevant stimulus features. Research has revealed that congruency effects in some tasks are modulated by alertness, but in an unusual way: Congruency effects are often larger when subjects are more alert, despite an overall performance benefit associated with heightened alertness (e.g., Callejas et al., 2005; Fan et al., 2002; J. W. MacLeod et al., 2010; McConnell & Shore, 2011; Redick & Engle, 2006; Schneider, 2018a, 2018b, 2019b; Weinbach & Henik, 2012). The mechanism underlying this alerting–congruency interaction is unclear, in part because the interaction is obtained with some cognitive control tasks (e.g., arrow flanker, Simon, and global/local tasks), but not with others (color Stroop task). Drawing on the idea that spatial attention and spatial information processing might be important for eliciting the alerting–congruency interaction (Schneider, 2019a), the purpose of the present study was to determine whether the interaction reliably occurs in the spatial Stroop task.

I conducted a set of four experiments in which alertness was manipulated in variants of the spatial Stroop task. In each experiment, subjects classified the spatial meaning of a stimulus presented at an irrelevant position that was either congruent or incongruent with the meaning. Stimulus type (arrows or words) and spatial dimension (horizontal or vertical) varied between experiments (see Fig. 1). Alerting cues preceded the stimuli on half of the trials in each experiment (see Fig. 2). The RT data patterns were highly consistent across the four experiments (see Fig. 3). Subjects responded slower to incongruent stimuli than to congruent stimuli, replicating congruency effects found in previous studies involving the spatial Stroop task (e.g., Clark & Brownell, 1975; Logan & Zbrodoff, 1979; Palef & Olson, 1975). The congruency effects were larger when classifying arrows instead of words, and when the spatial dimension for the task was vertical instead of horizontal. Subjects responded faster on alert trials than on no-alert trials, replicating alerting effects found in previous studies with visual alerting cues (e.g., Fan et al., 2002; Redick & Engle, 2006; Schneider, 2018a).

Critically, there was a significant alerting–congruency interaction in every experiment (see Table 2). Bayes-factor analyses indicated that the evidence in favor of an interaction was weak for arrow stimuli, but decisive for word stimuli. The finding of an alerting–congruency interaction in Experiment 1, which involved arrow stimuli in a horizontal version of the spatial Stroop task, replicates the results of Fischer et al. (2010), but with visual alerting cues and more than double their sample size. The alerting–congruency interactions found in Experiments 2–4 go beyond Fischer et al.’s results by providing novel evidence that the interaction generalizes to word stimuli and to vertical versions of the spatial Stroop task. To my knowledge, the present study is the first preregistered, multiexperiment demonstration of reliable alerting–congruency interactions in a cognitive control task other than the arrow flanker task.

Comparing Stroop tasks and stimuli

The present results fit with the idea that spatial attention and spatial information processing are important sources of the alerting–congruency interaction (Schneider, 2019a). Spatial attention is needed in the spatial Stroop task because attention has to shift to the position of the stimulus in order to classify the relevant stimulus feature. Spatial information processing occurs because the task is to classify the direction of an arrow or the spatial meaning of a word. In contrast, spatial attention and spatial information processing play little or no role in the color Stroop task, and typical alerting–congruency interactions have not been obtained with it (Schneider, 2019a; Soutschek et al., 2013; Weinbach & Henik, 2012).

A reviewer astutely noted that a significant effect for one task and a nonsignificant effect for another task do not necessarily imply a significant difference in effects between tasks (Gelman & Stern, 2006; Nieuwenhuis, Forstmann, & Wagenmakers, 2011). However, the apparent difference in the alerting–congruency interaction patterns for the spatial and color Stroop tasks receives statistical support when data from the two tasks are analyzed together. I conducted an exploratory analysis involving the spatial Stroop task data from the present Experiments 3 and 4 (N = 120), and the color Stroop task data from Experiments 1–3 (N = 258) of Schneider (2019a). These experiments were selected for this analysis because they all used word stimuli for which the relevant and irrelevant stimulus features were spatially integrated. In the present Experiments 3 and 4, a spatial word (e.g., LEFT) was the relevant feature and its position was the irrelevant feature. In Experiments 1–3 of Schneider (2019a), a color word (e.g., RED) was the irrelevant feature and its physical color was the relevant feature.

I submitted the RT data from both sets of experiments to a mixed ANOVA with task (spatial Stroop or color Stroop) as a between-subjects factor, and alerting (no alert or alert) and congruency (incongruent or congruent) as within-subjects factors. The key result was a significant three-way interaction, F(1, 376) = 17.10, p < .001, ηp2 = .043, which reflects the different alerting–congruency interaction patterns across tasks: The mean difference in congruency effects between alert and no-alert trials was 17 ms for the spatial Stroop task and −1 ms for the color Stroop task.Footnote 11 The different interaction patterns in different versions of the Stroop task represent complementary evidence supporting a role for spatial information processing in producing the interaction.

For the variants of the spatial Stroop task in the present study, the alerting–congruency interaction was larger for word stimuli than for arrow stimuli (compare the top and bottom panels of Fig. 3). Interestingly, the overall alerting effect—the difference in RTs between no-alert and alert trials—was smaller for word stimuli than for arrow stimuli. Considering that the same alerting cues were used with both types of stimuli (see Fig. 2), the quantitative difference in the alerting–congruency interaction patterns must reflect some aspect of how alertness influences stimulus processing.

Experimental and modeling work supports the idea that increased alertness makes stimulus encoding faster (e.g., Bundesen, Vangkilde, & Habekost, 2015; Matthias et al., 2010; Petersen, Petersen, Bundesen, Vangkilde, & Habekost, 2017; Rolke & Hofmann, 2007; Seifried, Ulrich, Bausenhart, Rolke, & Osman, 2010). Based on this idea, Nieuwenhuis and de Kleijn (2013) modeled the alerting–congruency interaction in the flanker task by assuming that a shorter stimulus encoding time allows stimulus classification and response selection to start sooner, when attention is relatively unfocused. However, the process of extracting spatial information from the encoded stimulus for classification depends on the type of stimulus, and there is evidence that spatial information is processed differently for spatial words and arrows (e.g., Gibson & Kingstone, 2006; Hommel, Pratt, Colzato, & Godijn, 2001). If the time course of spatial information processing differs for words and arrows, and if alertness affects the time when such processing commences, then it is possible that the resulting alerting–congruency interaction will vary in magnitude across stimulus types. Computational modeling could be a fruitful approach for investigating this possibility.

Implications for previous hypotheses

The recurrent finding of a reliable alerting–congruency interaction in the spatial Stroop task has important implications for previous hypotheses about the interaction, most of which were based on results from the arrow flanker task. One hypothesis is that increased alertness is associated with a more diffuse focus of spatial attention, resulting in more distractor processing that produces larger congruency effects on alert trials than on no-alert trials in the flanker task (McConnell & Shore, 2011; Nieuwenhuis & de Kleijn, 2013; Weinbach & Henik, 2012). However, the diffuse attention hypothesis has not been supported in recent tests (Schneider, 2018b; Seibold, 2018), and it is unclear how it would explain alerting–congruency interactions in the spatial Stroop task or the Simon task, where the irrelevant feature (stimulus position) is not spatially separated from the relevant feature. Weinbach and Henik (2012, p. 1538) suggested that “broadening the attentional beam could increase the [cognitive] system’s sensitivity to the spatial location of the target” in the Simon task. They did not elaborate on the nature of this increased sensitivity to spatial location, but one possibility is that a wider focus of attention is more likely to encompass both the target stimulus and a central fixation sign, with the latter serving as a reference point for easier coding of stimulus position. This idea would be consistent with referential coding accounts of Simon task performance (e.g., Hommel, 1993).

As an alternative to diffuse attention, Schneider (2018b) hypothesized that increased alertness might be associated with a higher probability of spatially grouping the distractors with the target in the flanker task, thereby producing an alerting–congruency interaction. However, the spatial grouping hypothesis was not supported in recent experiments in which spatial grouping was disrupted by misaligning stimuli in the flanker task (Schneider, 2019b), and it also cannot account for alerting–congruency interactions in the spatial Stroop task or the Simon task, where spatial grouping does not occur.

An idea that seems consistent with the present data is the facilitated response activation hypothesis of Fischer, Plessow, and Kiesel (2012; see also Böckler et al., 2011). They proposed that increased alertness facilitates a direct route for response selection involving established stimulus–response associations. When stimuli are strongly associated with specific response codes (e.g., arrow stimuli and spatial responses in the arrow flanker task), heightened alertness is hypothesized to boost activation along these direct links, amplifying the contribution of distractors to response selection and producing an alerting–congruency interaction. In support of the hypothesis, Fischer et al. (2012) found that the interaction occurred reliably in a word flanker task only when the distractor words also served occasionally as target words, which allowed the formation of direct stimulus–response associations for the distractors. Beyond the word flanker task, the facilitated response activation hypothesis can explain data from the Simon and spatial Stroop tasks if alerting modulates the activation of established links between the irrelevant stimulus positions and spatial response codes.

However, it is less clear how the hypothesis can account for the data in Schneider (2019a) from color Stroop-like tasks involving distractors with preexisting spatial associations (e.g., making a left or right key-press response to classify the color of a rectangle flanked above and below by the word LEFT or RIGHT). According to the facilitated response activation hypothesis, increased alertness should boost activation from a distractor (e.g., LEFT) to its associated spatial code (“left”) that overlaps with a response code (left key press), thereby increasing congruency effects. I obtained reliable congruency effects in three experiments involving color Stroop-like tasks (Experiments 6–8 in Schneider, 2019a), indicating that the distractors influenced response selection, but the congruency effects were not larger on alert trials than on no-alert trials.

Considering dimensional overlap

A reviewer suggested that the nature of the dimensional overlap between stimuli and responses might play a role in determining whether the alerting–congruency interaction will be obtained for a given task. Kornblum and colleagues proposed a taxonomy of stimulus–response ensembles for classifying the overlap of relevant and irrelevant stimulus dimensions and response sets (e.g., Kornblum & Lee, 1995; Kornblum, Stevens, Whipple, & Requin, 1999). In their taxonomy, all variants of the spatial Stroop task in the present study involve Type 8 ensembles because the relevant stimulus dimension (e.g., arrow direction), irrelevant stimulus dimension (e.g., arrow position), and response set (e.g., pressing a left or right key) all overlap with each other. The arrow versions of the flanker and global/local tasks also involve Type 8 ensembles. Alerting–congruency interactions have been consistently found with these tasks (e.g., Callejas et al., 2005; Fan et al., 2002; J. W. MacLeod et al., 2010; McConnell & Shore, 2011; Redick & Engle, 2006; Schneider, 2018a, 2018b, 2019b; Weinbach & Henik, 2011, 2012, 2014).

In contrast, the color Stroop task with left–right manual responses involves Type 4 ensembles because there is overlap between the relevant and irrelevant stimulus dimensions (stimulus color and word meaning, respectively), but neither overlaps with the response set. Alerting–congruency interactions have not been found with this task (Schneider, 2019a; Soutschek et al., 2013; Weinbach & Henik, 2012). The color Stroop task can be modified to involve Type 8 ensembles if vocal responses (color names) are used instead of manual responses. If the alerting–congruency interaction routinely occurs with Type 8 ensembles, then it might emerge in the color Stroop task with vocal responses. A caveat to this prediction is that previous research has revealed that vocal responding produces larger congruency effects than does manual responding, as well as a different pattern of performance across a range of stimulus types (e.g., Kinoshita, De Wit, & Norris, 2017; Sharma & McKenna, 1998). Consequently, a change in response modality would likely alter Stroop task processing (in addition to changing the type of stimulus–response ensemble), and the implications for finding an alerting–congruency interaction are unknown.

There is mixed evidence concerning alerting–congruency interactions for Type 4 ensembles in non-Stroop tasks. For example, many flanker tasks with nonarrow stimuli (e.g., letters) and left–right manual responses involve Type 4 ensembles. Reliable alerting–congruency interactions have been obtained in some experiments with nonarrow flanker tasks, but not in others (for a summary, see Schneider, 2019a). One potentially important factor might be whether the nonarrow stimuli have directional associations that can be mapped onto the spatial dimension used in responding (Kahan & Zhang, 2019).

Even if the alerting–congruency interaction turns out to be prevalent with Type 8 ensembles (and present under specific circumstances with Type 4 ensembles), there is already evidence that it is not restricted to those ensembles. The Simon task involves Type 3 ensembles because there is overlap between the irrelevant stimulus dimension (stimulus position) and the response set (pressing a left or right key), but neither overlaps with the relevant stimulus dimension (e.g., stimulus shape). Alerting–congruency interactions have been consistently found with the Simon task (Böckler et al., 2011; Fischer et al., 2010; R. M. Klein & Ivanoff, 2011; Soutschek et al., 2013). However, other tasks involving Type 3 ensembles have not yielded the interaction. For example, the Stroop-like tasks in Experiments 7 and 8 of Schneider (2019a) involved Type 3 ensembles because there was overlap between the irrelevant stimulus dimension (spatial word or arrow distractor) and the response set (pressing a left or right key), but neither overlapped with the relevant stimulus dimension (target stimulus color). An alerting–congruency interaction was not found in either experiment, despite high statistical power.

The preceding taxonomic analysis indicates that the alerting–congruency interaction is not limited to one specific type of stimulus–response ensemble. However, the available evidence suggests that it is difficult to predict whether the interaction will be obtained with a given ensemble, with the possible exception of Type 8 ensembles, which involve complete overlap between the relevant and irrelevant stimulus dimensions and the response set. Future research focusing on the circumstances under which the interaction is present or absent for an ensemble might lead to new or refined hypotheses of the alerting–congruency interaction. This is important because the collective results from my laboratory’s experiments involving the arrow flanker task (10 experiments: Schneider, 2018a, 2018b, 2019b), color Stroop task (eight experiments: Schneider, 2019a), and spatial Stroop task (four experiments: present study) cannot be fully accommodated by any existing hypothesis of the interaction. My previous findings led me to suggest that spatial attention and spatial information processing are important sources of the interaction (Schneider, 2019a), and this suggestion is reinforced by the results from the spatial Stroop task in the present study. Future empirical and theoretical work that focuses on spatial information processing, especially in the context of different tasks and with different types of stimulus–response ensembles, might contribute to more progress in understanding how alertness is related to cognitive control.