Reading is a relatively recent linguistic activity, invented and implemented by humans in the last few millennia. Its neural bases have been studied since at least the late 19th century (Dejerine, 1892), and a key area was traced to the left occipitotemporal cortex, a region later termed the visual word form area (VWFA; Cohen et al., 2000; Dehaene & Cohen, 2011; Martin, 2006; Warrington & Shallice, 1980). It is puzzling how a localized cortical area that evolved with humans can serve such a novel, specialized function such as reading. To solve this problem, Dehaene proposed a hypothesis called neuronal recycling, suggesting that new functions could arise in cortical areas that originally serve another purpose, but match the requirements of the new one (Dehaene & Cohen, 2007). The hypothesized reason for the occipitotemporal localization of the VWFA is twofold: firstly, here, the visual input reaches the level of processing required for orthographic decoding, and secondly, this area has sufficient connectivity with frontotemporal language networks to serve its function (Bouhali et al., 2014; Kim, Kanjlia, Merabet, & Bedny, 2017; Reich, Szwed, Cohen, & Amedi, 2011).

One of the most accepted frameworks of visual word recognition is the local combinations detector (LCD) model. This states that the hierarchical organization of the ventral visual stream can be applied to the processing of written words: line segments are combined into small local patterns, those into letters, groups of letters, and then finally, words (Dehaene, Cohen, Sigman, & Vinckier, 2005). Invariance to text size, position, and other aesthetic parameters like case and font can arise along this stream, as receptive field size and preferred stimulus complexity grows. Subsequent areas of the stream deal with increasingly complex information in a parallel manner (i.e., letters of a word are recognized at a single glance and then are fed forward in a combinatorial system, the VWFA, that outputs the most likely word that the letters code; Grainger, 2018; Norris, 2013). This ventral parallel processing allows well-trained readers to read quickly with ease. Evidence suggests, however, that when words are observed out of their usual appearance, the quick parallel processing and decoding are lost and auxiliary mechanisms through the dorsal pathway must salvage recognition in a serial, letter-by-letter manner (Cohen, Dehaene, Vinckier, Jobert, & Montavont, 2008). In-plane rotation is considered such a transformation, and even though letter recognition is not affected by rotation (Koriat & Norman, 1989; Perea, Vergara-Martínez, Marcet, Mallouh, & Fernández-López, 2020; but see Risko, Medimorec, Chisholm, & Kingstone, 2014), word recognition is heavily impaired by it (Cohen et al., 2008). The common explanation is that rotation breaks the experience-based functions of the VWFA, and it takes an increased cognitive load to mentally normalize the visual image back to its normal orientation. Yet we might argue that perception of orientation must come afterward shape is recognized, since without knowing what the shape is, we cannot possibly tell how it should be oriented (Corballis & Armstrong, 2007), and cases of isolated orientation agnosia also hint at this conclusion (Harris, Harris, & Caine, 2001). Indeed, evidence has been found of an early, orientation-invariant priming effect in object recognition (Harris, Dux, Benito, & Leek, 2008), but its validity in reading has not yet been studied in great detail; although some specific studies in recent years pointed out that priming is possible using marquee and 90° rotated words (Perea, Marcet, & Fernández-López, 2018; Witzel, Qiao, & Forster, 2011) and rotated letters (Perea et al., 2020).

To understand the detailed neurophysiological background of reading, it is essential to find the limitations and requirements of the system that carries it out. The goal of this study was to find out whether orientation-invariant priming exists for word recognition and also to investigate which aspects of written words are necessary to elicit such an effect.

Experiment 1: Rotated word priming

To gather detailed data on rotated word processing, we set out to implement the rotated repetition priming design of Harris et al. (2008), using words instead of images. In this paradigm, primes are rotated and targets are presented upright, highlighting the automaticity of orientation invariance in recognition. We chose to keep the naming task from that work, as it seemed more direct and thus more sensitive than a lexical decision.

Materials and methods

Procedure

Participants were seated 57 cm from the viewing screen, with a microphone in front of them. Their task was to read target words presented to them out loud, as quickly as possible. A total of 224 trials were presented in a single run, lasting approximately 10 minutes. Each trial started with a central fixation cross presented for an irregular interval between 1,500 and 2,000 ms. This was followed by a lowercase prime stimulus that varied in its identity and orientation and was of a different duration for subject groups (discussed in the Design section). Following the prime, a mask, composed of scattered characters, was presented for 100 ms. Finally, the uppercase target stimulus was displayed until response. The computer recorded voice onset and paced the experiment automatically. Reaction times (RT) and reading accuracy were later validated off-line by the experimenter, based on voice recordings. Trials with inaccurate utterances, late microphone triggering, and thus incomplete recording or RT above 1,000 ms were excluded from the analysis. Trials with premature triggering were fixed by resetting the measured RT to the actual start of the response.

Design

The experiment design (Fig. 1) was based on the first experiment by Harris et al. (2008). The within-subject factors were prime identity and prime orientation. Primes could either be of the same identity as the target they preceded (“same” condition: vonat → VONAT) or could be an unrelated word (“different” condition: kalap → VONAT). Primes could be presented in any orientation, ranging from 0° to 360° in 30° steps. The duration of the prime stimulus was a between-subject factor and was one of the following: 25, 50, 75, 100. The prime–target stimulus-onset asynchrony (SOA) was therefore 125, 150, 175, and 200 ms for these groups, with the priming duration added to the fixed-length backward mask.

Fig. 1
figure 1

The design of Experiment 1 (detailed in text)

Word order and prime–target pairings were randomized according to Harris et al. (2008), with each pair being unique and each target only occurring once per experiment. For each prime, an orientation (0°–180°) and a direction (±) were chosen in a pseudorandom manner. This way, in every orientation, 8 “same” and 8 “different” trials occurred (except for 0° and 180° orientations, given the ± equality, these occurred 16–16 times).

Stimuli

The stimuli were the 336 most frequent (17–196/million) Hungarian nouns, five to six characters in length (Oravecz, Váradi, & Sass, 2014). Words were drawn on light gray (#C8C8C8) background using Lucida Console, a monospaced typeface, in black color, and 40-pt font size. Words made up for a ~3.5 × 0.7° visual angle. To minimize visual similarity while retaining the orthographic code, primes were presented in lowercase, whereas targets were capitalized.

Masks were designed to disguise both the prime word and its orientation, and hence we had to use a circular mask, composed of visually similar elements as the primes. These were generated at the start of each run by scattering 300 Hungarian characters uniformly on a 400 × 400-pixel area and cropping the texture to a circle (~7° visual angle). Five masks were generated and drawn randomly for each trial at random rotation angles to minimize familiarization to masking.

Equipment

The experiment was carried out in the psychophysics laboratory of our department, in a dark and quiet environment. The experiment was programmed and executed in MATLAB R2016a with Psychophysics Toolbox Version 3.0.14 running under Microsoft Windows (Kleiner, Brainard, & Pelli, 2007). Stimuli were presented on an Asus PG248Q monitor with 120 Hz refresh rate and responses were captured with a Rode NT-USB microphone.

Subjects

Fifty-three healthy young adults (34 females, 25.6 ± 4.6 years) participated in the study all of whom had normal or corrected to normal vision, reported no reading difficulties, and were naive to the aims and means of the experiment. All participants gave written consent. The procedures were all approved by the Regional Research Ethics Committee of the University of Szeged and were applied in concordance with the World Medical Association Helsinki Declaration.

Group sizes: In a pilot study, we found a relatively large priming effect at normal orientation (~70 ms) that decayed to zero with increasing orientation. To calculate appropriate sample sizes, we simulated data sets with variable effect sizes at different orientations and variances based on the pilot model and performed the same analyses that we planned to do on the real data. These calculations showed that to achieve 80% power for priming effects above 20 ms, at any point of the design, we would need 20 participants per group. The COVID-19 outbreak and the consequent regulations, however, prevented us from reaching this goal. Based on the same simulations, we concluded that even with 10 subjects, we would still have the 80% power margin at around a 30-ms priming effect size, and with 15 subjects at 25 ms. As the systematic steps in priming duration and orientation are fitted into separate effects, any smoothness in the pattern of these would further increase the reliability of the results. We concluded that the available subject numbers (10, 17, 16, and 10 in the four groups, respectively) yield sufficient power in this paradigm.

Analysis

Analyses were performed in R (Version 3.6.2) using the lme4, and emmeans packages. Separate generalized linear mixed-effects models (GLMM) with inverse Gaussian function and log link were fitted to the RT data from each experimental group (Bates, Mächler, Bolker, & Walker, 2015; Lenth, Singmann, Love, Buerkner, & Herve, 2018). Fixed effects were evaluated with Wald tests, and random-effect significance was analyzed with likelihood ratio tests of reduced models. The priming effect for all conditions was calculated as estimated marginal mean contrasts between “same” and “different” conditions. Plots were made with the ggplot package (Wickham, 2016).

Results

A total of 11,351 RT observations were included in the analysis, and 521 (~4% of total) were excluded on the grounds stated earlier. In the four priming groups 10, 17, 16, and 10 subjects were included, respectively.

Separate GLMMs per group were built with prime identity and orientation as discrete fixed-effects with interactions. The natural logarithm of trial number was added as an independent continuous fixed effect. Random intercepts were fitted by subject and target word, and a random effect of prime identity was fitted by subject. All models converged with the bobyqa method included in the lme4 package.

Wald test results of fixed effects significance are summarized in Table 1. Likelihood ratio tests of diminished models justified the random effects structure across the models (see Table 2).

Table 1 Wald test results on the importance of the denoted fixed effects in the models of Experiments 1 and 2
Table 2 Likelihood ratio test results of diminished models lacking the denoted random effect, compared with the full models of Experiments 1 and 2

Marginal means were calculated for “same” and “different” priming conditions in each orientation and priming duration (see Table 3). Priming effect was calculated as the difference between conditions and multiple comparisons were adjusted with the Tukey method. The orientational pattern of priming effect was different across priming durations (see Fig. 2). With the shortest priming duration, it is merely present at 0° and −30°, and its size only reaches a few tens of milliseconds. This extends with longer priming durations, and we see a stable and comparable priming effect from −60° to 60° of a size around or above 50 ms. The effect peaks at normal orientation. Curiously, a counterclockwise bias stabilizes across all groups with effects being stronger for rotations in that direction (possibly related to resting head position (Risko et al., 2014), as this was not controlled in our experiments). Also, a priming tendency can be seen at −150° with the 50 ms primes that grows into a significant priming effect at 75-ms duration. Another incidental feature is that with the longest duration there is a relative reduction in the priming effect size around 0°, and much larger variance (even when compared with the 25-ms group that has the same number of subjects).

Table 3 RT marginal means and their differences in ms, across prime orientations and conditions in Experiments 1 and 2
Fig. 2
figure 2

Priming effect as a function of prime orientation in Experiment 1. RT difference between marginal means of “same” and “different” conditions in ms ±95% confidence intervals. The dashed circle is the zero line, meaning no effect, and positive priming effects are observed outside of this. Note that measurements were only made in the orientations denoted with ticks, and the lines between are interpolated for easier visual interpretation. Orientation labels also serve as exemplars for the indicated rotation

Experiment 2: Reversed letter order

After Experiment 1, we wanted to confirm, that the rotated priming effect is caused by orthographic similarity and not by overlap at lower levels of processing. For this reason, we decided to repeat the experiment using reversed primes with 50-ms duration, as this was the shortest, where the wider orientation-specific pattern arose.

Materials and methods

Procedure and design

Experiment 2 was an exact duplicate of the 50 ms prime duration group of Experiment 1, with one modification: the letter order of prime words was reversed (e.g., tanov → VONAT).

Subjects

Fourteen people (nine females, 21.7±2.7 years) matching the same criteria as in Experiment 1 participated under the same ethical license.

Results

The model was built from 3,037 RT data points from 14 subjects, 99 trials (~3% of total) were rejected. The model structure and fitting methods were identical to the one used in Experiment 1. Wald test and likelihood ratio test results are summarized in Tables 1 and 2.

Tukey corrected marginal means were contrasted similarly to Experiment 1 (Table 3) and priming effects are plotted in Fig. 3. Only a small inverse priming effect was detected at 0°; otherwise, there was no difference between priming conditions with the reversed prime words.

Fig. 3
figure 3

Priming effect as a function of prime orientation in Experiment 2. RT difference between marginal means of “same” and “different” conditions in ms ±95% confidence intervals. The dashed circle is the zero line, meaning no effect, and positive priming effects are observed outside of this. Note that measurements were only made in the orientations denoted with ticks, and the lines between are interpolated for easier visual interpretation. Orientation labels also serve as exemplars for the indicated rotation

Discussion

In the paper that our study was mostly based on, Harris et al. (2008) found that in object recognition, the size of the priming effect increased with priming duration, most apparent above 70 ms and that it was independent of prime orientation. This orientation invariance is in sharp contrast with our findings, but may be explained by the exceptional role of letters in human visual recognition (Grainger, 2018) and the “unlearning” of mirror-generalization during learning to read (Dehaene et al., 2005).

Repetition priming is believed to occur because the prime, at least partly, preactivates the processing network (Dehaene et al., 2001; Holcomb & Grainger, 2007). In the case of our paradigm, this network includes the phonological and articulatory systems besides the orthographic and visual ones, as the responses were vocal (Fig. 4). In trials of effective priming, the prime had elicited some activation along the pathway that later processed the target and gave an output. This means that in the conditions where a priming effect was detected, the processing of the prime words was successful, even if they were rotated. The results of Cohen et al. (2008) show that in an unprimed paradigm, lexical decision was markedly slowed by rotation of 90° (>200 ms for words of length of four to six characters), implying that the recruitment of the dorsal pathway for alternative reading strategies is rather time-consuming. In our experimental design, however, alternative reading strategies are unlikely to be implemented during the short prime–target SOA, and the same applies to conscious mental rotation or perspective-taking as orientation normalization strategies. The randomness of the consequent rotations also excludes the possibility of anticipation-driven orientation normalization, and the small main effect of orientation in the Wald tests also suggests the lack of time-dependent mechanisms. Note that in our experience, the subjects only occasionally reported seeing the primes, mostly in the 100-ms group, solely in the case of “different” primes, and never spotted that orientations other than normal were involved (unregistered data). Based on these findings we conclude, that the priming effect seen in Experiment 1 arises through the highly automated VWFA.

Fig. 4
figure 4

Schematic model of word form and lexical processing in our experiments, based on the LCD model (Dehaene et al., 2005) and the figures of Ramus (2004) on reading acquisition. The VWFA, trained for orthographic regularities, extracts the code in an automated processing step. When this is incapacitated, decoding is aided by slower, more attentive processes

Importantly, the earlier attempts at examining a priming effect with vertically oriented words, employed block designs, with all primes and targets being in the same orientation within a block (Perea et al., 2018; Witzel et al., 2011). This way the subjects are prepared for the orientation of the target, and this anticipation could already affect the processing of the prime since they appear in the same manner. Furthermore, in this setting, individual letters of the prime and target words could spatially overlap in the “same” condition, thus permitting a priming effect at the level of letter detectors. With our method, this only stands for the normal orientation, as the targets were always presented horizontally, but higher orthographic processing is necessary for rotated primes to have an effect.

We regard that the pattern of rotated priming in Experiment 1 does not conflict with most predictions of the LCD model, but some details call for amendments. Cohen et al. (2008) suggest that the involvement of the VWFA in word form processing has a binary pattern, it is either on or off, and its function breaks after rotation reaches a certain limit (~45°). The narrow effect at 25-ms priming duration could be attributed to priming at the level of case-invariant letter detectors, as there was a spatial overlap between the letters of the prime and target in the “same” condition. However, as duration increased, so did the priming effect and an orientation invariance emerged between ±60° rotations with a diminished effect up to ±90°. These results point at a rapid word-level abstraction of rotated primes, attributed to WVFA, meaning that the orientation-invariant boundary could be less sharp and at a larger degree of rotation than expected by earlier evidence (Cohen et al., 2008; Dehaene et al., 2005; but challenged by Perea et al., 2018; Witzel et al., 2011). As for the −150° priming effect at 75-ms duration and the upside-down effect in 75 and 100-ms durations, we cannot yet provide feasible explanations.

These discrepancies with the LCD model might point us to different models of word recognition. The SERIOL model for instance predicts that degradation, in our case rotation and probably the shortness of the presentation, affects the letter-level representation and limits the number of characters that can be extracted during an oscillatory cycle (the temporal quantum of word processing, proposed at 40 Hz; Whitney, 2002). The prime durations tested in our first experiment coincide with 1, 2, 3, and 4 of such cycles, and it would be interesting to examine whether this model could account for the priming effects we measured.

Experiment 2 yielded two important supplementary findings. Firstly, the effects seen in the 50-ms group of Experiment 1 cannot be explained by the repeated exposure to the same letters alone, as there was no facilitative priming at all when the letter content was unchanged but the code was broken. Secondly, it was shown earlier that individual letter identification is unaffected by extreme rotations (≥90°; Perea et al., 2020); if we assume that a string of rotated letters can be processed similarly by the decoding system, we would expect to see some priming effect with upside-down presented reversed primes (reversal with 180° rotation fixes the relative letter order). As this was not the case, we propose that internal cues of letter orientation could also serve as cues of word orientation, a potentially important piece of the code. To find the letter order, one must also find the start of the word and extract ordinal information along the axis defined by it. The default axis depends on the writing system, but also on the reference frame that the observer perceives, which depends on multiple factors (body position, visual background, and direction of gravity, according to Dyde, Jenkin, & Harris, 2006). The prevalence of accented characters in the Hungarian alphabet could be an important source of information on word orientation, as accents are only present on top of letters (approximately one-third of the words used as stimuli in this study contain such characters).

Regarding the inverse priming effect observed with reversed primes in the normal orientation, we think it might be explained by the inhibitory effect of “reversed anagram” priming, observed by Morris and Still (2012). In their case, a similar prime can elicit an inhibitory effect, when despite the similarity, there is strong evidence for a different word identity. Although our reversal retains the letter content, and maybe even the inner part of the code (Davis & Lupker, 2017), the mismatch at the more weighted starting and final letter positions could create competition instead of facilitation.

Finally, from a network point of view, we hypothesize that at all times, multiple systems attempt to make sense of the same sensory inputs, but only one perception can enter the consciousness: the one predicted with the highest confidence. When all systems are pointing at the same result, they increase fidelity. However, if they are conflicting, the perception could be unstable, and if certainty is low, more time is needed for slower systems to reach a decision boundary. This could be the reason why we see a continuous slowing with rotated words when the targets are manipulated (Cohen et al., 2008), but a stable effect in our priming paradigm. In this case, no matter how unsure the VWFA is in its prediction about the identity of the prime, if it was made, a priming effect follows. We propose that our results reflect the limitations of the VWFA as a separate system more precisely, whereas the results of Cohen et al. (2008) are more descriptive of the whole reading network.

Conclusions

We found that it is possible to obtain a significant priming effect with rotated prime words in a reading task. The most important explanatory factor of RT was found to be the interaction of prime identity and orientation. This is in contrast with the rotated priming effect in object naming, which was found to be invariant over orientation (Harris et al., 2008).

The extent of rotation that the effect resists is somewhat dependent upon priming duration. However, with priming durations above 50 ms, priming has a significant effect up to ±60° of in-plane rotation and in some cases even above. This suggests that the automated mechanisms responsible for the decoding of visual words could be more robust than previously suggested (Cohen et al., 2008).

When using a reversed letter order in the primes, the effects vanished, so the same letter content is not sufficient to elicit effective priming in any orientation. We conclude that the effects seen with normal prime words come from higher orthographic processing levels, even when the primes are rotated.

There is a certain limitation to how isolated word recognition experiments can be translated to the more complex activity of natural reading. In recent years, there were some exemplary studies using rotated letter words in both primed and unprimed settings of natural reading (Blythe, Juhasz, Tbaily, Rayner, & Liversedge, 2019; Fernández-Lopez, Mirault, Grainger, & Perea, in press). In our future work, we hope to follow them in extending our findings to a more native context.