Unifying the Ability-as-Compensator and Ability-as-Enhancer Hypotheses

Kühl, Tim; Fehringer, Benedict C. O. F.; Münzer, Stefan

doi:10.1007/s10648-021-09650-5

Unifying the Ability-as-Compensator and Ability-as-Enhancer Hypotheses

Intervention Study
Open access
Published: 06 January 2022

Volume 34, pages 1063–1095, (2022)
Cite this article

Download PDF

You have full access to this open access article

Educational Psychology Review Aims and scope Submit manuscript

Unifying the Ability-as-Compensator and Ability-as-Enhancer Hypotheses

Download PDF

2849 Accesses
2 Citations
2 Altmetric
Explore all metrics

Abstract

Spatial abilities have been found to interact with the design of visualizations in educational materials in different forms: (1) spatial abilities enhanced learning with optimized visual design (ability-as-enhancer) or (2) spatial abilities compensated for suboptimal visual design (ability-as-compensator). A brief review of pertinent studies suggests that these two forms are viewed as mutually exclusive. We propose a novel unifying conceptualization. This conceptualization suggests that the ability-as enhancer interaction will be found in the low-medium range of a broad ability continuum whereas the ability-as-compensator interaction will be found in the medium-high range. The largest difference in learning outcomes between visual design variations is expected for medium ability. A corresponding analytical approach is suggested that includes nonlinear quadratic interactions. The unifying conceptualization was confirmed in an experiment with a consistent visual-spatial task. In addition, the conceptualization was investigated with a reanalysis of pooled data from four multimedia learning experiments. Consistent with the conceptualization, quadratic interactions were found, meaning that interactions depended on ability range. The largest difference between visual design variations was obtained for medium ability, as expected. It is concluded that the unifying conceptualization is a useful theoretical and methodological approach to analyze and interpret aptitude-treatment interactions that go beyond linear interactions.

An Argument for Visualization Technologies in Spatial Skills Assessment

An empirical study of the effect that a computer graphics course has on visual-spatial abilities

Article Open access 05 November 2019

A meta-analysis of the impact of virtual technologies on students’ spatial ability

Article 21 January 2022

Introduction

There is abundant evidence that visualizations can foster learning in educational contexts (cf. Levie & Lentz, 1982; Mayer, 2021). In processing and understanding visualizations, learners’ spatial abilities are involved (cf. Hegarty & Waller, 2005). Correspondingly, substantive correlations between spatial ability measures and learning outcomes for learning with visualizations have been found, suggesting that spatial abilities play an important role in learning with visualizations (for reviews see Hegarty & Kriz, 2008; Höffler, 2010).

However, spatial abilities do not only correlate with learning outcomes, but they may interact with the design of the visualization, such as animations compared to static pictures or two-dimensional compared to three-dimensional visualizations (cf. Höffler, 2010). In this regard, two aptitude (spatial ability) × treatment (design of visualization) interaction hypotheses are stated (cf. Mayer & Sims, 1994): the ability-as-enhancer hypothesis and the ability-as-compensator hypothesis. According to the ability-as-enhancer hypothesis, a certain amount of spatial ability is necessary to profit from an optimized visualization. This means that the potential beneficial effect of an optimized visualization compared to a suboptimal visualization will not unfold when learners’ spatial abilities are low. The ability-as-compensator hypothesis posits that only learners with higher spatial abilities, but not learners with lower spatial abilities can compensate for a suboptimal (e.g., static) visualization, while this compensating effect of spatial abilities is not necessary when learners receive an optimized (e.g., dynamic) visualization.

The ability-as-enhancer and ability-as-compensator hypotheses are viewed as mutually exclusive. In contrast, we assume on a conceptual level that the two forms of the aptitude-treatment interactions are not necessarily exclusive but can be united — by considering them as two ends of an ability continuum. We introduce an analytical approach that corresponds to our conceptualization. We then present an original empirical study and a reanalysis of data from four multimedia learning studies that show first empirical evidence for the unifying view.

In the following, we will first provide a brief review concerning the empirical evidence with respect to the different roles spatial abilities can play in learning with visualizations, specifically with respect to the ability-as-enhancer and the ability-as-compensator hypotheses. Thereafter, we will introduce the unifying conceptualization of the two hypotheses.

A Brief Review of Empirical Results Concerning the Role of Spatial Ability in Learning with Visualizations

Visualizations are an important source of information in instructional materials. Visualizations can either be the only source of information or they can be combined with other sources, such as verbal explanations. In multimedia instructions, learners benefit from the combination of visualizations with text compared with learning from text alone (multimedia effect, cf. Levie & Lentz, 1982; Mayer, 2021), because visualizations convey information that cannot be easily understood or inferred from text (e.g., Kühl et al., 2018; Larkin & Simon, 1987). In order to understand and profit from a visualization, the visualization needs to be adequately processed by a learner.

Learners’ spatial abilities are involved in processing visualizations. Even though a comprehensive definition of spatial ability is not available, it is widely agreed on that spatial abilities are related to mentally performing processes such as apprehending, encoding, remembering, or mentally transforming visual-spatial objects (cf. Hegarty & Waller, 2005; Uttal et al., 2013). Spatial abilities are not considered as a unitary construct but consist of a set of related but different abilities (cf. Carroll, 1993; Hegarty & Waller, 2005). On the one hand, there is a factor-analytic tradition that is rooted in intelligence testing. On the other hand, there is a cognitive research tradition, including investigations of the structure and processes of working memory (see Hegarty & Waller, 2005, for a review of both traditions). In the factor-analytic tradition, different sub-factors of spatial ability have been described. Lohman (1988) distinguished three spatial factors: (1) “speeded rotation” of simple items, (2) “spatial orientation” (involving perspective change), and (3) “spatial visualization” (referring to complex materials and sequences of transformations). Carroll (1993) identified five main factors based on extensive data sets: (1) ”visualization” involving complex and multi-step spatial transformations, (2) ”spatial relations” requiring mental rotation with simple figures under speeded instruction, (3) ”closure speed” requiring quick recognition of incomplete figures, (4) ”closure flexibility” involving identification of hidden figures in complex spatial patterns, and (5) ”perceptual speed” requiring speeded comparisons of simple figures. Measures addressing the factor “visualization” (complex materials, multiple transformation steps) measure a rather broad notion of spatial ability.

In the cognitive-processing tradition, measures of working memory (WM) capacity are commonly used as predictors of individual differences. Such measures can be specific for different codes of information, such as verbal and visual-spatial information — which is for instance accounted for in Baddeley’s working memory model (Baddeley, 1986) that distinguishes a phonological loop and a visuo-spatial sketchpad (i.e., visuo-spatial working memory). Measures of visuo-spatial working memory predict performance in visuo-spatial tasks (e.g., Miyake et al., 2001). Individual differences in mental spatial processing, to which we refer with the broad term of spatial ability in the following, can be addressed with measures both from the factor-analytic tradition as well as from the cognitive-processing tradition (cf. Hegarty & Waller, 2005).

Spatial abilities play a crucial role in many domains in which the understanding of visualizations is important, such as in STEM domains (science, technology, engineering, and mathematics), in architecture, or in geography. Accordingly, studies have repeatedly demonstrated that individual differences in spatial abilities play a decisive role as a predictor of learning success with visualizations, for instance, in STEM domains (Allen et al., 2019; Wai et al., 2009), in medical training (Cohen & Hegarty, 2007; Garg et al., 1999; Keehner et al., 2006), or in route and layout learning about large-scale spaces from virtual environments and videos (Münzer & Stahl, 2011; Münzer & Zadeh, 2016; Hegarty et al., 2006; Waller, 2000).

Visualizations may demand higher or lower cognitive engagement in mental spatial processing, depending on their design. For instance, static pictures often — but not necessarily always — require inferring spatial transformations mentally, whereas dynamic visualizations (animations) can show spatial transformations explicitly. In such cases, dynamic visualizations compared to static visualizations may facilitate spatial understanding because the external presentation can support otherwise effortful mental visual-spatial processing (supplantation, Salomon, 1994; facilitation function, Schnotz & Rasch, 2005). Under such circumstances, the dynamic visualization may be considered as the optimized visualization and the static visualization as the suboptimal visualization. However, it should be noted that dynamic visualizations cannot not automatically be considered as the optimized and the static visualization as the suboptimal design; rather, which visualization format can be considered as the optimized one, and which as the suboptimal one, depends on the processes that dynamic and static visualizations require and support, respectively (Ploetzner et al., 2020).

Höffler (2010) reviewed the role of spatial abilities in multimedia learning with visualizations, considering 27 experiments from 19 primary studies published between 1994 and 2009. The meta-analytic review suggested a general advantage of higher spatial ability learners over lower-ability learners. Moreover, in this review, the potentially moderating role of spatial abilities in learning with different visualization formats was investigated. It was found that learners with lower spatial ability profited from dynamic visualizations (animations) compared to static visualizations, whereas differences between dynamic and static visualizations were less pronounced for learners with higher spatial ability. These results are in line with the ability-as-compensator hypothesis (Höffler, 2010). It should be noted though that in five of the reviewed studies, no significant interaction between spatial ability and treatment condition was found (Hannafin et al., 2008; Hegarty et al., 2003; Massa & Mayer, 2006; Wender & Muehlboeck, 2003; Westerman, 1997), and that in four primary studies the ability-as-enhancer interaction was actually obtained (Diaz & Sims, 2003; Huk, 2006; Huk & Steinke, 2007; Mayer & Sims, 1994).

More recent studies not yet included in this meta-analytic review have been similarly inconclusive. On the one hand, several studies found empirical evidence that is consistent with the ability-as-compensator hypothesis. In these studies, empirical evidence was observed (on at least one learning outcome measure) that animations compared to static pictures supported especially individuals with lower spatial abilities in multimedia learning, but not with higher spatial abilities (e.g., Kühl et al., 2018b; Barrett & Hegarty, 2016; Berney et al., 2015; Bogomolova et al., 2020; de Back et al., 2020; Höffler & Leutner, 2011; Lee & Wong, 2014; Sanchez & Wiley, 2014). Similarly, in studies examining spatial transformations needed for the understanding of three-dimensional spatial structures from two-dimensional visualizations, empirical evidence was found that students’ performances were less dependent on the respective spatial ability if animations of the spatial transformation were shown, compared to static pictures (Münzer, 2012, 2015). In these studies, the visualizations were not accompanied by text. The visual designs and the corresponding spatial ability tests focused on only one selected spatial transformation process.

On the other hand, there were also at least two studies where the pattern of results corresponded to the ability-as-enhancer interaction (on at least one learning outcome measure). In one study about spatial anatomy comprehension, particularly participants with higher spatial abilities profited from the sequence “first static, then dynamic visualizations” compared to the sequence “first dynamic, then static visualizations,” but not participants with lower spatial abilities (Wu et al., 2013). In another study in the domain knot tying (de Koning et al., 2019), only students with higher spatial abilities performed better when they received a dynamic visualization compared to a static visualization, while students with low spatial abilities performed equally well when receiving a dynamic compared to a static visualization. In the first study by Wu et al. (2013), the visualizations were accompanied by text, whereas the visualizations in the study by de Koning et al. (2019) were not accompanied by text.

With the meta-analytic review by Höffler (2010), there were also studies that explicitly investigated the interaction between spatial ability and static compared to dynamic visualizations, but observed no interaction (Kühl et al., 2011b; Brucker et al., 2014; Chen et al., 2015; Imhof et al., 2011; Imhof et al., 2012; Wang & Tseng, 2020). It should also be noted that in several studies, it was not reported (since it was not in the focus of the research questions) whether an interaction between spatial abilities and type of visualization was observable (e.g., Kühl et al., 2011a; Kühl et al., 2012; Kühl et al., 2018a; Kühl & Münzer, 2019; Castro-Alonso et al., 2014; Castro-Alonso et al., 2018; Garland & Sanchez, 2013; Paik & Schraw, 2013; Stebner et al., 2017).

Summing up, the empirical evidence regarding the interplay of spatial abilities in learning with different visualization formats is mixed. It seems far from obvious why one would predict either an ability-as-compensator or an ability-as-enhancer-interaction. In current research, the two hypotheses are viewed as mutually exclusive.

A Unifying Conceptualization of the Ability-as-Enhancer and Ability-as-Compensator Hypotheses

We argue that the two hypotheses are not necessarily mutually exclusive, but propose a unifying conceptualization of the ability-as-enhancer and the ability-as-compensator hypotheses as two different interaction forms (Fig. 1): These two different forms of interaction will be found if comparisons are made between different points of the ability continuum. At the point of low ability, the potential of an optimized visualization might not unfold, because learners lack resources to profit from an optimized visualization. If the level of ability, however, is appropriate relative to task demands such that attempts to perform the task can be successful (medium ability), then the optimized visualization might have the intended supportive effect. The ability-as-enhancer interaction would result from the comparison between these two ability points (i.e., between low and medium ability). That is, an ability-as-enhancer interaction will be obtained if low-ability learners are compared with medium-ability learners.

The medium-ability learners, in contrast to the low-ability learners, can make use of the optimized design to perform successfully. Ability is thus thought to function as an enhancer given the optimized visualization. Thereby, the regression line for the relation between ability and performance is steeper for the optimized visualization compared to the suboptimal visualization. In our illustrations of the conceptualization provided in Fig. 1, this comparison can be found between the “low” and the “medium” ability levels. For all illustrated forms of the unifying conceptualization depicted in Fig. 1a–e , it is true that the regression line for the optimized visual design is steeper than the regression line for the suboptimal visual design between the low and medium ability levels. This reflects the ability-as-enhancer interaction.

If the ability increases more (high ability), then a learner may have appropriate mental resources available to perform successfully, may the visual design be optimized or not. The ability-as-compensator hypothesis will be obtained if high-ability learners are compared with medium-ability learners. Here, the comparison includes learners who perform well regardless of the design (high ability) and learners for whom the optimized design is supportive (medium ability). In particular, high-ability learners perform the task well also in the suboptimal design variant, in contrast to the medium-ability learners who experience difficulties with the suboptimal design. High ability is thus considered to be compensating for the suboptimal design. Thereby, the regression line for the relation between ability and performance is flatter for the optimized visualization compared to the suboptimal visualization. In our illustrations of the conceptualization shown in Fig. 1, this comparison can be found between the “medium” and the “high” ability levels. For all illustrated forms of the unifying conceptualization depicted in Fig. 1a–e , it is true that the regression line for the optimized visual design is flatter than the regression line for the suboptimal visual design between medium and high ability levels. This reflects the ability-as-compensator interaction.

To conclude, the two different forms of the aptitude-treatment interaction between spatial ability and visual design are not mutually exclusive but can be united. The present conceptualization integrates the at first-glance exclusive ability-as-enhancer and ability-as-compensator hypotheses into one superordinate model. The unifying conceptualization implies that a difference between an optimized design and a suboptimal design would be most pronounced for medium-ability learners, but less accentuated for low-ability or high-ability learners.

Analytical Approach for the Unifying Conceptualization

The unifying conceptualization is reflected in a corresponding analytical approach. The regression model treats the aptitude variable as a continuous predictor (our verbal descriptions and the illustrations in Figure 1 refer to low, medium, and high ability ranges for illustrative purposes only). The challenge is to define one single model that is able to cover all corresponding regression lines. Without violation of the underlying principle of the conceptualization, the regressions of the suboptimal and optimized ability design can be linear, u-shaped, or inverted u-shaped in different combinations (see Fig. 1). The regression model to cover a linear regression line for the suboptimal visual design and an inverted u-shaped optimized visual design regression line (Fig. 1a) should contain a dummy variable coding “suboptimal” as “0” and “optimized” as “1” as well as a quadratic interaction term:

$$\mathrm{perf}={b}_0+{b}_1\mathrm{abi}+{b}_2\mathrm{dumOpt}+{b}_3\mathrm{abi}\times \mathrm{dumOpt}+{b}_4{\left(\mathrm{abi}\times \mathrm{dumOpt}\right)}^2$$

(1)

with “perf” indicating performance, “abi” ability, “dumOpt” the dummy variable, and b_i the regression weights. In this case, b₁ would indicate the slope of the suboptimal linear regression. Since the interaction term itself is not linear (there is a positive interaction from low to medium and a negative interaction form medium to high), the interaction has an inverted u-shaped progression, and this should be indicated by a negative b₄ coefficient.

In the case of a linear progression with the optimized visual design and a u-shaped progression with the suboptimal visual design (Fig. 1c), the dummy variable should be reverse coded with “0” indicating “optimized” and “1” indication “suboptimal”:

$$\mathrm{perf}={b}_0+{b}_1\mathrm{abi}+{b}_2\mathrm{dumSub}+{b}_3\mathrm{abi}\times \mathrm{dumSub}+{b}_4{\left(\mathrm{abi}\times \mathrm{dumSub}\right)}^2$$

(2)

In contrast to the previous case, b₄ should be positive indicating a u-shaped interaction (a negative interaction from low to medium and a positive interaction from medium to high).

However, the three other cases (Fig. 1b–e) could not be covered with this multiple regression equation as in these cases both (suboptimal and optimized) conditions have non-linear progression lines. To solve this issue, it was decided to include two dummy variables in the regression equation, one indicating the optimized visual design (as dumOpt, see Eq. 1) and one indicating the suboptimal visual design (as dumSub, see Eq. 2):

$${\displaystyle \begin{array}{c}\mathrm{perf}={b}_0+{b}_1\mathrm{abi}+{b}_{21}\mathrm{dumOpt}+{b}_{22}\mathrm{dumSub}+{b}_{31}\mathrm{abi}\times \mathrm{dumOpt}+\\ {}{b}_{32}\mathrm{abi}\times \mathrm{dumSub}+{b}_{41}{\left(\mathrm{abi}\times \mathrm{dumOpt}\right)}^2+{b}_{42}{\left(\mathrm{abi}\times \mathrm{dumSub}\right)}^2\end{array}}$$

(3)

Since dumOpt and dumSub are perfectly colinear, the coefficients b₂₁ and b₂₂ as well as b₃₁ and b₃₂ are redundant. Therefore, one of each pair can be removed from Eq. 3. Note that the quadratic terms express the non-linear regression specific for each visual design and they are, therefore, not redundant. After removing (arbitrarily) the b₂₂ and b₃₂ terms, the final multiple regression model results:

$${\displaystyle \begin{array}{c}\mathrm{perf}={b}_0+{b}_1 abi+{b}_2\mathrm{dumOpt}+{b}_3\mathrm{abi}\times \mathrm{dumOpt}+\\ {}{b}_{41}{\left(\mathrm{abi}\times \mathrm{dumOpt}\right)}^2+{b}_{42}{\left(a\mathrm{bi}\times \mathrm{dumSub}\right)}^2\end{array}}$$

(4)

The relevant regression coefficients are b₄₁ and b₄₂ indicating the specific progression of the regression lines presented in Fig. 1: (a) b₄₁ < 0 and b₄₂ = 0; (b) b₄₁ < 0 and b₄₂ > 0; (c) b₄₁ = 0 and b₄₂ > 0; (d) b₄₂ > b₄₁ > 0; (e) b₄₁ < b₄₂ < 0. However, these five different combinations of the two visual design regression lines all correspond to the same underlying principle of the unifying conceptualization and can be summarized by the simple hypothesis “b₄₂ > b₄₁” indicating a higher benefit from the optimized visual design than from the suboptimal visual design in the medium ability range compared to the low and high ability range.

The Present Study: Aims and Research Questions

The aim of the present study is to empirically substantiate the theoretical conceptualization of a unifying view of the ability-as-compensator and the ability-as-enhancer hypotheses. Therefore, an experiment (Study 1) was conducted as well as data from a larger research project with four experiments were reanalyzed (Study 2).

Study 1 focused on performing a specific spatial task with static and dynamic visual designs. The task requires to mentally integrate symbolic information in a spatial structure. This task resembles a basic spatial process in learning about complex spatial structures from visualizations with different zoom levels. The variation of visual design was realized using a within-subjects experimental design. For this task, the dynamic visual design can be considered as the optimized and the static visual design as the suboptimal visual design (for more details, see the “Study 1: Experimental Approach” section). No additional text accompanied the visualizations. A measure of spatial ability was selected that is rooted in the cognitive-processing tradition and measures the capacity of visual-spatial working memory (VSWM). This measure requires to integrate spatial information and varies the amount of to-be-rehearsed information in VSWM as a span measure. These features match mental processes of the experimental task. This experiment was thus designed to study the interactions between spatial ability and visual design variants in a focused and controlled way.

However, the advantage of high internal validity of the experiment might come along with lower ecological validity for learning about more complex, meaningful phenomena. Therefore, we reanalyzed in Study 2 pooled data from four published experiments with the same multimedia learning materials and examined whether the proposed unifying conceptualization would also be observable in multimedia learning with meaningful learning materials. The four experiments had been conducted in the context of a larger research project on learning with static and dynamic visual designs, at which the dynamic visual design can be considered as the optimized visual design (for more details, see the “Study 2: Reanalysis of Four Multimedia Experiments” section). In all of the experiments, multimedia learning materials consisting of a visualization and written text were presented, and in all of the experiments, the same variation of visual design (static picture vs. animation) was realized. Spatial ability was measured with the same ability test in all four experiments. This ability test is rooted in the factor-analytic tradition and measures the broad spatial factor of visualization. The measure is most commonly used in multimedia learning experiments (cf. Höffler, 2010). The data were collected in laboratory setting as well as school settings. The reanalysis thus contributes to examining the ecological validity of the unifying conceptualization.

The main hypothesis for the experiment as well as the reanalysis of the existing multimedia learning data addressed the unifying conceptualization of the ability-as-enhancer and -compensator hypotheses: The regression lines reflecting the relation between ability and task performance will not only differ for the suboptimal (here: static) visual design and the optimized (here: dynamic) visual design, but also change as a function of ability. In particular, the regression line for the optimized dynamic design will be steeper than the regression line for the suboptimal static design in the low-to-medium ability range, whereas in the medium-to-high ability range, the regression will be flatter for the dynamic compared to the static design. As a consequence, the largest difference between suboptimal static design and optimized dynamic design will be found for medium ability. If learners’ spatial abilities are low, then their attempts to perform the task might not be very effective, regardless of the design of the visualization. If learners’ spatial abilities are in a medium range, then their attempts to perform the task will be supported through the optimized, dynamic visual design, compared with the static design. A consideration of learners in the low-medium ability range would thus result in an ability-as-enhancer interaction. If learners’ spatial abilities are high, then performing the task will be successful, regardless of the design of the visualization. Thus, a consideration of learners in the medium-high ability range would result in an ability-as-compensator interaction.

Study 1: Experimental Approach

The goal of this experiment is to examine the aptitude-treatment interaction between spatial ability and static versus dynamic visual designs with a visual-spatial task. This task is termed mental integration task. It requires to link symbolic detail information with spatial locations, as well as to memorize and to integrate these combinations in a simple spatial structure. The mental integration task is thought to reflect common visual-spatial processes when learning about spatial structures required in domains such as anatomy, mechanical systems, architecture, or geography. Information obtained from two-dimensional visualizations of complex spatial structures depends on the zoom level. If the zoom level is adjusted such that details can be viewed, then the overall structure is not visible at the same time. If the zoom level is adjusted such that the overall structure can be overviewed, then the details are not visible. Thus, a learner has to mentally integrate detail-location combinations in order to know about the details at particular spatial locations.

The dynamic variant of the present mental integration task supports the formation of the link between the detail information and its location through an animation. In the static variant, the link has to be formed mentally from two separate static pictures. Thus, the dynamic visual design is considered to be the optimized visualization, and the static visualization as the suboptimal visualization. In both variants of the task, learners have to maintain a varying number of such links in integrated form. The integration is particularly important because memory is tested with regard to the integrated form in the mental integration task (see the “Method” section of Study 1 for details).

Corresponding to the mental integration task, the capacity of visual-spatial working memory (VSWM) is the spatial ability measure in the experimental study. VSWM functioning and capacity is considered an important and central facet of spatial ability (Hegarty et al., 2006; Hegarty & Waller, 2005; Miyake et al., 2001). More specifically, VSWM has been appraised as a key factor in learning about spatial configurations and structures, because different parts of the configuration have to be maintained and mentally integrated (Münzer, 2012; Hegarty et al., 2006). Accordingly, the present measure of VSWM capacity requires to imagine putting subsequently presented symbols (consecutive letters of the alphabet) into adjacent cells of an imagined spatial matrix. The separately presented pieces of information form an integrated, meandering sequence through the matrix. In previous studies with samples drawn from the same student population, this measure of VSWM capacity showed a wide range of individual differences (Münzer, 2012; Münzer et al., 2018). In a previous study, this measure of VSWM capacity was related to the static variant of the mental integration task with r = .40 (Münzer et al., 2018). Furthermore, this measure of VSWM capacity predicted spatial layout learning from direct navigation experience in a real environment (Münzer et al., 2012).

The items of the mental integration task vary with respect to the number of to-be-integrated detail-location-combinations in order to provide some fluctuation in demands between items. However, all items require the same mental processes. Performance in the mental integration task is ultimately indicated by the mean accuracy across all presented items.

Method

Participants

All participants (N = 79) were students of a German University. They received course credit for their participation. Two participants were excluded because they were not able to solve the easiest level of the measure of VSWM capacity (the mental pathway span task, see below). The final sample consisted of N = 77 (57 female, 20 male) participants. Their mean age was M = 21.51 years (SD = 2.38 years) and ranged from 18 to 32 years.

Materials

The capacity of visual-spatial working memory was measured by means of a mental pathway span task, a variant of the Brooks task (Brooks, 1967; Quinn & Ralston, 1986) with computer-based administration. The task required the mental imagination of an empty 5 × 5 matrix. Participants listened to a sequence of verbal-auditory assignments via headphones. These assignments described a path of adjacent cells through the imagined matrix (Fig. 2). The indicated cells should mentally be filled with characters in alphabetical order, with the start-cell always located in the second column, second row (from top-left) containing an “A.” The assignments thus instructed participants to imagine putting the characters in adjacent cells of the matrix (e.g., “in the cell to the left put a ‘B’”; “in the cell above put a ‘C’”). After the description of the path, participants clicked the filled cells on a 5 × 5 matrix shown on the computer screen. (The particular letters themselves did not play a particular role in the task and had not to be memorized, because they followed the alphabetical order.) The number of assignments to be imagined (“span level”) varied between five and 13 (maximum level). The present task does not change the size of the matrix, but only the length of the sequence (in contrast to variants used by Bosco et al., 2004, and Vecchi & Cornoldi, 1999). For each span level, there were three trials. The test started with a span level of eight, corresponding to average performance found in previous studies (Münzer, 2012). If at least two out of three trials of a level were correct, then the level was increased. If only one trial out of three trials was correct, the level was decreased. The test terminated if none of the three trials of the level was correct. Particular span levels were only tested once. That means if someone failed in a specific level or was successful, this participant would not repeat this level again. This adaptive procedure aimed at finding the participant’s highest span level that she/he could achieve, starting with the level corresponding to average performance. The mental pathway span of a particular participant corresponds to the level in which at least two out of three paths were correctly remembered.

In the mental integration task, participants were asked to mentally integrate a series of detail and location information pieces within a larger spatial structure. The spatial structure had the simple form of a 3 × 3 matrix, with each cell representing a room. Within a room, a symbol could be found on a wall (Fig. 3a). These symbols were not visible when looking at the 3 × 3 matrix structure. The combination of a certain location (a cell in the matrix) and the detail information to be found there (the symbol on the wall) was provided as two separate static pictures (static visual design condition) or as an animation (dynamic visual design condition). In the static visual design condition, the cell was indicated in the matrix by a red square and the symbol was shown separately (Fig. 3a). In the dynamic visual design condition, an animation showed a continuous zoom starting from an “aerial” view on the 3 × 3 matrix, then focusing on the location cell and finally “flying” into the cell to view the wall with the symbol on it. The animation thus started with the “location” picture and ended with the “detail” picture of the static version. It linked them with continuous zoom and camera movement. One item consisted of two to five such location-symbol combinations. These were shown subsequently in the study phase. Participants were asked to memorize all locations with their corresponding detail information. After the study phase of a trial, a test picture appeared which showed an integrated partial structure (part of the matrix) with visible symbols. This partial structure (Fig. 3b) always consisted of three adjacent cells (rooms) and contained two of the locations and symbols presented in the study phase. In addition, four alternative pictures showed different possible positions of the partial structure within the complete structure (matrix). Only one of the four alternatives matched simultaneously all the location-detail information presented in the study phase. Participants indicated as accurately and quickly as possible which of the four alternatives was the position of the partial structure within the overall structure. Static and dynamic conditions were blocked. In each visual design condition (static, dynamic) and for each span length (2, 3, 4, 5), there were eight items, resulting in 32 (4 × 8) items per visual design condition. Participants thus received a block of 32 static tasks and a block of 32 dynamic tasks. The order of these blocks varied between participants. Within a visual design condition, items were randomized. The presentation was system-controlled. Participants were not able to stop or repeat the presentation. The presentation time was 6 s for the presentation of one link between a location–detail combination of the spatial structure. Correspondingly, one dynamic zoom video had a duration of 6 s. Because of the zooming-in, the complete spatial structure was visible when the video started but disappeared during the animation. In the static condition, there were two static pictures, one showing the location in the spatial structure and one showing the detail. Each picture was shown for 3 s. Thus, presentation times were the same in both conditions. The mental integration task was presented by the experimental software E-Prime Version 2.0 (Psychology Software Tools Inc., 2012).

Procedure

First, the mental pathway span task was conducted. Then, the mental integration task followed. Finally, demographic data were collected. The participants were tested in groups of up to six persons in an experimental laboratory with separation panels between the desks.

Analytical Approach for Study 1

Spatial ability was measured with the mental pathway span task which is a measure of visual-spatial working memory capacity. The analyses included the mental pathway span as a continuous predictor variable. It was z-standardized across all participants. Visual design (static vs. dynamic) was included as a categorical within-subject factor in the analyses using two dummy variables (see the “Analytical Approach for the Unifying Conceptualization” section). The two-factor interaction was tested with a multi-level-model (MLM) with two levels (visual design was on Level 1 and subjects on Level 2).

According to the main hypothesis, the relationship between spatial ability (measured with the mental pathway span) and performance in the mental integration task (mean accuracy of the location-detail combinations integrated and remembered correctly over all trials) should correspond to the following pattern in this experiment: The benefit of the dynamic visual design compared to the static visual design should increase with mental pathway span in the low-to-medium range of mental pathway span, meaning that the slope of spatial ability and mental integration performance would be comparatively steeper for the dynamic compared to the static visual design (cf. left sides of Fig. 1a–e). However, this benefit should decrease for greater mental pathway span (i.e., in the medium-to-high range of mental pathway span), meaning that the slope of the relation of spatial ability and mental integration performance would be comparatively flatter for the dynamic compared to the static visual design (cf. right sides of Fig. 1a–e). As a result, differences between static and dynamic visual designs should be highest for participants with a medium mental pathway span compared to participants with either low or high mental pathway spans.

To account for different possible linear and non-linear forms of the regression lines that all correspond to the unifying conceptualization (illustrated in Figure 1a–e), the regression model included two dummy variables for coding the visual design. As explained above (in the “Analytical Approach for the Unifying Conceptualization” section), coding the treatment with one treatment variable (e.g., static = 0, dynamic = 1) and including a corresponding quadratic interaction term would make specific predictions about the linear and the non-linear regressions. With the coding static = 0, dynamic = 1 for the dummy variable, a linear regression would be predicted for the static visual design and a non-linear would be predicted for the dynamic visual design (with a coding static = 1, dynamic = 0, reverse predictions would be made). Therefore, a second dummy variable for visual design was included (as described above, Eq. 4): The two dummy variables for coding the visual design condition were “dumOpt” (0 = static, 1 = dynamic) and “dumSub” (0 = dynamic, 1 = static).

The linear interaction was considered by the interaction term of the first dummy variable with mental pathway span. Note that the second dummy variable was only considered for its corresponding quadratic interaction term, but not for the main effect or for the linear interaction term, as both dummy variables are perfectly complementary (see the “Analytical Approach for the Unifying Conceptualization” section).

Follow-up illustrative analyses were conducted comparing performances in the mental integration task between the static and the dynamic visual designs for low-ability, medium-ability, and high-ability participant groups separately. For these analyses, three distinct ability groups were built: a group of low-ability participants with a span < 8, a group of medium ability participants with a span of 8 and 9, and a group of high-ability participants with a span > 9. These span values were based on the known distribution of the mental pathway span from two previous studies. One study (Münzer et al., 2018) comprised university students, as in the present study. The other study (Münzer, 2012) involved N = 152 younger students from secondary schools who had participated in two experiments on real-world spatial learning. These students thus belonged to a population with a presumably broader distribution of spatial ability. Data from both studies show that mental pathway spans of 8 and 9 formed the medium ability category.

Results and Discussion

Table 1 shows the descriptive results of the mental pathway task (span) and the mental integration task in the static and dynamic visual design (accuracy). Apparently, there was no general advantage of the dynamic visual design (mean accuracy = 0.76) for the accuracy in the mental integration task, compared with the static visual design (mean accuracy = 0.75). Results show a moderate right-skewed distribution of the mental pathway span task and left-skewed distributions of the mental integration task with static and dynamic visual design. The maximum span of the mental pathway task of 13 was only reached by two participants.

Table 1 Descriptive statistics of the mental pathway span and the mental integration task (separated for static and dynamic visual design). Acc = accuracy. RT = reaction times (in milliseconds)

Full size table

Table 2 shows the results of the fixed effects of the analyzed multi-level model (MLM, see the “Method” section for Study 1). The significant quadratic terms for the interaction of mental pathway span with visual design, dynamic and static, b [(mental pathway span × dumOpt)²] = −.08, p < .001 and b [(mental pathway span × dumSub)²] = −.04, p = .036 (Table 2), indicate non-linear relations with negative quadratic trends for both presentation forms. The negative sign indicates an inverted U-progression (see also Fig. 1e). However, the effects were significantly different (z = −2.60, p = .009), indicating that the non-linear and inverted U-progression was more strongly accentuated for the dynamic visual design compared to the static visual design. Correspondingly, this reflects a higher mental integration performance gain for the group in the medium ability range with the dynamic compared to the static visual design. The main effect of mental pathway span indicates the positive relation between mental pathway span and mental integration performance, b (mental pathway span) = .08, p < .002.

Table 2 Results for accuracy of the fixed effects of the MLM

Full size table

For illustrative purposes, participants were assigned to three ability groups, as described above. There were n = 35 in the low ability group (span < 8); n = 22 in the medium ability group (8 ≤ span ≤ 9); and n = 20 in the high-ability group (span > 9). Fig. 4 illustrates the differences in the progressions for static vs. dynamic visual design with regard to the relation between mental pathway span and mental integration accuracy for the three ability groups. For the dynamic visual design, the relation between mental pathway span and mental integration accuracy is best described with an inverted U-shaped, nonlinear progression, while for the static visual design, this nonlinear progression is far less pronounced (and hardly discernible in Fig. 4). This pattern corresponds to the principle of the unifying conceptualization. There was a steeper accuracy gain from the low-to-medium ability group in the condition of the dynamic visual design compared to the static visual design. This pattern reflects the ability-as-enhancer effect. The ability-as-compensator effect can be found between the medium and high-ability group. There was a lower accuracy gain in the condition of dynamic visual design compared to the static visual design (Fig. 4).

Additional analyses compared the accuracy in the mental integration task between the static and dynamic design for each ability group separately (the descriptive data are depicted in Fig. 4). For the low-span group, performance did not differ between the static and the dynamic visual design, t(34) = 0.73, p = .765, Cohen’s d = 0.09, CI = [−0.39, 0.56]. The same was true for the high-span group, t(19) = 0.17, p = .433, Cohen’s d = 0.03, [−0.61, 0.67]. The difference between static and dynamic visual design just failed to reach significance for the medium-span group, t(21) = 1.60, p = .062, Cohen’s d = 0.20, [−0.41, 0.81]. For the medium-span group, a descriptively higher difference between visual design conditions in terms of effect size was found, compared to the low-span or the high-span groups.

The results of the experiment — and especially our analytical approach to examine the overall pattern with nonlinear regressions — revealed first empirical evidence for our proposed unifying conceptualization of the ability-as-enhancer and the ability-as-compensator hypotheses. Dividing the sample in three ability groups served primary for illustrative purposes: Also, these results speak — admittedly only descriptively — for the notion that the optimized visual design especially supported performing the mental integration task if learners’ ability was in the medium range, but not if learners’ ability was in the lower or higher ability range. However, the effect for learners in the medium ability range failed to reach statistical significance and was only marginal. The fact that we observed an overall significant effect with the nonlinear regression approach but failed to illustrate this effect in terms of significance by means of dividing the sample in subgroups and conducting t-tests may be attributable to a power problem (Irwin & McClelland, 2003). It is important to note that the latter results of the t-tests for each subgroup do not speak against the unifying conceptualization (but descriptively support it), since these t-tests were primarily conducted for illustrative purposes. The major analysis that drives the claim the data support the unifying conceptualization is the nonlinear regression approach. Hence, it can be concluded that the unifying conceptualization (including both the enhancer and the compensator interaction) was confirmed.

In Study 2, we wanted to substantiate our findings from Study 1, and examine whether the proposed unifying conceptualization of the aptitude-treatment-interaction would also be observable with more complex and meaningful multimedia learning materials that include visualizations and text. Meaningful multimedia learning materials that require learners to form a cognitive schema given inter-related pieces of information presented in visual and verbal codes might change the role of spatial ability. The predictive value of spatial abilities for learning with visualizations might be less pure and somewhat overshadowed when the visualizations are accompanied with text, since in this case spatial abilities also need to account for the comprehension of the text and the ability to integrate text with visualizations (Hegarty & Sims, 1994). Thus, in contrast to Study 1, which adopted an experimental approach that allowed us to focus on visual processing, the role of spatial ability might be an indirect one with complex and meaningful multimedia materials. Therefore, a higher number of participants representing a wide range of spatial ability was included to look at general trends for the relation between learning outcomes and ability in treatment conditions that varied the visual design.

Study 2: Reanalysis of Four Multimedia Experiments

We reanalyzed pooled data obtained in a larger research project comprising four experiments (Kühl, 2021; Kühl & Münzer, 2019; Kühl et al., 2018a, 2018b) utilizing the same instructional materials. The instructional materials were typical multimedia presentations, i.e., a visualization was accompanied with explanatory verbal information (written text). Each of the four experiments included specific learning conditions. However, the learning content, the visual design variants (static picture vs. animation), and the measure of spatial ability were the same in all four experiments. In all four experiments, there were retention knowledge questions and transfer knowledge questions on the learning content with only small changes on specific questions between the experiments. Spatial ability was assessed with a shortened form of the Paper Folding Test (PFT; Ekstrom et al., 1976) in all four experiments. The PFT is a measure of the broad “visualization” factor of spatial ability rooted in the factor-analytic tradition and is the most used spatial ability measure in multimedia learning research (cf. Höffler, 2010).

The instructional material used in the four experiments was about the topic of Kepler’s second law. This law describes how the velocity of a planet changes while orbiting the sun on an ellipse depending on the changes of the distance between sun and planet. The dynamic visualization (animation) possessed a clear informational advantage over the static picture, since only the animation depicted the changes in the planet’s velocity directly. The static picture only depicted the ellipse, the sun, and the planet as well as graphic indications of different spatial regions of the ellipse corresponding to the distance between the sun and the planet. This graphical indication was provided both in the picture and in the animation. It was intended to support the understanding of the relation between distance and velocity considering differences between the indicated spatial regions. All four single experiments have shown a main effect of visualization with an advantage of the dynamic visual design compared with the static visual design, especially for transfer tasks and partly also factual knowledge scores (Kühl, 2021; Kühl & Münzer, 2019; Kühl et al., 2018a; Kühl et al., 2018b). Thus, the dynamic visualization is demonstrably the optimized visual design for this topic and the static picture is demonstrably the suboptimal visual design.