A new type of pictorial database: The Bicolor Affective Silhouettes and Shapes (BASS)

Kawai, Claudia; Lukács, Gáspár; Ansorge, Ulrich

doi:10.3758/s13428-021-01569-7

A new type of pictorial database: The Bicolor Affective Silhouettes and Shapes (BASS)

Open access
Published: 07 May 2021

Volume 53, pages 2558–2575, (2021)
Cite this article

Download PDF

You have full access to this open access article

Behavior Research Methods Aims and scope Submit manuscript

A new type of pictorial database: The Bicolor Affective Silhouettes and Shapes (BASS)

Download PDF

1940 Accesses
3 Citations
1 Altmetric
Explore all metrics

Abstract

We introduce the Bicolor Affective Silhouettes and Shapes (BASS): a set of 583 normed black-and-white silhouette images that is freely available via https://osf.io/anej6/. Valence and arousal ratings were obtained for each image from US residents as a Western population (n = 777) and Chinese residents as an Asian population (n = 869). Importantly, the ratings demonstrate that, notwithstanding their visual simplicity, the images represent a wide range of affective content (from very negative to very positive, and from very calm to very intense). In addition, speaking to their cultural neutrality, the valence ratings correlated very highly between US and Chinese ratings. Arousal ratings were less consistent between the two samples, with larger discrepancies in the older age groups inviting further investigation. Due to their simplistic and abstract nature, our silhouette images may be useful for intercultural studies, color and shape perception research, and online stimulus presentation in particular. We demonstrate the versatility of the BASS by an example online experiment.

Diversity representation in advertising

Article Open access 26 December 2023

Colin Campbell, Sean Sands, … Alexis Mavrommatis

Color associations to emotion and emotion-laden words: A collection of norms for stimulus construction and selection

Article 19 May 2015

Tina M. Sutton & Jeanette Altarriba

What is Beauty?

Article 13 May 2021

Andrea Sisti, Negaar Aryan & Payam Sadeghi

Images are an excellent medium to convey information, simple or complex. They serve as stimuli in a wide range of research areas, such as emotion, attention, or aesthetics research, among others (e.g., Bradley & Lang, 2007; Huston et al., 2015; Lindsay, 2020). In the present article, we introduce an open access database of normed Bicolor Affective Silhouettes and Shapes (BASS).^{Footnote 1} Each of the BASS images consists of 300 × 300 black and white pixels, providing a computationally economical and visually uniform layout.^{Footnote 2}

New freely available normed images enrich research by providing greater diversity of choices for research, including potential replications of findings with different kinds of images to ensure generalizability. Additionally, participants may get habituated to the images in existing databases, meaning they may process them differently on repeated exposures (Baker et al., 2010; Foa & Kozak, 1986; Ramaswami, 2014). The present BASS database may be used, for instance, for conceptual replications and novel research in such diverse areas as space-valence congruence effects (e.g., Meier & Robinson, 2004), priming of evaluations (e.g., Fazio, 2001), affective priming (e.g., Hermans et al., 2001), or emotional facilitation (e.g., Schupp et al., 2003).

More importantly, the BASS also has very specific advantages compared to other available stimulus sets: Silhouette images allow for the study of meaning-related processing without many of the confounds that are present in words and pictures. Whereas words carry inherent differences, such as their length and their phonological transparency (or orthographic depth, see Aro & Wimmer, 2003; Frost et al., 1987; Schmalz et al., 2016) for which one would have to control, these confounding differences are absent in silhouettes. Likewise, whereas pictures differ from one another in terms of color heterogeneity, depth cues, spatial perspective, or feature complexity, silhouettes are more uniform and less complex, and can therefore be more easily equated for these factors, without corrupting their meaning entirely.

The main purpose of the present research article is to demonstrate that the BASS images evoke a very wide range of affective representations on the side of the participants—despite the images’ relative simplicity. We demonstrate this primarily by collecting valence and arousal ratings in a large and nationally representative US (Western) sample. We provide further evidence from analogous ratings from an extensive (Eastern) sample from Mainland China and from demonstrating characteristic valence-color congruence effects in an example response time experiment using BASS images (in an Austrian sample).

This methodological approach validates several important characteristics of the BASS images, relevant to but often not investigated in image sets used in psychological research. First, typically, normative ratings of affective picture databases like the Geneva Affective Picture Database (GAPED), the Open Affective Standardized Image Set (OASIS), the International Affective Picture System (IAPS), or the Nencki Affective Picture System (NAPS) (Dan-Glauser & Scherer, 2011; Kurdi et al., 2017; Lang et al., 1997; Marchewka et al., 2014, respectively) are largely based on mono-cultural samples, with ratings predominantly collected in Western populations only. Moreover, within Western populations, samples were often restricted to student participants.^{Footnote 3} Yet, ideally, behavioral image or stimulus sets should be applicable in different cultures in order to allow investigation of universal psychological phenomena and topics. The IAPS, for example, has been generally found to be valid among Western cultures (Deák et al., 2010; Grühn & Scheibe, 2008; Verschuere et al., 2001, to name a few), but less so among Asian cultures (Gong & Wang, 2016; Huang et al., 2015). Therefore, from the start, for the current BASS images, we intended to collect normative rating data from at least two large, representative samples of different cultural origin: one Western (US) and one Eastern (Mainland China) sample. Silhouette images theoretically carry potential for more culturally neutral affective representations than photographic images, due to the latter’s culture-specific details (such as depictions of regionally different vegetation, landscapes, food, people, architecture, traffic etc.), which are often subtle and difficult to control for. We tested this conceptual notion by directly comparing the consistency of the ratings from our Western and Asian samples.

Second, the bicolor silhouettes tackle one particular virulent problem posed by online studies: accurate color rendition of heterogeneous colors on different physical displays. By implication of their high realism, color photos of natural scenes carry substantial color heterogeneity and, in fact, their realism relies in part on this color heterogeneity (cf. Hansen & Gegenfurtner, 2009). It has indeed been shown that differences in the display of photographs, such as varying brightness or resolution, can substantially affect the evaluation of emotional content (e.g., Lakens et al., 2013; Mould et al., 2012). However, the huge variety of physical devices used to display images (e.g., smartphones, tablets, and laptops by different manufacturers) plus an increased variance in idiosyncratic monitor settings and lighting conditions on the side of the viewer compromise accurate color rendition, especially in online studies (Anwyl-Irvine et al., 2020)—a problem which increases with the complexity of the image’s color palette. In contrast, the black-and-white silhouettes of the BASS database were already created and selected (from existing silhouettes) under the premise of having to convey their affective meanings largely by their contours alone and less by their exact color rendition. In this way, the BASS images reduce the heterogeneity of the necessary color palette for their rendition and the problems associated with this color heterogeneity. We demonstrated this by collecting participants’ affective ratings using the BASS images in inverse colors (pilot study).

However, this is not to say that colors could not be manipulated and used to study their effects (e.g., on affective ratings) in studies with BASS images. For photographs, manipulations of color rendition can easily corrupt image meaning altogether (cf. Oliva & Schyns, 2000; Mould et al., 2012). Such manipulation requires complicated technical procedures (e.g., Orzan et al., 2007), and, even more fundamental, one can hardly be sure what manipulation of colors in a photography affects emotional image content in what manner: Setting up a proper balance of hue, saturation, and brightness is difficult enough when calibrating the display of a single color (Wilms & Oberfeld, 2018), let alone for a photograph with thousands of interacting colors. In contrast, the BASS images were created with the intention of keeping an unambiguous meaning by way of their contours, even if presented in varying colors. Thus, BASS images offer an extremely easy way to manipulate and study or control color: They can be converted effortlessly to any other two colors^{Footnote 4}—and, as long as the two colors are discernible from each other, the content remains unambiguous. Below, we also demonstrate how such color manipulations of the BASS images could be used in research by investigating known color-valence congruence effects using variously colored BASS images for an example experiment.

Third, large photographic images are generally not optimal for fast decision tasks based on their diverse visual characteristics and complex semantics: Various depicted details and their many subtle visual properties might affect each participant very differently, influencing processing time and potentially confounding results (especially where small effects are expected and processing-time differences in fractions of seconds can affect outcomes). In contrast, the contents of the less ambiguous (few or single) objects in each BASS image is easy to grasp quickly and better suited for a task involving fast decisions, unaffected by visual or semantic noise. In addition, their simplistic and abstract nature makes BASS silhouettes more resistant than photographs to incidental but potentially meaning-corrupting influences such as display size or viewing distance (Anwyl-Irvine et al., 2020).^{Footnote 5} Big file sizes of photographic images used in experiments typically pose technical problems too, especially in connection with online research, which is becoming more and more important in psychological science. For one, common browsers are not optimized for precisely timing image presentation, and physical display times can be strongly affected by loading large size images (e.g., Garaizar & Reips, 2019). For another, some participants might be reluctant (or even unable, where infrastructure is weak) to download many large files. It is not unusual for an experiment to require several hundreds of different stimuli (especially when each stimulus is presented repeatedly in different forms, e.g., varying hue, etc.; e.g., Huang et al., 2015; Kawai et al., 2020; Oliva & Schyns, 2000). Calculating with the average file size of IAPS images (262 KB; cf. averages of 92 KB for OASIS and 789 KB for GAPED), just 100 images take 26 MB—already a notable hurdle for potential participants. Participant dropout due to inadequate internet speed could even introduce confounds via selective attrition (Zhou & Fishbach, 2016). With an extremely economical average file size of 2.2 KB (max. 6.0 KB), the BASS images minimize the issues of display timing and download. Our follow-up experiment on color-valence congruence effects also serves to demonstrate these usability advantages in fast decision tasks online.

Fourth, silhouettes, contours, and shapes themselves constitute a field of scientific interest (e.g., Bar, 2007; Biederman, 1987; Marr, 1982), and the affective qualities associated with geometric contour properties may be of interest to future researchers (e.g., Leder et al., 2011), although they have not been a major focus of research on contours so far. This could be particularly interesting when comparing human object recognition with artificial object recognition (e.g., Rajalingham et al., 2018): While objects as simple as silhouettes may be processed by machines similarly to humans, affective responses are (currently) only evoked in humans.

Creating the BASS

Before carrying out the large-scale ratings for the BASS, we assessed several issues in a pilot study (n = 180, students of the University of Vienna). Detailed information on the pilot study can be found in the online supplementary material available on https://osf.io/anej6/. Firstly, researchers suspected, but never tested, possible contamination of affective ratings when participants had to rate valence and arousal in one experiment (Kurdi et al., 2017), since prior studies showed that question order in surveys can influence judgements (Lau et al., 1990; Wilcox & Wlezien, 1993). We put this assumption to the test and split participants into two groups: one rating only valence (or only arousal) and another rating images on both valence and arousal. Results indeed confirmed contamination in affective ratings when two affective ratings were required in the same experiment. Thus, we decided to use only one rating measure (either valence or arousal) per participant in the main study. In the pilot study, we simultaneously tested for the relative-color independence of the silhouette ratings by studying possible effects of color-inversion of silhouettes (black-on-white vs. white-on-black) on ratings. High positive correlations between the (original) black-on-white images and the inverted white-on-black images suggested that there is no apparent preferred color mode for ratings of our silhouettes (clarity ratings: r[618] = .866, 95% CI [.845, .885]; valence: r[618] = .918, 95% CI [.905, .930]; arousal: r[618] = .719, 95% CI [.678, .755]). The ratings from the pilot study, given by Austrian psychology students, also helped us to decide on a reasonable exclusion threshold for intrarater correlation, which we subsequently used as an individual reliability measure for data quality control in the main study. Since intrarater correlation turned out to be lower in the arousal rating than in the valence rating condition, we determined different exclusion thresholds depending on the rating condition (see Data Exclusion).

Clarity ratings provided by the first as well as a second online prestudy (n = 50, Chinese participants) helped to identify unclear or ambiguous images in the original set of 620. After excluding those with the lowest clarity ratings, the subset that was used in the large-scale ratings in the US and China comprised a total of 583 silhouettes.

Method

Participants

For the US rating, a total of 806 participants was recruited via Prolific (www.prolific.co). The study was divided into two parts: one sub-study comprised the valence rating (n = 402), the other the arousal rating (n = 404) task. The sample was collected as a “representative sample for the United States of America” (sex, age, ethnicity, according to Simplified US Census; excluding people under the age of 18) and participants received £2.17 upon study completion.

The experiment had to be completed at a desktop computer. Participants could read all relevant information on the welcome page and consented to participate by clicking the consent button at the bottom.

After exclusion (see Data Exclusion), valid data from 777 participants remained (arousal rating: 386; valence rating: 391), out of which 402 participants were female, 375 were male. Ages ranged from 18 to 78 years, with a mean age of 45.19 years (SD = 16.09).^{Footnote 6} Completion took an average of 17 min (SD = 4 min, median = 15 min).

For the Chinese sample, the recruitment (and compensation) of online participants in China was managed by Dynata. In the supplementary materials, we provide more information on Dynata’s incentive and compensation system. Chinese participants were asked to provide demographic data after consenting to the study. We only allowed participation for those who indicated (1) their age being between 18 and 95 years, (2) Mainland China as their country of residence, and (3) any gender information. A total of 1341 registered participants submitted their rating data, of which 472 had to be excluded due to poor data quality (for more details see Data Exclusion). We aimed for a representative sample (sex; age, excluding people under the age of 18 years) for the People’s Republic of China, but since our study required PC and Internet access, relatively younger age groups are over-represented.

After exclusion, valid data from 869 participants remained (arousal rating: 455; valence rating: 414), out of which 419 participants were female and 450 were male. Ages ranged from 18 to 81 years, with a mean age of 34.5 years (SD = 9.4). Completion took on average 25 min (SD = 116 min, median = 14 min).^{Footnote 7}

An overview of the demographic data from all included participants (US and China) is provided in Table 1 (Appendix A).

Materials

The majority of the silhouettes were acquired using Google Images (images.google.com) under the creative commons license, by searching either directly for silhouettes or for photographic pictures that we edited into black-and-white silhouettes using GNU Image Manipulation Program (The GIMP Development Team, 2019) and R (R Core Team, 2020).

We restricted our search to images labeled “available for reuse with modification.” Most silhouettes are from pixabay.com (n = 444), cleanpng.com (n = 33), and svgsilh.com (n = 29). Eighteen additional silhouettes were created for this project by a professional illustrator. We collected a wide range of images, depicting humans, animals, objects, and scenes, which we indicated by a category column in the database for easier stimulus selection. Special focus was put on collecting images of a wide range of valence (positive, negative, and neutral) as well as arousal (low, medium, and high) levels. Every image was scaled and/or cropped to a size of 300 × 300 pixels. All images were checked (or converted) to contain only fully black (RGB: 0, 0, 0) and fully white (RGB: 255, 255, 255) pixels.

Procedure

Online Rating Study – US

After giving their informed consent, participants saw the instruction page with an explanation of the terms valence and/or arousal (see Figs. 11 and 12 in Appendix A), depending on the participants’ rating condition, and three example images (selected based on the pilot studies as average in both arousal and valence; none of these images were used in the subsequent task for the given participant). The experiment started as soon as the “Start” button on the bottom of the instruction page was pressed. Each black-and-white silhouette stimulus (300 × 300 px) was presented on a grey background (RGB: 128, 128, 128) for 2 s. When this time had elapsed, the image disappeared and was replaced with the rating scale, a line ranging from “very low” on the left to “very high” on the right, with nine equally spaced tick marks (see Fig. 13 in Appendix A). Participants could enter their ratings by clicking on the scale(s) and submit their rating by pressing a button that appeared below, or they could skip the rating. After confirming their choice, the next image appeared on the screen, and so on. A total of 145-146 silhouettes were presented to each participant. Additionally, we incorporated two attention checks. Hyperlinks to the original experimental websites are available via https://osf.io/anej6/.

Attention checks

After rating every stimulus in the list, a black-and-white image was shown after which the participant had to indicate what was depicted on it out of a list of answers (this was the same item for all participants, a clearly discernible black-and-white image of a car). In addition, we assessed participants’ attention via intrarater reliability. Following the first attention check, the participant was presented with five of their most highly and five of their most lowly rated images (in randomized order). We calculated Spearman’s correlation coefficient between the first and second ratings for these ten items.

Online Rating Study – China

The procedure of the experiment was identical to that used for the US sample, with the exception that Chinese participants were asked to provide demographic data (gender, age, country of residence) after giving consent to the study.^{Footnote 8} The experimental website including the informed consent and instruction pages was translated into Mandarin Chinese by a native speaker (a psychologist) and independently double-checked by another native speaker (a linguist).

Data exclusion

Participants were excluded according to the following criteria: (1) the rate of skipped answers was above 25%; (2) no intrarater correlation could be calculated;^{Footnote 9} (3) for arousal ratings, a failed Attention Check 1 and an intrarater correlation of less than .77 or otherwise a passed Attention Check 1 but an intrarater correlation of less than .67; (4) for valence ratings, a failed Attention Check 1 and an intrarater correlation of less than .80; or otherwise a passed Attention Check 1 but an intrarater correlation of less than .70.^{Footnote 10} In addition, in the Chinese rating, data from participants was excluded if they did not provide a personal identification code (generated by Dynata).

Surprisingly, exclusion rates for the China data (472/1341 = 35.2%) were much larger than that for the US data (29/806 = 3.6%). This may have happened for various reasons, but a major difference is that Prolific (US data) specializes in collecting data for scientific purposes, while Dynata serves mainly as a provider of market research data.

Results

Each image received between 91 and 119 ratings. For each country, we computed mean valence and mean arousal ratings per image.

US rating

Reliability

For valence and arousal ratings, separately, we generated 1000 random split halves of our sample and calculated the Spearman–Brown reliability coefficient of the mean ratings. The mean of the correlation coefficients in the valence dimension was \( {\overline{R}}_{val} \) = .986 (SD = 5.51 × 10^-5, range: R_min = .985 and R_max = .986), and in the arousal dimension \( {\overline{R}}_{aro} \) = .921 (SD = 3.50 × 10^-4, range: R_min = .920 and R_max = .929), demonstrating extremely high reliability in both affective dimensions.

Relationship between valence and arousal

Mean valence ratings from the US participants lie between 1.11 and 8.35 (M = 5.64, SD = 1.76); mean arousal ratings lie between 2.51 and 7.23 (M = 5.00, SD = 1.02). The correlation between valence and arousal ratings was significant, with r(581) = – .323, 95% CI [– .394, – .248], p < .001, BF₁₀ = 5.76 × 10¹². However, the Pearson correlation coefficient shows a relatively weak linear relationship and a look at the distributions of the US mean ratings, as shown in Fig. 1, suggests a U-shaped relationship between valence and arousal in the BASS images, where arousal dips for neutral images but then increases with rising valence.^{Footnote 11}

Effects of demographic variables

Prior research has identified differences in emotion processing based on factors like gender (Bradley et al., 2001; Proverbio et al., 2009; Sabatinelli et al., 2004; Wrase et al., 2003) and age (Bradley & Lang, 2007; Grühn & Scheibe, 2008; Pôrto et al., 2010). To investigate potential gender-related differences in our affective ratings, we calculated the individual means for valence and arousal ratings for male and female US participants.

The correlation between the mean ratings made by male and female participants was very high, for both valence, r(581) = .972, 95% CI [.967, .976], p < .001, BF₁₀ > 10²⁰⁰, and arousal, r(581) = .877, 95% CI [.857, .895], p < .001, BF₁₀ = 2.31 × 10¹⁸².

For age-wise comparison, we performed a median split for the demographic variable age on both of our US subsamples (valence, arousal) and correlated the respective mean ratings of the halves. For the valence rating group (median age = 46 years), correlation between the younger (< 46 years) and the older (≥ 46 years) age group was very high, with r(581) = .976, 95% CI [.972, .979], p < .001, BF₁₀ > 10²⁰⁰. The same was true for the arousal rating group (median age = 44.5 years), with a correlation between the younger (< 45 years) and the older (≥ 45 years) age group of r(581) = .906, 95% CI [.891, .920], p < .001, BF₁₀ > 10²⁰⁰. Finally, Fig. 2 shows that the valence-arousal pattern is very similar for all age groups (divided into five brackets following Prolific’s age stratification, see Table 1; see also Fig. 14 in Appendix A).

Chinese rating

Reliability

The mean of the correlation coefficients in the valence dimension was \( {\overline{R}}_{val} \) = .980 (SD = 7.64 × 10^-5, range: R_min = .979 and R_max = .980), and in the arousal dimension \( {\overline{R}}_{aro} \) = .921 (SD = 2.87 × 10^-4, range: R_min = .917 and R_max = .923). Interrater reliability for both affective dimensions was again extremely high and almost identical to the US sample.

Relationship between valence and arousal

Mean valence ratings from the Chinese participants lie between 1.47 and 7.84 (M = 5.46, SD = 1.37); mean arousal ratings lie between 3.65 and 6.75 (M = 5.31, SD = 0.64). Unexpectedly, and different from the results in the US sample, mean arousal and mean valence ratings showed a strong positive correlation, with r(581) = .805, 95% CI [.774, .832], p < .001, BF₁₀ = 2.33 × 10¹²⁹. The distributions of the Chinese mean valence and arousal ratings are shown in Fig. 3, which illustrates that negative images received relatively low arousal ratings while positive images received relatively high arousal ratings (as opposed to the US ratings, where arousal ratings were distributed fairly evenly between positive and negative images).

Effects of demographic variables

Similarly to the US sample, mean ratings between Chinese male and female participants were highly correlated for valence, r(581) = .979, 95% CI [.975, .982], p < .001, BF₁₀ > 10²⁰⁰, and arousal, r(581) = .856, 95% CI [.832, .876], p < .001, BF₁₀ = 4.11 × 10¹⁶³.

For age-wise comparison, we again performed a median split for age on both Chinese subsamples (valence, arousal). For the valence rating group (median age = 32 years), correlation between the younger (< 32 years) and the older (≥ 32 years) age group was extremely high, with r(581) = .974, 95% CI [.970, .978], p < .001, BF₁₀ > 10²⁰⁰. For the arousal rating group (median age = 33 years), the agreement between the younger (< 33 years) and the older (≥ 33 years) age group was still very high, although somewhat lower, r(581) = .785, 95% CI [.752, .814], p < .001, BF₁₀ = 3.41 × 10¹¹⁸. Note that the median age was much lower in the Chinese compared to the US sample: The Chinese sample was split in the early 30s while the US sample was split in the mid-40s.

As for the US sample, five age brackets were created—however, the ratings by Chinese age group “58+” are hardly reliable since this sample consists of a mere 14 participants—therefore they are omitted from the present figures. (They are nonetheless largely in line with the rest of the data; see supplementary figures at https://osf.io/anej6/). Most interestingly, the different age groups show somewhat different patterns (see Fig. 4), with ratings from younger age groups, especially the youngest (ages 18–27), tending to resemble more to the US sample and the U-shape often observed in Western samples.

Comparison of US and Chinese ratings

Mean ratings in the valence category from the US sample showed extremely high positive correlation with those from the Chinese rating, r(581) = .936, 95% CI [.925, .946], p < .001, BF₁₀ > 10¹⁰⁰ (see Fig. 5). A paired t test showed that Americans rated images more positive than Chinese, with a statistically significant but practically negligible difference of 0.18, 95% CI [0.12, 0.23] (US: 5.64 ± 1.76, China: 5.46 ± 1.37), t(582) = 6.35, p < .001, d = 0.26, 95% CI [0.18, 0.35], BF₁₀ = 1.21 × 10⁷. Both these findings are very similar for all age groups and both sexes (see Fig. 6; Table 2).

The mean ratings for arousal showed only a weak, albeit significant, correlation, r(581) = .282, 95% CI [.206, .355], p < .001, BF₁₀ = 2.30 × 10⁹ (see Fig. 7). The lower correlation in the arousal (as compared to the valence) dimension relates to the fact that, as mentioned above, the data from US raters showed a U-shaped valence-arousal-relationship, but the Chinese average affective ratings displayed a linear relationship. However, just as the ratings by younger age groups showed more visually similar valence-arousal patterns (see Fig. 4), they also demonstrated substantially higher correlation with US arousal ratings, as most clearly reflected in the youngest age group (see Fig. 8; r = .53 as opposed to .22, .17, and – .07 for the other three age groups, see Table 2; p < .001 for all z-test comparisons; Diedenhofen & Musch, 2015). This pattern of age group influence is similar for both males and females, though the correlation was generally higher for males (r = .39 vs .20; Table 2; see also all figures via https://osf.io/anej6/ with male and female ratings in separate panels).

A paired t test showed that arousal ratings by Americans were lower than those by Chinese, again a statistically significant yet very small difference, − 0.31, 95% CI [− 0.40, − 0.23] (mean rating US: 5.00 ± 1.02, China: 5.31 ± 0.64), t(582) = − 7.30, p < .001, d = − 0.30, 95% CI [− 0.39, − 0.22], BF₁₀ = 4.95 × 10⁹. This relation is similar for all age groups and both sexes (Table 2).

All in all, we may conclude that the consensus for valence ratings was generally higher than for arousal ratings, both within a sample of a given culture (see correlation tests for interrater reliability, gender, and age) and between the two cultures tested here.

Example experiment: Color-valence congruence effects in the BASS

We also included an example experiment, in which we tested the suitability of the BASS images for online research. Here, we manipulated color to demonstrate one major advantage of the BASS images: their easy rendition in a different color. Content-wise, it has been repeatedly shown that the stimulus valence of words interacts with perceptual features like color and brightness: Positive stimuli are associated with green (rather than red) and with white (rather than black; Kawai et al., 2020; Kuhbandner & Pekrun, 2013; Lakens et al., 2012; Meier et al., 2004, 2015; Moller et al., 2009). As a validation of the suitability of the BASS for online research, in a web-based fast decision task, we tested if the BASS silhouettes can induce similar effects. Details about the experiment can be found in the online supplementary material. Below we provide only a brief summary.

Method

We selected a subset of 60 positive and 60 negative silhouettes from the BASS (a list of filenames is provided in Table 4 in Appendix C). These 120 images were presented to 90 students (age = 23.0 ± 3.5; 27 male) from the University of Vienna in two experimental blocks: a “color” block, containing each silhouette once in red and once in green color; and a “brightness” block, containing each silhouette once in black and once in white (with silhouette background always in gray, see Appendix C, Fig. 15, for examples). The task was to categorize the valence of each silhouette via key press as either positive or negative. For each block, we ran a repeated-measures analysis of variance (ANOVA) on the correct mean response times (RTs) and another on the mean error rates (ERs).

Results

According to expectations, the two (Color: red vs. green; within) × two (Valence: positive vs. negative; within) ANOVA in the color block showed a significant interaction between color and valence for RTs, F(1, 89) = 91.46, p < .001, \( {\upeta}_{\mathrm{p}}^2 \) = .507, 90% CI [.383, .595], \( {\upeta}_{\mathrm{G}}^2 \) = .022, BF₁₀ = 3.10 × 10⁹, and also for ERs, F(1, 89) = 83.67, p < .001, \( {\upeta}_{\mathrm{p}}^2 \) = .485, 90% CI [.359, .576], \( {\upeta}_{\mathrm{G}}^2 \) = .106, BF₁₀ = 2.30 × 10¹⁵. Positive silhouettes were categorized faster and more accurately when presented in green rather than red, and, inversely, negative silhouettes were categorized faster and more accurately when presented in red rather than green. Also, in the brightness block, the silhouettes elicited the expected brightness–valence interactions in both RTs, F(1, 89) = 18.68, p < .001, \( {\upeta}_{\mathrm{p}}^2 \) = .173, 90% CI [.068, .286], \( {\upeta}_{\mathrm{G}}^2 \) = .005, BF₁₀ = 65.74, and ERs, F(1, 89) = 8.76, p = .004, \( {\upeta}_{\mathrm{p}}^2 \) = .090, 90% CI [.017, .191], \( {\upeta}_{\mathrm{G}}^2 \) = .011, BF₁₀ = 7.55. Positive silhouettes were categorized faster and more accurately when presented in white rather than black, and inversely, negative silhouettes were categorized faster and more accurately when presented in black rather than white.

Means and SDs for RTs and ERs are illustrated in Figs. 9 and 10, and the means can be found in Table 5.

Discussion

For the current study, we created an affective silhouette database with the specific goal of providing an easy and openly accessible lightweight stimulus pool capable of reliable elicitation of affective representations on a wide range of valence and arousal levels that was validated by extensive samples of two of the biggest culture groups studied in psychology. In the US sample, we found that the images cover a very wide range of affective representations from very negative to very positive (valence), and from very calm to very intense (arousal), indicating various potential applications (see Introduction). The Chinese sample was remarkably similar in valence ratings, covering a wide range with an extremely high correlation with the US sample.

However, the Chinese arousal ratings were only weakly correlated, and comparatively limited in range. This relates to the fact that data from the US participants demonstrated a U-shaped relationship between valence and arousal, while data from the Chinese participants, instead, displayed a more linear valence–arousal relationship (strong positive correlation between valence and arousal ratings) on average.

Studies that look at affective ratings for emotional words in Chinese find a U-shaped valence-arousal relation similar to the one apparent in the US ratings (Ho et al., 2015; Liu et al., 2018; Yao et al., 2017). It is noteworthy, that participants in the affective word studies were high school or university students. Our Chinese sample was more diverse: The age-group-wise analysis shows that younger Chinese display a more quadratic relationship which becomes more linear with increasing age. This provides a plausible reconciliation of these previous findings with our results in that it is only the relatively older Chinese generations whose perception substantially differs from those in Western populations, while the younger generations perceive images more in accordance with the Western population. We can only speculate about the reasons, but the more similar ratings of young US and Chinese participants could reflect a larger shared sphere of visual experiences, for instance, through digital social media (e.g., TikTok).

However, two studies that compared affective ratings for emotional photographs (a subset of the IAPS) between Chinese and Western samples interestingly seem to indicate the opposite phenomenon: Gong and Wang (2016) found a high correlation of valence and arousal ratings between Chinese and German older adults^{Footnote 12}, and Huang et al. (2015) argued that Chinese young adults display a different rating behavior from US young adults for IAPS pictures.

Generally, more empirical data on cultural differences in emotion representation or judgments is necessary to establish how and when Chinese rating behavior differs from that of a Western culture. One possibility might be, for instance, that the lower arousal for negative affective silhouettes, which explains most of the difference in our BASS ratings (and the lack of U-shape among Chinese participants, especially with increasing age), might be a result of conflict avoidance or emotion control as a more generally accepted strategy among Chinese than Westerners (Gong & Wang, 2016; Tjosvold & Sun, 2002).

We also demonstrated how the BASS images could be used in online experiments. In our example experiment, we showed the typical color-congruence effects of stimulus color on speed and accuracy of judgments about image valences: Positive images were rated faster and more accurately if presented in green than in red, and if presented in white than in black, while the opposite held true of negative images.^{Footnote 13} This result replicated prior research and already proves at least one particular use of the BASS images in scientific research.

All in all, the quality of our samples’ data (as measured by the respective interrater correlation) is outstanding and, where comparable, its general characteristics are in line with effects observed in notable affective databases like IAPS, OASIS, ANEW and BAWL-R (Võ et al., 2009), to name a few.

The BASS, however, aims only to complement, and not to replace, realistic picture databases, for research for which realistic images do not provide ideal stimuli for a given experiment for any of the reasons detailed in the Introduction (e.g., complications of color manipulation, file size, etc.).

While the simplicity of silhouettes can be advantageous for some research designs, their innate lack of context information may simultaneously lead to more (or less) ambiguity in their interpretations between participants than is the case for pictures that are rich in detail. Heterogeneous cultural backgrounds between participant groups could add to differences in content evaluation if contexts interact with objects in photorealistic image sets, but not or at least less so with silhouettes where context is sparse. However, while the direct interpretation of some of the silhouettes may, thus, be more or less ambiguous than that of a photographic image of the corresponding content (e.g., some participants may perceive a coyote’s silhouette simply as a dog when presented without a fitting habitat as context), in the current study, the affective ratings of the silhouettes generally demonstrated very high consistencies (very high inter-rater correlation, low SDs, and, in case of valence, very high intercultural correlation), which attest to the silhouettes’ emotional unambiguousness. Furthermore, several numeric measures in our BASS database, such as the rating differences between US and China per image, the corresponding BFs or residuals, or simply the SDs of the ratings in either sample, allow researchers to choose images of emotional clarity or culture-neutrality suiting their needs best.

Nonetheless, and although the rating data of our BASS images correspond well with the data obtained for photographic databases, we cannot rule out that the silhouettes evoke smaller emotional responses than contextually richer photographic pictures when using other measures (e.g., skin conductance, heart rate, facial muscles activities). Do photorealistic pictures elicit stronger emotion-related physiological responses than silhouettes, and if so, what are the driving forces in such a case (object detail, color, depth and perspective, etc.)? We hope that future research, facilitated by the BASS, will help to address these questions.

Conclusions

The BASS offers a novel database of visual silhouettes for psychological research optimally suited for the study of image-elicited affective representations in online experiments and validated by normative ratings by representative Western and Asian samples.

Notes

Some of the BASS images are not silhouettes in the strictest sense, but rather silhouette-like shapes because they also illustrate hollow parts or outlines (e.g., the edges and dots on dice). Nonetheless, for simplicity, in the present article, we refer to all the BASS images as “silhouettes.”
Several databases of shapes or objects do exist, like MultiPic (Duñabeitia et al., 2018), ALOI (Geusebroek et al., 2005), the sets by Nishimoto et al. (2012), by Brady et al. (2013), or by Snodgrass and Vanderwart (Rossion & Pourtois, 2004; Snodgrass & Vanderwart, 1980). However, these were not created (let alone verified or normed) with the intention of eliciting affect: The majority of these images depict everyday objects or animals. With a few exceptions, like dangerous versus cute animals, or weapons, it is unlikely that they would elicit strong affective responses or emotions. In the OASIS, for example, people and scene pictures were rated as more emotional than objects (Kurdi et al., 2017, p. 463), which suggests that these databases are not ideal for emotion research purposes.
IAPS: US undergraduate students, OASIS: more diverse but predominantly white US participants, GAPED: South/West-European psychology students, NAPS: Polish and other mostly European students.
An R script for easy instant conversion of any number of images to any chosen color is available via https://gasparl.github.io/BASS/.
The contents of most BASS images can be easily recognized even when they are strongly blurred or scaled to icon size.
For two US participants, we do not know their definite ages, only which age group they belong to. Mean age and SD are calculated without these two participants.
Duration ranged from 10 min to 53 hours. The latter was an extreme outlier that is no doubt due to a measurement error: For example, the participant may have opened the link to the task but only began completing it after a very long pause.
The demographic data for people who participated in the representative US rating was provided by Prolific.
This can either happen when the repetition items needed to calculate the correlation were skipped or when the same rating, for example, “5”, was given on every trial.
Exclusion criteria 3–4 were empirically determined by pilot testing in the pilot study, which is described in more detail in the supplementary online material.
The same U-shaped relationship was found in the IAPS and OASIS. Our observed valence-arousal correlation is very similar to that of the IAPS, which shows r(1192) = − .289, 95% CI [− .340, − .236], p < .001, BF₁₀ = 2.16 × 10²¹, while the correlation in the OASIS was not significant, with r(898) = − .058, 95% CI [− .123, .007], p = .082, BF₀₁ = 2.84.
Gong and Wang (2016) compared their Chinese participant sample with a mean age of 67.3 ± 4.96 years to a German participant sample with a mean age of 69.61 ± 3.58 years (Grühn & Scheibe, 2008). Thus, in this comparison, participants were older than our oldest Chinese age group (58+) which had a mean age of 62.36 ± 5.70 years and included only 14 participants.
The finding may seem to contradict the pilot study results showing that black-on-white and (inverted) white-on-black images are rated very similarly, demonstrating equal recognizability. However, in the pilot study, each participant was presented with only one image type (either all images black-on-white, or all white-on-black). Color-valence congruence effects can depend on dichotomous decision alternatives within the same task (cf. Kawai et al., 2020; Lakens et al., 2012), and, the present results are in line with this possibility and previous research showing that such congruence effects were only elicited in the example experiment (where each task contained contrasting color types alternating trial by trial), not during the affective rating study.

References

Anwyl-Irvine, A., Dalmaijer, E. S., Hodges, N., & Evershed, J. K. (2020). Realistic precision and accuracy of online experiment platforms, web browsers, and devices. Behavior Research Methods. https://doi.org/10.3758/s13428-020-01501-5
Aro, M., & Wimmer, H. (2003). Learning to read: English in comparison to six more regular orthographies. Applied Psycholinguistics, 24(4), 621–635. https://doi.org/10.1017/S0142716403000316
Article Google Scholar
Baker, A., Mystkowski, J., Culver, N., Yi, R., Mortazavi, A., & Craske, M. G. (2010). Does habituation matter? Emotional processing theory and exposure therapy for acrophobia. Behaviour Research and Therapy, 48(11), 1139–1143. https://doi.org/10.1016/j.brat.2010.07.009
Article PubMed PubMed Central Google Scholar
Bar, M. (2007). The proactive brain: Using analogies and associations to generate predictions. Trends in Cognitive Sciences, 11(7), 280–289. https://doi.org/10.1016/j.tics.2007.05.005
Article PubMed Google Scholar
Biederman, I. (1987). Recognition-by-components: A theory of human image understanding. Psychological Review, 94, 115–147. https://doi.org/10.1037/0033-295X.94.2.115
Article PubMed Google Scholar
Brady, T. F., Konkle, T., Gill, J., Oliva, A., & Alvarez, G. A. (2013). Visual long-term memory has the same limit on fidelity as visual working memory. Psychological Science, 24(6), 981-990. https://doi.org/10.1177/0956797612465439
Article PubMed Google Scholar
Bradley, M. M., Codispoti, M., Sabatinelli, D., & Lang, P. J. (2001). Emotion and motivation II: Sex differences in picture processing. Emotion, 1(3), 300–319. https://doi.org/10.1037/1528-3542.1.3.300
Article PubMed Google Scholar
Bradley, M. M., & Lang, P. J. (2007). The International Affective Picture System (IAPS) in the study of emotion and attention. In Handbook of emotion elicitation and assessment (pp. 29–46). Oxford University Press.
Dan-Glauser, E. S., & Scherer, K. R. (2011). The Geneva affective picture database (GAPED): A new 730-picture database focusing on valence and normative significance. Behavior Research Methods, 43(2), 468–477. https://doi.org/10.3758/s13428-011-0064-1
Article PubMed Google Scholar
Deák, A., Csenki, L., & Révész, G. (2010). Hungarian ratings for the International Affective Picture System (IAPS): A cross-cultural comparison. Empirical Text and Culture Research, 4(8), 90–101.
Google Scholar
Diedenhofen, B., & Musch, J. (2015). cocor: A Comprehensive Solution for the Statistical Comparison of Correlations. PLOS ONE, 10(4), e0121945. https://doi.org/10.1371/journal.pone.0121945
Article PubMed PubMed Central Google Scholar
Duñabeitia, J. A., Crepaldi, D., Meyer, A. S., New, B., Pliatsikas, C., Smolka, E., & Brysbaert, M. (2018). MultiPic: A standardized set of 750 drawings with norms for six European languages. Quarterly Journal of Experimental Psychology, 71(4), 808–816. https://doi.org/10.1080/17470218.2017.1310261
Article Google Scholar
Fazio, R. H. (2001). On the automatic activation of associated evaluations: An overview. Cognition & Emotion, 15(2), 115-141. https://doi.org/10.1080/0269993004200024
Article Google Scholar
Foa, E. B., & Kozak, M. J. (1986). Emotional processing of fear: Exposure to corrective information. Psychological Bulletin, 99(1), 20–35. https://doi.org/10.1037/0033-2909.99.1.20
Article PubMed Google Scholar
Frost, R., Katz, L., & Bentin, S. (1987). Strategies for visual word recognition and orthographical depth: A multilingual comparison. Journal of Experimental Psychology: Human Perception and Performance, 13(1), 104–115. https://doi.org/10.1037/0096-1523.13.1.104
Article PubMed Google Scholar
Garaizar, P., & Reips, U.-D. (2019). Best practices: Two web-browser-based methods for stimulus presentation in behavioral experiments with high-resolution timing requirements. Behavior Research Methods, 51(3), 1441–1453. https://doi.org/10.3758/s13428-018-1126-4
Article PubMed Google Scholar
Geusebroek, J.-M., Burghouts, G. J., & Smeulders, A. W. M. (2005). The Amsterdam Library of Object Images. International Journal of Computer Vision, 61(1), 103–112. https://doi.org/10.1023/B:VISI.0000042993.50813.60
Article Google Scholar
Gong, X., & Wang, D. (2016). Applicability of the International Affective Picture System in Chinese older adults: A validation study: Cross-cultural validity of the IAPS. PsyCh Journal, 5(2), 117–124. https://doi.org/10.1002/pchj.131
Article PubMed Google Scholar
Grühn, D., & Scheibe, S. (2008). Age-related differences in valence and arousal ratings of pictures from the International Affective Picture System (IAPS): Do ratings become more extreme with age? Behavior Research Methods, 40(2), 512–521. https://doi.org/10.3758/BRM.40.2.512
Article PubMed Google Scholar
Hansen, T., & Gegenfurtner, K. R. (2009). Independence of color and luminance edges in natural scenes. Visual Neuroscience, 26(1), 35–49. https://doi.org/10.1017/S0952523808080796
Article PubMed Google Scholar
Hermans, D., De Houwer, J., & Eelen, P. (2001). A time course analysis of the affective priming effect. Cognition & Emotion, 15(2), 143-165. https://doi.org/10.1080/02699930125768
Article Google Scholar
Ho, S. M. Y., Mak, C. W. Y., Yeung, D., Duan, W., Tang, S., Yeung, J. C., & Ching, R. (2015). Emotional valence, arousal, and threat ratings of 160 Chinese words among adolescents. PLOS ONE, 10(7), e0132294. https://doi.org/10.1371/journal.pone.0132294
Article PubMed PubMed Central Google Scholar
Huang, J., Xu, D., Peterson, B. S., Hu, J., Cao, L., Wei, N., Zhang, Y., Xu, W., Xu, Y., & Hu, S. (2015). Affective reactions differ between Chinese and American healthy young adults: A cross-cultural study using the international affective picture system. BMC Psychiatry, 15(1), 60. https://doi.org/10.1186/s12888-015-0442-9
Article PubMed PubMed Central Google Scholar
Huston, J. P., Nadal, M., Mora, F., Agnati, L. F., & Conde, C. J. C. (2015). Art, aesthetics, and the brain. OUP Oxford.
Book Google Scholar
Kawai, C., Lukács, G., & Ansorge, U. (2020). Polarities influence implicit associations between colour and emotion. Acta Psychologica, 209, 103143. https://doi.org/10.1016/j.actpsy.2020.103143
Article PubMed Google Scholar
Kuhbandner, C., & Pekrun, R. (2013). Joint effects of emotion and color on memory. Emotion, 13(3), 375–379. https://doi.org/10.1037/a0031821
Article PubMed Google Scholar
Kurdi, B., Lozano, S., & Banaji, M. R. (2017). Introducing the Open Affective Standardized Image Set (OASIS). Behavior Research Methods, 49(2), 457–470. https://doi.org/10.3758/s13428-016-0715-3
Article PubMed Google Scholar
Lakens, D., Fockenberg, D. A., Lemmens, K. P. H., Ham, J., & Midden, C. J. H. (2013). Brightness differences influence the evaluation of affective pictures. Cognition & Emotion, 27(7), 1225–1246. https://doi.org/10.1080/02699931.2013.781501
Article Google Scholar
Lakens, D., Semin, G. R., & Foroni, F. (2012). But for the bad, there would not be good: Grounding valence in brightness through shared relational structures. Journal of Experimental Psychology: General, 141(3), 584–594. https://doi.org/10.1037/a0026468
Article Google Scholar
Lang, P. J., Bradley, M. M., & Cuthbert, B. N. (1997). International affective picture system (IAPS): Technical manual and affective ratings. NIMH Center for the Study of Emotion and Attention, 1, 39–58.
Google Scholar
Lau, R. R., Sears, D. O., & Jessor, T. (1990). Fact or artifact revisited: Survey instrument effects and pocketbook politics. Political Behavior, 12(3), 217–242. https://doi.org/10.1007/BF00992334
Article Google Scholar
Leder, H., Tinio, P. P. L., & Bar, M. (2011). Emotional valence modulates the preference for curved objects. Perception, 40(6), 649–655. https://doi.org/10.1068/p6845
Article PubMed Google Scholar
Lindsay, G. W. (2020). Attention in psychology, neuroscience, and machine learning. Frontiers in Computational Neuroscience, 14, 29. https://doi.org/10.3389/fncom.2020.00029
Article PubMed PubMed Central Google Scholar
Liu, P., Li, M., Lu, Q., & Han, B. (2018). Norms of valence and arousal for 2,076 Chinese 4-character words. In K. Hasida & W. P. Pa (Eds.), Computational Linguistics (Vol. 781, pp. 88–98). Springer Singapore. https://doi.org/10.1007/978-981-10-8438-6_8
Chapter Google Scholar
Marchewka, A., Żurawski, Ł., Jednoróg, K., & Grabowska, A. (2014). The Nencki Affective Picture System (NAPS): Introduction to a novel, standardized, wide-range, high-quality, realistic picture database. Behavior Research Methods, 46(2), 596–610. https://doi.org/10.3758/s13428-013-0379-1
Article PubMed Google Scholar
Marr, D. (1982). Vision. A computational investigation into the human representation and processing of visual information. W. H. Freeman and Company.
Meier, B. P., Fetterman, A. K., & Robinson, M. D. (2015). Black and white as valence cues: A large-scale replication effort of Meier, Robinson, and Clore (2004). Social Psychology, 46(3), 174–178. https://doi.org/10.1027/1864-9335/a000236
Meier, B. P., & Robinson, M. D. (2004). Why the sunny side is up: Associations between affect and vertical position. Psychological Science, 15(4), 243-247. https://doi.org/10.1111/j.0956-7976.2004.00659.x
Article PubMed Google Scholar
Meier, B. P., Robinson, M. D., & Clore, G. L. (2004). Why good guys wear white: Automatic inferences about stimulus valence based on brightness. Psychological Science, 15(2), 82–87. https://doi.org/10.1111/j.0963-7214.2004.01502002.x
Article PubMed Google Scholar
Moller, A. C., Elliot, A. J., & Maier, M. A. (2009). Basic hue-meaning associations. Emotion, 9(6), 898–902. https://doi.org/10.1037/a0017811
Article PubMed Google Scholar
Mould, D., Mandryk, R. L., & Li, H. (2012). Emotional response and visual attention to non-photorealistic images. Computers & Graphics, 36(6), 658–672. https://doi.org/10.1016/j.cag.2012.03.039
Article Google Scholar
Nishimoto, T., Ueda, T., Miyawaki, K., Une, Y., & Takahashi, M. (2012). The role of imagery-related properties in picture naming: A newly standardized set of 360 pictures for Japanese. Behavior Research Methods, 44(4), 934–945. https://doi.org/10.3758/s13428-011-0176-7
Article PubMed Google Scholar
Oliva, A., & Schyns, P. G. (2000). Diagnostic colors mediate scene recognition. Cognitive Psychology, 41, 176–210. https://doi.org/10.1006/cogp.1999.0728
Article PubMed Google Scholar
Orzan, A., Bousseau, A., Barla, P., & Thollot, J. (2007). Structure-preserving manipulation of photographs. Proceedings of the 5th International Symposium on Non-Photorealistic Animation and Rendering - NPAR ’07, 103. https://doi.org/10.1145/1274871.1274888
Pôrto, W. G., Bertolucci, P. H. F., & Bueno, O. F. A. (2010). The paradox of age: An analysis of responses by aging Brazilians to International Affective Picture System (IAPS). Revista Brasileira de Psiquiatria, 33(1), 10–15. https://doi.org/10.1590/S1516-44462010005000015
Article PubMed Google Scholar
Proverbio, A. M., Adorni, R., Zani, A., & Trestianu, L. (2009). Sex differences in the brain response to affective scenes with or without humans. Neuropsychologia, 47(12), 2374–2388. https://doi.org/10.1016/j.neuropsychologia.2008.10.030
Article PubMed Google Scholar
Rajalingham, R., Issa, E. B., Bashivan, P., Kar, K., Schmidt, K., & DiCarlo, J. J. (2018). Large-scale, high-resolution comparison of the core visual object recognition behavior of humans, monkeys, and state-of-the-art deep artificial neural networks. The Journal of Neuroscience, 38(33), 7255–7269. https://doi.org/10.1523/JNEUROSCI.0388-18.2018
Article PubMed PubMed Central Google Scholar
Ramaswami, M. (2014). Network plasticity in adaptive filtering and behavioral habituation. Neuron, 82(6), 1216–1229. https://doi.org/10.1016/j.neuron.2014.04.035
Article PubMed Google Scholar
R Core Team (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/
Rossion, B., & Pourtois, G. (2004). Revisiting Snodgrass and Vanderwart’s object pictorial set: The role of surface detail in basic-level object recognition. Perception, 33(2), 217–236. https://doi.org/10.1068/p5117
Article PubMed Google Scholar
Sabatinelli, D., Flaisch, T., Bradley, M. M., Fitzsimmons, J. R., & Lang, P. J. (2004). Affective picture perception: Gender differences in visual cortex? NeuroReport, 15(7), 1109–1112. https://doi.org/10.1097/00001756-200405190-00005
Article PubMed Google Scholar
Schmalz, X., Beyersmann, E., Cavalli, E., & Marinus, E. (2016). Unpredictability and complexity of print-to-speech correspondences increase reliance on lexical processes: More evidence for the orthographic depth hypothesis. Journal of Cognitive Psychology, 28(6), 658–672. https://doi.org/10.1080/20445911.2016.1182172
Article Google Scholar
Schupp, H. T., Markus, J., Weike, A. I., & Hamm, A. O. (2003). Emotional facilitation of sensory processing in the visual cortex. Psychological Science, 14(1), 7-13. https://doi.org/10.1111/1467-9280.01411
Article PubMed Google Scholar
Snodgrass, J. G., & Vanderwart, M. (1980). A standardized set of 260 pictures: Norms for name agreement, image agreement, familiarity, and visual complexity. Journal of Experimental Psychology: Human Learning and Memory, 6(2), 174–215.
Google Scholar
The GIMP Development Team. (2019). GIMP. Retrieved from https://www.gimp.org
Verschuere, B., Crombez, G., & Koster, E. (2001). The International Affective Picture System: A Flemish Validation Study. Psychologica Belgica, 41(4), 205–217. https://doi.org/10.5334/pb.981
Article Google Scholar
Tjosvold, D., & Sun, H. F. (2002). Understanding conflict avoidance: Relationship, motivations, actions, and consequences. International Journal of Conflict Management, 13(2), 142–164. https://doi.org/10.1108/eb022872
Article Google Scholar
Võ, M. L. H., Conrad, M., Kuchinke, L., Urton, K., Hofmann, M. J., & Jacobs, A. M. (2009). The Berlin Affective Word List Reloaded (BAWL-R). Behavior Research Methods, 41(2), 534–538. https://doi.org/10.3758/BRM.41.2.534
Article PubMed Google Scholar
Wilcox, N., & Wlezien, C. (1993). The contamination of responses to survey items: Economic perceptions and political judgments. Political Analysis, 5, 181–213. https://doi.org/10.1093/pan/5.1.181
Article Google Scholar
Wilms, L., & Oberfeld, D. (2018). Color and emotion: Effects of hue, saturation, and brightness. Psychological Research, 82(5), 896–914. https://doi.org/10.1007/s00426-017-0880-8
Article PubMed Google Scholar
Wrase, J., Klein, S., Gruesser, S. M., Hermann, D., Flor, H., Mann, K., Braus, D. F., & Heinz, A. (2003). Gender differences in the processing of standardized emotional visual stimuli in humans: A functional magnetic resonance imaging study. Neuroscience Letters, 348(1), 41–45. https://doi.org/10.1016/S0304-3940(03)00565-2
Article PubMed Google Scholar
Yao, Z., Wu, J., Zhang, Y., & Wang, Z. (2017). Norms of valence, arousal, concreteness, familiarity, imageability, and context availability for 1,100 Chinese words. Behavior Research Methods, 49(4), 1374–1385. https://doi.org/10.3758/s13428-016-0793-2
Article PubMed Google Scholar
Zhou, H., & Fishbach, A. (2016). The pitfall of experimenting on the web: How unattended selective attrition leads to surprising (yet false) research conclusions. Journal of Personality and Social Psychology, 111(4), 493–504. https://doi.org/10.1037/pspa0000056
Article PubMed Google Scholar

Download references

Acknowledgements

Claudia Kawai and Gáspár Lukács are recipients of DOC Fellowships of the Austrian Academy of Sciences at the Department of Cognition, Emotion, and Methods in Psychology at the University of Vienna. The research was further supported by Dr. Franziska Pinsker, for which we thank her. We want to acknowledge the assistance of Yang Zhang, who translated the experiment to Mandarin and collected the online pilot data in China. We also thank Wenyi Chu for her feedback on clarity of the experimental procedure, and Sophie Hanke, Tobias Greif, and Bence Szaszkó for making browsing the BASS easier. The meticulous comments Anna Walker provided are greatly appreciated. Special thanks goes to the talented Anna Lumaca (allhailthesnail.com) for creating and contributing 20 much needed “ghastly” silhouettes to the BASS.

Availability of data, materials, and code

The BASS and its accompanying data table are available for download from the OSF project page under https://osf.io/anej6/. The project page further contains all preregistrations, collected raw data, experimental and analysis scripts, a useful R-script to convert the colors of the silhouettes, as well as detailed reports on the procedures involved in the creation of the BASS and its follow-up experiment. A browsable and downloadable version of the complete database (silhouette images and data table) is also available via https://gasparl.github.io/BASS/.

Funding

Open access funding provided by University of Vienna. This research was funded by the Austrian Academy of Sciences (C.K.’s personal grant number 25068, G.L.’s personal grant number 24945).

Author information

Authors and Affiliations

Department of Cognition, Emotion, and Methods in Psychology, University of Vienna, Vienna, Austria
Claudia Kawai, Gáspár Lukács & Ulrich Ansorge
Vienna Cognitive Science Hub, University of Vienna, Vienna, Austria
Ulrich Ansorge
Research Platform Mediatized Lifeworlds, University of Vienna, Vienna, Austria
Ulrich Ansorge

Authors

Claudia Kawai
View author publications
You can also search for this author in PubMed Google Scholar
Gáspár Lukács
View author publications
You can also search for this author in PubMed Google Scholar
Ulrich Ansorge
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Concept by C.K. and U.A., images by C.K., study design and analysis by C.K. and G.L., presentation software by G.L. and C.K., manuscript by C.K., G.L., and U.A.

Corresponding author

Correspondence to Claudia Kawai.

Ethics declarations

Ethics, consent to participate, and consent for publication

The research was conducted according to the principles expressed in the Declaration of Helsinki. During online participation, no potentially identifying information such as IP addresses was collected and participants were informed that (i) their data would be treated anonymously, and (ii) they could unconditionally stop their participation. Participants were informed that their anonymized data would be published and were given the opportunity to prohibit the use of their data. Informed consent was obtained from all subjects.

Conflict of interest

The authors have no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A

Table 1 Demographics of the included participants from the US and Chinese ratings with corresponding means and SD for valence and arousal

Full size table

Table 2 US and Chinese valence and arousal rating correlation per sex and age group

Full size table

Appendix B

Table 3 Bicolor Affective Silhouettes and Shapes (BASS) datatable manual

Full size table

Appendix C

Table 4 Filenames of the positive and negative silhouettes used in the verification study

Full size table

Table 5 Means and SDs for reaction times (RTs) and error rates (ERs) per color and valence categories in the Bicolor Affective Silhouettes and Shapes (BASS) validation experiment

Full size table

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kawai, C., Lukács, G. & Ansorge, U. A new type of pictorial database: The Bicolor Affective Silhouettes and Shapes (BASS). Behav Res 53, 2558–2575 (2021). https://doi.org/10.3758/s13428-021-01569-7

Download citation

Accepted: 19 February 2021
Published: 07 May 2021
Issue Date: December 2021
DOI: https://doi.org/10.3758/s13428-021-01569-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A new type of pictorial database: The Bicolor Affective Silhouettes and Shapes (BASS)

Abstract

Similar content being viewed by others

Diversity representation in advertising

Color associations to emotion and emotion-laden words: A collection of norms for stimulus construction and selection

What is Beauty?

Creating the BASS

Method

Participants

Materials

Procedure

Online Rating Study – US

Attention checks

Online Rating Study – China

Data exclusion

Results

US rating

Reliability

Relationship between valence and arousal

Effects of demographic variables

Chinese rating

Reliability

Relationship between valence and arousal

Effects of demographic variables

Comparison of US and Chinese ratings

Example experiment: Color-valence congruence effects in the BASS

Method

Results

Discussion

Conclusions

Notes

References

Acknowledgements

Availability of data, materials, and code

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics, consent to participate, and consent for publication

Conflict of interest

Additional information

Publisher’s note

Appendices

Appendix A

Appendix B

Appendix C

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation