Introduction

Associative learning, in which an initially neutral cue (the conditioned stimulus, CS) is associated with a stimulus of innate positive or negative valence (something ‘good’ or ‘bad’, an unconditioned stimulus, US), is critical to most animals’ survival. Indeed, associative learning is so important that it is almost ubiquitous in the animal kingdom, and beyond (van Duijn 2017).

Insects can form associative memories with very few exposures, and maintain these memories for long periods. Fruitflies (Drosophila melanogaster) can form an association between an odour and a reward in one exposure, which may last up to 24 h (Krashes and Waddell 2008). Honeybees and ants can learn to associate a sucrose reward with a colour, odour, or direction after only one exposure, and retrieve this memory several days later (Menzel 1968; Grüter et al. 2011; Piqueret et al. 2019; Oberhauser et al. 2019; Villar et al. 2020). Ants have even been reported to return to memorised locations after multiple months in winter diapause (Salo and Rosengren 2001). Lasius niger ants can form multi-modal associations, combining visual and odour cues, possibly even after a very limited number of exposures (De Agrò et al. 2020). However, not all social insects perform equally well in odour-association tasks. While Apis mellifera honeybees can learn an odour-reward association in a single proboscis extension response trial (Menzel 1993; Henske et al. 2015; Villar et al. 2020), some stingless bee species, such as Meliponula bocandei, seem to form only weak associations under identical circumstances (Henske et al. 2015). While L. niger ants reach a choice accuracy of 71% after one round of differential exposure to two odours associated with a reward and a punishment (Oberhauser et al. 2019), Camponotus mus seem to require up to eight such visits to reach comparable accuracy (Dupuy et al. 2006). Moreover, the nature of the neutral stimulus and the valance of the unconditional stimulus can influence learning performance. For example, Desmedt et al. (2017) found that Camponotus aethiops retained an association between a negative stimulus (heat) and 1-Hexanol 10 min after exposure, while they failed to retain a similar association to Octanal. Drosophila larvae successfully learn to associate a reward (fructose) with either light or dark, but seem unable to learn to associate punishments (table salt or quinine) with light levels (Gerber et al. 2004).

Discrimination learning is the ability to learn to associate multiple different conditioned stimuli with different unconditioned stimuli. For example, one odour may be associated with a reward (e.g. sugar) and another with a punishment (e.g. quinine solution), or two odours may be associated with two different reward qualities (e.g. low and high molarity sucrose). Differential association experiments, in which researchers attempt to teach animals to discriminate between stimuli, are the main method by which advanced cognitive abilities such as concept learning, numerosity, transitive inference, and meta-cognition are tested (Perry and Barron 2013; Avarguès-Weber and Giurfa 2013; Tibbetts et al. 2019; Bortot et al. 2020). However, differential conditioning is usually restricted to only two cue associates, one positive and one negative. Experiments with differential conditioning to two cues associated with the same valence, e.g. two rewards of different strength, are less well explored. L. niger ants can form such differential positive associations, as demonstrated in incentive contrast studies (Wendt et al. 2019), but how rapidly such associations can be formed is unknown. In addition, very few studies have attempted to form more than two differential associations. Brembs and Wiener (2006) successfully trained Drosophila flies to four different compound stimuli consisting of combinations of two colours and two shapes, using four 2-min training trials. Each combination of colour and shape predicted one of two US (punishment or no punishment). Tibbetts et al. (2019) successfully trained Polistes paper wasps to associate four pairs of five colours with a choice for one of the colours in each pair. In each colour pair, one of the colours offered safety from electric shocks. Using six 30-s training trials per pair, plus one refresher trial, the wasps reached just under 70% accuracy. In a test of transitive inference, the wasps were also capable of choosing untrained pairs, implying that they were learning more than simple conditional responses (if A do X, if B do Y), although non-transitive mechanisms were not ruled out. Similarly, Benard and Giurfa (2004) successfully trained honeybees to four pairs of five pattern cues using free-flying bees and rewards for correct choices, although above-chance choice on multiple pairs required 40 trials. Such extensive training prohibits more complex experimental designs.

Identifying an invertebrate model which can rapidly learn multiple (more than two) differential associations in a single modality would open the way to a variety of complex tests of cognition, and especially of economic preference behaviour. The ant Lasius niger is capable of learning odour, visual, and directional cues in only one learning trial (Grüter et al. 2011; Oberhauser et al. 2019; De Agrò et al. 2020) and of learning differential associations within one valence (Wendt et al. 2019). Here we ask whether it can form multiple differential associations, with the same valence, in a limited number of learning trials.

Methods overview

For a detailed methodological description, see supplement S1.

Eight queenless Lasius niger colonies were studied, which were deprived of food for 4 days prior to testing. The aim of the experiments was to train an ant to associate a certain number of odours (2, 3, or 4) with a certain sucrose molarity. After training, pairwise preference between the odours were then tested on a Y-maze. Four experiments were run (see Table 1), all following the same design: An individual ant was allowed to find a scented food source at the end of a linear runway scented to match the food source. After feeding, the ant returned to the nest, unloaded the food, and was allowed to return to the apparatus. While unloading, the apparatus was switched so that the reward molarity offered, the reward scent, and the matching runway scent, were changed. Ants were allowed to make repeated visits to the linear runway, experiencing each odour/quality combination between 1 and 4 times.

Table 1 Experiment overview

In experiment 1 we trained two odour/reward combinations, with four exposures to each combination. This experiment reproduces part of Oberhauser et al. (2019), and acts as a demonstration that using food flavourings (as opposed to essential oils) results in efficient learning. Experiment 2 linked three molarities (0.2, 0.6, 1.8) to three odours, with three exposures to each combination. Experiments 3 and 4 linked four molarities (0.2, 0.4, 0.8, 1.6) to four qualities, with two and one exposure to each combination, respectively.

After training, the linear runway was replaced by a Y-maze offering a pair of the trained odours. The choice of the ant was noted, the ant was returned to the start of the apparatus, and a new odour pair was presented. The odour pairs tested in each experiment are shown in Fig. 1.

Fig. 1
figure 1

Proportion of ants choosing the branch of a Y-maze scented with an odour associated with the higher molarity. Shown are all binary choices faced by the ants. Bars are absolute proportions, whiskers are two-tailed exact binomial 95% confidence intervals

Pairwise preferences were tested using exact binomial tests, after finding no colony-level effects using mixed-effect models. A detailed description of the statistical analysis method is given in S1. The entire statistical analysis workflow, with code and output, is provided in S2. Innate odour preference controls showed no strong preference between any of the odour pairs (see S1).

Results

The ants successfully chose the odour associated with the higher quality in all but two of the pairwise choices (Table 1 and Fig. 1). Specifically:

In experiment 1 (two odour four training visits), most of ants chose the odour associated with the higher molarity (16/17, 94%, P = 0.00027, 95% CI 0.71–0.999).

In experiment 2 (three odour three training visits), in the 0.2 M vs 0.6 M test most of the ants (22/29, 76%, P = 0.0081, CI 0.56–0.90) chose the odour associated with 0.6 M. Similarly, in the 0.6 M vs 1.8 M test most ants (23/29, 79%, P = 0.023, CI = 0.6–0.92) chose the odour associated with 1.8 M. Finally, in the 0.2 M vs 1.8 M test, 27/29 (93%) chose the odour associated with 1.8 M (P < 0.0001, CI = 0.77–0.99).

In experiment 3 (four odour two training visits), in the 0.4 M vs 0.8 M test, 30/40 (75%) of ants chose the odour associated with 0.8 M (P = 0.0022, CI = 0.59–0.87). In the 0.8 M vs 1.6 M test 28/40 (72%) chose the odour associated with 1.6 M (P = 0.0095, CI = 0.55–0.85). In the 0.2 M vs 0.4 M test, 26/40 (65%) chose the odour associated with 0.4 M (P = 0.08, CI = 0.48–0.79).

Finally, in experiment 4 (four odour one training visit), in the 0.4 M vs 0.8 M test 32/43 (74%) of ants chose the odour associated with 0.8 M (P = 0.0019, CI = 0.59–0.86). In the 0.8 vs 1.6 M test 33/43 (77%) of ants chose the odour associated with 1.6 M (P = 0.0006, CI = 0.61–0.88). However, ants showed no preference in the 0.2 M vs 0.4 M test, with 24/43 (56%) of ants choosing the odour associated with 0.4 M (P = 0.54, CI 0.40–0.71).

We found no strong evidence of a side bias in experiment 2, with neither correct side, nor its interaction, showing any clear effect (GLMM, Z < 1.17, P > 0.24), see Fig. on page 12 of supplement S2). However, in experiment 3 ants showed a left-bias in the 0.2 M vs 0.4 M test, with a strong tendency to make correct decisions when the correct direction was left, and a weaker tendency to make incorrect decisions when the correct decision was right (see Figure in S2). This can be seen in an interaction between correct side and the binary test, with ants in the 0.2 M vs. 0.4 M test making fewer correct decisions than in the 0.4 M vs 0.8 M test when the correct direction was right, but not when it was left (Z = − 2.17, P = 0.03). Side biases are commonly reported in ants (Hunt et al. 2014; Oberhauser et al. 2020).

Similarly, we found a left bias in experiment 4 (Z = − 2.33, P = 0.026). Most ants chose the correct side in all pairwise choices, but this preference was much stronger when the correct side was left (see Figure in S2).

Discussion

Ant demonstrated very rapid discrimination learning. Even given just one exposure to each of four odour/molarity combinations (experiment 4), ants were able to correctly choose the higher-molarity associated odour of the intermediate pair (0.4 M vs 0.8 M) 72% of the time, and between the two best molarities (0.8 M vs 1.6 M) in 77% of trials. We can thus definitively claim that they can make at least two odour/molarity associations given only one exposure to each combination.

It is highly likely that the ants can make three such associations given only one exposure. However, as the ants did not reliably distinguish between the 0.2 M and 0.4 M associated odours, we cannot rule out the possibility that neither the 0.2 M nor the 0.4 M associations were formed. We cannot distinguish a choice between 0.8 M and nothing, and between 0.8 M and 0.4 M. We thus refrain from claiming that the 0.4 M association was formed. Nonetheless, we think it highly likely that it was, given that when allowed two exposures to each cue/molarity combination, ants make 65% correct choices (95% CI 0.48–0.79).

Why did the ants fail to correctly choose between 0.2 M and 0.4 M associated cues? It is possible that a 0.2 M solution is near the detection limit of the ants, and as the detection limit is approached, discrimination ability also suffers, in line with basic psychophysical principles (Gescheider 1997). In a previous experiment, L. niger ants successfully learned to distinguish between 0.1 M and 0.3 M after three visits to each molarity (De Agrò et al. 2019). The ants thus likely struggled to distinguish 0.2 M and 0.4 M with so few exposures. That 0.4 M vs. 0.8 M and 0.8 M vs. 1.6 M were distinguished equally well is also well in line with basic psychophysics, which maintains that differences are sensed in relative terms, as opposed to as absolute deviations from a fixed point.

Examining the speed of differential learning is often hindered in two-combination experiments, as even given a preference for the correct cue, one cannot rule out the possibility that the worst option is simply not learned. This can be demonstrated by further training trials, in which the unrewarded cue is first responded to (due to generalisation) but after several trials is differentiated from the rewarded cue (e.g. (Menzel 1993; Roussel et al. 2012). To demonstrate discrimination learning in just one exposure to each cue, non-extreme cues in a set must be compared, as in experiments 3 and 4. To our knowledge, this is the first conclusive evidence of such discrimination learning with only one exposure to each associative combination in an insect.

Our results suggest L. niger as an ideal invertebrate model for studying complex choice situations involving multiple options. Economic preference experiments, for example looking into menu-effects, is a clear example of a field requiring multiple differential learning. Decoy effects, a subcategory of menu effects, have been studied in ants and honeybees along two trade-off dimensions (Sasaki and Pratt 2011; Tan et al. 2014), but extending into multiple trade-off dimensions has so far been limited by the prohibitive training requirements. L. niger offers itself as an ideal model here, as not only can it learn rapidly, but can also rapidly make multi-modal associations (De Agrò et al. 2020), and can be induced to make many tens of repeat visits in a limited time period (Oberhauser et al. 2020).