Introduction

Apple (Malus × domestica Borkh.) is the most widely grown permanent fruit crop of temperate climates. In Europe, apple was cultivated on an area of 473,500 ha in 2017 and it represented the most common fruit tree species (Eurostat 2019). The domestic apple is an interspecific hybrid originating from Central Asia that had been cultivated since antiquity (Janick 2005; Cornille et al. 2014). In the course of centuries, a multitude of apple varieties had arisen by spontaneous crossing and the trees with the most desirable properties have been selected by humans and further clonally propagated by grafting. Some of the cultivars were grown locally, while others were distributed over larger geographic areas in Europe (Hartmann 2015). Furthermore, trees, scionwood or apple seeds from European countries were disseminated to other continents during the colonisation of the New World, where the selection of new cultivars had been continued (Janick 2005; Volk and Henk 2016).

There are estimates that more than 10,000 apple cultivars had been documented (Janick et al. 1996; Rieger 2006). Europe saw the highest cultivar diversity of apple in the late 19th century, when many different local cultivars were grown in numerous little orchards (Luby 2003). The number of known cultivars in England most probably exceeded 2500, while more than 6000 cultivars were assumed on the territory of the former Soviet Union (Juniper et al. 1998). In the United States, representing a secondary centre of apple cultivar diversification (Janick 2005), more than 7000 apple cultivars were listed between 1804 and 1904 (cited by Volk and Henk 2016). This number, however, may also include synonymous names.

Even though scientific breeding programmes initiated at the beginning of the 20th century, some of the globally most widely grown apple cultivars, such as ‘Golden Delicious’, ‘Red Delicious’ or ‘Granny Smith’ arose as chance seedlings (Brown and Maloney 2003). Only in the recent decades, cultivars obtained by systematic breeding have been gaining more importance on the market. However, many of the breeding programmes relied on a small number of progenitors, posing a risk of reduced genetic diversity of new apple cultivars (Noiton and Alspach 1996; Bannier 2011). Therefore, conservation of the remaining cultivar diversity in germplasm repositories is of utmost importance in order to prevent a future narrowing of the genetic base of apple (Way et al. 1990).

A prerequisite for efficient germplasm management is an exact determination of the cultivars maintained in a collection. Apple cultivars can be described and determined based on phenotypic characters, which mainly focus on the external and internal traits of fruit (Morgan and Richards 2002). Although in the past, there were strong attempts to implement a systematics and a key for determination of apple cultivars, this initiative had failed because cultivars of the domestic apple do represent the same botanical species. Furthermore, phenotypic traits can be modulated by environmental factors and thus render pomological determination more difficult. In addition, a sufficient number of typical fruit has to be available. Finally, the pomological determination requires experienced specialists, which are becoming scarce, as pomology has been widely excluded from the curricula of horticultural science education (Hartmann 2015).

In the last two decades, molecular tools have been implemented as an alternative or additional means to the characterisation of apple cultivar diversity (Guilford et al. 1997; Hokanson et al. 1998). Such methods rely on the direct analysis of DNA, which can be isolated from plant tissue independent of the phenological stage of the tree and which is not affected by environmental influences. The most commonly applied technique is the analysis of simple sequence repeats (SSRs) or microsatellite markers, which, due to their robustness, reproducibility and high-throughput potential, have been employed to describe the genetic resources of apple in many countries (e.g. Guarino et al. 2006; Pereira-Lorenzo et al. 2007; Routson et al. 2009; van Treuren et al. 2010; Garkava-Gustavsson et al. 2013; Ferreira et al. 2016; Gasi et al. 2016; Urrestarazu et al. 2016; Larsen et al. 2017; Testolin et al. 2019).

The main objective of the present study was to establish a database with molecular genetic fingerprints of reference cultivars that could be used for determination of a large number of unidentified or misidentified apple cultivars. A thorough validation of the dataset comprising historic, old and recent apple cultivars was considered crucial in order to assure a high degree of reliability of the database, making it a suitable tool for characterisation of genetic resources but also for other applications.

Materials and Methods

Samples

The present study comprises genotyping data of 1621 accessions of apple trees that were sampled in 37 public and private cultivar collections in different European countries and that can be attributed to 600 different genotypes (see summary in Tables 1 and 2, and extensive information in Supplementary Table S1). The key criterion for including a molecular genetic profile into the reference database was that at least two accessions of the same cultivar of different provenances generated an identical result. Accessions belonging to 570 cultivars were indeed derived from two or more distinct cultivar collections, whereas 19 cultivars were obtained from a single collection, but comprised at least two accessions of different origin. Eleven cultivars were sampled from a single accession, however, in ten instances their cultivar assignment was supported by a parentage analysis or by molecular genetic data derived from other studies (Table 2). The crab apple species M. floribunda 821, which is the progenitor of scab resistance (Rvi6) and was therefore included in the database, was only available from one cultivar collection.

Table 1 List of germplasm collections and organisations, from which accessions of apple cultivars were collected or obtained for the present study
Table 2 Summary information on the 600 apple cultivars included in the reference database

German and English language pomological literature was searched for information about the country and year of origin of each cultivar as well as their parent cultivars (if known). The principal sources of information on old apple cultivars were the books of Morgan and Richards (2002), Silbereisen et al. (2015), Mühl (2001), Votteler (2014), Hartmann (2003, 2015), Bartha-Pichler et al. (2005), Smith (1971), Rolff (2001), Bernkopf et al. (2003) and Bernkopf (2011), cited in the order of consultation significance. For the more recent cultivars, the publications of Sansavini et al. (2012), Brown and Maloney (2003) and Evans et al. (2011) were consulted as were the Community Plant Variety Office (CPVO) Database (https://cpvo.europa.eu/en/applications-and-examinations/cpvo-varieties-database), the United States Patent and Trademark Office (USPTO) online database (http://appft.uspto.gov/netahtml/PTO/index.html) and the UPOV PLUTO: Plant Variety Database (http://www.upov.int/pluto/en/). The information about the origin of the rootstocks investigated in the present study was taken from Webster and Wertheim (2003), NIAB-EMR (2016) and Petzold (1984). In case that the references/databases mentioned above did not provide information about a given cultivar, literature searches specifically targeting that particular cultivar were conducted and cited in Table 2. Based on information about the year of origin, cultivars were classified as (i) historic, if they arose before 1900, as (ii) old, if they originated between 1900 and 1950 and as (iii) recent, if they were bred after 1950.

Nucleic Acid Isolation and Microsatellite DNA Analysis

Nucleic acid was isolated from leaf discs (approximately 100 mg) by direct homogenisation of fresh plant tissue in lysis buffer PL1 of the NucleoSpin Plant II Kit (Macherey-Nagel, Düren, Germany), using a 3-mm tungsten carbide bead (Qiagen, Hilden, Germany) and a Mixer Mill MM300 (Retsch GmbH, Haan, Germany). The remaining steps of the DNA isolation process were performed exactly according to the kit’s manual. Quality and quantity of DNA isolates were controlled by electrophoresis on 1% agarose gels stained with ethidium bromide. DNA isolates were maintained at −20 °C until analysis. Aliquots of DNA isolates were obtained of rootstock samples from East Malling Research (UK) and part of the samples from Haidegg Research Station (Austria).

Each DNA sample was analysed at 14 microsatellite DNA loci described by Liebhard et al. (2002) in four multiplex reactions (Table 3). For each reaction pool, a 10 µl reaction mix was prepared, which contained 200 µM dNTPs, the appropriate concentration of each primer (see Table 3), 1 × GeneAmp PCR Buffer II (Life Technologies, Carlsbad, CA, USA), 1.5 mM MgCl2 (Life Technologies) and 0.5 Units of AmpliTaq Gold Polymerase (Life Technologies). To each 10 µl aliquot of the master mix, 2 µl DNA isolate were added (approximately 10 ng) and amplified on a GeneAmp PCR System 2700 (Life Technologies) under the following conditions: 10 min initial denaturation at 94 °C, 35 cycles with 20 s at 94 °C, 20 s at 57.5 °C and 45 s at 72 °C, followed by 10 min of final extension at 72 °C.

Table 3 List of microsatellite loci (Liebhard et al. 2002) analysed in the present study in four multiplex reactions

Amplified microsatellite DNA products were separated and visualised on a CEQ 8000 Genetic Analysis System (Beckman Coulter, Fullerton, CA, USA) for 35 min at 7.5 kV. Sizing of fragments relative to the internal CEQ DNA Size Standard 400 (Beckman Coulter) and assignment to specific allele classes were performed by applying the Fragment Analysis Software version 9.0 of the same manufacturer. Each electropherogram was carefully visually inspected for binning accuracy before data were exported. A cross-tabulation matrix in Microsoft Access was employed in order to compare all the molecular genetic profiles at 14 microsatellite loci to each other and to identify exact matches (Baric et al. 2009). All synonymous profiles were represented by a unique genotype (each assigned a distinctive profile number) and included into the final dataset (Supplementary Table S2).

Data Analysis

As the complete dataset comprised diploid and triploid genotypes, the number of alleles per locus and their size ranges were determined by applying the software SPAGeDi version 1.4 (Hardy and Vekemans 2002). In addition, the software CERVUS 3.0.7 (Kalinowski et al. 2007) was employed on a dataset including only diploid genotypes of apple. More precisely, allele frequency and identity analyses were performed on (i) a dataset including all diploid genotypes and the 14 microsatellite loci analysed in this study, (ii) a dataset excluding locus COL, and thus containing data at 13 microsatellite loci, (iii) a dataset containing data at six microsatellite loci (CH01f02, CH02b10, CH02c09, CH02c11, CH02d08, CH01h01) and (iv) a dataset containing data at five microsatellite loci (CH01f02, CH02c09, CH02c11, CH02d08, CH01h01). The latter two reduced sets of microsatellites overlapped with part of the loci employed in the studies of Bus et al. (2012) and Urrestarazu et al. (2016) (see Table 3). The aim of the latter analyses was to calculate the average exclusion probabilities of identical genotypes by using different numbers of loci and to test whether the reduced number of microsatellites would provide sufficient resolution to distinguish genotypes of different apple cultivars, when used as a tool for determination of unknown apple genotypes.

In order to get an impression about the accuracy of genotyping data, parentage analyses were carried out. In a first step, the analysis involved the dataset at 14 microsatellite loci and focused on 85 diploid genotypes of old and recent cultivars, and rootstocks that arose after 1900, for which information about at least one parent cultivar was available and the molecular genetic profile of the assumed parent was also present in the database. This dataset also included the cultivars ‘Golden Delicious’ and ‘James Grieve’ that arose before 1900, but their parentage was confirmed by genotypic data in the studies of Evans et al. (2011) and Salvi et al. (2014). This analysis, however, excluded other historic cultivars, for which information about parentages provided in the literature may be less reliable as they were not derived from systematic breeding activities. The analysis was performed with the software ML-Relate (Kalinowski et al. 2006), which can calculate the maximum likelihood estimates for each of the four relationships: unrelated, half siblings, full siblings and parent/offspring. In the course of these analyses, it was noticed that seven assumed parent-offspring combinations showed mismatches at locus COL, likely due to the presence of null alleles. Consequently, this locus was excluded from the dataset and the analyses were repeated with the remaining 13 loci. Furthermore, locus COL was omitted from all further parentage analyses.

In a second step, the analysis targeted 116 diploid genotypes, again comprising old and recent cultivars, and rootstocks, for which information about both parents was available from the literature and which genotypes were also present in the dataset. The software CERVUS 3.0.7 was employed in order to identify the statistically most likely candidate parent pairs. A simulation of parentage analysis was performed in order to determine the critical LOD values for parent pairs without known sexes at a strict confidence of 95% and a relaxed confidence of 80%. The simulation was based on the following parameters: N offspring: 20,000; N candidate parents: 1000; proportion of candidate parents sampled: 0.30; proportion of loci typed: 0.99; proportion of loci mistyped: 0.01; and error rate in likelihood calculations: 0.01. The confidence was determined using the log-likelihood ratio or LOD score. The parent pair analysis without known sexes was run based on the output of the parentage simulation and by allowing for typing errors.

In order to use the here described microsatellite dataset as a reference database for determination of unknown apple cultivars and to handle the presence of diploid and triploid genotypes, a new “Apple Fingerprint Identifier” script was generated in Microsoft Access (Baric and Radmüller, unpublished). This programme allows comparing the genotype of an unknown variety to all the 600 entries present in the reference database including the full set of 14 microsatellite loci. The programme permits manually inserting the genotypic data of the target cultivar or to import a list of cultivars from an Excel file comprising three columns per locus. For each target genotype, the alleles at each locus are compared to each entry of the database. Finally, a ranked list of the genotypes from the database with the highest similarity to the target is generated, which is based on the number of matches. In order to visually facilitate identification, all matching alleles between the target genotype and the genotypes from the database are highlighted in green, whereas mismatching alleles are highlighted in red. In this way, the genotype of an unknown apple variety can be assigned to the reference cultivar, if it shares all alleles with the reference genotype or displays a tolerated number of mismatches. The Access programme can also be used with a lower number of microsatellite loci, if the set of microsatellite loci analysed in different projects or laboratories is only partially overlapping.

Finally, a preliminary trial was performed to assess the potential of combining genotyping data obtained in the present study with published microsatellite data (Urrestarazu et al. 2016). First, the allele lengths of five common cultivars (‘Fuji’, ‘Gala’, ‘Golden Delicious’, ‘Granny Smith’ and ‘Red Delicious’) were compared at a set of five microsatellite loci that were analysed in both studies in order to determine conversion values necessary to align the allele lengths (Table 4). After conversion of the allele lengths in the dataset of Urrestarazu et al. (2016), a set of 20 cultivars that were present in both datasets were randomly selected and their allele lengths were compared.

Table 4 Conversion values used to align the allele lengths of the dataset of Urrestarazu et al. (2016) with the dataset obtained in the present study at five common loci. Depending on the length of alleles in the Urrestarazu et al. (2016) dataset (that are given in parentheses), two conversion values may be required for the same locus. The conversion values need to be subtracted from the allele lengths derived from the dataset of Urrestarazu et al. (2016) in order to achieve an alignment with the here presented dataset (compare Supplementary Table S6)

Results

The dataset includes a total of 600 genotypes at 14 microsatellite loci, of which 533 are diploid and 67 are triploid. The ploidy was determined exclusively based on molecular genetic data and the genotypes identified as triploids displayed three different alleles at three or more microsatellite loci. Of the 600 genotypes included in the database, two belonged to the crab apples Malus baccata and M. floribunda, while the remaining were 24 rootstocks and 574 scion cultivars of M. domestica. Of the latter, 264 were classified as historic, 87 as old and 223 as recent cultivars. Of the historic cultivars, 51 (19.3%), of the old cultivars, 9 (10.3%) and of the recent cultivars only 7 (3.1%) were triploid (Fig. 1).

Fig. 1
figure 1

Composition of the database as regards the proportion of molecular genetic profiles belonging to scion cultivars, rootstocks and crab apples (a). The proportion of historic (derived before 1900), old (originated between 1900 and 1950) and recent (bred after 1950) apple cultivars is shown exclusively for scion cultivars (b). The small black-and-white pie charts indicate the proportion of diploid and triploid genotypes within the entire dataset (a) or within each category of scion cultivars (b). The percentages provided as numbers in the small pie charts specify the share of diploid cultivars

The complete dataset of 600 genotypes comprised a mean number of 19.4 alleles per locus, ranging from 12 to 32 (see Table 3). The average number of alleles per locus was slightly smaller in the dataset with 533 diploid genotypes and amounted to 19.0. Six alleles at five different loci were exclusively present in triploid genotypes. The number of missing genotypic information in the dataset was observed at a very low frequency; for cultivar ‘Vista Bella’ genotypic information was missing at locus Ch02c02a, while for the rootstocks ‘M.4’ and ‘M.6’ genotypic information was partially missing at locus Ch02d08 and for the rootstock ‘M.11’ at locus CH02h11a.

The dataset comprising 533 diploid genotypes was used to determine the combined non-exclusion probability of identity, which amounted to 2.6 × 10−20, to 3.4 × 10−19, to 8.8 × 10−10, and to 5.0 × 10−8 in the dataset with 14, 13, six and five microsatellite loci, respectively, showing that the probability of finding identities by chance increases with the decrease of the number of loci employed. In fact, the identity analysis, which compared 533 individuals in 141,778 pairwise comparisons, did not result in any matching genotype in the datasets containing 14 and 13 loci, whereas in the datasets reduced to six and five microsatellite loci one matching genotype combination was found. The dataset containing six or five microsatellite loci did not allow distinguishing the genotypes of the cultivars ‘Jeanne Hardy’ and ‘Adersleber Kalvill’ as they showed an exact match at all these loci. The probability of identity and the probability of sib identity for the exact match of these two genotypes were 5.8 × 10−11 and 9.3 × 10−4, respectively, when using the dataset with six loci.

The initial analysis aiming to validate the dataset indicated the presence of null alleles at locus COL. In 7 out of 85 presumed parent-offspring combinations tested, a mismatch was found at this locus (see Supplementary Table S3). A closer look at the genotyping data revealed that six of these parent-offspring combinations were homozygous for different alleles at locus COL, which strongly suggests the presence of null alleles. This assumption is further supported by the low observed heterozygosity found at this locus, which amounted to 0.72 (data not shown). In comparison, the average observed heterozygosity for the remaining loci was 0.84 (±0.03 S.D.). When omitting locus COL from the dataset and repeating the parentage analysis with software ML-Relate, no further indication for a genotyping error was noticed. Of the 85 diploid cultivars tested, for which one documented parent was included in the dataset, the highest likelihood for a parent-offspring relationship was found in 77 instances. In three cases, the highest likelihood supported a full sibling relationship, however, the log likelihoods for the parent-offspring relationship were only 0, 0.56, 0.85 lower, respectively (Supplementary Table S3). For the cultivars ‘Collina’, ‘Pimona’, ‘Prime Gold’ and ‘Reglindis’, the parent cultivar suggested from the literature could be excluded (see Table 2), while another cultivar with the highest likelihood for a parent-offspring relationship was identified (Supplementary Table S3). Moreover, while for ‘Prime Gold’ the parentage of ‘Golden Delicious’ was excluded, a parent-offspring relationship was identified for ‘Morgenduft’ and ‘Red Delicious’. In the case of ‘Lombarts Kalvill’, ‘Weißer Winterkalvill’ present in our dataset could be excluded as a parent.

The next analysis focused on a dataset with 116 diploid genotypes, for which information about both parents was available from the literature or other sources, and the molecular genetic profiles of the suggested parent cultivars were also present in the dataset. This analysis confirmed the parentage for 108 genotypes. While 104 offspring genotypes showed an exact match with both parent genotypes at all 13 loci, a single mismatch was observed for the offspring genotype and one of its parents in four cases (Supplementary Table S4). In each mismatch, a different locus was involved, which results in a mean genotyping error rate of 0.28% per locus. For five apple cultivars, ‘Millicent Barnes’, ‘Nevson’, ‘Plumac’, ‘Strauwaldts Goldparmäne’ and ‘Tunda’, one of the assumed parents could be excluded, while another parent was found as the most probable parent (see Supplementary Tables S4 and S5). All confirmed or identified parent-offspring combinations showed positive LOD scores that ranged from 17.01 to 45.31, with an average value of 27.97 (±4.79 S.D.). All LOD scores but one exceeded the critical LOD value of 17.56 (Supplementary Table S4). For all but three cultivars, the parentage assignment was made at the strict confidence level of 95%. For the cultivar ‘Melrose’, which is an offspring of ‘Jonathan’ × ‘Red Delicious’ (Silbereisen et al. 2015), a higher likelihood was found for the combination ‘Akane’ × ‘Red Delicious’, ‘Akane’ being an offspring of ‘Jonathan’. Similarly, for the cultivar ‘Rubinola’, an offspring of ‘Prima’ × ‘Rubin’ (Salvi et al. 2014), a higher likelihood was found for the combination ‘Prima’ × ‘Topaz’, ‘Topaz’ being an offspring of ‘Rubin’. In case of the cultivar ‘Dalilight’, one mismatch was found for the parent combination ‘Elstar’ × ‘Cripps Pink’ and the LOD score was below the critical threshold of 17.56. For this cultivar a higher LOD score, also below the threshold, was found for the combination ‘Elstar’ × ‘Golden Delicious’, ‘Golden Delicious’ being a parent of ‘Cripps Pink’.

For two cultivars, ‘Blushing Golden’ and ‘Sinta’, the presumed parent pairs (see Table 2) could not be confirmed by the analysis. While in both cases ‘Golden Delicious’ was confirmed as one of the parents, the second parent suggested by the literature was rejected. For these cultivars, it was not possible to identify a genotype in our dataset that may represent the second most likely parent (Supplementary Table S5). In addition, the two possible parents of ‘Prime Gold’, identified in the ML-Relate analysis, ‘Morgenduft’ and ‘Red Delicious’ (Supplementary Table S3), were further confirmed as the probable parent pair with a LOD score of 28.89.

The comparison of molecular genetic profiles obtained in the present study with a set of data of common cultivars that were randomly derived from the dataset of Urrestarazu et al. (2016) showed a very good comparability at the five microsatellite loci that were in common. For four microsatellite loci, it was necessary to apply a conversion value in order to align the allele lengths (Baric et al. 2008). For three of these loci, it was even necessary to apply a second conversion value for longer alleles as a shift from odd to even allele lengths (or from even to odd numbers) occurred in the longer allele range of the dataset of Urrestarazu et al. (2016). Nevertheless, after applying the conversion values shown in Table 4 and comparing the molecular genetic profiles of the five cultivars used to determine the conversion values and the 20 randomly derived cultivars, an exact match between the data of the two studies could be found (see Supplementary Table S6).

Discussion

In the last years, there has been an increasing interest in the characterisation and maintenance of heirloom cultivars of apple at the regional and national level of many countries. Multilocus microsatellite analysis has become a standard tool for the assessment of genetic variability and the identification of duplicate accessions in order to contribute to a more efficient management of germplasm collections (e.g. Guarino et al. 2006; Pereira-Lorenzo et al. 2007; Routson et al. 2009; van Treuren et al. 2010; Garkava-Gustavsson et al. 2013; Ferreira et al. 2016; Gasi et al. 2016; Urrestarazu et al. 2016; Larsen et al. 2017; Testolin et al. 2019). Furthermore, molecular data can provide a complementary or alternative means for the determination of apple cultivars based on phenotypic traits. However, while the comparison of molecular genetic profiles of accessions present in a collection or geographic area can help to disclose identical genotypes, such an approach does not automatically lead to a correct cultivar determination. Frequently, trees are maintained in germplasm collections under their local names and without the possibility to compare their molecular genetic profiles to standard cultivars, they could be regarded as local even if they in fact represented widely grown international cultivars (Urrestarazu et al. 2016; Testolin et al. 2019). In addition, several studies have pointed to the problem of misclassified accessions preserved in germplasm collections, which can occur due to phenotypic determination errors, due to mix-up of grafting and/or planting material or due to documentation mistakes (Hokanson et al. 1998; Baric et al. 2009; van Treuren et al. 2010; Urrestarazu et al. 2016).

With regard to using multilocus microsatellite data for determination of unidentified or misidentified apple varieties, a database with molecular genetic fingerprints of well-determined reference cultivars needs to be available (Thomas et al. 1994). Based on the results of a genotyping study of Dutch genetic resources of apple, van Treuren et al. (2010) presented a database with 121 confirmed or verified multilocus microsatellite profiles to identify apple cultivars. The aim of the present study was to set up a more comprehensive reference database comprising 600 unique multilocus microsatellite profiles that could be applied for the identification of historic, old and recent apple cultivars, with focus on the Central European cultivation area. In order to generate a representative dataset, a targeted sampling procedure involving 37 European cultivar collections and more than 1600 accessions of apple trees was implemented. One of the intended applications of the database was to characterise and determine the local genetic resources of apple from the regions of Tyrol in Austria and South Tyrol in Italy that were collected in a common research project (Baric et al., in preparation). Therefore, the apple cultivars to be sampled and genotyped for the establishment of the database were selected based on the study of pomological references. A large part of the database covers apple cultivars that were grown in the Central European area in the past. The database includes 72 out of 74 common German apple cultivars described by Silbereisen et al. (2015), 24 out of 27 cultivars considered as widespread in South Tyrol (Northern Italy) in the first half of the 20th century (Amonn and Meier 1934), and approximately 50% of the cultivars listed in the Austrian-Hungarian Pomology published in 1888 (Stoll 1888), to name but a few. Another utilisation potential of our database was seen as a tool for the support of breeding programmes or the production of propagation material, as the database also comprises a considerable proportion of recent cultivars, many of which are still in the process of technical examination and variety testing.

In order to diminish the inclusion of molecular genetic profiles belonging to erroneously assigned cultivars into the reference database, an important goal of the present study was to sample at least two independent accessions of distinct provenances of each cultivar (Baric et al. 2009; Evans et al. 2011). If two or more (up to eight) accessions carrying the same cultivar name displayed identical molecular genetic fingerprints, they were considered as confirmed and the multilocus microsatellite genotypes were included in the reference database. This objective was reached for 98% of the apple cultivars present in the database. Three or even more independent accessions were analysed of one third of the cultivars present in the database. Among the historic cultivars raised before 1900, the percentage of molecular genetic profiles that were verified by three or more accessions was considerably higher and amounted to 56%. Nevertheless, as discussed by Urrestarazu et al. (2016), an identical molecular genetic profile of two accessions with different origins does not always prove the trueness-to-type, as graft cuttings have been exchanged for centuries on a large geographical scale and erroneously determined accessions could have been maintained and transmitted from one germplasm collection to another. This may indeed be the case for some historic cultivars, as was shown for ‘Edelborsdorfer’, one of the oldest German apple cultivars documented in the 16th century. In a study employing the same set of microsatellite markers as used in the present work, several distinct genotypes denominated ‘Edelborsdorfer’ were found in different cultivar collections (Storti et al. 2013). Among these were accessions with an identical triploid genotype that by the integration of genotyping data and pomological characterisation could successively be identified as the cultivar ‘Seebaer Borsdorfer’, which is a synonym of ‘Fromms Renette’, and also an offspring of the true-to-type ‘Edelborsdorfer’ (Storti et al. 2013). Therefore, the presence of a small number of misidentified cultivars in the here described reference database cannot be completely ruled out and may be rectified in the course of future pomological evaluations of the germplasm collections sampled. For this reason, whenever using the database to determine an unknown apple variety, it is recommended to provide the cultivar name followed by an indication about the germplasm collection where accessions with a matching multilocus genotype can be found. In this way, a subsequent comparative pomological assessment based on reference material from defined cultivar collections could be performed.

In order to acquire high quality multilocus microsatellite fingerprints of each apple cultivar, every sample obtained from a different accession was analysed independently by using an anonymous sampling code. Before data was exported, each electropherogram was visually assessed for binning accuracy. The comparison of the multilocus microsatellite genotypes in a cross-tabulation matrix finally allowed identifying identical or highly similar molecular genetic profiles. In case of incongruences between samples supposed to represent the same apple cultivar, the data were reviewed and, if necessary, the DNA analyses were repeated. Each cultivar was finally represented by a unique genotype that was included into the reference database.

Due to the rigorous quality control performed during laboratory and data analyses, we feel confident that the here presented dataset is reliable and robust. Nevertheless, a parentage analysis performed on a set of multilocus microsatellite genotypes of apple cultivars with previously known parent-offspring relationships permitted to get an impression about the rate of genotyping errors present in our dataset. Genotyping errors affect all kinds of datasets, but the rate of their occurrence is generally addressed by few studies (Hoffman and Amos 2005; Pompanon et al. 2005). Hoffman and Amos (2005) found that one of the most common errors affecting microsatellite data is the misinterpretation of allele banding patterns due to confusion between homozygote and heterozygote genotypes with adjacent alleles. In particular, dinucleotide microsatellite repeats that are prone to “stutter” peaks are affected by this type of genotyping error (Litt et al. 1993). However, a study comparing microsatellite profiles of apple cultivars at a set of five microsatellite loci obtained independently by two laboratories showed that the most common type of discordance among laboratories was ‘dropout of longer alleles’ due to preferential amplification of shorter alleles during PCR, followed by mis-scoring of “stutter” patterns as homozygous or heterozygous, and complete allele mismatch (Baric et al. 2008). Another issue, that can in particularly affect the success of parentage analyses, is the occurrence of ‘null alleles’ that are caused by a non-amplification of alleles due to a mutation at the primer binding site (Hoffman and Amos 2005). In the present study, one of the loci analysed (COL) was found to be considerably affected by null alleles and consequently, this locus had to be excluded from parentage analysis. Other genotyping incongruences between parent-offspring trios with known parentages were observed at very low rates, resulting in a mean genotyping error rate of 0.28% per locus, which points to a high reliability of the dataset. Nevertheless, as the presence of genotyping errors cannot be completely excluded, it is recommended to allow for at least one mismatch when using the database for cultivar identification or for parentage analysis. The parentage analysis used to verify the dataset included apple cultivars that were derived after 1900 and for which information about parent-offspring relationships were provided by different references. Multilocus microsatellite genotypes of more than 100 scion cultivars and rootstocks could be compared to the genotypes of both documented parents, while more than 80 could be compared to one of their parents. This means that approximately one third of the cultivars included in the reference database are supported by the matching of their multilocus microsatellite profiles with one or even two parental genotypes suggested by the pomological literature. Among these is a small number of cultivars for which only one accession could be analysed, but that were still included in the reference database because their multilocus microsatellite profiles matched with the documented parents. The remaining cultivars, which multilocus microsatellite profiles are based on the analysis of a single accession, were not supported as offspring but as a parent, such as ‘Otava’ and ‘Minnewashta’ in the group of recent cultivars. ‘Wyken Pippin’, a British cultivar from the early 1700s could only be obtained from the National Fruit Collection in Brogdale (UK), but was inserted in the database as it is listed as a parent of some historic and old apple cultivars (Smith 1971).

Parentage analyses revealed a smaller percentage of cultivars, which documented pedigrees were not supported by multilocus microsatellite data, as was observed in previous studies (e.g. Evans et al. 2011; Moriya et al. 2011; Salvi et al. 2014). The cultivar ‘Nevson’ was patented as a controlled cross between ‘Gala’ and ‘Red Delicious’ (United States Patent PP12,415; USPTO Database of the United States Patent and Trademark Office), while for ‘Plumac’ the cultivars ‘Fuji’ and ‘Braeburn’ were assumed as the “… probable parents based on their characteristics and their proximity to the new cultivar in the area of discovery.” (United States Patent PP23,418; USPTO Database of the United States Patent and Trademark Office). For each of the two cultivars, multilocus microsatellite data refuted one of the proposed parents and indicated that both are likely to be the offspring of the same cultivar pair, ‘Gala’ × ‘Braeburn’, and can thus be considered sister cultivars. Similar, one of the parents proposed for the cultivar ‘Tunda’, which was described as an offspring of ‘Delcorf’ × ‘Liberty’ (Sansavini et al. 2012) could not be supported by molecular data and instead the parentage of ‘Delcorf’ × ‘Alkmene’ was proposed. The American cultivar ‘Prime Gold’, introduced in 1965, was assumed to be a seedling of ‘Golden Delicious’, also because of their resemblance (Morgan and Richards 2002). However, the most likely parent pair found in our database and supported by matching of multilocus microsatellite data was ‘Red Delicious’ × ‘Morgenduft’, while ‘Golden Delicious’ was excluded as a possible parent. Based on parent-offspring analysis involving only one of the potential parents, new inferences were possible for the cultivars ‘Pimona’ and ‘Reglindis’. Both cultivars were derived from the Dresden-Pillnitz breeding programme (Germany) in the 1960s. ‘Pimona’ was documented as a cross of ‘Clivia’ × ‘Undine’ (Mühl 2001) and microsatellite analysis performed by Reim et al. (2009) confirmed the maternal parent, while ‘Undine’ was excluded as the possible pollen donor. The parent-offspring analyses based on our database identified ‘Tumanga’ as an alternative cultivar that could represent the second parent of ‘Pimona’. The reported parentage of ‘Reglindis’ as ‘James Grieve’ × ‘Antonowka’ F2 was put in doubt in previous studies (Reim et al. 2009; Bus et al. 2012). It was suggested that two different genotypes of ‘Reglindis’ existed and that the commercially grown cultivar did not represent the original selection. The commercialised genotype in fact carried a rare allele at the CH-Vf1 locus, which is analysed as a marker for scab resistance. As the rare allele was known to occur in ‘Geheimrat Dr. Oldenburg’ and its progeny, it was speculated that ‘Apollo’, ‘Helios’ or ‘Alkmene’ may be a parent of the commercialised ‘Reglindis’ (Bus et al. 2012). Indeed, the parent-offspring analysis using our database identified ‘Apollo’ as a parent of ‘Reglindis’, while the parentage of the latter two cultivars was not supported. The parent-offspring relationships of ‘Pimona’ and ‘Reglindis’ proposed by our analyses appear sound as both ‘Apollo’ and ‘Tumanga’ (Syn. ‘Auralia’) were used as ancestors and are confirmed parents of a number of cultivars derived from the Dresden-Pillnitz breeding programme (Reim et al. 2009).

It is likely that our database may have revealed a higher number of parent-offspring relationships among recent apple cultivars, for which no documented information was found. However, this group of cultivars was excluded from parentage analysis because many modern apple cultivars were derived from a limited number of founding clones and their pedigrees may be interconnected (Noiton and Alspach 1996). Consequently, genetically and genealogically closely related cultivars could be mistakenly assigned a parent-offspring relationship, especially if the number of analysed microsatellite loci is restricted (Salvi et al. 2014). This was also evident from the parentage analysis performed in the present study involving cultivars with known pedigrees. For the cultivars ‘Melrose’, ‘Rubinola’ or ‘Dalilight’ higher likelihoods of parentage were found for siblings or grandparents than for the established parents. Whether this is the case for the parent-offspring relationship of the cultivar ‘Collina’, could not be fully clarified in the present study. ‘Collina’ was described as a cross of ‘Priscilla-NL’ × ‘Elstar’, but its multilocus microsatellite profile in our database was not supported as a direct progeny of ‘Elstar’, but of ‘Santana’. The latter is an offspring of the same parent pair (‘Elstar’ × ‘Priscilla-NL’), but the maternal and paternal parent being inversed. These cultivars originated from the Dutch breeding programme and their pedigree was previously analysed by a set of 80 microsatellite markers (Evans et al. 2011). While the postulated pedigree of ‘Santana’ was fully supported by molecular genetic data, that of ‘Collina’ showed two mismatches, leading to the conclusion that “… this pedigree is not yet fully correct, but nevertheless close.” (Evans et al. 2011). Indeed, our dataset comprising a much smaller set of loci also revealed two mismatches between ‘Collina’ and ‘Elstar’ at loci CH02b10 and CH02h11a, while ‘Collina’ and ‘Santana’ showed an exact match at one of the two alleles of all analysed loci. Therefore, it could be speculated that ‘Collina’ (a cultivar from the 2000s) and ‘Santana’ (a cultivar from the 1990s) are not in a full-sibling but in a parent-offspring relationship. However, this speculation would need to be tested by analysing a larger set of microsatellite loci and including the second parent, ‘Priscilla-NL’, in the analysis, which was not available in the present study.

Hundreds of microsatellite markers have been described so far (Guilford et al. 1997; Liebhard et al. 2002; Silfverberg-Dilworth et al. 2006), leading to a high diversity of laboratory protocols and marker sets that have been employed by different laboratories (Sehic et al. 2011). Nevertheless, in the last years there has been a tendency towards harmonisation of microsatellite marker sets employed by different studies in order to allow exchange of data (Larsen et al. 2017; Lassois et al. 2016; Urrestarazu et al. 2016; Testolin et al. 2019). Five of the microsatellite loci analysed in the present study overlapped with the marker set analysed by Urrestarazu et al. (2016), who investigated a large number of accessions from different European germplasm collections. The comparison of microsatellite data obtained in different laboratories is, however, not straightforward, as allele lengths do not represent absolute measures, but can be affected by different capillary electrophoresis systems and chemistries (Haberl and Tautz 1999; Delmotte et al. 2001). Therefore, locus- and laboratory-specific conversion values need to be applied to make datasets directly comparable (Baric et al. 2008). Another issue to be considered in this regard is the occurrence of ‘allelic drift’, which can affect the accuracy of allele binning in automated electrophoresis systems (Idury and Cardon 1997). This is the reason, why we had to apply two conversion values for some of the loci in order to align our dataset with the one of Urrestarazu et al. (2016). In the latter, a shift deviating from the expected pattern for dinucleotide microsatellite repeats with even and odd fragment lengths was observed for three of the five loci compared. Nevertheless, the application of the conversion values and the comparison of 25 cultivars analysed in both studies finally resulted in an exact correspondence of the microsatellite profiles and demonstrated that our data are comparable with those obtained in a different study. This comparison also confirmed that the triploid accessions of ‘Winesap’ that were sampled from two cultivar collections in fact represented the cultivar ‘Stayman Winesap’. The triploid genotype, which is an offspring of ‘Winesap’, seems to be maintained under the name of its parent in several European germplasm collections (Ordidge et al. 2018). In addition, the molecular genetic profiles of the cultivar ‘Wyken Pippin’ at five loci that were in common to the present study and the microsatellite dataset of Ordidge et al. (2018) showed a perfect match. Similarly, by comparing genotyping data at six microsatellite loci that overlapped among our dataset and that of Bus et al. (2012), it was possible to confirm the identity of three ‘Antonowka’ cultivars (Storti et al. 2012). However, it needs to be considered that the probability of finding identities by chance increases with the decreasing number of microsatellite loci included in the analysis. While for the dataset with 14 microsatellite loci, the combined non-exclusion probability of identity and sib identity amounted to 2.6 × 10-20 and 2.7 × 10−7, respectively, these values were considerably higher, 5.0 × 10−8 and 3.9 × 10−3, for the dataset with five microsatellite loci. Indeed, all the 600 genotypes included in the database could be distinguished from each other with the set of 14 and 13 microsatellite loci. When reducing the number of microsatellites to five and six loci, it was still possible to distinguish 599 genotypes, but not the cultivars ‘Jeanne Hardy’ and ‘Adersleber Kalvill’ that showed identical profiles at these loci. Consequently, a reduced number of microsatellite loci needs to be employed with great attention, especially if the dataset comprises data of closely related genotypes. In such a situation, the resulting identities should be used as a first indication for follow-up confirmation studies of the correctness of the molecular genetic determination.

The full set of microsatellite markers analysed in the present study showed a high degree of variability and a good resolution allowing to distinguish all 600 apple genotypes present in the database. As demonstrated in previous studies, the application of microsatellite loci was not suitable to distinguish clones or spurs of the same cultivar (e.g. Patzak et al. 2012; Gasi et al. 2016; Larsen et al. 2017). Accordingly, the microsatellite data did not permit to differentiate the common and the red clones of the cultivars ‘Gravensteiner’ and ‘Jonathan’. In addition, the same genotype was identified for the accessions denominated ‘Tiroler Spitzlederer’ and ‘Plattlederer’ indicating that this local cultivar may display a considerable degree of phenotypic plasticity. It also appears probable that the accession ‘Plattlederer’ had a falsely assigned name and that the true genotype was in fact not analysed in the present study. It is intriguing that the book “Bozner und Meraner Obstsorten” (Amonn and Meier 1934), which describes the 27 most important apple cultivars grown in the area of South Tyrol in the first half of the 20th century, only lists ‘Spitzlederer-Apfel’ (Syn. ‘Tiroler Spitzlederer’), but does not mention the cultivar ‘Plattlederer’. If ‘Tiroler Spitzlederer’ and ‘Plattlederer’ indeed represented two distinct cultivars, the former was obviously more common in the last century and could thus be preserved, while the latter, less widespread cultivar, may have become lost over time. Our database furthermore allowed identifying several synonym names, such as the synonymy of ‘Köstlicher von Zallinger’, ‘Napoleone’ and ‘Carla’ or the synonymy of ‘Stina Lohmann’ and ‘Korbiniansapfel’. Finally, the here presented database proved useful at several occasions to identify unknown or misidentified apple cultivars (Baric 2012). Therefore, apart from its application to characterise genetic resources or to manage germplasm collections, the database could serve as an important tool for quality control. This instrument could prove useful for the selection of parents and the confirmation of crosses in breeding programmes, for the confirmation of true-to-typeness during the production of propagation and planting material in nurseries or to verify the cultivar declaration in the retail process of fresh and/or processed apple fruit.