Introduction

Regulation (EU) 1169/2011 [1] requires the provision of information on 14 priority allergens when intentionally incorporated in the food. While the current EU legislation does not contain specific provisions related to the possible and unintentional presence of these allergens, the General Food Law (Regulation (EC) No 178/2002 [2]) obliges food business operators to implement appropriate risk assessment and risk management procedures to assure that food considered unsafe is not placed on the market. Therefore, many food business operators use a precautionary allergen labelling (PAL) to alert consumers on the inadvertent presence of allergens. In order to avoid the unnecessary use of PAL statements, that may reduce consumer choices, quantitative risk assessments are recommended [3]. These can only be performed on the basis of reliable measurement results. However, a major challenge for allergen analysis consists in achieving reliable and comparable measurement results across different measurement procedures addressing different molecular species or parts thereof.

To facilitate the harmonisation and standardisation of food allergen analysis, a European Network for Food Allergen Detection Laboratories (ENFADL) has recently been established. One of the first actions of this network was to evaluate the equivalence of measurement results reported by ENFADL members for total cow’s milk protein. This ingredient was selected since it is known to cause most allergies in childhood [4] and it causes most of the allergen-based notifications in the Rapid Alert System for Food and Feed (RASFF) [5]. Therefore, the Joint Research Centre (JRC) of the European Commission (EC) organised a feasibility interlaboratory comparison (ILC) for the determination of the mass fraction of total cow’s milk protein in food in support of Commission Regulation 1169/2011 [1].

Unlike commercially available proficiency testing schemes for allergen analysis, which provide a consensus value for each ELISA test kit used and allow participants to report in different units, this first of a kind feasibility ILC defined a common measurand as “mass fraction of total cow’s milk protein in a baked cookie”, independently from the methods used. This allowed an assessment of the equivalence of measurement results obtained by the different laboratories applying different measurement procedures. Since no adequate certified reference material (CRM) was available, a tailor-made test material was prepared and characterised. The corresponding “indicative” assigned value was established using the reference method developed by Martinez-Esteso et al. [6]. This value did not rely on the results reported by the participants.

This manuscript describes the outcome of this unique ILC round.

Experimental

Preparation of the ILC test item

Incurred cookies were prepared at the JRC by mixing fat, sugar and flour. Diluted skim milk powder, sodium bicarbonate, ammonium bicarbonate and salt were added to this mixture resulting in a homogeneous dough. Thirty-gram balls were then prepared, flattened to a thickness of 1.5 mm and cut into circular cookies. The raw cookies were placed on greaseproof paper and baked in an industrial oven for 14 min at 155 °C. The resulting cookies were ground twice: first in a Retsch mill followed by an ultraturax mixer in liquid nitrogen. Three gram portions of the resulting dry powder were finally vacuum-packed in moisture-proof laminated sachets.

Homogeneity and stability

Homogeneity and stability studies were performed at the JRC. Two independent commercially available ELISA kits (from the Morinaga Institute of Biological Science, Inc., Yokohama-Shi, Japan) were used to measure the mass fractions of casein (CAS) and β-lactoglobulin (LACB) in the incurred baked cookie powder. The studies were performed after the test items had been packed into the final sachets, but before sending them to the participants.

A sufficient number of sachets (10 out of the 200 produced [7]) were randomly selected for the homogeneity assessment. These samples were analysed in duplicates using two ELISA test kits under repeatability condition using 1.0 g test portions. The results were evaluated according to ISO 13528:2015 [8]. The contribution from inhomogeneity (uhom) to the standard measurement uncertainty of the indicative assigned value u(xILC) was calculated using the SoftCRM software [9]. Since no standard deviation for proficiency assessment was established, the following criterion for adequate homogeneity was set: the standard deviation of the measurement results—obtained under repeatability conditions (sw) and expressing a random variability within sachets—should be smaller than or equal to the standard deviation for measurements between sachets (sb). This would imply that the uncertainty contribution due to inhomogeneity could not be distinguished from the random variability. This criterion was fulfilled for the marker proteins monitored (CAS and LACB); hence, the test item was considered to be adequately homogeneous.

Based on previous experience, the selected ELISA test kits were considered to have a fit-for-purpose repeatability. It is assumed that, once adequate homogeneity was demonstrated for the selected milk proteins, identical adequate homogeneity would be obtained using LC–MS for the same proteins.

The assessment of stability was performed by keeping 3 sachets at 4 °C for 8 weeks to cover the complete period of the ILC from value assignment till the deadline for reporting. Three other sachets were stored at the reference temperature of −80 °C. At the end of the storage period, all samples were measured under repeatability conditions. The uncertainty contribution due to stability (ust) was calculated using SoftCRM [9]. Since the slope of the linear regression curve, relating the measured value over time, could not be distinguished from zero, the test item was considered adequately stable at 4 °C for the testing period. Hence, the standard measurement uncertainty due to instability was set to zero (ust = 0) [8].

Value assignment and uncertainty estimation

The indicative assigned value (xILC) of the mass fraction of total cow’s milk protein (TCMP) in the baked cookie was determined by applying the single-laboratory validated LC–MS method described by Martinez-Esteso et al. [6], and providing results traceable to the International System of Units (SI). It is based on the measurement of selected marker peptides by mass spectrometry and the conversion of the corresponding MS signals to the not-directly measurable TCMP mass fraction. In brief, proteins were extracted from the cookie test material and an aliquot of the cookie extract was fortified with a solution of stable-isotope-labelled isotopologues of the target peptides. The cookie extract, containing the stable-isotope-labelled isotopologues, was digested and cleaned-up. From the final solution, eleven peptides specific to the five constituent proteins of TCMP were measured by LC–MS: four peptides for αS1-casein (CASA1), two for αS2-casein (CASA2), two for β-casein (CASB), one for κ-casein (CASK) and two for β-lactoglobulin (LACB).

Similarly, calibration blends were prepared from a baked cookie free of TCMP and fortified with a reference solution of the above-mentioned non-labelled peptides. The purities of the non-labelled peptides were determined by amino acid analysis [10]. All aliquots and fortifications were prepared gravimetrically using an analytical calibrated balance and masses were recorded.

According to Gellrich et al. [11], the sum of the individual mass fractions of the five selected marker proteins contributes to 92 % of the total mass fraction of the TCMP. Thus, a compositional factor (fs) taking into account the part of TCMP, which was not covered by the five marker proteins, was set to fs = 1.087 (=1/0.92). Finally, a value of 11.8 mg kg−1 was obtained by combining measurements for the five proteins into a single result (approach A4 described in [6]). This value was set as the indicative assigned value (xILC).

The associated standard measurement uncertainty of the assigned value, u(xILC), was calculated combining the standard uncertainty due to characterisation (or value assignment), homogeneity and stability (uchar, uhom and ust, respectively) as prescribed by ISO 13528:2015 [8]:

$$u\left( {x_{{{\text{ILC}}}} } \right) = \sqrt {u_{{{\text{char}}}}^{2} + u_{\hom }^{2} + u_{{{\text{st}}}}^{2} }.$$
(1)

A relative measurement uncertainty associated with the LC–MS measurement (uchar,rel) of 4 % was estimated by Martinez-Esteso et al. applying the approach outlined above [6] (which includes an uncertainty component associated with the compositional factor fs). A conservative relative uncertainty contribution due to inhomogeneity (uhom,rel) of 2.4 % was derived from the homogeneity study, while ust was set to zero. Hence, a combined standard uncertainty of the informative assigned value u(xILC) of 0.55 mg kg−1 (k = 1) was obtained (=11.8 \(\times \sqrt {0.04^{2} + 0.024^{2} }\)).

Results and discussion

The laboratory performance was expressed in terms of the percent difference (Di ) according to ISO 13528:2015 [8]:

$$D_{i} = 100 \left( {x_{i} - x_{{{\text{ILC}}}} } \right)/x_{{{\text{ILC}}}},$$
(2)

where xi is the measurement result reported by a participant and xILC is the indicative assigned value.

A total of 25 results were reported by 22 ENFADL laboratories. Three laboratories registered twice in order to submit results obtained by different measurement procedures. Laboratory L25 reported a truncated value (“higher than”), while L10 reported a total casein result without converting this value to total milk protein. The vast majority of the reported results (23 out of 25) were obtained applying different ELISA test kits from various manufacturers.

Figure 1 displays the graphical representation of the reported results including their associated expanded measurement uncertainties (see error bars) as reported by the participants (some of them did not report their measurement uncertainties). Most of the participants submitted values below the indicative assigned range (11.8 mg kg-1 ± 0.6 (k = 1) mg kg−1). This is further confirmed by the significantly lower consensus range of 7.3 mg kg-1 ± 0.8 (k = 1) mg kg−1 calculated using Algorithm A and applying Eq. 6 of ISO 13528:2015 [8].

Fig. 1
figure 1

Measurement results including the associated expanded uncertainties (error bars) as reported by the participants. The following symbols are used: filled circles, for a specific commercial ELISA test kit targeting TCMP; open circles, for other ELISA test kits targeting TCMP; open triangles, for ELISA test kits targeting caseins; stars, for the LC–MS method. L25 reported a “higher than” value. The horizontal line depicts the indicative assigned value

A large scatter of results has been observed (with Di ranging from −71 % to +38 %), even when the same ELISA test kits, produced by the same manufacturer targeting total milk, were used (see Fig. 1, filled circles). Also, different standard measurement procedures are prescribed by the respective test kit manufacturer, while the composition of their calibrants that must be used is seldom disclosed. In addition, specificity and cross-reactivity are poorly documented and should be thoroughly investigated by the laboratories performing the measurement. One may also question the reliability of any conversion factors used (to be applied or imbedded in the calibration curve) to convert the measurement results obtained for the targeted protein into the total cow’s milk protein content in the food sample. The low ELISA values reported may be attributed to the use of extraction buffers without reducing agents, which may have hindered the denaturing and release of LACB due to its disulfide bonds. This highlights the importance of understanding the nature of the matrix which has to be analysed and the limitations of each test kit. More information should be provided by the kit manufacturers on the type of extraction buffers used and their suitability for different types of matrices.

Additional experimental information was provided by the participants in a dedicated questionnaire. It included technical questions related to (i) the analytical techniques used by the participants including a separate set of questions regarding the measurement system used by the participant (ELISA, LC–MS), (ii) how the participants ensured the quality of their results (their quality management system) and (iii) their experience analysing these particular targets. Hence, the information collected allowed to identify possible reasons for the diverging results.

In the absence of appropriate certified reference materials (CRM), several laboratories mentioned using the standard reference material NIST 1549a. This SRM is a whole milk powder supplied by the National Institute of Standards and Technology (USA), which provides only an informative (non-certified) protein value. However, this SRM was not intended for allergen detection and may contain “uncommon” ratios or process-related modifications (e.g. lactosylation) of the caseins and whey proteins for a milk product. The processing during the RM production may have affected the total protein and/or marker protein contents as well as their conformation or solubility. Hence, the use of this RM to calibrate or validate a method for the determination of TCMP is deemed questionable.

Two laboratories used LC–MS for the quantification of TCMP (Fig. 1). In order to achieve accurate results by this technique, matrix effects (e.g. ionisation suppression/enhancement, interferences) must be properly accounted for. In addition, a sufficient equilibration between the matrix, the incurred and the added analytes have to be ensured when applying standard addition. Laboratory L07 applied the standard addition approach using skimmed cow’s milk powder, instead of adding stable-isotope-labelled analogues of the analytes. L07 also digested the proteins with trypsin for 60 min only. This very short duration is likely to result in an incomplete digestion (see Nitride et al. [12] for a detailed description), resulting in the lower value reported.

Laboratory L26 used milk protein in a rice matrix for the calibration of their method and an incurred cookie of 16 mg TCMP/kg to convert the measured mass fractions to the reported mass fraction. The protein content of rice is only a fraction of that of wheat and the protein content can vary widely between different wheat varieties. Even though L26 reported the use of six marker peptides for three milk proteins and overnight digestion with trypsin, it is unlikely that the chosen approach is capable to account properly for the matrix effects in the test material. This could explain the highly overestimated reported value.

Conclusions

In order to ensure accurate measurement results produced by competent laboratories using appropriate measurement procedures for the determination of food allergens in food, this ILC implemented a common well-defined measurand, namely the “mass fraction of total cow’s milk protein in baked cookies”. A large scatter of reported results was observed even when the same commercially available ELISA test kit was used. This clearly identifies the need for improving the standard operating procedures of the ELISA test kits to ensure the unambiguous interpretation of their applicability. In addition, test kit manufacturers should verify the quality of the calibrants they deliver and should provide a transparent chain for the metrological traceability of the property values carried by the calibrant.

The provision of appropriate CRMs and proficiency testing (PT) schemes, with metrologically traceable assigned values using a validated reference measurement procedure, is essential in improving the accuracy of routine allergen measurements. Currently, the divergent results between different ELISA kit manufacturers and between different measurement principles, even when requested to report results for the same measurand, have proved challenging for PTs offering participant-derived consensus values.

The need for improvements has also been identified for the LC–MS methods. Due care is recommended to achieve proper equilibration after the addition of standards, a complete extraction and a complete digestion of proteins required for an equimolar release of peptides from the allergenic proteins.

The ENFADL will pursue its efforts in the harmonisation and validation of measurement procedures to achieve a reliable and consistent detection of allergens in food products among the EU Member States. The JRC envisages to develop a dedicated certified reference material that could be used as a common calibrant to allow comparable results from ELISA test kits and the LC–MS methods.