1. Introduction
Nuclear reactors produce a clean and intense flux of electron antineutrinos and thus are very good sources for experiments in neutrino physics. The
spectrum is composed of thousands of spectral components formed by the
decay of the fission products of four main isotopes,
U,
U,
Pu, and
Pu (see [
1,
2,
3] for comprehensive reviews). Calculating this spectrum is not an easy and well-defined task. Rather sophisticated (decade ago) calculations [
4,
5] yielded a net 3–3.5% upward shift in the predicted spectrum-averaged event rate with respect to the previously expected flux used in the earlier short baseline (SBL) reactor experiments (ILL [
6,
7,
8], SRP [
9,
10,
11,
12,
13,
14], Gösgen [
15,
16], Krasnoyarsk [
17,
18,
19,
20], Rovno [
21,
22,
23,
24,
25,
26], Bugey [
27,
28,
29,
30,
31,
32,
33]), as well as in the medium and long baseline (MBL, LBL) experiments Palo Verde [
34,
35,
36,
37], CHOOZ [
38,
39,
40], and KamLAND [
41,
42]. The flux normalization uncertainty in the calculation by Mueller et al. [
5] was declared to be
%. This implies that the measured event rates in the reactor experiments is about 6% less than previously thought. This deficit known as the “reactor antineutrino anomaly” (RAA) still remains an unresolved problem of particle and nuclear physics.
Figure 1 illustrates the current state-of-the-art of the RAA; the early and more recent reactor data in this figure are compared with the prediction based on the
flux by Mueller et al. [
5]. Here, and below, we use the best-fit values for the neutrino mass-squared splittings and mixing angles from the recent global analysis of the neutrino oscillation data by Esteban et al. [
43]; here and below, we assume the normal neutrino mass ordering and no
violation (irrelevant to the issue under consideration). The inverse
decay (IBD) cross section is calculated by using the results of [
44] (see
Section 6.3 for more details). In
Figure 1 and similar plots shown below, all the curves correspond to a reactor with pure
U fuel; in all subsequent calculations, we explicitly take into account the particular fuel composition in each experiment, although the corresponding effect is very small (see
Section 6.5 for explanation). The original data (references are listed in the caption to the figure) are rescaled according to Mention et al. [
45] and then renormalized to the new world average value of the neutron mean life [
46], as explained in
Section 6.1. Note that the RENO and Daya Bay datasets from, respectively, [
47,
48], are relative measurements; they are simply normalized to the curve to demonstrate that these measurements are in excellent agreement in shape with the standard
oscillation scenario. The newer high-precision measurements [
49,
50] are absolute (in
Figure 1; they are placed at the effective flux-weighted baselines). In order not to complicate the figure, we do not show very recent relative SBL measurements of Neutrino-4 (SM-3 HEU research reactor, Dimitrovgrad), STEREO (ILL HEU research reactor, Grenoble) [
51], PROSPECT (HFIR HEU research reactor, Oak Ridge) [
52,
53], and DANSS (a
GW
LEU industrial reactor, Kalinin NPP) [
54,
55], all of which, however, are critical for the present study and will be discussed in detail in
Section 6.
It can be seen from the figure that the theory is in very poor agreement with most of the absolute measurements: on average, only about 94–95% of the emitted antineutrino flux was detected at short and medium baselines and maybe even less at very short baselines (the measurements at m, where L is the distance between the reactor core and detector). Several exceptions (Nucifer, SRP-II, Palo Verde, CHOOZ) do not formally contradict the general trend within the data uncertainties. Consequently, both early and new measurements at different baselines indicate either “new physics” or incorrect inputs, primarily related to the reactor antineutrino flux calculations. Both these possibilities are currently under intensive experimental and theoretical studies, with the search for light sterile neutrinos as the most popular explanation of the anomaly.
In this article, which is a continuation of our previous studies [
61,
62,
63], we discuss an explanation of the RAA alternative (fully or partially) to the sterile neutrino hypothesis and complementary to the more prosaic one, associated with the difficulties in the
flux modeling. Our hypothesis is based on the possibility of violation of the classical (geometric) inverse-square law (ISL) at short but macroscopic distances between the (anti)neutrino source and the detector. The ISL violation (ISLV) is one of the predictions of a field-theoretical approach to neutrino oscillations, which, however, cannot predict the spatial scale at which the ISLV effect may become measurable. We therefore use the reactor data to determine or confine this scale (assuming that the ISLV is related to the RAA). We emphasize that the present study is not a test of the ISLV as a mathematical statement, but only an elucidation of whether the ISLV scale (not determined by the theory) can be related to the SBL reactor data. Since, as we hope to show, the answer to this question is positive, we are also trying to apply the same hypothesis to another still unsolved puzzle of neutrino physics—the so-called “gallium neutrino anomaly” (GNA). However, that issue is somewhat more speculative and the conclusions are less reliable.
2. Extra Neutrinos or Miscalculated Flux?
Most if not all efforts to explain the reactor anomaly rely on the hypothesis of the existence of one or more light (eV mass scale) sterile neutrinos that are fundamental neutral fermions with no standard model interactions, except those induced by mixing with the standard (active) neutrinos (see [
64,
65,
66,
67,
68,
69] for reviews and further references). The active-to-sterile neutrino mixing would lead to a distance-dependent spectral distortion and overall reduction of the reactor
flux.
In
Figure 2, we show, as an example, the results of calculations carried out in the framework of the simplest “3 + 1” phenomenological model with one sterile (anti)neutrino,
, by using the three pairs of the
mixing parameters,
, listed in the legend of the figure.
These values were derived in [
70] from detailed statistical analyses of all the neutrino oscillation data available to date. The “SBL rates only” fit includes the SBL reactor data except the points “Krasnoyarsk-IV” and “Rovno 92” and very new data from Nucifer (at the OSIRIS research reactor, CEA-Saclay), and MiniCHANDLER (Mobile Neutrino Lab deployed at the North Anna NPP), see
Figure 1. The “SBL + Bugey 3 spectrum” fit includes the same SBL data set and spectral data from Bugey 3 [
32]. The “Global
disappearance” fit involves the data from the reactor experiments (including the spectral data from Bugey 3), as well as solar neutrinos (261 data points from Homestake, SAGE, GALLEX/GNO, Super-Kamiokande, and SNO experiments), radioactive source experiments at SAGE and GALLEX, and the LSND/KARMEN
disappearance data from
C scattering (see [
70] for the full list of references and further details). It should be mentioned that these fits operate with somewhat lower (to within roughly 1%) values for the IBD event rates and with a bit different covariance matrix, as compared to those used in the present analysis (see
Section 6.1). Moreover, some of the inputs are a little out of date (see, e.g., [
71] for a more recent global analysis). This is, however, not very important for our purposes, since the considered “3 + 1” model is not used in further analysis and is only needed to demonstrate that it leads to an effect very similar to a banal renormalization of the
flux.
The solid curve in
Figure 2 represents the same
oscillation prediction as in
Figure 1, but shifted down by the normalization factor
derived from a fit to all the reactor data. In this fit, we take into account all known correlations between the data, including the overall normalization uncertainty, which is taken to be 2.7% [
45]. The details of such fits are discussed below, in
Section 6.5.1. The obtained normalization factor
(
) does not contradict the adopted flux uncertainty within ≈2
but is somewhat different from the results of previous calculations [
44,
45,
72], which used different data samples and input parameters. All curves in
Figure 2 but “SBL rates only” are in agreement, within the errors, with the new absolute measurements of RENO and Daya Bay, but are in tension with several data points. The “SBL rates only” curve is expectedly in agreement with most of the SBL data but is in slight tension with the Palo Verde, CHOOZ, RENO, and Daya Bay absolute data points.
As first stated in [
73], the true uncertainty in the
flux predictions may be as large as
, and the spectral shape uncertainties may be much larger due to a poorly known structure of the forbidden decays. This finding has been in essence confirmed by several new precision measurements and theoretical studies. In the recent MBL experiments RENO [
74], Daya Bay [
75,
76], and Double Chooz [
77], using well-calibrated detectors and fairly different industrial reactors, an unexpected excess (so-called “5 MeV bump” or “shoulder”) was observed in the IBD events at
energies within the 4.8–7.3 MeV. Similar excess has been also seen in the reactor SBL (
m) experiment NEOS (Hanbit-5 LEU reactor, Yeonggwang) [
78]. Post factum, the same bump was recognized in the data from the earlier experiment, carried out at three distances from the
GW reactor at Gösgen NPP [
79], and also in a series of experiments performed at five (shorter) distances from the
GW WWER-440 reactor at Rovno NPP [
22] (see also [
80]). The statistical significance in all mentioned experiments is beyond doubt and thus the observed excess appears to be baseline independent. It is currently unclear which physics are responsible for this bump, although several possibilities have been proposed to explain it [
81,
82,
83,
84] (see also [
85,
86,
87] for further discussion and references). This problem complicates the study of the IBD spectrum distortions mediated by sterile neutrinos but might have little or no relation to the reactor rate anomaly. The study of the spectral reactor data obtained in the earlier and new experiments goes far beyond the scope of this work. However, it can be inferred from the above that the efforts to explain the reactor anomaly with the sterile neutrino hypothesis may be somewhat premature due to still unsolved (or yet unidentified) problems related to nuclear physics rather than “new physics”.
In this connection, model-independent methods that do not require knowledge of the non-oscillating
spectrum at the reactor should be mentioned. Such methods were, in particular, used in the experiment Neutrino-4 [
88,
89] (see also [
90,
91,
92] for earlier results and further references) and in the recent combined analysis of RENO and NEOS data [
93] (since the RENO and NEOS detectors share the same reactor complex, this analysis removes the
source dependency in the previous NEOS sterile neutrino search [
78]). The results of these experiments provide intriguing evidence in favor of the “3 + 1” scenario, although they appear to be in deep conflict with each other. In the two consecutive analyses of the Neutrino-4 data [
88,
89] (eprint version 7), the following allowed values for the mass-squared splitting and mixing angle were obtained by using the so-called coherent data summation method:
These results do not agree with the constraints based on the combined analysis of the MINOS, Daya Bay, and Bugey-3 data [
94], and with the limits obtained in the recent SBL experiments DANSS [
95,
96,
97], STEREO [
51], NEOS [
78] (see also [
98], and PROSPECT [
53]; this issue is now broadly discussed in the literature [
99,
100,
101,
102,
103]. On the other hand, the
C.L. allowed the region with the best fit of
obtained in [
93] formally (with minor reservations), does not contradict the mentioned constraints.
Figure 3 shows a comparison of the ratios similar to those shown in
Figure 2, but with the sterile neutrino parameters obtained in [
88,
89,
93].
It is seen that the Neutrino-4 results contradict most of the data on the integrated event rate, and especially dramatically differ from the precision absolute measurements of Double Chooz [
59], Daya Bay [
50], and RENO [
49]. Comparison with
Figure 2 suggests that the Neutrino-4 results disagree with the outcome of the global-fit analyses of all oscillation experiments. Quite to the contrary, the “RENO + NEOS” best-fit parameters are in agreement with the “global
disappearance”
fit. In addition, lastly, both the “RENO + NEOS” and (to a much greater extent) Neutrino-4 allowed regions that are in strong tension with the cosmological constraints based on “3 + 1” model analyses of the modern CMB and BAO data [
104,
105].
One can in particular see from
Figure 2 and
Figure 3 that the proper renormalization of the
flux is hardly distinguishable from the “global
disappearance”
fit and almost fully coincides with the “SBL + Bugey 3 spectrum” and “RENO + NEOS” best fits. This coincidence is in fact accidental because the normalization factor is highly dependent on the
spectrum model. The shapes of the oscillation curves also depend on the shape of the spectrum and, therefore, on the model of the spectrum, but to a much lesser extent. What is more important is that mixing with the sterile neutrinos of mass
eV is very similar to, if not indistinguishable from, an overall downward shift of the curve
(relative to the standard
curve), except very short baselines, where the fine structure of the “3 + 1” curves becomes potentially measurable. To put it differently, the expected effect from sterile neutrinos for the integrated event rate is very similar to an overall
flux renormalization, and the latter is defined by the mixing angle
. Below, the overall normalization factor will be used as an adjustable parameter and will have a double meaning: as a factor compensating the uncertainty in the reactor antineutrino spectrum calculations, or (if desired) as an indicator of the sterile neutrino effect.
It may be even more interesting that the steady decrease of the event rate at very short baselines, mentioned in our previous works [
62,
63], if real, cannot be described by a flux renormalization alone, as well as by the eV-scale sterile neutrinos with the parameters following from the “global
disappearance” fits. Thus, it seems pertinent to consider an alternative or complementary explanation, namely the ISLV hypothesis. It will be shown below that this explanation is almost insensitive to the features of the
energy spectra (of course, when we deal only with the integrated event rates) and thus we can (perhaps temporarily) decouple the nuclear physics problems from possible allusions to the effects of new physics. Finally, we note that even the existence of the light sterile neutrinos does not at all exclude the possibility of the presence of the ISLV effect as well, since both effects may work together.
3. Why Quantum Field Theory?
Before digging into the inverse-square law violation that is based on the QFT approach to neutrino oscillations, let us briefly discuss the standard quantum-mechanical (QM) approach. According to the latter, the probability of the transformation of the neutrino with flavor
to the neutrino with flavor
(
) on the way from source to detector, separated by distance
L, is described by the following well-known expression:
Here,
,
are the elements of the neutrino flavor mixing matrix in vacuum (“PMNS” matrix) defined to link the neutrino mass eigenstates
(
) with definite masses
and momenta
to the neutrino flavor eigenstates
(
) with definite lepton numbers,
(both sets of the states are orthonormal:
,
), and
is the neutrino energy. From here on, we use natural units with
. To obtain Equation (
1), one has to use several assumptions that seem intuitively obvious, bordering on commonplace:
- (i)
Massive neutrino states originating from reaction or decay of any kind have the same (definite) momentum: .
- (ii)
Neutrino masses are so small that, in essentially all experimental circumstances (or, more precisely, in a wide class of reference frames), the neutrinos are ultrarelativistic () and hence ; this is a cornerstone of the QM theory of neutrino oscillations. One can nevertheless neglect that the flavor states (assumed to be “physical”) have no definite masses and momenta and suppose they have (can be characterized by = can be interpreted as) the common energy .
- (iii)
Due to the same reason (ultrarelativism), the travel time T of the neutrino from the source to the detector can be replaced by the distance between them, .
Formula (
1) is widely used in the analyses of all earlier and current experiments studying vacuum neutrino oscillations. However, from the theoretical point of view, the assumptions used for its derivation are rather doubtful.
First of all, if a quantum state has definite momentum, its spatial position is completely undefined owing to Heisenberg’s uncertainty principle. Thus, assumption (i) implies that the spatial coordinates of the neutrino production () and detection () are fully uncertain and, as a result, the distance is uncertain too. The same is true for the travel time T, although more cumbersome reasoning is required here. In a more sophisticated theory, the neutrino momentum uncertainty must be at least the minimum of the inverse characteristic dimensions of the source and detector devices along the neutrino beam. Consequently, more realistic neutrino states should rather be wave packets (albeit with a small momentum dispersion), depending in the general case on the quantum states of the particles involved in the neutrino production and detection processes, but not the states with definite momentum. Well, one may view this inference as a formalistic cavil, and the definite momentum assumption as a reasonable approximation.
Is the “equal momenta assumption” another reasonable approximation? No, this key assumption is unphysical as it is reference-frame dependent. Indeed, according to the Lorentz transformations,
if the equality
holds true (for all
j) in some reference frame, then it is not true in another frame moving with the velocity
relative to the first one, namely,
Hence, for relativistic frames (typical in, e.g., astrophysical environments), violation of condition (i) may lead to a large phase shift, considering that the oscillation phase is approximately invariant () as and approximations (ii) and (iii) are valid. Neutrino masses are so small that even very relativistic boosts () leave neutrinos ultrarelativistic, that is, condition (ii) is satisfied, but condition (i) is clearly violated. Of course, this violation is negligible if the velocity is nonrelativistic, but, in general, this is not the case. Interpreting the Lorentz transformation as active, we see that condition (i) cannot be satisfied simultaneously for all neutrinos emitted due to inelastic collisions or decay of particles moving at different velocities (from subrelativistic to ultrarelativistic), as, e.g., in a broad-spectrum pion beam from an accelerator, relativistic astrophysical jet, or cosmic-ray collisions with the Earth atmosphere. As a result, we conclude that the equal momenta assumption is, strictly speaking, meaningless.
Moreover, even if condition (i) is satisfied approximately in some circumstances (e.g., for a narrow-energy beam of parent particles), the velocities of neutrinos of different masses are different:
During the time
T, the neutrino
travels the distance
. As the neutrino velocities are different, there must be a spread in distances of each neutrino pair:
where we put (denote)
. If, for example,
AU and
MeV, then
cm. Is such a spread small or large to preserve the quantum coherence necessary for neutrino flavor transitions? If one considers the
physical neutrino states as wave packets, it is obvious that the oscillations’ pattern must disappear when the neutrino packets diverge far enough from each other. However, how far? In other words: how the effective size of the neutrino wave packet depends on energy and on features of the neutrino production and detection processes? The standard QM approach does not provide information on this.
Another objection to the QM approach relates to hypothetical heavy (sterile or superweakly interacting) neutrinos. Consider a toy experiment in which a neutrino beam is generated by decay at flight of very high-energy particles P whose mass is less than the mass of heavy neutrino and, therefore, the latter cannot be produced in decays of particles P (regardless of the value of their coupling with P). However, according to the naive QM approach, which ignores the physics of neutrino production and detection, the heavy neutrinos can appear in the beam through mixing with the ordinary light neutrinos and, moreover, be ultrarelativistic. The Lorentz boost into the rest frame of particle P immediately shows the impossibility of such a phenomenon, but the standard QM theory does not prohibit this. It is worth noting that this objection is not in the least directed against the existence of heavy neutrinos.
There are many other, sometimes interrelated questions that cannot be answered within the QM approach, for example:
- •
Do charged leptons oscillate, and if not (as experiment suggests), then why?
- •
Do relic (“CB”) neutrinos (some of which are definitely nonrelativistic) oscillate? Or, in a more general form: how to extend the theory to cover nonrelativistic neutrinos?
- •
How does the relative motion of the neutrino source and detector affect the survival and transition probabilities?
Further discussions of some of these and related issues and numerous references can be found in [
106,
107,
108,
109]. In the following consideration, we adhere to an approach based on perturbative quantum field theory (QFT), which explicitly accounts for the neutrino production and detection processes, does not use the ad hoc assumptions of the QM approach and is free of paradoxes. It allows one to interpret the QM formula (
1) properly, determine its area of applicability, and derive corrections to this formula. It also makes it possible to answer the above questions, predicts potentially observable effects (e.g., loss of coherence and dispersion distorting the standard QM neutrino oscillation pattern), and leaves room for further generalizations and extensions. Various aspects of the QFT approach have been discussed by many authors [
109,
110,
111,
112,
113,
114,
115,
116,
117,
118,
119,
120,
121,
122,
123,
124,
125,
126,
127,
128,
129,
130,
131,
132,
133,
134,
135,
136,
137,
138,
139,
140,
141,
142,
143] (see also references in these article), who used rather different methods and approximations, leading to similar (although not always identical) conclusions and sometimes to disputes (see, e.g., [
144,
145,
146]). The experimental activity in the field is still at the very beginning [
147], but the reactor experiments (main subject of the present article) have the potential to study the decoherence and dispersion effects predicted by the QFT approach [
148,
149,
150,
151].
Below, we will very briefly review the essentials of the covariant “diagrammatic” formalism proposed in [
134,
135] (see also [
109] for a more detailed presentation) and dwell on another prediction of the theory, not related to decoherence or dispersion. The key ingredient of the formalism are covariant wave packets used to describe the quantum states of the initial and final particles involved in the neutrino production and detection processes. Many studies have been devoted to the development of this aspect of the theory (see, e.g., [
152,
153,
154,
155,
156,
157] and references therein), which extends well beyond the narrow topic of neutrino oscillations.
4. A Sketch of the QFT Approach
The “neutrino oscillation” phenomenon in the S-matrix QFT approach is nothing else than a result of interference of the macroscopic Feynman diagrams perturbatively describing the lepton number violating processes with the massive neutrino eigenfields that are internal lines (propagators) connecting vertices of interactions in which the neutrino is produced (together with a charged lepton) and absorbed (producing another charged lepton). In other words, the massive neutrinos, (), are treated as virtual intermediate states, while the neutrinos with definite flavors, (), do not participate in the formalism at all, although they can be used to compare the predictions with those from other approaches, in particular, with the QM approach based on the concept of flavor mixing. The spatial interval between the Feynman diagram vertices (hereinafter referred to as “source” and “detector”) can be arbitrarily large.
A generic example of such a macrodiagram is shown in
Figure 4, which also introduces notation used below. The external lines of the macrodiagrams are assumed to be asymptotically free quasi-stable wave packets (WP) rather than the conventional one-particle Fock’s states
with definite 3-momenta
and spin projections
s.
If the states
and
consist exclusively of hadronic WPs and
, the lepton charges
and
are not conserved in the process
(where sign “⊕” indicates that the regions of interaction of the corresponding WPs are macroscopically separated in space and/or time) that is only possible via exchange of massive neutrinos (no matter whether they are Dirac or Majorana particles). Formally, the sets of initial (
I) and final (
F) states interacting in the vertices of the macrodiagram may include any number of WPs (particles or nuclei), but more often the
I states include one or two WPs as, e.g., in the process
. The particular “decryption” of the neutrino production/absorption mechanism in
Figure 4 assumes the standard model charged current interaction of quarks and leptons, although this is not necessary for the formalism under discussion; the more general case can include nonstandard interactions (e.g., flavour-changing neutral currents), new generation neutrinos and charged leptons, and so on.
According to [
135], the free external WP states are constructed as covariant space-time point dependent linear superpositions of the one-particle states,
(where
p and
k are the 4-momenta;
,
, and
), satisfying the
correspondence principle which demands that
turns into
in the plane-wave limit (PWL),
, which is equivalent to the following condition for the relativistic invariant form factor function
:
The detail properties of the WP states Equation (
2) are discussed in [
109,
153]. Very shortly, the function
can parametrically depend on a set of constants or Lorenz-invariant combinations of “hidden” variables (4-momenta of the WP states participated in the creation of the state Equation (
2)), but in the simplest case, the momentum spread of the packet Equation (
2) can be characterized by a single parameter
(momentum spread) defined in such a way that the limit Equation (
3) occurs as
. Thus, it is natural to assume
to be small, namely
(WP states for massless particles require a more special consideration). In this simplest case, the WP is spherically symmetric in its center of inertia frame of reference (where
) and, in arbitrary RF, the vector
has the physical meaning of the most probable 3-momentum.
In our approach, the amplitude of the process
is constructed as ordinary,
but the standard Feynman rules have to be modified to account that the standard initial and final one-particle states with definite momenta have to be replaced by the WP states Equation (
2). Then, after applying several natural model assumptions and technical simplifications, it is proved [
134,
135] that the neutrino-induced event rate in an ideal detector can be expressed (somewhat symbolically) in the following form:
Here,
is the detector exposure time,
is the neutrino energy,
is the QFT generalization of the standard quantum-mechanical neutrino flavor transition probability, and the differential form
represents the differential cross section of the neutrino scattering from the whole detector device;
is the differential neutrino flux incident on the detector from a stationary source device (e.g., a fission reactor core). The integrations in Equation (
5) are over the source (
) and detector (
) fiducial volumes
and
. In the conventional ultrarelativistic approximation, the generalized “flavor transition probability” includes the standard QM oscillation phase factor and corrections of different kinds to it, but the derivation of these results does not require the unphysical assumptions discussed in
Section 3. The main requirements explicitly used in the derivation of Equation (
5) are, instead, rather natural and apparent:
- (i)
spatial dimensions of and along the (anti)neutrino beam are small compared to the distance between them but large compared to the effective dimensions () of all WPs colliding, decaying, or appearing in and ;
- (ii)
the statistical distributions of the incoming WPs over their mean 3-momenta, discrete quantum numbers, and mean spatial coordinates in and can be described by stationary one-particle distribution functions or, more generally, by density matrices;
- (iii)
the spatial distributions of the incoming WPs are smooth, that is, the scale of their variations within and is much larger than the effective dimensions of the WPs themselves;
- (iv)
the edge effects can be neglected, and the time intervals needed to turn and on and off are small compared to the periods of their steady operation;
- (v)
the experiment catches only the secondaries b arising in , whereas the background events caused by the (long-range) secondaries falling into from can be fully ignored;
- (vi)
the neutrino oscillation lengths are large compared to .
Not all of these constraints are mutually independent and are not equally important. It should also be emphasized that essentially all of them are not necessary in the general formalism but are usually obeyed (or are believed to be obeyed) in most (anti)neutrino oscillation experiments (see, however,
Section 7).
The theory explicitly predicts that, at sufficiently long spaces between the neutrino production and absorption points
and
, the neutrino flux decreases with increasing
in compliance with the classical inverse-square law (ISL):
This reasonably expected result is unrelated to the lepton numbers violation; it was derived by using a very general theorem, called the Grimus–Stockinger (GS) theorem [
112], which defines the asymptotic behaviour of the amplitude Equation (
4) at
, and this is the crucial point in the context of the problem under investigation. As it follows from the formalism, the
L-dependence of the amplitude described by the macrodiagram, like that shown in
Figure 4, is defined by the neutrino propagator modified (“dressed”) by the external WPs,
where
,
and
are the 4-momentum transfers,
are the most probable (on-shell) 4-momenta of the external packets
, and
is the mass of the neutrino
mass eigenfield. The functions
and
are the “smeared”
functions parametrically dependent on the (most probable) 4-momenta
, masses
(
), and momentum dispersions
of the external in and out WPs; it is assumed that
. The explicit form of the functions
is determined by the Lorenz-invariant form factor functions
which describe the external WPs, but, regardless of the explicit form of the functions
, in the plain-wave limit (
,
), the functions
should turn into the ordinary
Dirac
functions,
thus leading to the exact energy–momentum conservation in the vertices of the macrodiagram shown in
Figure 4; as a result, the function Equation (
7) becomes, up to a multiplier, the standard fermion propagator.
If, however, the momentum spreads
are finite, and the space-time behavior of the function Equation (
7) is nontrivial. In particular, its spatial dependence at sufficiently large distances
L is given by the above-mentioned GS theorem [
112], according to which
as
. This offers the QFT explanation of the ISL behavior Equation (
6) but does not, however, provide the spatial scale above which the distance
L may be considered as “sufficiently large”. It is assumed that the complex-valued function
itself and its first and second derivatives decrease at least like
as
and
. In [
61], an extended version of the GS theorem has been proved, which parametrically defines such a scale by using the asymptotic expansion of the integral
in terms of inverse powers of
L at large
L. To be more precise, the theorem in its simplest form states that, for any function
in the Schwartz space
where
are recursively defined differential operators in the momentum space; the lowest order operators are
In [
158], a closed formula for the
expansion of
was obtained, which is equivalent to Equation (
9) but is a bit more mathematically transparent. We do not discuss this result, since below we will only use the leading order (LO) terms of expansion Equation (
9), which is sufficient for our present purposes. An analysis of Equation (
9) shows that the
behavior of the amplitude Equation (
4) (and thus the ISL behavior of the event rate) is violated at the distances
, where
and the function
represents a cumulative effect of the overlapping of the external (in and out) states
dependent on the mean velocities, masses, and momentum spreads of these states, and on the mean neutrino momentum
, defined by these quantities. The explicit form of the function
can be found after specification of a particular model for the external WP states. A simple example is discussed in [
61] within the so-called contracted relativistic Gaussian packet (CRGP) model [
152,
153]. It is in particular shown that the “overlap function”
is defined through the space-time and transverse (with respect to the neutrino propagation direction
) components of the so-called inverse overlap tensors (
and
), which determine the effective space-time (
) overlap volumes of the in and out WP states around the impact points
and
in the vertices of the macrodiagram Equation (
Figure 4). The most general formulas for the tensors
are derived in [
61] (see also [
109] for more details). It is significant that they are nearly independent of the masses of ordinary neutrinos (assuming these masses to be small with respect to the neutrino energy and thus
). Within the CRGP model, it can also be shown that the magnitude of
is strongly affected by the hierarchy of the momentum spreads (
) of the external WSs
, but in a simple (though not very realistic) case, when all these spreads are similar in order of magnitude, the overlap function
can also be of the same order, up to the Lorentz factors of the most relativistic particles. Generally, the spatial scale Equation (
10) can be macroscopically large for sufficiently small values of the function
and/or for sufficiently high neutrino energies, thus leading to a potentially measurable ISL violation (ISLV). Verification of this possibility is the main goal of the subsequent analysis of the reactor antineutrino data.
It is shown in [
152] that the series Equation (
9) modifies the formula for the event rate Equation (
5) in such a way that the differential neutrino flux Equation (
6) is multiplied by the asymptotic series in even powers of
L,
with the coefficient functions
explicitly defined from the expansion Equation (
9). A critical property of the series Equation (
11) is that its first term is
negative. The proof of this property is not very simple, and one of the most complicated steps in this proof is the proper integration with respect to
, which is required to obtain the modified neutrino propagator
defined by Equation (
7). Using the CRGP model and the
third order saddle-point asymptotic expansion (see, e.g., [
159]), it can be proved that
where
and
are the components of the aforementioned inverse overlap tensors in the source and detector vertices; the purely transversal term
in Equation (
12) is the first-order contribution; the second and third order corrections are not in general small and describe nontrivial effects of the in-in, out-out, and in-out WP overlaps in time and space in both source and detector vertices. It should be pointed out that the scale of the functions
is defined not only by the momentum spreads of the external WPs,
, but also by their masses and momenta, some of which can be ultrarelativistic. Therefore, the value of the function
may generally differ from any of
by orders of magnitude. We also note that expression Equation (
12) is derived for the coordinate system whose third axis is directed along the neutrino velocity and thus the boost covariance is not explicit. Note that the Lorenz-invariant function
represents (at long but not very long distances) the momentum spread of the effective WP of neutrino
, which, in turn, determines the effects of decoherence (see [
61,
134] and also [
154] for rigorous consideration). This, in particular, means that the function
is only mediately related to the neutrino momentum dispersion and thus it cannot be measured at long distances. The ISLV effect is ultimately associated with incomplete overlap in space and time of the noncollinearly interacting incoming and outgoing WPs (e.g., parent and daughter nuclei) involved in the processes of neutrino production and absorption, which must lead to a decrease of the cumulative probability of the WP interaction. The negativeness of the coefficient function
is therefore quite expected.
Let us add that a similar reason (partial overlap of wave packets) is basically responsible also for the very long-range decoherence. In this case, due to a kind of duality, the behavior of the neutrino propagator is naturally translated into the language of the effective neutrino WPs; the effect consists of a distortion of the standard oscillation pattern, and—in the very long distance limit—to the disappearance of the oscillations. The explanation in this language is rather apparent: WPs of neutrinos with different masses move at different mean velocities and, therefore, the probability of their overlap decreases with time and distance. Simultaneously, the neutrino WPs spread with time, thereby increasing the probability of their mutual overlapping. These two competing processes govern the evolution of the oscillation pattern at very long spatial ranges and definitely must work at extraterrestrial (astrophysical) distances.
At short distances (as in the short baseline reactor experiments), the ordinary neutrino oscillations by themselves do not play any role and therefore the ISLV effect has nothing to do with the neutrino masses and mixing, as well as, and even more so, with the overlap or spreading of the effective neutrino WPs. The single macrodiagram Equation (
Figure 4) describes the cumulative, “already accomplished” result of the interaction of the external WPs in its vertices, and the counting rate Equation (
5) is the result of averaging of the squared amplitude Equation (
4) over the periods of operation of the source and exposure of the detector, and these periods bear no relation to the time of neutrino propagation from the source to the detector (this is, by the way, one of the main differences of our formalism from others). Consequently, the spatial scale of the ISLV does not have to be comparable with the effective size of the neutrino WP.
Using Equation (
11) in the first approximation (thereby assuming that
and hence the ISLV correction is small), substituting it into Equation (
5), and taking into account the inequality
, we arrive at the modified formula for the neutrino event rate,
which represents the phenomenological signature of the ISLV effect. Needless to say, at present, the function
cannot be derived from the first-principle calculations, but it can be measured (if it is not too small) in the spectral measurements in the experiments with appropriately short but macroscopic baselines. In further analysis, we assume that the decoherence and dispersion effects that are present in the general QFT expression for the survival probability
are insignificant at the distances under consideration, and thus the standard QM formula (
1) is applicable. Additional simplifications and concretization of formula (
13) are discussed below in
Section 6.3.
5. Antineutrino Energy Spectra
Reactor antineutrino fluxes are one of the main ingredients and the main source of uncertainties of subsequent analysis. In a typical nuclear reactor, almost all (>99%) antineutrinos are produced through thousands of
decay branches of fission fragments from
U,
U,
Pu, and
Pu. There are two main approaches to calculate the antineutrino fluxes produced by these isotopes. One can either employ the ab initio method [
81,
160,
161,
162,
163,
164,
165,
166,
167,
168,
169,
170,
171,
172,
173] by a direct summation of the
energy spectra form the
decay of each fission fragment using information from the relevant nuclear databases, or apply a conversion procedure [
4,
174,
175,
176,
177,
178,
179,
180,
181,
182,
183] based on measurements of the integral electron energy spectra for the fissionable isotopes; the conversion-based models have relatively little dependence on the nuclear databases. Both approaches can be used in a “summation + conversion” combination, as in [
5,
184]. In [
80], a direct method for measuring the IBD reaction products was applied (using the high-statistics data collected with the high-aperture RONS spectrometer at the Rovno NPP [
185]).
It might be good to point out here that, if the ISLV hypothesis turns out to be true, then the conversion method based on the
-spectrum measurements at very short distances from the reactor core may be either inapplicable at all or of limited utility. Unfortunately, our formalism is not yet capable of making even qualitative predictions for the very small baselines (
), where the LO ISLV correction is obviously insufficient. Thus, let us postpone this problem for later. In any case, in this article, we did not set ourselves the daunting task of studying all the competing models for the reactor antineutrino spectra. Instead, we take a pragmatic approach, using a representative set of the
energy spectra calculated by different methods, namely we use the spectra calculated by Huber [
4], Mueller et al. [
5], Fallot et al. [
166], and Silaeva and Sinev [
173], and compare these with the original conversions of the
-spectrum measurements performed with the spectrometers BILL (at the ILL in Grenoble) [
176,
177] and RONS (at the Rovno NPP) [
80]. In addition, we combine theoretical or conversion-based spectra with the cumulative
spectrum from
U fission obtained with the scientific neutron source FRM II in Garching [
180], instead of the corresponding author’s contributions. Thus, we use ten different models in total.
Comparisons of the models under consideration are shown in
Figure 5,
Figure 6 and
Figure 7, where the spectra are plotted as the ratios to the Mueller parametrization
where
and the coefficients
are given in the following matrix:
The corresponding spectra are plotted in
Figure 8. To facilitate visual comparison of the models displayed in
Figure 5,
Figure 6 and
Figure 7 with each other, we also show, as a reference model, the parametrization of the spectra calculated by Vogel and Engel [
165] (of course also normalized to the Mueller parametrization). To simplify and speed up the subsequent calculations, we use the second degree B-spline interpolations of the spectra from all the models being discussed. Some examples of this interpolation are shown in
Figure 5,
Figure 6 and
Figure 7. In case of the FRM II data, the spline is smoothly sewn with Mueller’s parameterization outside the measurement region. We have verified that the use, when available, of the author’s parameterization (similar to Equation (
14)) has practically no effect on the results of the fits.
We emphasize that our goal is not at all to find out which of the spectrum models or method is the best according to one or another criterion, but it is to study the sensitivity of the parameters
and
to variations of the spectra. As can be seen from the figures, the spectra predicted in different models can differ by an amount that sometimes exceeds the declared uncertainties of the models. The recent Silaeva–Sinev calculation [
173] demonstrates a spectral feature that resembles the notorious 5 MeV bump for the
fluxes from
U and
Pu and thus it can be used in particular for studying the interplay between the overall renormalization (dependent on the bump) and the ISLV effect.
In the present analysis, we do not take into account the correlated and uncorrelated uncertainties of the spectra predicted in each individual model, but instead we accumulate all the uncertainties in the single normalization factor
. We also neglect differences in the detection thresholds for the
energy, adopted (explicitly or implicitly) in different experiments and use integration from the IBD reaction threshold. Our estimates suggest that this simplification would only lead to marginal effects in our numerical analysis, when the experimental cuts are not too far from the kinematic boundaries. Similarly, it would be also an excess of accuracy to take into account the
decay branches from
Rh and
Pr and antineutrinos from nonfuel materials in reactors [
188], which contribute slightly above the IBD threshold. On the other hand, large discrepancies between the high-energy spectral shapes usually have a little effect on the outputs because of very small contributions of these tails. It should be, however, pointed out that the coefficients in the asymptotic expansion Equation (
9) are functions of all kinematic variables of all particles and nuclei involved in the processes of (anti)neutrino production and detection, and, as a consequence, the effective parameter
is in general sensitive to the detection threshold. Thus, the differences in the experimental detection thresholds and, more generally, in the experimental cuts add uncontrollable uncertainty to the global analysis. Considering that the expected ISLV effect itself is very small, all of these issues must be the subject of future investigations.
In addition, there are other sources of uncertainty that are potentially relevant to our analysis. One of them is the change in the IBD detection rate arising from evolution of the fuel content in the reactor, which may be especially important for the experiments based on commercial LEU reactors experiencing significant changes in fission rates during their fuel cycles (see, e.g., [
189,
190,
191,
192] for extended discussion). Therefore, it is important to study the robustness of the parameters
and
, extracted from the fits, also to this effect. To simplify this job, we just compare the dependencies of the IBD yields calculated for several fuel compositions. Looking ahead, we note that the corresponding effect turns out to be either small or negligible at the current level of accuracy.
The stability of
to these kinds of uncertainties is quite expected: the main influence of the ISLV effect is associated with short baselines, where the standard three-flavor oscillations are completely insignificant and nonstandard distortions (in particular, due to the hypothetical eV-scale sterile neutrinos) are, according to our assumption, either absent or small. As a result, the shapes of the
energy spectra and fuel composition are almost or fully inessential in the ratio Equation (
16). In contrast, the normalization factor
is equally sensitive to the data measured at all baselines. Moreover, the high-precision measurements from Daya Bay and RENO provide essential contributions to
. Since the two fitted parameters are strongly correlated, they both must be in general sensitive to the above-mentioned effects. The sensitivity to the subleading effects mentioned is relatively low, and our simplifications are adequate to current accuracy of the experimental data and theoretical inputs. However, as measurements and calculations improve (and if our hypothesis will not be disconfirmed), it will be important to use the most accurate up-to-date models of the antineutrino energy spectra and take into account all potentially significant sources of uncertainty, including bin-to-bin correlations.
7. Gallium Neutrino Anomaly
The gallium neutrino anomaly (GAA) is a deficit in the number of events caused by electron neutrinos from intense artificial
Cr and
Ar sources and measured by the gallium-based solar neutrino detectors GALLEX [
209,
210,
211] and SAGE [
212,
213,
214]. The aims of these measurements were to calibrate the detection efficiency of the detectors and to test the experimental procedures, including chemical extraction, counting, and analysis techniques. The results of the measurements are summarized in
Table 2.
As explained in [
214], the ratios of the measured-to-predicted rates,
, given in
Table 2 for the two GALLEX experiments have been revised due to changes in counter efficiencies, the solar neutrino subtraction, and the
Rn cut inefficiency subtraction (see [
215] for more detail). The mean ratio formally combined from the four measurements is
. The quality of the fit to the average value is characterized by
and the goodness-of-fit
. The ratios shown in
Table 2 have been calculated with respect to the rates estimated using the best-fit values of the cross section of the detection process Equation (
30) calculated by Bahcall [
216]. Haxton’s calculation [
217] reduces the ratios by about 10%. In recent years, these results have been reconsidered by many authors (see, e.g., [
218,
219,
220,
221,
222,
223] and references therein). In
Table 3, we show the results of five such reanalyses [
216,
217,
220,
222,
224].
Isotope
Cr decays by electron capture to the ground state of
V (90.1%) and to the 320 keV excited level (9.9%), and
Ar decays 100% by electron capture to the ground state of
Cl. Taking into account the atomic levels to which the transitions can occur, the chromium (argon) neutrino energy spectrum consists of four (two) lines. Detection of neutrinos is achieved through the charged-current neutrino capture reaction
which produces a
nucleus in one of four (for
Cr neutrinos) or five (for
Ar neutrinos) lowest excited states, including the ground state. The neutrino spectra along with the pertinent fractions and gallium cross sections are listed in
Table 4 taken from [
224]. The cross sections are model dependent (see, e.g., [
222]).
If the ISLV mechanism really works for reactor’s
s, it must also work for the Cr and Ar
s albeit on a different spatial scale (considering the differences in the neutrino energies and production/detection processes). From the extended GS theorem, it follows that the corresponding scale is of the order of
where
is an effective parameter similar to that for the reactor data and
is the mean neutrino energy (defined through the neutrino energy lines, decay fractions, and capture cross sections listed in
Table 4). The value of
does not have to be the same (even in order of magnitude) as the value extracted from the reactor data. Moreover, it can be in general different for the chromium and argon
sources. However, in order to check, at least at a rough qualitative level, the possibility that the GNA can be relevant to the ISLV, let us speculate that, for both
sources,
is of
the same order of magnitude as the reactor
. Then, using Equations (
27) and (
31), the effective length can be crudely estimated to be
where the errors are estimated at
C.L. Surprisingly, these ranges are comparable in order of magnitude with the average neutrino paths in the gallium detectors. Therefore, it makes sense to undertake a bit more detailed analysis.
Figure 21 schematically shows a sectional view of the detectors GALLEX and SAGE (both are cylindrically symmetrical) and their dimensions, including the dimensions of the cylindrical openings with radioactive sources. Obviously the approximation Equation (
13) is not applicable for the zone of the fiducial volume,
, close to the source (the L0 ISLV correction is clearly insufficient for
and the neutrino path is not much larger than the size of the neutrino source). Moreover, the general expression for the count rate Equation (
5) becomes incorrect since the trivial but important condition (i) (
Section 4) does not hold in these zones. Therefore, a bit more general formulas and much more technically complex analysis are required to study the gallium anomaly in terms of the ISLV effect, as well as (it is pertinent to emphasize here) to quantitatively explain the anomaly in terms of the sterile neutrino hypothesis. The latter circumstance is simply ignored (without any reservations) in the related studies based on the standard QM approach. Below, we restrict ourselves to very crude estimates, remaining within the same framework as in the case of the reactor anomaly.
Let us roughly represent as a sphere of radius which is determined by the same effective spatial scale 20–30 cm responsible for the ISLV effect. Thus, although we cannot estimate the ISLV effect in the whole fiducial volume V of GALLEX or SAGE, we can do it for the volume with the cut-out sphere surrounding the source region. Clearly, the larger the radius of the sphere , the more accurate the estimate. While this will not be a complete solution to the problem (especially for the small detector SAGE), it will allow us to check the consistency of the assumptions made.
In this formulation and assuming no mixing with sterile neutrinos, we may expect the following measured-to-predicted ratio due to the ISLV effect:
Here, the origin is positioned at the center of the source; the shape of the volume
in the GALLEX and SAGE detectors is clear from
Figure 21. Let us parametrize the radius of the sphere
as follows:
In the calculations, we use
eV (this is the lower bound for the reactor
) and
(this is just an artificial parameter that makes it easier to visualize the dependency of the ratio Equation (
32) on the size of the cut volume).
Figure 22 shows the comparison of the ratios
as functions of
, evaluated with several values of the radius
. An increase of the radius leads to a decrease of the volume
and, as a consequence, to an increase of the neutrino mean path and the ratio
. The correction becomes ∼1 (which is meaningless for the first term of the asymptotic series) at the inner edge of the remaining fiducial volume for the given range of
. Thus, the calculations extending around and below these borders are very rough extrapolations of the LO calculations. The curves for the G1 and G2 experiments are indistinguishable in the physical regions since the only difference in the calculations for these experiments is in the positions of the chromium source (
and
in
Figure 21). Since the SAGE tank is much smaller compared to the GALLEX one, the mean neutrino path is shorter and therefore the ISLV effect is larger. On the other hand, the volume
in SAGE is also very small, which makes comparison with the SAGE data more uncertain.
The SAGE and GALLEX data are represented in
Figure 22 by the wide shaded bands; the upper and lower boundaries on the measured ratios are model-dependent and are defined here as, respectively,
and
, where the maximum and minimum are taken over the
uncertainties of the values of
listed in
Table 3; the lower bound is given by the Haxton model [
217] and the upper bound—by the model of Kostensalo et al. [
222]. Semi-transparent rectangles in the figure enclose the allowed range of
obtained from our global analysis of the reactor data (see Equation (
26),
Section 6). It is indicative that it is in this range that an apparent correlation between the predicted curves and gallium data is observed. Considering that the ISLV effect is very sensitive to variation of
, this observation a posteriori validates our assumption that this parameter may be of the same order of magnitude in the reactor and gallium experiments. However, it is quite possible that we are dealing with just a coincidence here.
Sadly, the forced roughness of our estimations does not allow us to draw more definite conclusions. It is hoped that the new-generation experiments with artificial neutrino sources, like BEST/BEST-2 [
225,
226], SOX/CeSOX [
227,
228,
229,
230], and CeLAND [
231,
232], as well as a more rigorous analysis of the data, will make it possible to test the ISLV hypothesis more directly and conclusively. Although the primary aim of these experiments is to clarify the sterile neutrino puzzle [
64], they are quite suitable for this purpose as well. However, of course, a dedicated experimental setup (large sectioned detector) would be highly desirable.