Abstract
We derive rigorously the leading order of the correlation energy of a Fermi gas in a scaling regime of high density and weak interaction. The result verifies the prediction of the random-phase approximation. Our proof refines the method of collective bosonization in three dimensions. We approximately diagonalize an effective Hamiltonian describing approximately bosonic collective excitations around the Hartree–Fock state, while showing that gapless and non-collective excitations have only a negligible effect on the ground state energy.
Similar content being viewed by others
1 Introduction and main result
In the last thirty years, the study of the quantum many-body problem has made tremendous progress, in particular for weakly interacting regimes where the validity of mean-field theory (or slightly more generally the quasi-free approximation) as an effective theory can be proved. In particular for bosonic systems the mathematical results have been very rich. Just to name some: in the beginning of the 2000s the Gross–Pitaevskii functional for the ground state energy of dilute Bose gases was derived [85, 88]. Later the time-dependent Gross–Pitaevskii equation was derived [46, 47]; bounds on the rate of convergence were obtained by [11, 31]. In 2011 validity of the quasi-free approximation for the excitation spectrum of Bose gases in the mean-field regime was proven [100], thus obtaining also the next-to-leading order of the ground state energy. In contrast, for dilute gases, the quasi-free approximation is not sufficient for obtaining the second order of the energy, although it can be used to derive the leading order with optimal rate of convergence [8, 10, 95]. Very recently, results going beyond the quasi-free approximation were obtained: the excitation spectrum for dilute Bose gases was derived [7, 9]; the Lee–Huang–Yang formula for the second order of the ground state energy was proven [57]; and nonlinear classical Gibbs measures were derived as an approximation at positive temperature [55, 84].
Compared to the development in the theory of bosonic systems, the mathematical progress in the derivation of effective theories for fermionic systems has been lagging behind. For fermions, the mean-field or quasi-free theory leads to the Hartree–Fock approximationFootnote 1 which is widely used in computational physics and chemistry. The validity of the Hartree–Fock approximation was established for the ground state energy of Coulomb systems in a number of seminal works [5, 56, 63]. Rigorous results taking this analysis beyond the quasi-free effective theory have been notably absent, except for a second-order bound [74] on the many-body correction (called correlation energy) to the ground state energy, inspired by [65, 70]. In the present paper we derive an optimal formula for the correlation energy.
Our proof is based on a non-perturbative framework which we started to develop in [23]. The central concept of our approach is that the dominant degrees of freedom are particle-hole pairs which are delocalized over patches on the Fermi surface in momentum space in such a way that they behave approximately as quasi-free bosons. In [23], by means of a trial state, we proved that the formula known as the random phase approximation (RPA) in physics is an upper bound to the correlation energy of a three-dimensional Fermi gas in the mean-field scaling regime (i. e., high density and weak interaction) with a regular interaction potential. In the present paper, we again start from the interacting many-body Hamiltonian and prove the matching lower bound, thus completely validating the random-phase approximation for the ground state energy of the three-dimensional Fermi gas in the mean-field scaling regime.
The problem of calculating corrections to the Hartree–Fock approximation has a long history in theoretical physics. Already in the early days of quantum mechanics the computation of the correlation energy was attempted using second order perturbation theory [69, 104] for a Fermi gas with Coulomb interaction (the electron gas); however, this approach leads to a logarithmically divergent expression due to the long range of the Coulomb potential. It was then noticed [90] that perturbation theory with Coulomb potential becomes even more divergent at higher orders and suggested that a resummation might cure this problem. Then in their seminal work [26], Bohm and Pines developed the RPA: they argued that the Hamiltonian can be partially transformed into normal coordinates which describe collective oscillations screening the long-range of the Coulomb potential, and thus leading to a better behaved perturbative expansion. However, they had to introduce additional bosonic collective degrees of freedom by hand. This was somewhat clarified by [98, 99], who showed that the collective modes can be understood as a superposition of particle-hole pair excitations. The formulation of the RPA due to Sawada et al. has in fact been an important inspiration for our work. Ultimately it was discovered that the RPA can be seen as a systematic partial resummation of perturbation theory; following this line, one even obtains a more precise result [59]. These works have been very influential in the establishment of theoretical condensed matter physics.
The particle-hole pair bosonization of Sawada et al. found application in many settings, for example to describe nuclear rotation and calculate moments of inertia of atomic nuclei [4, 94]. A bosonization method considering only the radial excitations of the Fermi surface was developed by [89]; similar methods applied to systems with square Fermi surface [58, 102]. Later, the bosonization of collective excitations of the Fermi surface became an important tool in the context of renormalization group methods [16, 34, 68, 72, 73]. The collective aspect was further emphasized in the operator-formalism by [33, 35]. In the functional-integral formalism [48, 49, 76,77,78, 80] bosonization was established as a Hubbard–Stratonovich transformation. Despite this popularity, difficulties in judging the quality of the bosonic approximation have been pointed out [79]: “For example, scattering processes that transfer momentum between different boxes on the Fermi surface and non-linear terms in the energy dispersion definitely give rise to corrections to the free-boson approximation for the Hamiltonian. The problem of calculating these corrections within the conventional operator approach seems to be very difficult.” As far as the mean-field scaling regime is concerned, with our result we quantify such corrections as being of subleading order.
A different mathematical approach to the fermionic many-body problem has been developed by employing rigorous renormalization group methods to construct convergent perturbative expansions. This allowed the construction of Gibbs states or ground states for two main classes of interacting fermionic models.
The first class concerns models in the Luttinger liquid universality class (which was first proposed by Haldane [66, 67]), such as interacting fermions or quantum spin chains in one dimension and some two-dimensional models. These models show universal properties agreeing with those of the Tomonaga–Luttinger model which is solvable in one dimension by an exact bosonization method [92]. These predictions of bosonization have been verified rigorously, starting from [16, 17] to [1, 14, 15, 20,21,22, 62, 93]; the proofs however are by detailed analysis of the fermionic theory instead of justifying directly the bosonization. One justification of a bosonization method was achieved by [6], showing equivalence of the massless sine-Gordon model for a special choice of the coupling constant and the massive Thirring model at the free fermion point.
The second class concerns fermions in two or three dimensions at low temperature. In this context, the use of sectors on the Fermi surface, very similar to the construction of patches we use, has been introduced in [53] for the program of proving existence of superconductivity [54]. There, bosonization was implemented as a Hubbard–Stratonovich transformation of sectorized collective excitations. While this ambitious program has not been completed, the sector method was later used to prove Fermi liquid behavior of fermions in two dimensions with uniformly convex Fermi surface at exponentially small positive temperatures (and non-Fermi liquid behavior for fermions with flat Fermi surface) [2, 3, 18, 44, 45, 96]. It furthermore lead to a proof of convergence for the zero-temperature perturbation theory in a special two-dimensional fermionic model with an asymmetry condition of the Fermi surface; this is a series of eleven papers an overview of which is given in [52]. Partial results have been obtained for fermions in three dimensions at positive temperature [43]. We see our approach, while sharing the ‘sectorization’ or ‘patches’ concept, as providing a complementary point of view on related physical problems, based on different, non-perturbative ideas.
Finally, our result should also be contrasted to the study of two-dimensional models that have been constructed to be exactly bosonizable. This goes back to a proposal of [91], who was motivated by high-temperature superconductivity. The analysis and variants of the model were developed by [39,40,41,42, 81, 82]. Furthermore, one may also see similarities (such as the limitation of the number of bosons that can occupy a single bosonic mode) in the bosonization concept to methods such as the Holstein–Primakoff map [12, 36, 37] for spin systems.
1.1 Many-body Hamiltonian in the mean-field regime
To describe N spinless fermionic particles on the torus \({\mathbb {T}}^3 := {\mathbb {R}}^3/(2\pi {\mathbb {Z}}^3)\), the Hilbert space is the space of totally antisymmetric \(L^2\)-functions of N variables in \({\mathbb {T}}^3\),
The Hamiltonian is defined as the sum of Laplacians describing the kinetic energyFootnote 2 and a pair interaction, i. e., a multiplication operator defined using a function \(V:{\mathbb {R}}^3 \rightarrow {\mathbb {R}}\),
The positive parameters \(\hbar \) and \(\lambda \) adjust the strength of the kinetic energy and interaction operator, respectively.
In this paper, we assume the interaction potential V to be smooth. Thus the Hamiltonian is bounded from below and its self-adjointness follows from the Kato–Rellich theorem or using the Friedrichs extension. Here we are interested in the infimum of the spectrum (the ground state energy)
In full generality, the computation of \(E_N\) is clearly out of reach, simply because the model is too general: it may describe physical systems from superconductors to neutron stars. We thus need to be more specific and consider a particular case of the model, the most accessible case being a mean-field scaling regime: by considering a high density of particles we expect the leading order of the theory to be approximately described by an effective one-particle theory. We thus consider the limit of large particle number on the fixed-size torus. However, kinetic energy and interaction energy in typical states scale differently: the kinetic energy like \(N^{5/3}\) due to the Pauli exclusion principle, the interaction energy like \(N^2\) since there are \(N(N-1)/2\) interacting pairs. To have a chance of obtaining a non-trivial limit we choose to scale the parameters byFootnote 3
With this choice the kinetic energy and the interaction energy in typical states close to the ground state have the same order of magnitude (order N). This scaling regime couples a semiclassical scaling (\(\hbar = N^{-\frac{1}{3}}\rightarrow 0\)) and a mean-field scaling (coupling constant \(\lambda =N^{-1}\)).
If the interaction vanishes, \(V=0\), then the ground state of the system is exactly given by the Slater determinant (i. e., antisymmetrized tensor product) of plane waves
Here the momenta k of the plane waves are chosen such that the expectation value of the kinetic energy operator is minimized. The set of the corresponding momenta \(B_\text{ F }\) is called the Fermi ball. For simplicity we assume that the ball is completely filled, namely we set
and then define the particle number accordingly as \(N := |B_\text{ F }|\). The limit of large particle number is then realized by considering \(k_\text{ F }\rightarrow \infty \). According to Gauss’ classic counting argument we have
If the system is interacting, \(V\not = 0\), the ground state becomes a complicated superposition of Slater determinants. Nevertheless, in Hartree–Fock theory one minimizes only over the set of all Slater determinants. In our setting, the Hartree–Fock energy
is attained by the plane waves as in (1.5) and (1.6); see Appendix A for a proofFootnote 4. Thus in order to gain non-trivial information about the interacting system one must go beyond the Hartree–Fock theory.
Note that by the variational principle, the Hartree–Fock energy \(E_N^\text{ HF }\) is an upper bound to the ground state energy \(E_N\). It follows from the analysis of [5, 63] that Hartree–Fock theory also provides a good lower bound to the ground state energy. In our setting, the approach of [5, 63] shows that
In particular, both \(E_N\) and \(E_N^\text{ HF }\) contain the Thomas–Fermi energy (in our scaling of order N) and the Dirac correction, also know as the exchange term (in our scaling of order 1).
From the physical point of view, Slater determinants are as uncorrelated as fermionic states (which have to satisfy the Pauli principle) can be, in the sense that they are just antisymmetrized tensor products. Due to the presence of the interaction, the true ground state will contain non-trivial correlations (i. e., it will be a superposition of Slater determinants). Therefore Wigner [104] called the difference
the correlation energy. According to (1.7) we know that the correlation energy in our scaling is of size o(1) as \(N \rightarrow \infty \). In the present paper, we are going to determine the leading order of the correlation energy. It is of order \(\hbar = N^{-\frac{1}{3}}\) and given by the explicit formula predicted by the random-phase approximation, as obtained by [59, 90] based on a partial resummation of the perturbation series. We believe that our result is of importance as a rigorous step beyond mean-field theory into the world of interacting quantum systems. Our proof shows that the leading order of the correlation energy can be understood as the ground state energy of an effective quadratic Hamiltonian describing approximately bosonic collective excitations.
1.2 Main result
We write the interaction potential via its Fourier coefficients
Theorem 1.1
(Main Result) There exists a \(v_0 > 0\) such that the following holds true. Assume that \(\hat{V}: {\mathbb {Z}}^3 \rightarrow {\mathbb {R}}\) is compactly supported, non-negative, satisfies \({\hat{V}}(k)={\hat{V}}(-k)\) for all \(k\in {\mathbb {Z}}^3\), and \(\Vert {\hat{V}}\Vert _{\ell ^1}< v_0\). For every \(k_\text{ F }>0\) let the particle number be \(N := |\{ k\in {\mathbb {Z}}^3: |k|\le k_\text{ F }\}|\). Then as \(k_\text{ F }\rightarrow \infty \), the ground state energy of the Hamiltonian \(H_N\) in (1.2) with \(\hbar =N^{-1/3}\) and \(\lambda =N^{-1}\) is
Here the correlation energy \(E_N^\text{ RPA }\) is of order \(\hbar \) and, with \(\kappa _0 = \left( \frac{3}{4\pi }\right) ^{\frac{1}{3}}\), given by
The upper bound, \(E_N \le E_N^\text{ HF } + E_N^\text{ RPA } + {\mathcal {O}}(\hbar ^{1+\frac{1}{9}})\), was proved in [23], even without smallness condition on the potential. In the present paper we prove the lower bound. The smallness condition is technical, and we expect that the lower bound is also true without this condition.
As already explained in [23], by expanding (1.9) for small \({{\hat{V}}}\), we obtain
Thus we recover the result for the weak-coupling limit of [74]. Moreover, the leading order of the correlation energy of the jellium model as given by Gell-Mann and Brueckner [59, Eq. (19)] (see also [99, Eq. (37)] and [90]) when applied to the case of bounded compactly supported \({\hat{V}}\) agrees with (1.9).
Although some tools from the earlier papers [23, 74] will be useful for us, the proof of Theorem 1.1 requires several important new ingredients. Conceptually, our justification of the random phase approximation is based on the main input that at the energy scale of the correlation energy there are rather few excitations around the Fermi ball. For the upper bound in [23], we consider a trial state whose number of excited particles is of order 1, allowing to control most of error terms easily. However, for the lower bound, the best available estimate for the number of excited particles in a ground state is \({\mathcal {O}}(N^{\frac{1}{3}})\), thanks to a kinetic inequality from [74]. This weaker input breaks most of the error estimates in the upper bound analysis [23], and this is also the reason why a less precise lower bound was obtained in [74]. In fact, using only similar bounds to [23], we can at best show that the error terms are of the same order as the correlation energy. In the present paper, we go beyond that and complete the bosonization approach for the first time.
Let us quickly mention the most important new ingredients of the proof; a more detailed explanation will be given in Sect. 1.3.
-
A refined estimate for the number of bosonic particles. In [23], after removing the Fermi ball by a particle-hole transformation, we control the number of bosonic particles by the fermionic number operator \({\mathcal {N}}\). This is insufficient here, since the bound \(\langle {\mathcal {N}}\rangle \le CN^{\frac{1}{3}}\) mentioned above is too weak. It is natural to try to bound all error terms using the kinetic operator \({\mathbb {H}}_0\), but a serious problem is that \({\mathbb {H}}_0\) is not stable under the Bogoliubov transformation introduced later. Instead, we introduce the gapped number operator \({\mathcal {N}}_\delta \) in (5.6), which takes into account only the fermionic particles far from the Fermi surface and has a much better bound \(\langle {\mathcal {N}}_\delta \rangle \le CN^\delta \) with \(\delta >0\) small. Thus in practice, using \({\mathcal {N}}_\delta \) is as good as using the kinetic operator \({\mathbb {H}}_0\) in many estimates, with the advantage that \({\mathcal {N}}_\delta \) is stable under the Bogoliubov transformation (see Lemma 7.2). Since \({\mathcal {N}}_\delta \) involves the fermionic particles far from the Fermi surface, we have to control separately the contribution from particles close to the Fermi surface, using an improvement of the kinetic inequality in [74] (see Lemma 4.2). The latter issue does not appear in [23] since for an upper bound we can simply take a trial state without any contribution from particles close to the Fermi surface.
-
A refined linearization of the kinetic energy. Similarly to [23], the bosonization approach in the present paper is based on the construction of patches, which allows to linearize the fermionic kinetic operator \({\mathbb {H}}_0\) and relates it to a bosonic operator \({\mathbb {D}}_\text{ B }\). In [23], we prove that the expectation value of \({\mathbb {H}}_0-{\mathbb {D}}_\text{ B }\) against a well-chosen trial state is small, which requires that the number of patches is \(M\gg N^{\frac{1}{3}}\). In the present paper, we only control the commutator of \({\mathbb {H}}_0-{\mathbb {D}}_\text{ B }\) with bosonic pairs operators (see Lemma 8.2). This weaker bound is sufficient to ensure that \({\mathbb {H}}_0-{\mathbb {D}}_\text{ B }\) is essentially invariant under the Bogoliubov transformation (see Lemma 8.1), and importantly it requires only \(M\gg N^{2\delta }\) with \(\delta >0\) small. The possibility of taking a much smaller M is crucial to bound all error terms caused by the Bogoliubov transformation.
-
A refined control on the Bogoliubov kernel. Similarly to [23], we will diagonalize the bosonizable part of the Hamiltonian by a Bogoliubov transformation. In [23] we prove that the kernel of the Bogoliubov transformation is bounded uniformly in the Hilbert–Schmidt topology. This information is sufficient to estimate the error terms when \(\langle {\mathcal {N}}\rangle \sim 1\) (as in the trial state used for the upper bound), but it is insufficient now that there are potentially many excitations. In the present paper, we will derive an optimal bound for the matrix elements of the Bogoliubov kernel (see Lemma 6.1). The new estimate encodes that due to the geometry of the Fermi surface, the interaction energy vanishes at the same rate as the kinetic gap closes. This bound is crucial for improving error estimates involving the Bogoliubov transformation (see Lemma 7.1), especially for controlling the non-bosonizable terms.
-
A subtle analysis of the non-bosonizable terms. As explained in [23], the contribution of the non-bosonizable terms can be controlled by \(N^{-1}\langle {\mathcal {N}}^{2}\rangle \). The trial state in [23] satisfies \(\langle {\mathcal {N}}^{2}\rangle \sim 1\), and hence the non-bosonizable terms are much smaller than the correlation energy. In the present paper, we only know that \(\langle {\mathcal {N}}\rangle \le CN^{\frac{1}{3}}\), which is not enough to rule out the possibility that the non-bosonizable terms are comparable to the correlation energy. It turns out that controlling the non-bosonizable terms is highly nontrivial since these terms couple the bosonic degrees of freedom with the uncontrolled low-energy fermions. Our idea is to bound these terms from below by the kinetic operator. Technically, it is easy to establish the lower bound \(-C \Vert {{\hat{V}}}\Vert _{\ell ^1} {\mathbb {H}}_0\) by completing a square. However, the difficulty here is that we have to validate this bound after implementing the Bogoliubov transformation (see Lemma 9.1). Handling the non-bosonizable terms requires a subtle analysis, using the refined estimate on the Bogoliubov kernel and the smallness assumption on the interaction potential.
-
Analysis of the diagonalized effective Hamiltonian. After implementing the Bogoliubov transformation, we obtain the desired correlation energy plus \({\mathbb {H}}_0-{\mathbb {D}}_\text{ B } + {\mathbb {K}}\) where \({\mathbb {K}} = \sum _{k\in \Gamma ^{\text{ nor }}} \sum _{\alpha ,\beta \in {\mathcal {I}}_{k}} 2\hbar \kappa |k|{\mathfrak {K}}(k)_{\alpha ,\beta } c_\alpha ^*(k) c_\beta (k)\) is the diagonalized effective Hamiltonian. Here \({\mathbb {H}}_0-{\mathbb {D}}_\text{ B }\) remained since it is essentially invariant under the Bogoliubov transformation. For the upper bound in [23], the term \({\mathbb {K}}\) does not cause any problem since its expectation value in the vacuum state is 0. In the present paper, however, we have to bound it from below as an operator (see (10.16)). This task is nontrivial and we have to use again the refined estimate on the Bogoliubov kernel and the smallness assumption on the innteraction potential.
In summary, in the present paper we provide a complete and unified bosonization approach which can handle the states with a lot of low-energy excitations. We believe that our approach is of general interest and could be useful in other contexts.
We also see our result as a possible starting point for further investigations. For example, our bosonization method is general enough to derive a norm approximation on the many-body dynamics [24]. Many questions remain; given the historical context of the problem, maybe most importantly the extension to Coulomb interaction, i. e., the electron gas, at least in some coupled mean-field/large-volume limit, requiring to optimize our bounds for extensivity. Of course, to reach this goal, we would first need to remove the small-potential condition, which at the moment plays a central role. The next key task is to deal with the divergence at small k which appears in the higher orders of perturbation theory. As the small-k singularity is improved to a logarithmic singularity in (1.9), we believe that the bosonization method contains intrinsically the necessary “resummation” that is responsible for this screening of the potential. Of course, hard technical refinements, e. g., in optimizing the k-dependence of our estimates will be necessary. Another question concerns the low-energy spectrum of the Hamiltonian: it is believed that a collective plasmon mode can be isolated from the bosonized excitation spectrum, realizing a theory of electrons dressed by a cloud of excitations and supporting the screening concept. Within the bosonic approximation, the emergence of the plasmon mode has been discussed in [13]. We expect that through a detailed analysis of the spectrum, the screening of the Coulomb potential, and the properties of the approximate ground state, the bosonization method may support the future development of a rigorous, non-perturbative Fermi liquid theory.
Beyond the mean-field scaling regime and the electron gas, there are other systems of physical interest: for example the helium isotope \({}^3\text{ He }\) is fermionic and has short-range isotropic interactions. Furthermore, a high-density limit is particularly important in the description of atomic nuclei; the short-range interactions there are however spin- and isospin-dependent and anisotropic and furthermore have attractive parts. We conjecture that even with attractive potentials the RPA formula for the correlation energy applies as long as the logarithm in \(E^\text{ RPA}_N\) does not become ill-defined. In our scaling, we do not see any contribution from the pairing density related to superconductivity, but one may expect that even if it was non-vanishing, its effect on the energy may be exponentially small. One may speculate that in an appropriate scaling limit the state of a superconductor might be described using a product of a particle-hole pair Bogoliubov transformation as we construct it for the normal phase, times a BCS-type fermionic Bogoliubov transformation.
1.3 Sketch of the proof
We will use the Fock space formalism. Recall the fermionic Fock space
The vector
is called the vacuum. For \(\psi = (\psi ^{(0)}, \psi ^{(1)}, \psi ^{(2)}, \ldots ) \in {\mathcal {F}}\) and \(f\in L^2({\mathbb {T}}^3)\) we define the creation operators \(a^*(f)\) and the annihilation operators a(f) by their actions
Since we will work in the discrete momentum space (Fourier space) \({\mathbb {Z}}^3\), it is convenient to write
These operators satisfy the canonical anticommutator relations (CAR)
The Hamiltonian \(H_N\) in (1.2), originally defined on the N-particle sector \(L^2_\text{ a }(({\mathbb {T}}^{3})^N) \subset {\mathcal {F}}\), can be lifted to an operator on the fermionic Fock space as
Restricted to \(L^2_\text{ a }(({\mathbb {T}}^{3})^N) \subset {\mathcal {F}}\), \({\mathcal {H}}_N\) agrees with the Hamiltonian as given in (1.2).
Correlation Hamiltonian. Now we separate the degrees of freedom described by the Slater determinant of plane waves in (1.5) from non-trivial quantum correlations. Recall the Fermi ball and its complement
We define the particle-hole transformation \(R: {\mathcal {F}}\rightarrow {\mathcal {F}}\) by
This map is well-defined since vectors of the form \(\prod _{j}a^*_{k_j} \Omega \) constitute a basis of \({\mathcal {F}}\). Moreover, it is easy to verify that \(R = R^* = R^{-1}\); in particular R is a unitary transformation. (In fact, R is an example of a fermionic Bogoliubov transformation.)
In practice, the action of R on an operator on Fock space is easily computed using the rules (1.14) and the CAR (1.12). For example, consider the particle number operator
For \(\psi = (\psi ^{(0)}, \psi ^{(1)}, \ldots ) \in {\mathcal {F}}\) we have \({\mathcal {N}}\psi = (0, \psi ^{(1)}, 2\psi ^{(2)}, 3 \psi ^{(3)}, \ldots )\); in particular \({\mathcal {N}}\psi = N \psi \) is equivalent to the vector belonging to the N-particle sector of Fock space, \(\psi \in L^2_\text{ a }(({\mathbb {T}}^{3})^N) \subset {\mathcal {F}}\). Now
This identity implies that if \(R\psi \) is a N-particle state, then
namely after the transformation R the number of particles is equal to the number of holes.
The transformed Hamiltonian \(R^* {\mathcal {H}}_N R \) has been computed in [27,28,29,30, 32], in a slightly different way for mixed states in [19], and in the context of the correlation energy in [23, 74]. Let us therefore just give a short sketch of the transformation of the interaction term; the transformation of the kinetic term uses (1.16) but is otherwise very similar to (1.15). We start by using the CAR once to write
where we introduced
The second summand of (1.17) equals \(-\frac{1}{2}\sum _{k\in {\mathbb {Z}}^3}{\hat{V}}(k)\), which contributes to the Hartree–Fock energy. For the transformation of the first summand one computes
where we have introduced for any \(k\in {\mathbb {Z}}^3\) the particle-hole pair creation operatorFootnote 5
and the non-bosonizable operator
Note that \({\mathfrak {D}}(k)^* = {\mathfrak {D}}(-k)\) and \({\mathfrak {D}}(0)^* = {\mathcal {N}}^\text{ p } - {\mathcal {N}}^\text{ h }\). Observing that the constant terms (i. e., not containing any creation or annihilation operator) contribute to the Hartree–Fock energy \(E^\text{ HF}_N\) and collecting all quadratic terms in the operator \({\mathbb {X}}\), we arrive at the result
where the summands are given by
Note that we have introduced the set \(\Gamma ^{\text{ nor }}\) of all momenta \(k=(k_1,k_2,k_3)\) in \({\mathbb {Z}}^3 \cap {\text {supp}}{{\hat{V}}}\) satisfying
This set is chosen such that
The term \(Q_\text{ B }\) is the bosonizable part of the interaction and contains only the pair operators. The term \({\mathcal {E}}_1\) is purely non-bosonizable and \({\mathcal {E}}_2\) couples bosonizable and non-bosonizable excitations. Note that unlike the other terms \({\mathcal {E}}_1\) is not normal-ordered (this choice is made so that we have \({\mathcal {E}}_1 \ge 0\)); for this reason \({\mathbb {X}}\) and \({\mathcal {E}}_1\) differ slightly from the expressions given in [23].
Since \({\mathbb {X}}\) is quadratic in fermionic operators, it can be easily bounded using \({\mathcal {N}}/N\), which will be seen to have expectation value much smaller than the order \(\hbar \) of \(E_N^\text{ RPA }\).
In [23], it was proved that \({\mathbb {H}}_0+Q_\text{ B }\) evaluated in a trial state of quasi-free particle-hole pairs gives rise to \(E_N^\text{ RPA }\) as an upper bound to the correlation energy. Accordingly, an important part of our task will be to prove that the contribution from \({\mathcal {E}}_1+{\mathcal {E}}_2\) is negligible. (Whereas this was easily achieved for the upper bound using the explicit form of the trial state, for the lower bound it actually turns out to be a major challenge.)
The rest of the paper is devoted to the proof of the inequality
Thanks to (1.20) it directly implies the main result, the lower bound in Theorem 1.1.
In the following we explain the key estimates in our proof. We use the symbol C for positive constants that may change from line to line, but are independent of N, \(\hbar \), and M (the number of patches, to be introduced in (1.32)). The constants C may depend on the momentum k, which does not play a role ultimately since we only consider the finitely many \(k \in {\text {supp}}{\hat{V}}\), i. e., we can always take the maximum and so treat all constants as independent of k. We generally absorb any dependence on \({\hat{V}}\) in the constants C; we only write the \({\hat{V}}\)-dependence of estimates explicitly where the smallness condition on \(\Vert {\hat{V}}\Vert _{\ell ^1}\) plays a role.
A priori estimates. Similarly to [23, 74], many approximations used in our approach are based on the idea that the relevant quantum states have only few excitations. For the upper bound in [23], this fact is easily justified by the strong bound \(\langle \Psi _\text{ trial }, {\mathcal {N}}^m \Psi _\text{ trial }\rangle \le C_m\) (for all \(m \in {\mathbb {N}}\)) for the trial state used to compute the expectation value of \({\mathcal {H}}_\text{ corr }\). Compared to that bound, for the ground state we can only derive weaker estimates. In Lemma 2.4 we prove that the particle number operator can be controlled by the kinetic energy (i. e., the kinetic energy operator has a tiny gap, of order \(\hbar ^2\)) by
To avoid the particle number operator, where possible we bound pair operators directly by the kinetic energy, using an inequality from [74],
(The idea of directly using the kinetic energy for bounds has appeared already in [65, 70] in the context of rigorous second order perturbation theory.) The bounds (1.24) and (1.23) imply the rough estimates in Lemma 2.1, as in [74]:
Together with an upper bound of order \(\hbar \) such as the trivial variational one obtained using the trial state \(\Omega \) (corresponding to the Slater determinant of plane waves before the particle-hole transformation), this implies that the ground state \(\psi _\text{ gs }\) of \({\mathcal {H}}_\text{ corr }\), the minimizer of the expectation value on the left hand side of (1.22), satisfies
For technical reasons, we will also need to control the expectation of higher powers of \({\mathcal {N}}\), which does not follow from (1.24) and (1.23). To overcome this difficulty, in Lemma 3.1 we replace the ground state \(\psi _\text{ gs }\) by an approximate ground state \(\Psi \) satisfying
while its energy is still close to the ground state energy, i. e.,
This is achieved by using the technique of localizing particle number on Fock space, which goes back to Lieb and Solovej [87]. In the proof we will use the formulation from [86, Proposition 6.1]. It is the state \(\Psi \) that most of our subsequent analysis will be applied to.
Approximately bosonic creation operators. When applied to states with few excitations, the pair creation operators behave approximately as bosonic creation operators, namely we have to leading order the canonical commutator relations (CCR)
Unfortunately there is no expression for the kinetic energy \({\mathbb {H}}_0\) in terms of the \(b^\natural (k)\)-operatorsFootnote 6. We take inspiration from the solution of the Luttinger model [92]: if the dispersion relation were linear, the \(b^*(k)\) would create eigenvectors of \({\mathbb {H}}_0\). Since the dispersion relation \(\hbar ^2 |k|^2\) is not linear, we will linearize it locally. This is achieved by localizing the creation operators to patches on the Fermi surface. More precisely, we cut the shell of width \(R_{{\hat{V}}} := {\text {diam}}{\text {supp}}{\hat{V}}\) around the Fermi surface into patches \(\{B_\alpha \}_{\alpha =1}^M\). The construction of the patches is recalled in Sect. 4. As discussed in the introduction, under the name of “sectors”, this idea has already been employed in the rigorous renormalization group context.
We consider the pair excitations supported in each patchFootnote 7
To normalize the constant in the approximate CCR, the normalization constant \(m_\alpha (k)\) should be chosen such that \(\Vert b^*_\alpha (k)\Omega \Vert =1\), namely
This has the meaning of the number of particle-hole pairs \((p,h) \in B_\text{ F}^c\times B_\text{ F }\) inside the patch \(B_\alpha \) with relative momentum \(p-h=k\). However, this number may be zero! In fact, if \(k\cdot {\hat{\omega }}_\alpha <0\) with \({\hat{\omega }}_\alpha \) the unit vector pointing in the direction of the patch \(B_\alpha \), then a simple geometric consideration shows that the summation domain in (1.30) and (1.29) is empty (the condition \(k\cdot {\hat{\omega }}_\alpha <0\) is incompatible with \(p \in B_\text{ F}^c\) and \(p-k \in B_\text{ F }\)). The same problem occurs for \(m_\alpha ^2(-k)=0\) if \(k\cdot {\hat{\omega }}_\alpha >0\).
Furthermore, as suggested by [97, Chapters 8, 9.2.3, and 9.2.4] and [38], bosonization is expected to be a good approximation only if \(m_\alpha (k)\) is large. This cannot be ensured for patches where \(k \cdot {\hat{\omega }}_\alpha \approx 0\) (if we think of the direction of k as defining the north pole of the Fermi ball, these are the patches near the equator). However, the momentum k of such excitations is almost tangential to the Fermi surface and thus their energy is very low. In fact, we will be able to show that their contribution to the ground state energy is small and exclude them from the bosonization. To do so, we introduce a cut-off near the equator by defining the index subset \({\mathcal {I}}_{k}= {\mathcal {I}}_{k}^{+}\cup {\mathcal {I}}_{k}^{-}\) where
We will choose the cut-off parameter \(\delta \) and the number of the patches M such that
(Eventually we will choose \(M = N^{4\delta }\) and \(\delta =\frac{1}{24}\).) Note that unlike [23] where we require \(M\gg N^{\frac{1}{3}}\), here we allow a much smaller value of M, which is important to control the error terms due to the Bogoliubov transformation introduced later.
Then by [23, Proposition 3.1], the constant
can be computed to be given by
(Heuristically, the reader may think of the number of particle-hole pairs as given by the surface area of the patch, \(4\pi k_\text{ F}^2/M\), times the depth inside the Fermi ball that can be reached by h, namely \(|k \cdot {{\hat{\omega }}}_\alpha |\). For this counting argument to be justifiable, the diameter of a patch on the Fermi surface may not become too large, requiring \(M \gg N^{2\delta }\).) Consequently, the operators
are well-defined and behave like bosonic creation operators, namely
This is proven in Lemma 5.2, which is a slight extension of [23, Lemma 4.1].
Gapped Number Operator. As we have seen in (1.26) we do not have strong control on the particle number operator, due to the possibility of having many small-energy excitations near the Fermi surface; a problem which in the beginning is avoided by directly using \({\mathbb {H}}_0\) for bounds. However, a serious problem of using \({\mathbb {H}}_0\) is that it is not stable under the Bogoliubov transformation that we will later introduce to approximately diagonalize the effective Hamiltonian. A way of overcoming this problem, and a key improvement compared to [23] is that instead of using the full fermionic number operator \({\mathcal {N}}\) to control error terms, wherever possible we use only the gapped number operator
which does not count low-energy excitations. Here we have used the dispersion relation \(e(i)=|\hbar ^2 |i|^2 - \kappa ^2 |\) introduced in (1.21), and due to the artificial gap we obtain
Therefore, (1.27) implies that \(\langle \Psi , {\mathcal {N}}_\delta \Psi \rangle \le CN^\delta \) which is much better than \(\langle \Psi , {\mathcal {N}}\Psi \rangle \le CN^{\frac{1}{3}}\) in (1.26). Thus in practice, controlling error terms by using \({\mathcal {N}}_\delta \) is as good as using the kinetic operator \({\mathbb {H}}_0\). Furthermore, unlike \({\mathbb {H}}_0\), the gapped number operator \({\mathcal {N}}_\delta \) is stable under the Bogoliubov transformation (see Lemma 7.2).
The main instance where \({\mathcal {N}}_\delta \) finds use is Lemma 5.3, where we bound the approximately bosonic number operator by the fermionic gapped number operator,
This improves [23, Lemma 4.2], where \({\mathcal {N}}\) was used as the bound. The key insight leading to this improvement is that only bosonic pair operators with \(\alpha \in {\mathcal {I}}_{k}\) are needed in the effective Hamiltonian (1.48) and the diagonalizing Bogoliubov transformation (1.49) to obtain the RPA energy (1.53). Since \(\alpha \in {\mathcal {I}}_{k}\) means \(|k\cdot {\hat{\omega }}_\alpha |\ge N^{-\delta }\), the relative momentum k between particles p and holes \(h = p-k\) cannot be tangential to the Fermi surface; i. e., p or h (or both) has to lie above the gap \(e(i) \ge \frac{1}{4}N^{-\frac{1}{3}-\delta }\). This is the reason for the same parameter \(\delta > 0\) appearing both in the gapped number operator and in the equator cut-off (1.31). The new bound allows us to work with the bosonic pairs at the energy scale relevant for the result, while keeping them as much as possible separate from the low-energy excitations on whose number we do not have strong control.
In the next steps, we will write the correlation Hamiltonian \({\mathcal {H}}_\text{ corr }\) as a quadratic Hamiltonian in terms of the approximately bosonic operators \(c_\alpha ^*(k)\) and \(c_\alpha (k)\).
Bosonization of the interaction energy. By decomposing
we can write the main interaction term as
In the approximation (1.39) we have ignored all excitations outside the patches. It is justified in Lemma 4.1, where we prove that
where \(Q_\text{ B}^{{\mathcal {R}}}+{\mathcal {E}}_2^{{\mathcal {R}}}\) is similar to \(Q_\text{ B }+ {\mathcal {E}}_2\) but contains only pair excitations in the patches. The proof of (1.40) requires an improved version of the kinetic inequality (1.24) (see Lemma 4.2). Thanks to (1.27), the error term in (1.40) does not contribute to the leading order of the correlation energy.
Note that the bound (1.40) is not necessary for the upper bound in [23] because the trial state there is constructed to contain only pair excitations inside the patches, so that the expectation value of a pair not belonging completely to relevant patches is identically zero.
Bosonization of the kinetic energy. The bosonization of the fermionic kinetic energy is more complicated. A key observation is that if \(\alpha \in {\mathcal {I}}_{k}^{+}\), then using the CAR (1.12) and linearizing the dispersion relation around \(k_\text{ F }{\hat{\omega }}_\alpha \), we find
For linearizing the dispersion relation we used the fact that for any \(p\in B_\text{ F}^c\cap (B_\text{ F }+k)\cap B_\alpha \), since \({\text {diam}}(B_\alpha )\ll k_\text{ F }/\sqrt{M}\) we have
Obviously the same holds if \(\alpha \in {\mathcal {I}}_{k}^{-}\). Therefore, within commutators with pair operators, \({\mathbb {H}}_0\) can be approximated as in the Luttinger model [92] by independent modes (i. e., harmonic oscillators) of energies \(\hbar \kappa 2 k\cdot {\hat{\omega }}_\alpha \), namely
A key idea of our analysis is to justify (1.43) not by estimating the difference \({\mathbb {H}}_0-{\mathbb {D}}_\text{ B }\) directly, but rather by proving that it is essentially invariant under the approximate Bogoliubov transformation T which we will introduce below to diagonalize the quadratic bosonized Hamiltonian. More precisely, in Lemma 8.1 we show that with \(\psi := T^* \Psi \) we have
where
Note that only here, in the first error summand, due to the linearization of \({\mathbb {H}}_0\), does M enter in the denominator. With \(\langle \psi , {\mathcal {N}}_\delta \psi \rangle \le C N^\delta \) (this bound is stable under the Bogoliubov transformation), we need to take \(M\gg N^{2\delta }\). We will eventually choose \(M=N^{4\delta }\).
The bound (1.44) is a crucial improvement over the linearization technique in [23] which requires \(M\gg N^{\frac{1}{3}}\), a condition that we cannot fulfill due to the second error summand in (1.45) (recall that in our approximate ground state we only know \({\mathcal {N}}\le C N^{\frac{1}{3}}\)). This improvement is achieved because in [23] we unnecessarily linearized the expectation value of \({\mathbb {H}}_0\), whereas in the present paper we only linearize the necessary commutator with a pair operator \(c^*_\alpha (k)\). In general, this new possibility of choosing a rather small M means that we gain flexibility in the technical steps because we can afford arbitrarily high powers of M as long as there is a negative power of N.
To apply (1.44), prior to using the Bogoliubov transformation, we will decompose
Diagonalization of the bosonized Hamiltonian. By combining the approximation (1.39) and the operator \(+{\mathbb {D}}_B\) from (1.46), we find the effective quadratic bosonic Hamiltonian
with
We have arrived at an effective quadratic Hamiltonian in terms of the approximately bosonic creation and annihilation operators. If the effective Hamiltonian were exactly bosonic, it could be diagonalized by a Bogoliubov transformation [25]. While we do not have this tool available since our operators are not exactly bosonic, we can still use the explicit formula as for a true Bogoliubov transformation and define the unitary map
where the real symmetric matrices K(k) are computed as in the exactly bosonic case. The choice of K(k) is the same as in [23], following the abstract formulation given in [64]. We will quickly recall it in Sect. 6.
Another key aspect of our proof is the observation that the Bogoliubov kernel K(k) satisfies a refined entry-wise bound,
This is proved in Lemma 6.1. An important role in the proof is played by the fact that due to the geometry of the Fermi surface the normalization factor \(n_\alpha (k)^2\) is proportional to \(|k\cdot {\hat{\omega }}_\alpha |\) (see (1.33)) which is also the linearization of the dispersion relation (see (1.43)), leading to cancellations. This means that as the gap of the kinetic energy closes when we consider particle-hole pairs that are almost tangential to the Fermi surface, the energy gain due to the interaction of such an excitation vanishes at the same rate. While the proof is essentially a detailed computation, it is crucial in controlling the non-bosonizable terms \({\mathcal {E}}_2\), see (9.5).
In Lemma 7.1, we show that T acts approximately as a bosonic Bogoliubov transformation, namely
where the error operators satisfy
This is an improvement of [23, Prop 4.4] in that we replaced some \({\mathcal {N}}\) by \({\mathcal {N}}_\delta \). In order to put the error estimate (1.52) in good use, we need also that the particle number operators be stable under the approximate Bogoliubov transformation; this is the content of Lemma 7.2, based on a refinement of the Grönwall argument in [23, 27].
To diagonalize the bosonizable part of the Hamiltonian, we insert (1.51) in \(T^*({\mathbb {D}}_0+Q_\text{ B}^{\mathcal {R}})T\) and write the transformed expression in Wick-normal order (with respect to the approximately bosonic operators). Up to a small error, this produces the ground state energy as desired,
Additionally we obtain the (up to a one-particle unitary) diagonalized quadratic Hamiltonian which in exact Bogoliubov theory would be the excitation spectrum; for some explicit matrix \({\mathfrak {K}}(k)_{\alpha ,\beta }\) it has the form
As mentioned before, a further new idea of our proof is that we sacrifice the positive contribution of the excitation spectrum to control the negative term \(-{\mathbb {D}}_\text{ B }\) left from the comparison of the fermionic and bosonic kinetic energy (1.44). In fact, we will prove that (see (10.16))
The proof of (1.54) is based on an explicit computation of the operator \({\mathfrak {K}}(k)\) and the nice property (1.50) of the Bogoliubov kernel.
When \(\Vert {{\hat{V}}}\Vert _{\ell ^1}\) is small, the error term \(-\Vert {{\hat{V}}}\Vert _{\ell ^1} {\mathbb {H}}_0\) in (1.54) is controlled by the positive term \({\mathbb {H}}_0\) left from the comparison of the fermionic and bosonic kinetic energy (1.44).
Controlling non-bosonizable parts of the Hamiltonian. We still have to show that the non-bosonizable terms \({\mathcal {E}}_1+{\mathcal {E}}_2^{{\mathcal {R}}}\) have only a small effect on the ground state energy. As explained in [23] these error terms can be easily controlled by \({\mathcal {N}}^2/N\). In the trial state in [23], the expectation value of \({\mathcal {N}}^2\) is of order 1, so that \({\mathcal {N}}^2/N\) is a small error. For the lower bound however, in the (approximate) ground state we only know that \({\mathcal {N}}\) is of order \({\mathcal {O}}(N^{\frac{1}{3}})\), so that \({\mathcal {N}}^2/N\) could be of the same order \(\hbar =N^{-\frac{1}{3}}\) as the correlation energy. Another way to see the difficulty in dealing with these terms is to observe that \({\mathcal {E}}_2^{\mathcal {R}}\) couples the “good” bosonic degrees of freedom with “bad” uncontrolled fermions near the Fermi surface (the latter were, by construction, absent in the trial state used for the upper bound).
Thus the non-bosonizable parts require a subtle analysis. The following argument relies on the fact that \({\mathcal {E}}_1\) is non-negative (as \({\hat{V}}(k) \ge 0\)), which helps us in obtaining a lower bound for \({\mathcal {E}}_2^{\mathcal {R}}\). By the Cauchy–Schwarz inequality and the kinetic energy estimate (1.24), it is easy to see that
Of course, this bound is useless because \({\mathbb {H}}_0\) is of the same order as \({\mathcal {H}}_\text{ corr }\). However, we are able to rescue this idea by proving a similar lower bound for the transformed operator \(T^*({\mathcal {E}}_1+{\mathcal {E}}_2^{\mathcal {R}})T\). In fact, in Lemma 9.1 we prove that, with \(\psi =T^*\Psi \),
The bound (1.55) is one of the most subtle estimates of our analysis and does not have any counterpart in the proof of the upper bound. Note that on the right hand side, once and only once the vector \(\Psi \) appears. The proof of this bound relies on the nice property (1.50) of the Bogoliubov kernel.
The second and third summand on the right hand side of (1.55) are simply bounded by the a-priori estimates (1.27). Unlike \(CN^{-\frac{1}{2}} \Vert \psi \Vert \Vert {\mathbb {H}}_0^{1/2} \Psi \Vert \) with its small pre-factor \(N^{-\frac{1}{2}}\) the expectation value \(-\Vert {{\hat{V}}}\Vert _{\ell ^1} \langle \psi ,{\mathbb {H}}_0 \psi \rangle \) has to be controlled differently: since \(\Vert {{\hat{V}}}\Vert _{\ell ^1}\) is assumed to be small, we can control it using the positive term \({\mathbb {H}}_0\) left after the Bogoliubov transformation of the difference of fermionic and bosonic kinetic energy, see the right hand side of (1.44).
Eventually we will take the parameters \(M=N^{4\delta }\) and \(\delta =\frac{1}{24}\), resulting in the total error \({\mathcal {O}}(\hbar ^{1+\frac{1}{16} })\) to the correlation energy. This completes the sketch of the proof.
2 Kinetic estimates
Our goal is to derive some rough estimates on the correlation Hamiltonian \({\mathcal {H}}_\text{ corr }\) in (1.20) using the kinetic energy
The main result of this section is the following estimate for \({\mathcal {H}}_\text{ corr }\). The proof is based on the estimates of [74, Section 4.1], which we shall review for the convenience of the reader.
Lemma 2.1
(A-Priori Estimates for the Correlation Hamiltonian) There exists a \(v_0 > 0\) such that for \({\hat{V}}: {\mathbb {Z}}^3 \rightarrow {\mathbb {R}}\) compactly supported, non-negative, with \({\hat{V}}(k)={\hat{V}}(-k)\) for all \(k\in {\mathbb {Z}}^3\), and \(\Vert {\hat{V}}\Vert _{\ell ^1}< v_0\), the following holds true:
Before coming to the proof of this lemma at the end of the section we need a couple of auxiliary lemmas. We start by recalling [74, Lemma 4.7], which allows us to control the pair operators b(k) using the kinetic energy \({\mathbb {H}}_0\).
Lemma 2.2
(Kinetic Bound for Pair Operators) For every \(k\in {\mathbb {Z}}^3\) and \(\psi \in {\mathcal {F}}\) we have
Since we will use this bound several times and we also need a modified version in Sect. 4, a simplified proof of (2.2) is provided in Appendix B for the reader’s convenience.
As a consequence of Lemma 2.2 we can bound the pair operators by the kinetic energy.
Lemma 2.3
(Kinetic Bound for \(b(k)^\natural \)) For all \(k\in {\mathbb {Z}}^3\) we have
Proof
For every \(\psi \in {\mathcal {F}}\), from Lemma 2.2 and the triangle inequality we have
This is equivalent to \(b^*(k)b(k) \le C N {\mathbb {H}}_0\). To estimate \(b(k) b^*(k)\), we use the CAR (1.12)
The last estimate follows from a simple counting argument: the set \(B_\text{ F}^c\cap (B_\text{ F }+k)\) (sketched in grey in Fig. 1) is contained in the volume obtained by extending an area of size \({\mathcal {O}}(N^{\frac{2}{3}})\) on the Fermi surface to a shell of thickness of order \({\mathcal {O}}(1)\). Therefore, this set contains no more than \(C N^{\frac{2}{3}}\) points of \({\mathbb {Z}}^3\). Thus
\(\square \)
The following new bound is our main tool to control the number of excited fermions in the ground state. It is based on the observation that because of the discreteness of momentum space the kinetic energy operator has a tiny (i. e., order \(\hbar ^2\)) gap.
Lemma 2.4
(Kinetic Bound for Number of Fermions) Let \(\psi \in {\mathcal {F}}\) satisfy \(\left( {\mathcal {N}}^\text{ p } - {\mathcal {N}}^\text{ h } \right) \psi = 0\). Then we have
Proof
Consider any \(k_2 \in B_\text{ F}^c\) and \(k_1 \in B_\text{ F }\). By definition of the Fermi ball \(|k_2 |> |k_1 |\), and since \(k_1, k_2 \in {\mathbb {Z}}^3\) we have \(|k_2 |^2 - |k_1 |^2 \in {\mathbb {N}}\); thus
Define
Obviously \(\sup _{k \in B_\text{ F }} |k|^2 \le c_0 \le \inf _{k \in B_\text{ F}^c} |k|^2\), and since \(B_\text{ F }\cup B_\text{ F}^c = {\mathbb {Z}}^3\) we get
Moreover, using \(\left( {\mathcal {N}}^\text{ p } - {\mathcal {N}}^\text{ h }\right) \psi = 0\) we find
Thus we conclude
\(\square \)
As a consequence of Lemma 2.4 we can easily prove that the exchange term has a very small contribution to the ground state energy, namely bounded as in the following lemma.
Lemma 2.5
(Bound for \({\mathbb {X}}\)) Let \(\psi \in {\mathcal {F}}\) satisfy \(\left( {\mathcal {N}}^\text{ p } - {\mathcal {N}}^\text{ h } \right) \psi = 0\). Then
Proof
This follows from the simple estimateFootnote 8\(\pm {\mathbb {X}}\le C\Vert {{\hat{V}}}\Vert _{\ell ^1} {\mathcal {N}}/N\) and Lemma 2.4. \(\square \)
Now we are ready to prove the main result of the section.
Proof of Lemma 2.1
By the Cauchy–Schwarz inequality and Lemma 2.3
Combining this with Lemma 2.5 we obtain
The desired result follows from the smallness condition on \({{\hat{V}}}\). \(\square \)
3 Localization of particle number
From the previous kinetic energy estimates it is possible to derive a-priori bounds for the ground states of \({\mathcal {H}}_\text{ corr }\) (see Lemma 3.2 below). For example, we can control the expectation value of the particle number \({\mathcal {N}}\) in a ground state using Lemma 2.4. To estimate also the expectation values of higher powers of \({\mathcal {N}}\), inspired by [87] we use IMS localization with respect to particle number to construct an approximate ground state which has energy close to the ground state energy and at the same time fulfills the desired bounds for powers of the number operator. This is the main outcome of the section, given in the following lemma.
Lemma 3.1
(Localization in Particle Number) Let \(\psi _\text{ gs }\) be a ground state vector for \({\mathcal {H}}_{corr}\) satisfying \(({\mathcal {N}}^\text{ p } - {\mathcal {N}}^\text{ h})\psi _\text{ gs } = 0\), i. e., a minimizer of the left hand side of (1.22). Then there exists a normalized vector \(\Psi \in {\mathcal {F}}\) such that
(i.,e., \(\Psi \) lives in the Fock space sectors with particle number less or equal to \(N^{\frac{1}{3}}\)) and
Furthermore \(({\mathcal {N}}^\text{ p } - {\mathcal {N}}^\text{ h})\Psi = 0\).
As a first ingredient for the proof of Lemma 3.1, we have a-priori estimates based on Sect. 2. Note that for \(\psi = \Omega \) we have \(\langle \Omega , {\mathcal {H}}_\text{ corr }\Omega \rangle =0 \le C\hbar \), and thus for any ground state \(\psi _\text{ gs }\) of \({\mathcal {H}}_\text{ corr }\) we have \(\langle \psi _\text{ gs },{\mathcal {H}}_\text{ corr }\psi _\text{ gs } \rangle \le 0\) by the variational principle. Thus we can apply the following lemma to \(\psi _\text{ gs }\).
Lemma 3.2
(A-Priori Estimates) Let \(\psi \in {\mathcal {F}}\) such that \(\langle \psi , {\mathcal {H}}_\text{ corr } \psi \rangle \le C\hbar \). Then we have
Proof
From the lower bound in Lemma 2.1 and the assumption of Lemma 3.2 we have
This implies \(\langle \psi , ({\mathbb {H}}_0 +{\mathcal {E}}_1) \psi \rangle \le C \hbar \). The bound on \({\mathcal {N}}\) follows from Lemma 2.4. \(\square \)
Next, we localize the particle number using a suitable localization formula on Fock space. This technique goes back to [87, Theorem A.1]. The following general statement is taken from [86, Proposition 6.1].
Lemma 3.3
(Localization on Fock Space) Let \({\mathcal {A}}\) be a non-negative operator on \({\mathcal {F}}\) such that \(P_i D({\mathcal {A}})\subset D({\mathcal {A}})\) and \(P_i {\mathcal {A}}P_j=0\) if \(|i-j|>\ell \), where \(P_i=\mathbb {1}({\mathcal {N}}=i)\). Let \(f,g: [0,\infty ) \rightarrow [0,1]\) be smooth functions such that \(f^2 + g^2=1\), \(f(x) = 1\) for \( x \le 1/2\) and \(f(x)=0\) for \( x \ge 1\). For any \(L \ge 1\) define the operators
Then
where \({\mathcal {A}}_\text{ diag }=\sum _{i=0}^{\infty } P_i {\mathcal {A}}P_i\) and \(C_f= 2 (\Vert f'\Vert _{L^\infty }^2 + \Vert g'\Vert _{L^\infty }^2)\).
The proof of Lemma 3.3 in [86] is based on the double commutator identity
This is an analogue of the standard IMS localization formula in position space [101]
Now we are ready to prove the main result of the section.
Proof of Lemma 3.1
We will apply Lemma 3.3 for \({\mathcal {A}}= {\mathcal {H}}_\text{ corr } + \hbar \). We can take \(\ell =4\) as the Hamiltonian \( {\mathcal {H}}_\text{ corr }\) changes particle number by at most \(\pm 4\). By Lemma 2.1, we have
which also implies that
because \({\mathcal {N}}\) commutes with \({\mathbb {H}}_0\) and \( {\mathcal {E}}_1\). Thus by Lemma 3.2 we get (with the constant \(C_0 > 0\) fixed for reference in the further proof)
Now applying Lemma 3.3, for all \(L\ge 1\) we can bound
Combining with the variational principle (and using that \(\psi _\text{ gs }\) is a ground state for \({\mathcal {A}}\))
and then together with \(f_L^2+g_L^2=1\) we obtain
Choosing \(L:=4 C_0 N^{\frac{1}{3}}\), with \(C_0\) the constant fixed in (3.2), we get
Consequently, (3.3) implies
with
Since \({\mathcal {A}}={\mathcal {H}}_\text{ corr }+\hbar \) and \(\Psi \) and \(\psi _\text{ gs }\) are normalized, the previous inequality is equivalent to
Finally, from the definition of \(\Psi \) and \(0\le f_L \le \mathbb {1} ({\mathcal {N}}\le L)\) we get \(\Psi = \mathbb {1} ({\mathcal {N}}\le L) \Psi \). Since \({\mathcal {N}}\) and \({\mathcal {N}}^\text{ p } - {\mathcal {N}}^\text{ h }\) commute, it follows also that \(({\mathcal {N}}^\text{ p } - {\mathcal {N}}^\text{ h})\Psi = 0\).
\(\square \)
4 Reduction to pair excitations on patches
Our bosonization method is based on decomposing the pair excitations \(b^*(k)\) into smaller pieces localized in disjoint patches on the Fermi surface. This procedure has been introduced in the context of the renormalization group [16, 34, 68, 72, 73]. In this section we will define the patches precisely. Moreover, we prove that the correlation Hamiltonian \(H_\text{ corr }\) can be properly represented by the pair excitations in patches up to an explicitly estimated error.
First, we will decompose the Fermi surface into patches; a partition of the Fermi surface is sketched in Fig. 2. Then we thicken the patches on the Fermi surface by allowing a relative momentum of order \({\mathcal {O}}(1)\). With a parameter \(\delta \in (0,1/6)\) that will eventually be optimized, the number M of patches will be chosen in the range
(We will eventually take \(\delta =1/24\) and \(M=N^{4\delta }\).) The parameter \(\delta \) is the same as will appear in the gapped number operator (5.6) and in the equator cut-off (4.6). The details of the construction are given in the following paragraphs, leading to the patch definition (4.5).
Patches on the unit sphere. We start our construction on the unit sphere (following, e. g., [83]) and later scale up to the Fermi sphere of radius \(k_\text{ F }= \kappa N^{\frac{1}{3}}\). We use standard spherical coordinates: for \({{\hat{\omega }}} \in {\mathbb {S}}^2\), denote by \(\theta \) the inclination (measured between \({{\hat{\omega }}}\) and \(e_3 = (0,0,1)\)) and by \(\varphi \) the azimuth (measured between \(e_1 = (1 , 0 , 0)\) and the projection of \({\hat{\omega }}\) onto the plane perpendicular to \(e_3\)). We write \({\hat{\omega }}(\theta ,\varphi )\) to specify a vector on the unit sphere by its inclination and azimuth.
We place a spherical cap centered at \(e_3\) with opening angle \(\Delta \theta _0 := D /\sqrt{M}\), \(D > 0\) chosen such that the area of the cap is \(4\pi / M\). Then we decompose the remaining part of the northern half sphere into \(\sqrt{M}/2\) (rounded to the next integer) collars; the i-th collar consists of all \({\hat{\omega }}(\theta ,\varphi )\) with \(\theta \in [\theta _i - \Delta \theta _i,\theta _i+\Delta \theta _i)\) and arbitrary \(\varphi \). The inclination of every collar extends over \(\Delta \theta _i \sim 1/{\sqrt{M}}\); the proportionality constant is adjusted so that the number of collars is integer.
Observe that the circle \(\left\{ {{\hat{\omega }}}(\theta _i,\varphi ): \varphi \in [0,2\pi ) \right\} \) has circumference proportional to \(\sin (\theta _i)\); therefore we split the i-th collar into \(\sqrt{M}\sin (\theta _i)\) (rounded to the next integer) patches. This implies that the j-th patch in the i-th collar covers an azimuth \(\varphi \in [\varphi _{i,j}-\Delta \varphi _{i,j},\varphi _{i,j} + \Delta \varphi _{i,j})\), where
We fix the proportionality constants demanding that all patches have area \(4\pi /M\).
Since \({\hat{V}}\) is compactly supported, we can set
Next we introduce corridors between the patches by redefining
The constant \({\tilde{D}}\) is chosen such that when scaled up to the Fermi sphere adjacent patches are separated by corridors of width strictly larger than \(2 R_{{\hat{V}}}\).
We then define \(p_1\) as the spherical cap with opening angle \(\Delta {\widetilde{\theta }}_0\) centered at \(e_3\) and the other \(\frac{M}{2} - 1\) patches as
Patches on the southern half sphere are defined through reflection by the origin. Finally we enumerate the patches by \(\alpha \in \{1, \ldots , M\}\) and obtain the collection \(\{p_\alpha \}_{\alpha =1}^M\) from (4.4). This completes the construction of patches \(\{p_\alpha \}_{\alpha =1}^M\) for the unit sphere.
Patches on the Fermi sphere. Next, using the Fermi momentum
we scale the patches \(\{p_\alpha \}_{\alpha =1}^M\) from the unit sphere up to the Fermi sphere by defining
By construction, we have the following properties for \(\{P_\alpha \}_{\alpha =1}^M\):
-
(i)
Reflection symmetry: \(-P_\alpha = P_{\alpha +\frac{M}{2}}\) for all \(\alpha = 1,\ldots ,\frac{M}{2}\).
-
(ii)
The area of every patch is \( 4\pi k_\text{ F}^2{M^{-1}} \Big ( 1 + {\mathcal {O}}\big (M^{\frac{1}{2}}N^{-\frac{1}{3}}\big )\Big ). \) Moreover, the diameter of each patch is bounded by \( {\text {diam}}(P_\alpha ) \le C N^{\frac{1}{3}}M^{-\frac{1}{2}}. \) In words: patches do not degenerate into elongated thin strips.
-
(iii)
The patches are separated by corridors of width at least \(2 R_{{\hat{V}}}\). The area of the union of all corridors is bounded by \(CN^{\frac{1}{3}}M^{\frac{1}{2}}\).
Extended patches around the Fermi sphere. Next we extend the patch decomposition radially, over the shell around the Fermi surface that is affected by the interaction with momenta \(k \in {\text {supp}}{\hat{V}}\). For any \(\alpha \in \{1,2,\ldots ,M\}\) we introduce the extended patch
Thus we have the following properties for the patch decomposition \(\{B_\alpha \}_{\alpha =1}^M\):
-
(i)
Reflection property: \(-B_\alpha = B_{\alpha +\frac{M}{2}}\) for all \(\alpha = 1,\ldots ,\frac{M}{2}\).
-
(ii)
The diameter of each patch is bounded by \(C N^{\frac{1}{3}}/\sqrt{M}\).
-
(iii)
The patches \(\{B_\alpha \}_{\alpha =1}^M\) are pairwise disjoint and separated by corridors of width \(2 R_{{\hat{V}}}\). (If the separation of patches on the Fermi surface is S, then below the Fermi surface, at distance \(k_\text{ F }- R_{{\hat{V}}}\) from the origin, their separation is \(S-{\mathcal {O}}(N^{-\frac{1}{3}})\). Since by construction (4.3) S is strictly larger than \(2R_{{\hat{V}}}\), also \(S-{\mathcal {O}}(N^{-\frac{1}{3}}) > 2 R_{{\hat{V}}}\) for large enough N.)
Removing patches near the equator. Now we assign a unit vector \({\hat{\omega }}_\alpha \) to every patch on the northern half such that \(k_\text{ F }{\hat{\omega }}_\alpha \in P_\alpha \subset B_\alpha \). Reflecting the construction to the southern half sphere, the vectors \({\hat{\omega }}_{\alpha }\) inherit the reflection symmetry
For any momentum \(k\in {\mathbb {Z}}^3\setminus \{0\}\), we are only interested in a subset of the constructed patches, as labeled by the index set (the parameter \(\delta \in (0,1/6)\) is the same as in (4.1))
Pair excitations near the equator \(k \cdot {\hat{\omega }}_\alpha \approx 0\) are almost tangential to the Fermi surface and cannot be treated with the bosonization technique. Fortunately their contribution to the energy turns out to be small.
For any \(k\in {\mathbb {Z}}^3\setminus \{0\}\) we define the operators without the corridors and the excitations near the equator as
The main result of this section is the following lemma, which takes care of estimating the difference to the original \({\mathcal {H}}_\text{ corr }\).
Lemma 4.1
(Reduction to Pair Excitations on Patches) We have
In order to prove Lemma 4.1 we will need the following modified version of the kinetic energy estimate in Lemma 2.2.
Lemma 4.2
(Kinetic Bound for Pairs near the Equator) Let \(\delta \in (0,77/624)\). Then for all \(k\in {\mathbb {Z}}^3\) we have
The condition \(e(p)+e(p-k) \le 4 N^{-\frac{1}{3} -\delta }\) implies that the momentum p is located near the equator of the Fermi surface (if we think of k as defining the direction of the north pole). This is easily seen expanding \(e(p) + e(p-k) = \hbar ^2 2 p\cdot k - \hbar ^2 k^2\); because \(|p |\sim N^{\frac{1}{3}}\)we then have \(k\cdot {\hat{p}} < C N^{-\delta }\). The idea of Lemma 4.2 is that the estimate in Lemma 2.2 can be improved since here we sum only over a ribbon parallel to the equator on the Fermi surface. The ribbon covers a fraction of order \(N^{-\delta }\) of the Fermi surface, explaining the improvement from N to \(N^{1-\delta }\) in (4.10). The proof of Lemma 4.2 can be found in Appendix B.
Proof of Lemma 4.1
For every \(k\in {\mathbb {Z}}^3\), recall from (4.7) that
i. e., the summation is over all p in the set
Thus the error term compared to the full pair operator becomes
where, with \(A_1 := B_\text{ F}^c\cap (B_\text{ F }+k)\) and \(A_2 := \bigcup _{\alpha \in {\mathcal {I}}_{k}} \Big ( B_\alpha \cap (B_\alpha +k) \Big )\), we define the set
In words: U consists of all those particle momenta \(p \in B_\text{ F}^c\) that correspond to a kinematically permitted particle-hole pair (i. e., \(h := p-k\) is inside the Fermi ball) but do not belong to any included patch. Thus in (4.11) we sum over pairs belonging to a corridor between patches or to the cut-off equator region (i. e., they belong to a patch \(B_\alpha \) but \(\alpha \not \in {\mathcal {I}}_{k}\)). To estimate \({\mathfrak {r}}^{\mathcal {R}}(k)\), let \(\psi \in {\mathcal {F}}\). By the triangle inequality we can bound
where
To make contact with our earlier heuristic explanations, note that Y corresponds to the region (both patches and corridors between patches) of the Fermi surface near the equator, i. e., where \(k \cdot {\hat{p}} \approx 0\), and \(U \setminus Y\) to the corridors between the patches on the rest of the Fermi surface. This may be seen by expanding \(e(p) + e(p-k) = \hbar ^2 2 p\cdot k - \hbar ^2 k^2\); because p is close to the Fermi surface we have \(|p |\sim N^{\frac{1}{3}}\), so that \(k\cdot {\hat{p}} < C N^{-\delta }\).
On the set Y, by Lemma 4.2 we get
We turn to the set \(U \setminus Y\). By the Cauchy–Schwarz inequality we have
To estimate the first factor, note that \(p \not \in Y\) implies \(e(p)+e(p-k) \ge 4N^{-\frac{1}{3}-\delta }\), so that we have \(e(p) \ge 2N^{-\frac{1}{3}-\delta }\) or \(e(p-k)\ge 2N^{-\frac{1}{3}-\delta }\). Consequently
To estimate the second factor, note that the number of lattice points of \({\mathbb {Z}}^3\) in \(U\setminus Y\) can be bounded by the number of lattice points in the corridors between all patches
(Here we used that the length of a corridor surrounding a patch is of order \(N^{\frac{1}{3}} M^{-\frac{1}{2}}\), its width of order one, and the number of patches is M.) Having estimated both factors, we get
Putting (4.12) and (4.16) together we arrive at
Now we turn to the Hamiltonian. Expanding \(b(k)= b^{\mathcal {R}}(k) + {\mathfrak {r}}^{\mathcal {R}}(k)\) in the formula for \({\mathcal {H}}_\text{ corr }\) in (1.20), we get
It is easy to see that the kinetic bounds in Lemma 2.3 hold also with b(k) replaced by \(b^{\mathcal {R}}(k)\). Therefore, in combination with (4.17), by the Cauchy–Schwarz inequality we get
\(\square \)
5 Approximately bosonic creation operators
Let \(\{B_\alpha \}_{\alpha =1}^M\) be the patches constructed as in the previous section and let \(k_\text{ F }{\hat{\omega }}_\alpha \in B_\alpha \). Recall that for every \(k\in {\mathbb {Z}}^3 \setminus \{0\}\) we have defined
By the reflection symmetry we decompose further \({\mathcal {I}}_{k}:= {\mathcal {I}}_{k}^{-}\cup {\mathcal {I}}_{k}^{+}\) where
Then we define the local pair excitations \(\{c_\alpha ^*(k)\}_{\alpha \in {\mathcal {I}}_{k}}\) by
and
Thus, for all \(k \in \Gamma ^{\text{ nor }}\), the operator \(b^{\mathcal {R}}(k)\) in Lemma 4.1 can be decomposed as
The quantity \(n_\alpha (k)^2\) counts the number of particle–hole pairs of relative momentum k belonging to patch \(B_\alpha \). We cite the result from [23, Proposition 3.1]. The condition \(M\gg N^{\frac{1}{3}}\) in [23] is not necessary, \(M \gg N^{2\delta }\) is sufficient.
Lemma 5.1
(Normalization Constant) Assume that \(N^{2\delta } \ll M \ll N^{\frac{2}{3}-2\delta }\). Then for all \(k \in \Gamma ^{\text{ nor }}\) and \(\alpha \in {\mathcal {I}}_{k}\), we have
(This lemma may heuristically be understood as follows: the surface area covered by the patch is \(4\pi k_\text{ F}^2/M\). We think of the particle–hole pairs \((p,h) \in B_\text{ F}^c \times B_\text{ F }\) with \(p-h =k\) as organized on lines through the lattice \({\mathbb {Z}}^3\) parallel to k. To count how many lines intersect the Fermi surface we project the patch onto a plane, picking up the factor \(|k \cdot {{\hat{\omega }}}_\alpha |\). The condition \(M \gg N^{2\delta }\) ensures that patches remain so small that even near their boundaries the assumption \(|k\cdot {\hat{\omega }}_\alpha |\ge N^{-\delta }\) implies that k points from inside the Fermi ball to outside. The error term arises since we may miscount a pair when one of its components falls into the surrounding corridor, so it is proportional to the circumference of the patch. We write only \(1+o(1)\) because the precise estimate is not important for us.)
A crucial idea of our analysis is that the local pair excitation operators \(\{c_\alpha ^*(k)\}_{\alpha \in {\mathcal {I}}_{k}}\) behave similarly to bosonic creation operators. More precisely, we have approximate canonical commutator relations as given by the following lemma. The lemma is a simple consequence of [23, Lemma 4.1], but since it is a key idea of the collective bosonization concept we provide a self-contained proof again.
Lemma 5.2
(Approximate CCR) Let \(k\in \Gamma ^{\text{ nor }}\) and \(l\in \Gamma ^{\text{ nor }}\). Let \(\alpha \in {\mathcal {I}}_{k}\) and \(\beta \in {\mathcal {I}}_{l}\). The operators \(c_\alpha (k)\) and \(c_\beta ^*(l)\) satisfy the following commutator relations:
The operator \({\mathcal {E}}_\alpha (k,l)={\mathcal {E}}_\alpha (l,k)^*\) commutes with \({\mathcal {N}}\) and, for any \(\gamma \in {\mathcal {I}}_{k}\cap {\mathcal {I}}_{l}\), satisfies the operator inequalities
Furthermore, for all \(\psi \in {\mathcal {F}}\) we have
Proof
By the CAR (1.12) it is easy to see that
Moreover, if \(\alpha \ne \beta \), then \(B_\alpha \cap B_\beta = \emptyset \), and hence \([c_\alpha (k),c^*_\beta (l)] =0\).
Now let us focus on the case \(\beta =\alpha \) and compute \([c_\alpha (k),c^*_\alpha (l)]\). We only consider the case \(\alpha \in {\mathcal {I}}_{k}^{+}\cap {\mathcal {I}}_{l}^{+}\) (the other cases are simple variations). By the CAR (1.12) it is straightforward to compute that
Let us focus on \(|{\mathcal {E}}^{(1)}_\alpha (k,l)|^2\); the term \(|{\mathcal {E}}^{(2)}_\alpha (k,l)|^2\) can be bounded similarly, and the mixed terms are controlled by the Cauchy–Schwarz inequality. Symmetrizing, we find
By Lemma 5.1 we have \(n_{\alpha }(k) n_{\alpha }(l) \ge C^{-1} N^{\frac{2}{3}-\delta }/M\). Moreover, by the Cauchy–Schwarz inequality,
Therefore
The first bound in (5.3) (without the summation) is a trivial consequence. The bound (5.4) follows from (5.3) and the Cauchy–Schwarz inequality. \(\square \)
In the following, we show that the approximately bosonic number operator can be controlled by a fermionic number operator. One of our main technical improvements compared to [23, Lemma 4.2] is that instead of using the full \({\mathcal {N}}\) we use only the gapped number operator
The parameter \(\delta >0\) is the same as that in the cut-off parameter \(N^{-\delta }\) defining the index set \({\mathcal {I}}_{k}\) of relevant patches. Compared to \({\mathcal {N}}\) as in Lemma 2.4, the gain in using the gapped number operator \({\mathcal {N}}_\delta \) is that it can be controlled by \(\langle \Psi , {\mathcal {N}}_\delta \Psi \rangle \le C N^{\delta }\) in an approximate ground state \(\Psi \).
Lemma 5.3
(Bosonic Number Operator) For all \(k \in \Gamma ^{\text{ nor }}\) we have
Consequently, for all \(\psi \in {\mathcal {F}}\),
and for \(f \in \ell ^2({\mathcal {I}}_{k})\) also
Proof
First we take \(\alpha \in {\mathcal {I}}_{k}^{+}\) (the case \(\alpha \in {\mathcal {I}}_{k}^{-}\) is similar). For any \(\psi \in {\mathcal {F}}\), by the triangle and Cauchy–Schwarz inequalities,
Using the definition of \(n_\alpha (k)\) and the fermionic property \(\Vert a_i\Vert _\text{ op } \le 1\) we deduce that
For all \(p\in B_\text{ F}^c\cap B_\alpha \cap (B_\text{ F }+k)\) we have
and the condition \(\alpha \in {\mathcal {I}}_{k}^{+}\) ensures that \(k\cdot {\hat{\omega }}_\alpha \ge N^{-\delta }\); hence
Consequently, we have \(e(p) \ge \frac{1}{4} N^{-\frac{1}{3}-\delta }\) or \(e(p-k)\ge \frac{1}{4} N^{-\frac{1}{3}-\delta }\). Thus
By the same method we obtain the same bound when \(\alpha \in {\mathcal {I}}_{k}^{-}\). Thus by the definition of the gapped number operator we can bound
which is equivalent to (5.7). Moreover, it can be seen from \({\mathcal {E}}_\alpha (k,k) \le 0\) in (5.5) that
Thus
By the Cauchy–Schwarz inequality we obtain
Using that \([c_\alpha (k), c^*_\beta (k)]\) vanishes for \(\alpha \ne \beta \), by (5.10) we obtain
This concludes the proof. \(\square \)
For further application, it is useful to extend the definition of \(c_\alpha (k)\) to include a weight function. Given \(g: {\mathbb {Z}}^3\times {\mathbb {Z}}^3 \rightarrow {\mathbb {R}}\), we define weighted pair operators
The weighted pair operators satisfy similar bounds as the simple pair operators.
Lemma 5.4
(Weighted Pair Operators) For all \(k \in \Gamma ^{\text{ nor }}\) and \(\psi \in {\mathcal {F}}\) we have
and for all \(f \in \ell ^2({\mathcal {I}}_{k})\) also
Proof
Compared to Lemma 5.3 the only non-trivial modification is that we now use
where before we used \([c_\alpha (k),c^*_\alpha (k)] \le 1\). We omit the further details. \(\square \)
6 Bogoliubov kernel
In this section we study the Hamiltonian \(h_\text{ eff }(k)\) introduced in (1.48). Let us use \({\hat{k}} := k/|k|\). It is convenient to write
where \(D(k)\), \(W(k)\), and \({\widetilde{W}}(k)\) are \({\mathcal {I}}_{k}\times {\mathcal {I}}_{k}\) real symmetric matrices with elements
If \(c_\alpha ^*(k)\) were exactly bosonic creation operators, then the quadratic Hamiltonian \(h_\text{ eff }(k)\) could be diagonalized by a Bogoliubov transformation of the form
The matrix K(k) (also called the Bogoliubov kernel) achieving this can be computed from \(D(k)\), \(W(k)\), \({\widetilde{W}}(k)\); we refer to [23, Appendix A.1] for a detailed derivation. Here let us just state the result. We introduce the \({\mathcal {I}}_{k}\times {\mathcal {I}}_{k}\) matrices
and
(Formulas (6.9) and (6.10) below show that the square roots here involve only positive matrices, so that E(k) and \(S_1(k)\) are well-defined.) Then the Bogoliubov kernel K(k) is
The following lemma provides strong estimates for the matrix elements of K(k). While in most parts the simpler bound \(|K(k)_{\alpha ,\beta }|\le C {\hat{V}}(k)/M\) is sufficient for our analysis, the sharp bound of the lemma is crucial for controlling the non-bosonizable terms \({\mathcal {E}}_2\); see (9.5). The simpler bound can be proved without smallness assumption on the potential; the sharp bound requires the smallness because we prove it using a power series expansion in (6.14).
The proof of the lemma is a lengthy but mostly straightforward computation. A key role in the proof is played by the fact that the factor \(k\cdot {\hat{\omega }}_\alpha \) arising from the linearized kinetic energy (through the matrix D(k)) appears also for the independent geometric reason of Lemma 5.1 in the normalization factor \(n^2_\alpha (k)\) in the bosonized interaction, i. e., in W(k) and \({\tilde{W}}(k)\). The geometry of the Fermi surface implies that the excitation-creating operators of the interaction vanish at the same rate \(u_\alpha = \sqrt{|{\hat{k}} \cdot {\hat{\omega }}_\alpha |}\) as the leading order of their kinetic energy when we move toward “tangential” excitations.
Lemma 6.1
(Bogoliubov Kernel) Let K(k) be defined in (6.6). If \(\Vert {\hat{V}}\Vert _{\ell ^\infty }\) is sufficiently small, then for any \(k\in \Gamma ^{\text{ nor }}\), K(k) is a real symmetric matrix satisfying
Proof
In the following, we frequently drop the k-dependence from the notation for simplicity. Let us introduce
Recall that \(\kappa =(\frac{3}{4\pi })^{\frac{1}{3}}+ {\mathcal {O}}(N^{-1/3})\). By definition of the index set \({\mathcal {I}}_{k}\), we have \(1\ge u_\alpha ^2 \ge CN^{-\delta }\) for all \(\alpha \in {\mathcal {I}}_{k}\). Moreover, Lemma 5.1 implies the important relation \(v_\alpha \simeq u_\alpha \sqrt{ 4\pi /M}\) (up to a lower order error term) between the normalization factor \(n_\alpha (k)\) and the linearization of the kinetic energy, which will be used repeatedly for cancellations.
Due to the reflection symmetry
we can denote \(I:= |{\mathcal {I}}_{k}^{+}|= |{\mathcal {I}}_{k}^{-}|\le M/2\) and map the indices \({\mathcal {I}}_{k}^{+}\) to \(\{1,\ldots , I\}\), and the indices \({\mathcal {I}}_{k}^{-}\) to \(\{I+1,\ldots 2I\}\). Obviously
Therefore, the matrices in (6.2) can be written in the \(2I\times 2I\) block form
where \(d={\text {diag}}(u_\alpha ^2, \alpha = 1,\ldots ,I)\) and \(b=g|v\rangle \langle v|\) (i. e., a rank-one operator) with \(v = (v_1, \cdots , v_{I})\).
As in [64], denoting by \({\mathbb {I}}\) the \(I\times I\)-identity matrix, we define
Obviously \(U^\intercal = U = U^{-1}\) and it simultaneously block-diagonalizes \(D+W+ {\widetilde{W}}\) and \(D+W- {\widetilde{W}}\), namely
Recall the matrix
applying the block-diagonalization we find
Now defining the matrix \(L := S_1 S_1^\intercal - {\mathbb {I}}\) with \(S_1\) as in (6.5), we find
We are going to prove
In particular, thanks to the assumption of \(\Vert {\hat{V}}\Vert _{\ell ^\infty }\) being small, also, e. g., the Hilbert–Schmidt norm of L can be assumed to be uniformly smaller than 1, which is sufficient to ensure convergence of the matrix power series (6.11).
From L to K. Let us deduce (6.7) by assuming (6.12). Spelling out the matrix product
we obtain (recall that \(|{\mathcal {I}}_{k}|=2I\le M\))
The same holds with exchanged roles of \(u_\alpha \) and \(u_\beta \). From (6.11) we obtain
The convergence of the series of the logarithm follows from the assumption that \(|{\hat{V}}(k)|\) is small. This implies (6.7), thanks to Lemma 5.1.
Bound for L. We now prove (6.12). The \(2I\times 2I\)-matrix L can be block-diagonalized using the orthogonal matrix U from (6.8), i. e.,
with \(I\times I\)-blocks
Inverting (6.15) we obtain
Thus, with the matrix indices on \(L_1\) and \(L_2\) to be read as \(\alpha \mod I\) and \(\beta \mod I\), we have
Estimating \(L_1\). In the square brackets in the definition (6.16) of \(L_1\) we have a rank-one perturbation of a diagonal matrix, namely defining the vector \({\tilde{v}} := d^{1/2} v\) we have
Using the Sherman–Morrison formula for the resolvent of any invertible matrix A with rank-one perturbation given by vectors x, y such that \(1+ \langle y, A^{-1} x \rangle \not = 0\), i. e.,
we explicitly calculate the resolvent of (6.19) and then enter with it in the integral representation \(A^{-1/2} = \frac{2}{\pi }\int _0^\infty {\text{ d }}\lambda (A + \lambda ^2)^{-1}\) (for any positive matrix A), with the result that
The function \(f(\lambda )\) here is given by
Multiplying from both sides by \(d^{1/2}\), and subtracting the identity matrix, we obtain
Recall \(v_\alpha \le u_\alpha CM^{-\frac{1}{2}}\) and observe that \(1/f(\lambda ) \le 1\), so we get
Estimating \(L_2\). Recall that
Here \(-(d+2b)^{1/2} 2b (d+2b)^{1/2} = -2g |(d+2b)^{1/2} v\rangle \langle (d+2b)^{1/2} v |\) is a rank-one perturbation of \((d+2b)^2\), so by employing again the integral expansion as used for \(L_1\) we obtain
Now consider the function \({\tilde{f}}(\lambda ) := 1 - 2g\big \langle v, \frac{d+2b}{(d+2b)^2 + \lambda ^2} v\big \rangle \), the inverse of which is appearing in the integral. For \(\lambda = 0\), using the Sherman–Morrison formula (6.20), this time expanding \(d+2b\) around d, we find
Since
is uniformly bounded we have \({\tilde{f}}(0) > 0\), strictly and uniformly in k and M. Furthermore \(\lambda \rightarrow \left\langle v, \frac{d+2b}{(d+2b)^2 + \lambda ^2} v\right\rangle \) is monotone decreasing for all \(\lambda \ge 0\), thus \({\tilde{f}}(\lambda ) \ge {\tilde{f}}(0)\) for all \(\lambda \ge 0\). We expand \(\frac{d+2b}{(d+2b)^2+\lambda ^2}= (d+2b)(d+2b+i\lambda )^{-1} (d+2b -i\lambda )^{-1} \) and use the Sherman–Morrison formula separately for both the resolvents \((d+2b\pm i\lambda )^{-1}\). Using the Dirac ket notation, this results in
where for the last line we used \(b = g|v\rangle \langle v|\). Keeping the 1 from the big square bracket separate while combining the other terms, this simplifies to
The vector \(\big |\frac{d+2b}{(d+2b)^2+\lambda ^2} v\big \rangle \) has real elements since v is a real vector and \(\frac{d+2b}{(d+2b)^2+\lambda ^2}\) is a real matrix. However, (6.25) and (6.26) are not explicitly real (by choosing an order out of the two options \((d+2b)^2 + \lambda ^2 = (d+2b+i\lambda )^{-1} (d+2b -i\lambda )^{-1}\) and \((d+2b)^2 + \lambda ^2 = (d+2b-i\lambda )^{-1} (d+2b +i\lambda )^{-1}\) we have broken this symmetry). To make the expression explicitly real again, let us add the complex conjugate, yielding
Using \(\langle v, (d+i\lambda )^{-1} v\rangle + \langle v, (d-i\lambda )^{-1} v\rangle = \langle v, \frac{2d}{d^2+\lambda ^2} v\rangle \) (and dividing by 2) this becomes
In the denominator \(1+ 2g \langle v, \frac{2d}{d^2+\lambda ^2} v\rangle +4g^2 |\langle v, (d+i\lambda )^{-1} v\rangle |^2 \ge 1\); hence (6.29) can be dropped for an upper bound. Inserting (6.28) in (6.23) we obtain
For the following estimates, note that \(\langle v, \frac{d}{d^2+\lambda ^2} v\rangle \le \langle v, \frac{1}{d} v\rangle \le C\) according to (6.24).
First summand times first summand in (6.30). Consider the product of the first summands from inside each of the absolute values. This is of the same type as (6.21), so
Second summand times second summand in (6.30). We have \(\frac{1}{2 d \lambda } \ge \frac{1}{d^2+\lambda ^2}\), which implies \(\langle v, (d^2+\lambda ^2)^{-1} v\rangle \le (2\lambda )^{-1} \langle v,d^{-1} v\rangle \le C \lambda ^{-1}\). Thus
Assuming without loss of generality \(u_\alpha \ge u_\beta \), by dropping a non-negative \(u_\beta ^4\) from the numerator we find
Mixed term in (6.30). We turn to the remaining two terms obtained from the product in the integral, for which we have to estimate
The integral is
Without loss of generality \(u_\alpha \le u_\beta \); then
In the last step we used that both fractions in the sum are bounded by 1. This completes the proof of Lemma 6.1. \(\square \)
7 Approximate Bogoliubov transformation
Given the Bogoliubov kernel K(k) in (6.6), for any \(\lambda \in {\mathbb {R}}\) we define a unitary transformation \(T_\lambda : {\mathcal {F}}\rightarrow {\mathcal {F}}\) by
The following lemma is the main result of this section, showing that \(T_\lambda \) acts approximately as a bosonic Bogoliubov transformation.
Lemma 7.1
(Approximate Bogoliubov Transformation) For all \(\lambda \in [-1,1]\), \(l \in \Gamma ^{\text{ nor }}\) and \(\gamma \in {\mathcal {I}}_{l}\), we have
where the error operator \({\mathfrak {E}}_\gamma (\lambda ,l)\) satisfies (with a constant C independent of \(\lambda \) and l)
The same estimate holds for \({\mathfrak {E}}^*_\gamma (\lambda ,l)\).
Here the matrices \(\cosh (K(k))\) and \(\sinh (K(k))\) are defined by functional calculus, or more explicitly by the series
As a consequence of Lemma 6.1, by a calculation similar to that in (6.14) using the power series, we can verify that, for all \(\lambda \in [-1,1]\) and \(k\in \Gamma ^{\text{ nor }}\),
In order to prove Lemma 7.1, we need to show that the fermion number is stable under the approximate Bogoliubov transformation. This is the content of the next lemma.
Lemma 7.2
(Stability of Fermion Number) Let \(T_\lambda \) be the approximate Bogoliubov transformation defined in (7.1). For all \(\lambda \in [-1,1]\) and all \(\psi \in {\mathcal {F}}\) we have
Proof
Let \(\psi \in {\mathcal {F}}\). We use a variation of the Grönwall argument in [23, 27].
Proof of (7.4)
For any function f we have
Thus
Recall from Lemma 6.1 that \(|K(k)_{\alpha ,\beta } |\le C/M\), so that (5.9) from Lemma 5.3 implies
Furthermore, by (5.8) from Lemma 5.3 we have
Hence for all vectors \(X,Y \in {\mathcal {F}}\) we have
Using this bound we find
In the last estimate we used that \({\mathcal {N}}_\delta \) commutes with \({\mathcal {N}}\), that \(0\le {\mathcal {N}}_\delta \le {\mathcal {N}}\), and that
The estimate (7.6) closes a Grönwall bound for \(\langle T_\lambda \psi , ({\mathcal {N}}+4)^m T_\lambda \psi \rangle \) and (7.4) follows.
Proof of (7.5)
In view of definition (5.11) we can write
for some weight function g with \(g(p,k)\in \{0,1,2\}\). Similarly to the above calculation
Similar to the bounds used above, Lemma 5.4 provides us with
We then get
Thus (7.5) follows by Grönwall’s inequality. \(\square \)
Proof of Lemma 7.1
Let us denote the exponent of the Bogoliubov transformation by B, i. e., \(T_\lambda = \exp (\lambda B)\). As in the proof of [23, Prop. 4.4], we pick \(n_0 \in {\mathbb {N}}\) and iterate \(n_0\) times the Duhamel expansion
and use the commutator formula
which follows from Lemma 5.2. When iterating the expansion, the term \(K(l)_{\gamma ,\alpha }c_\alpha ^*\) in the commutator formula gives rise to the power series of the \(\sinh \) and \(\cosh \); the rest of (7.8) will be considered as an error term. The conclusion is that (7.2) holds true, with the error term summed over all iteration steps being
for any \(n_0 \ge 1\). (The two terms on the last line are the non-explicit integral term from the Duhamel formula and the powers missing to complete the series of the \(\sinh \) and \(\cosh \) from \(n=n_0\) to \(+\infty \).) Here in every summand \(X^\natural \) means X for n even and \(X^*\) for n odd.
Recall that as a consequence of Lemma 6.1 we have
Therefore, by the triangle inequality and summing over \(\gamma \),
Using Lemma 5.3 we can bound the operators in the last two lines; then taking \(n_0\rightarrow \infty \) we obtain
It remains to bound (7.10) term by term. For the first term, using Lemma 5.2, Lemma 5.3 and Lemma 7.2 we can estimate
For \(I_2\), if \(\beta \ne \alpha \), then \(c^*_\beta (k){\mathcal {E}}_\alpha (k,l)={\mathcal {E}}_\alpha (k,l) c^*_\beta (k)\) which is just what we bounded in (7.11). If \(\beta =\alpha \) we have the decomposition
(This decomposition can be verified by noticing that (7.12) consists of two commutators, recalling that \({\mathcal {E}}_\alpha (k,l) = [c_\alpha (k),c^*_\alpha (l)] - \delta _{k,l}\), and using the Jacobi identity.) With this decomposition we proceed to
Here we have used the first bound from Lemma 5.2 (without the summation), Lemma 5.3 and then proceeded similarly to (7.11). The second term of (7.12) is treated similarly. For the last term of (7.12), we bound \(\Vert c^*_\alpha (l)\xi \Vert \le \Vert ({\mathcal {N}}_\delta +1)^{1/2}\xi \Vert \) (which follows from (5.9) with \(f=\delta _\alpha \)) and then use the fact that \({\mathcal {E}}_{\alpha }(k,k)\) commutes with \({\mathcal {N}}_\delta \) (this is clear from (5.5) with \(k =l\)) to get
In the last estimate we used Lemma 7.2 again. Therefore
By the same argument, we obtain similar bounds for \({\mathcal {E}}^*_\alpha (k,l) c_\beta (k)\) and \(c_\beta (k) {\mathcal {E}}^*_\alpha (k,l)\) (recall that \({\mathcal {E}}^*_\alpha (k,l) = {\mathcal {E}}_\alpha (l,k)\) to reduce to the previous estimates). Collecting the estimates for the terms of (7.10), this completes the proof of Lemma 7.1. \(\square \)
8 Linearization of the kinetic energy
In this section we prove that the fermionic kinetic energy \({\mathbb {H}}_0\) behaves similarly to the bosonized kinetic energy
Recall the definition of the Fermi momentum \(k_\text{ F }=\kappa \hbar ^{-1}=(\frac{3}{4\pi })^{\frac{1}{3}}N^{\frac{1}{3}}+{\mathcal {O}}(1)\). The main result of the section is that the difference of fermionic and approximately bosonic kinetic energy is almost invariant under the approximate Bogoliubov transformation.
Lemma 8.1
(Comparing Fermionic and Bosonized Kinetic Energy) Let \(T_\lambda \) be the approximate Bogoliubov transformation defined in (7.1). Then for all \(\psi \in {\mathcal {F}}\) we have
The first of the two error terms in the lemma is the main reason for taking M large. More precisely, the first error (involving \(M^{-\frac{1}{2}}\)) comes from linearizing the fermionic kinetic energy on each patch, for which the size of the patch must not be too large. The linearization error of the fermionic kinetic energy is estimated in the following lemma.
Lemma 8.2
(Linearization of Kinetic Energy) For all \(k \in \Gamma ^{\text{ nor }}\) and all \(\alpha \in {\mathcal {I}}_{k}\) we have
where the error term satisfies, for all \(\psi \in {\mathcal {F}}\),
and
All bounds here gain a factor \(M^{-\frac{1}{2}}\) compared to the bounds in Lemma 5.3.
Proof
Let us consider \(\alpha \in {\mathcal {I}}_{k}^{+}\), the other case is similar. By the CAR (1.12) we have \([a_i^* a_i, a^*_p] = \delta _{i,p} a^*_p\). Therefore
where, using definition (5.11), we can write \({\mathfrak {E}}^{\text{ lin }}_\alpha (k) = c^{g}_{\alpha }(k)\) with the weight function
Since \({\text {diam}}(B_\alpha )\le CN^{\frac{1}{3}}M^{-\frac{1}{2}}\) and \(|k|\le C\) we can bound
The claimed error estimates now follow from Lemma 5.4. \(\square \)
Proof of Lemma 8.1
Recall that \(T_0 = {\mathbb {I}}\). We will show that for all \(\lambda \in [0,1]\) we have
The claim then follows by integration over \(\lambda \in [0,1]\).
Consider the approximately bosonic operator \({\mathbb {D}}_\text{ B }\). We have
where, with an indicator function \(\chi (\alpha \in {\mathcal {I}}_{l})\), the error term is
For all \(\psi \in {\mathcal {F}}\), by the non-summed first bound from Lemma 5.2 and by Lemma 5.3
Now using
we can decompose and estimate
It remains to estimate the right side of (8.8), term by term. For \(I_1\), since \(|K(k)_{\alpha ,\beta }|\le CM^{-1}\), using Lemma 5.3, Lemma 8.2 and Lemma 7.2 we have
We can bound \(I_2\) similarly to \(I_1\), simply exchanging the roles of the \({\mathfrak {E}}^{\text{ lin }}\)-operator with the c-operator. For \(I_3\) using (8.6) instead of Lemma 8.2 we have
For \(I_4\), we split the sum over \(\alpha ,\beta \in {\mathcal {I}}_{k}\) into two parts. If \(\alpha \ne \beta \), then
and this part can be treated similarly to \(I_3\). When \(\alpha = \beta \), the corresponding contribution is
Here we have inserted the definition (8.5) and used (7.12) to obtain
The advantage of the last expression is that \({\mathcal {E}}_\alpha (l,l)\) commutes with \({\mathcal {N}}_{\delta }\). Therefore, using the Cauchy–Schwarz inequality together with Lemma 5.2, Lemma 5.3 and Lemma 7.2 we obtain
Adding up the contributions to (8.8) we arrive at the claimed bound (8.3). \(\square \)
9 Controlling Non-Bosonizable terms
In this section we consider the non-bosonizable terms \({\mathcal {E}}_1+{\mathcal {E}}_2\). Lemma 4.1 allows us to replace \({\mathcal {E}}_2\) by \({\mathcal {E}}_2^{{\mathcal {R}}}\), hence we estimate \({\mathcal {E}}_1 + {\mathcal {E}}_2^{{\mathcal {R}}}\). Recall that after the expansion into patches (5.1) we have
Now we prove that after the Bogoliubov transformation, the non-bosonizable terms can be bounded from below using the fermionic kinetic energy. This step relies on the assumption \({\hat{V}}(k) \ge 0\).
Lemma 9.1
(Non-Bosonizable Terms) Let \(T_\lambda \) be the approximate Bogoliubov transformation defined in (7.1). Then for all \(\lambda \in [-1,1]\) and for all \(\psi \in {\mathcal {F}}\) we have
Note that in the lemma \({\mathbb {H}}_0\) acts once on \(\psi \) and once on \(T_\lambda \psi \). The sharp bound on the matrix elements of the Bogoliubov kernel from Lemma 6.1 is crucial to the proof of this lemma.
The smallness assumption on \({\hat{V}}\) is important to control the term \(-C\Vert {\hat{V}}\Vert _{\ell ^1} {\mathbb {H}}_0\) appearing on the right hand side in this lemma.
Proof
Take \(k,l\in \Gamma ^{\text{ nor }}\). We denote \(\widetilde{{\mathfrak {D}}}(l):=T_\lambda ^* {\mathfrak {D}}(l) T_\lambda \). By Lemma 7.1 we can write
Therefore, by the triangle inequality, the contribution of \({\mathcal {E}}_2^{{\mathcal {R}}}\) to (9.2) can be bounded by
We proceed to bound the right side of (9.4) term by term. The first term can be bounded using \(n_\alpha (k)\le CN^{\frac{1}{3}}M^{-\frac{1}{2}}\) and Lemma 7.1:
Using (7.3) we have
Consequently, using Lemma 2.2 we get
The third term is more difficult. We have
The term \(I_4\) can be bounded again by Lemma 2.2 as
The commutator in \(I_5\) can be computed by undoing the approximate Bogoliubov transformation,
and using (7.2) with \(T_\lambda =T^*_{-\lambda }\), i. e.,
Therefore, by the triangle inequality and (9.5) we can decompose
For \(I_6\), we simply expand the commutator and use Lemma 7.1 similarly as done for \(I_1\) to get
Here we used \(\Vert {\mathfrak {D}}(l) T_\lambda \psi \Vert = \Vert \widetilde{{\mathfrak {D}}}(l) \psi \Vert \) as \(T_\lambda \) is unitary.
For \(I_7\) we compute the commutator explicitly. We decompose the operator \({\mathfrak {D}}(l)^*\) as
In the case \(\alpha \in {\mathcal {I}}_{k}^{+}\) we can then compute
By the Cauchy–Schwarz inequality and the kinetic energy estimate in Lemma 2.2 we obtain
For \(\alpha \in {\mathcal {I}}_{k}^{-}\) and \({\mathfrak {D}}^\text{ h }(l)\) we get similar estimates. The commutator in \(I_8\) can be bounded exactly the same way, using \({\mathfrak {D}}(l)^*={\mathfrak {D}}(-l)\). Thus
Collecting all estimates for \(I_1,\ldots , I_8\) we conclude from (9.4) that
The bound (9.6) holds true for all \(k,l\in \Gamma ^{\text{ nor }}\). In particular, by the Cauchy–Schwarz inequality we deduce that
This concludes the proof of (9.2). \(\square \)
10 Diagonalization of approximately bosonic Hamiltonian
We now focus on the approximately bosonic Hamiltonian \({\mathbb {D}}_\text{ B }+ Q_\text{ B}^{{\mathcal {R}}}\), with \({\mathbb {D}}_\text{ B }\) and \(Q_\text{ B}^{{\mathcal {R}}}\) as defined in (8.1) and (4.8). With \(h_\text{ eff }(k)\) being the effective Hamiltonian introduced in (6.1), we can write
The main result of this section is the following lemma in which we approximately diagonalize the effective Hamiltonian, extract its ground state energy \(E_N^\text{ RPA }\), and then bound the excitation spectrum below by \({\mathbb {D}}_\text{ B }- C \Vert {\hat{V}}\Vert _{\ell ^1} {\mathbb {H}}_0\).
Lemma 10.1
(Diagonalization of Bosonized Hamiltonian) Let \(T_1\) be the approximate Bogoliubov transformation defined in (7.1). For all normalized \(\psi \in {\mathcal {F}}\) we have
where \(E_N^\text{ RPA }\) is the RPA correlation energy defined in (1.9).
The smallness assumption on \({\hat{V}}\) is important to control the term \(-C\Vert {\hat{V}}\Vert _{\ell ^1} {\mathbb {H}}_0\) appearing on the right hand side in this lemma.
Proof
Error Terms. Recall that K(k) is a real symmetric matrix, and hence also \(\sinh (K(k))\) and \(\cosh (K(k)\) are real symmetric matrices. Defining
we have, according to Lemma 7.1,
In the first step we are going to control the contribution of \({\mathfrak {E}}_\alpha (1,k)\). By (7.3) we have
and thus using Lemma 5.3 (to treat the contribution of \(\delta _{\alpha ,\beta }\), recall that \(\Vert c_\alpha (k) \psi \Vert \le \Vert {\mathcal {N}}_\delta ^{1/2}\psi \Vert \) for all \(\psi \in {\mathcal {F}}\) follows from the first bound in the lemma) we get
Now we expand
The main contribution is
To bound the error terms in (10.5), first observe that
Now using the Cauchy–Schwarz inequality together with (10.4) and Lemma 7.1 we get
In this calculation we used the crude bound \(\sum _{\alpha \in {\mathcal {I}}_{k}} \Vert {\mathfrak {E}}_\alpha (1,k) \psi \Vert ^2 \le \big (\sum _{\alpha \in {\mathcal {I}}_{k}} \Vert {\mathfrak {E}}_\alpha (1,k) \psi \Vert \big )^2\).
Bosonic Terms. Now we compute \(h_\text{ eff}^\text{ diag }(k)\) by inserting the transformation (10.2). In this step let us suppress the k-dependence in the notation. We have
where we used the approximate CCR (5.2) to achieve bosonic Wick-normal order. Moreover, using \([c_\alpha ^*,c_\beta ^*]=[c_\alpha ,c_\beta ]=0\) we symmetrize the coefficients of \(c_\alpha ^* c_\beta ^*\) and \(c_\alpha c_\beta \); thus
Moreover
Adding both terms, we thus have
To simplify (10.7) further, recall from (6.5) that
and set
We use the polar decomposition to write \(S_1^\intercal = O |S_1^\intercal |\) with an orthogonal matrix O. Inserting our choice \(K = \log |S_1^\intercal |\) into the exponentials defining the \(\cosh \) and the \(\sinh \), and noting that each exponential by itself is a symmetric matrix, we obtain
Note that since K is symmetric, it cannot equal the non-symmetric matrix \(\frac{1}{2}(S_1+S_2)\). Thus the inclusion of the unitary operator O in (10.8) is inevitable. However, O corresponds to a change of basis in the one-boson Hilbert space, which, at least in the bosonic approximation, does not change the energy of the many-body state.
Ground state energy. As in [23, Proof of Theorem 2.1 and Appendix A.2], with the matrix E defined in (6.4), we have
Using the fact that \(\cosh (K)\) and \(\sinh (K)\) are symmetric matrices, the constant term of (10.7) thus simplifies to
This is the term giving rise to the ground state energy we aim to derive.
To estimate the error term proportional to \({\mathcal {E}}_\alpha (k,k)\) in (10.7), note first that
From (5.5) one easily derives the bound
This bound in combination with (10.9) and the fact that, due to the equator cut-off \(|k\cdot {\hat{\omega }}_\alpha |\ge N^{-\delta }\) in the definition of the index set \({\mathcal {I}}_{k}\),
implies
Vanishing of the off-diagonal terms. Next we show that the terms in \(h_\text{ eff}^\text{ diag }(k)\) proportional to \((c_\alpha ^* c^*_\beta + c_\beta c_\alpha )\) vanish, as intended with the Bogoliubov transformation. Indeed
Summarizing we get the lower bound
with the matrixFootnote 9
Bogoliubov-diagonalized effective Hamiltonian. Summing (10.12) over \(k\in \Gamma ^{\text{ nor }}\) and including the pre-factors as in (10.1) (in particular an \(\hbar \)) we conclude that
The error terms on the last two lines of (10.13) come from (10.6) and (10.10). In [23, Eq. (5.15)] we already showed that the constant term on the right hand side of (10.13) gives rise to the correlation energyFootnote 10\(E_N^\text{ RPA }\) we aimed to derive,
This is the correlation energy as given in the main theorem in (1.9).
Controlling \(-{\mathbb {D}}_\text{ B }\) by the diagonalized effective Hamiltonian. To make use of the positive contribution of \(\sum _{\alpha ,\beta \in {\mathcal {I}}_{k}}{\mathfrak {K}}(k)_{\alpha ,\beta } c_\alpha ^*(k) c_\beta (k)\), it is convenient to subtract D(k). We can then expand \(\cosh (K(k)) = (\cosh (K(k))-{\mathbb {I}}) + {\mathbb {I}}\); the term in \({\mathfrak {K}}(k)\) where D(k) is multiplied from both sides by the identity cancels with the explicitly subtracted D(k). For the remaining terms we can use (7.3) and \(|W(k)_{\alpha ,\beta }|\le \frac{C}{M} u_\alpha (k) u_\beta (k)\) to get
Thus, by Lemma 2.2 and recalling \(u_\alpha (k) \le C M^{\frac{1}{2}} N^{-\frac{1}{3}} n_\alpha (k)\), we conclude that for all \(\xi \in {\mathcal {F}}\)
Therefore
Summing over \(k\in \Gamma ^{\text{ nor }}\) we conclude that
Inserting the last bound together with (10.14) in (10.13), the proof is complete. \(\square \)
11 Proof of the main result
Proof of Theorem 1.1
Recall the definition (1.20) of the correlation Hamiltonian \({\mathcal {H}}_\text{ corr }\),
Let \(\Psi \in {\mathcal {F}}\) be the approximate ground state constructed by the particle number localization, Lemma 3.1, from some exact ground state \(\psi _\text{ gs }\) of \({\mathcal {H}}_\text{ corr }\) (i. e., from a minimizer of the expectation value on the left hand side of (1.22)). By the localization we have
and
for all \(m\ge 0\). Here we have used \({\mathcal {N}}_\delta \le CN^{\frac{1}{3}+\delta } {\mathbb {H}}_0\) to estimate the gapped number operator.
Let \(T_1\) be the approximate Bogoliubov transformation defined in (7.1) with the Bogoliubov kernel K(k) from (6.6). Since \(T_1\) is unitary, we can define \(\psi \) by setting
From (11.1) and Lemma 7.2 we also have
and because of \(M \gg N^{2\delta }\) we have
Now we collect the main bounds and estimate the error terms using (11.1), (11.2), and (11.3). First, by Lemma 2.5 we can bound the quadratic operator \({\mathbb {X}}\) below by
By Lemma 4.1, we have
Next we can use Lemma 8.1 to deduce that
and, because of the estimate for the matrix elements of the Bogoliubov kernel derived in Lemma 6.1, Lemma 9.1 shows that
(Note that here, in the second summand on the second line, in \(\Vert {\mathbb {H}}_0^{1/2}\Psi \Vert \) we have a single instance of the approximate ground state vector \(\Psi \) and not \(\psi \), so that this norm can be estimated by (11.1).) Finally, by Lemma 10.1, the bosonized effective Hamiltonian \({\mathbb {D}}_\text{ B }+ Q_\text{ B}^{{\mathcal {R}}}\) yields
To conclude, we sum all error bounds from (11.4)–(11.8). The quantities
from the error terms suggest us to take
With this choice, collecting all of (11.4)–(11.8), we conclude that
The contribution of \(\big ( 1 - C \Vert {{\hat{V}}}\Vert _{\ell ^1} \big )\langle \psi , {\mathbb {H}}_0 \psi \rangle \) is non-negative thanks to the smallness assumption for the potential. Thus it remains the error of order \(\hbar N^{-\frac{1}{48}} = \hbar ^{1+\frac{1}{16}}\), which completes the proof of the main result. \(\square \)
Notes
In this paper we focus on a setting where the pairing density is not relevant. If the pairing density becomes important, one is lead to Hartree–Fock–Bogoliubov theory or the Bardeen–Cooper–Schrieffer (BCS) theory of superconductivity. Already the study of these quasi-free theories is very challenging. Recently the mathematical properties of BCS theory were extensively analyzed [51, 71], and the Ginzburg–Landau theory of superconductivity was derived from BCS theory [50].
Compared to the mass \(m=1\) in [23], we now choose \(m=1/2\).
This fact is special for the completely filled Fermi ball of a homogeneous gas in finite volume. In general, the plane waves state is not even a local minimum of the Hartree–Fock functional [61].
In [23] this operator was denoted by \({\tilde{b}}^*_k\).
The symbol \(\natural \) may stand both for “\(*\)” (adjoint in Fock space \({\mathcal {F}}\)) and for absence of “\(*\)”; we use it whenever the choice does not play a role.
Where confusion may arise, we use the notation \(p:p \in B_\text{ F}^c\cap B_\alpha ,\,p-k\in B_\text{ F }\cap B_\alpha \) in specifying the range of summation: here it is over all \(p \in {\mathbb {Z}}^3\) (but not over k) satisfying \(p \in B_\text{ F}^c\cap B_\alpha \) and \(p-k\in B_\text{ F }\cap B_\alpha \).
We use the notation \(\pm A \le B\) for two self-adjoint operators A and B to indicate that both \(A \le B\) and \(-A \le B\) hold.
A computation shows that \({\mathfrak {K}}(k)=O(k)^\intercal E(k) O(k)\) but we are not going to use this formula.
In \(E^\text{ RPA}_N\), one also has to replace \(\kappa \) by \(\kappa _0\); the error estimate is simple and subleading, see [23].
References
Antinucci, G., Mastropietro, V., Porta, M.: Universal edge transport in interacting Hall systems. Commun. Math. Phys. 362, 295–359 (2018)
Afchain, S., Magnen, J., Rivasseau, V.: Renormalization of the 2-point function of the Hubbard model at half-filling. Annales Henri Poincaré 6(3), 399–448 (2005)
Afchain, S., Magnen, J., Rivasseau, V.: The Hubbard model at half-filling, Part III: the lower bound on the self-energy. Annales Henri Poincaré 6(3), 449–483 (2005)
Arponen, J., Pajanne, E.: Theory of electron gas as a system of interacting collective excitations I. Boson formalism. Ann. Phys. 91(2), 450–480 (1975)
Bach, V.: Error bound for the Hartree-Fock energy of atoms and molecules. Commun. Math. Phys. 147(3), 527–548 (1992)
Bauerschmidt, R., Webb, C.: The Coleman correspondence at the free Fermion point. arXiv:2010.07096 [math-ph] (2020)
Boccato, C., Brennecke, C., Cenatiempo, S., Schlein, B.: The excitation spectrum of Bose gases interacting through singular potentials. J. Eur. Math. Soc. 22(7), 2331–2403 (2020)
Boccato, C., Brennecke, C., Cenatiempo, S., Schlein, B.: Complete Bose–Einstein condensation in the Gross–Pitaevskii regime. Commun. Math. Phys. 359(3), 975–1026 (2018)
Boccato, C., Brennecke, C., Cenatiempo, S., Schlein, B.: Bogoliubov theory in the Gross–Pitaevskii limit. Acta Mathematica 222(2), 219–335 (2019)
Boccato, C., Brennecke, C., Cenatiempo, S., Schlein, B.: Optimal rate for Bose–Einstein condensation in the Gross–Pitaevskii regime. Commun. Math. Phys. 376, 1311–1395 (2020)
Benedikter, N., de Oliveira, G., Schlein, B.: Quantitative derivation of the Gross–Pitaevskii equation. Commun. Pure Appl. Math. 68(8), 1399–1482 (2015)
Benedikter, N.: Interaction corrections to spin-wave theory in the large-S limit of the quantum Heisenberg ferromagnet. Math. Phys. Anal. Geom. 20(2), 5 (2017)
Benedikter, N.: Bosonic collective excitations in Fermi gases. Rev. Math. Phys. 32, 2060009 (2020). https://doi.org/10.1142/S0129055X20600090
Benfatto, G., Falco, P., Mastropietro, V.: Extended scaling relations for planar lattice models. Commun. Math. Phys. 292(2), 569–605 (2009)
Benfatto, G., Falco, P., Mastropietro, V.: Massless Sine-Gordon and massive thirring models: proof of Coleman’s equivalence. Commun. Math. Phys. 285(2), 713–762 (2009)
Benfatto, G., Gallavotti, G.: Perturbation theory of the fermi surface in a quantum liquid. A general quasiparticle formalism and one-dimensional systems. J. Stat. Phys. 59(3–4), 541–664 (1990)
Benfatto, G., Gallavotti, G., Procacci, A., Scoppola, B.: Beta function and Schwinger functions for a many fermions system in one dimension. Anomaly of the Fermi surface. Commun. Math. Phys. 160(1), 93–171 (1994)
Benfatto, G., Giuliani, A., Mastropietro, V.: Fermi liquid behavior in the 2D Hubbard model at low temperatures. Annales Henri Poincaré 7(5), 809–898 (2006)
Benedikter, N., Jakšić, V., Porta, M., Saffirio, C., Schlein, B.: Mean-field evolution of fermionic mixed states. Commun. Pure Appl. Math. 69(12), 2250–2303 (2016)
Benfatto, G., Mastropietro, V.: Renormalization group, hidden symmetries and approximate ward identities in the XYZ model. Rev. Math. Phys. 13(11), 1323–1435 (2001)
Benfatto, G., Mastropietro, V.: Ward Identities and Vanishing of the Beta Function for d = 1 Interacting Fermi Systems. J. Stat. Phys. 115(1), 143–184 (2004)
Benfatto, G., Mastropietro, V.: Drude weight in non solvable quantum spin chains. J. Stat. Phys. 143(2), 251–260 (2011)
Benedikter, N., Nam, P.T., Porta, M., Schlein, B., Seiringer, R.: Optimal upper bound for the correlation energy of a Fermi gas in the mean-field regime. Commun. Math. Phys. 374, 2097–2150 (2020)
Benedikter, N., Nam, P.T., Porta, M., Schlein, B., Seiringer, R.: Bosonization of Fermionic Many-Body Dynamics. arXiv:2103.08224 [math-ph] (2021)
Bogolubov, N.N.: On the theory of superfluidity. J. Phys. (USSR), 11, 23–32, 1947. (Izv. Akad. Nauk Ser. Fiz. 11, p.77, 1947)
Bohm, D., Pines, D.: A collective description of electron interactions: III. Coulomb interactions in a degenerate electron gas. Phys. Rev. 92(3), 609–625 (1953)
Benedikter, N., Porta, M., Schlein, B.: Hartree-Fock dynamics for weakly interacting fermions. In: Mathematical Results in Quantum Mechanics (Proceedings of the QMath12 Conference). World Scientific Publishing Company (2014)
Benedikter, N., Porta, M., Schlein, B.: Mean-field dynamics of fermions with relativistic dispersion. J. Math. Phys. 55(2), 021901 (2014)
Benedikter, N., Porta, M., Schlein, B.: Mean-field evolution of fermionic systems. Commun. Math. Phys. 331(3), 1087–1131 (2014)
Benedikter, N., Porta, M., Schlein, B.: Effective Evolution Equations from Quantum Dynamics. SpringerBriefs in Mathematical Physics. Springer International Publishing, Berlin (2016)
Brennecke, C., Schlein, B.: Gross–Pitaevskii dynamics for Bose–Einstein condensates. Anal. PDE 12(6), 1513–1596 (2019)
Benedikter, N., Sok, J., Solovej, J.P.: The Dirac–Frenkel principle for reduced density matrices, and the Bogoliubov–de Gennes equations. Annales Henri Poincaré 19(4), 1167–1214 (2018)
Castro Neto, A.H., Fradkin, E.: Bosonization of Fermi liquids. Phys. Rev. B 49(16), 10877–10892 (1994)
Castro Neto, A.H., Fradkin, E.: Bosonization of the low energy excitations of Fermi liquids. Phys. Rev. Lett. 72(10), 1393–1397 (1994)
Castro Neto, A.H., Fradkin, E.H.: Exact solution of the Landau fixed point via bosonization. Phys. Rev. B 51(7), 4084–4104 (1995)
Correggi, M., Giuliani, A.: The free energy of the quantum Heisenberg ferromagnet at large spin. J. Stat. Phys. 149(2), 234–245 (2012)
Correggi, M., Giuliani, A., Seiringer, R.: Validity of the spin-wave approximation for the free energy of the Heisenberg ferromagnet. Commun. Math. Phys. 339(1), 279–307 (2015)
Combescot, M., Shiau, S.-Y.: Excitons and Cooper Pairs: Two Composite Bosons in Many-Body Physics. Oxford University Press, Oxford (2016)
de Woul, J., Langmann, E.: Partially gapped Fermions in 2D. J. Stat. Phys. 139(6), 1033–1065 (2010)
de Woul, J., Langmann, E.: Exact solution of a 2D interacting fermion model. Commun. Math. Phys. 314(1), 1–56 (2012)
de Woul, J., Langmann, E.: Fermions in two dimensions, Bosonization, and exactly solvable models. Int. J. Mod. Phys. B 26(22), 1244005 (2012)
de Woul, J., Langmann, E.: Gauge invariance, correlated fermions, and photon mass in 2+1 dimensions. J. Stat. Phys. 154(3), 877–894 (2014)
Disertori, M., Magnen, J., Rivasseau, V.: Interacting fermi liquid in three dimensions at finite temperature: part I: convergent contributions. Annales Henri Poincaré 2(4), 733–806 (2001)
Disertori, M., Rivasseau, V.: Interacting Fermi liquid in two dimensions at finite temperature. Part I: convergent attributions. Commun. Math. Phys. 215(2), 251–290 (2000)
Disertori, M., Rivasseau, V.: Interacting Fermi liquid in two dimensions at finite temperature. Part II: renormalization. Commun. Math. Phys. 215(2), 291–341 (2000)
Erdős, L., Schlein, B., Yau, H.-T.: Rigorous derivation of the Gross–Pitaevskii equation with a large interaction potential. J. Am. Math. Soc. 22(4), 1099–1156 (2009)
Erdős, L., Schlein, B., Yau, H.-T.: Derivation of the Gross–Pitaevskii equation for the dynamics of Bose–Einstein condensate. Ann. Math. Second Ser. 172(1), 291–370 (2010)
Fröhlich, J., Götschmann, R.: Bosonization of Fermi liquids. Phys. Rev. B 55(11), 6788–6815 (1997)
Fröhlich, J., Götschmann, R., Marchetti, P.A.: The effective gauge field action of a system of non-relativistic electrons. Commun. Math. Phys. 173(2), 417–452 (1995)
Frank, R., Hainzl, C., Seiringer, R., Solovej, J.: Microscopic derivation of Ginzburg–Landau theory. J. Am. Math. Soc. 25(3), 667–713 (2012)
Frank, R.L., Hainzl, C., Seiringer, R., Solovej, J.P.: The external field dependence of the BCS critical temperature. Commun. Math. Phys. 342(1), 189–216 (2016)
Feldman, J., Knörrer, H., Trubowitz, E.: A two dimensional fermi liquid. Part 1: overview. Commun. Math. Phys. 247(1), 1–47 (2004)
Feldman, J., Magnen, J., Rivasseau, V., Trubowitz, E.: An infinite volume expansion for many Fermion Green’s functions. Helv. Phys. Acta 65(5), 679–721 (1992)
Feldman, J., Magnen, J., Rivasseau, V., Trubowitz, E.: A Rigorous analysis of the superconducting phase of an electron–phonon system. arXiv:cond-mat/9503047 (1995)
Fröhlich, J., Knowles, A., Schlein, B., Sohinger, V.: The mean-field limit of quantum Bose gases at positive temperature. arXiv:2001.01546 (2020)
Fefferman, C.L., Seco, L.A.: On the energy of a large atom. Bull. (New Series) Am. Math. Soc. 23(2), 525–530 (1990)
Fournais, S., Solovej, J.P.: The energy of dilute Bose gases. Ann. Math. 192(3), 893–976 (2020)
Fjærestad, J.O., Sudbø, A., Luther, A.: Correlation functions for a two-dimensional electron system with Bosonic interactions and a square Fermi surface. Phys. Rev. B 60(19), 13361–13370 (1999)
Gell-Mann, M., Brueckner, K.A.: Correlation energy of an electron gas at high density. Phys. Rev. 106(2), 364–368 (1957)
Gontier, D., Hainzl, C., Lewin, M.: Lower bound on the Hartree–Fock energy of the electron gas. Phys. Rev. A 99(5), 052501 (2019)
Gontier, D., Lewin, M.: Spin symmetry breaking in the translation-invariant Hartree-Fock Uniform Electron Gas. SIAM J. Math. Anal. 51(4), 3388–3423 (2019)
Giuliani, A., Mastropietro, V., Toninelli, F.: Non-integrable dimers: universal fluctuations of tilted height profiles. Commun. Math. Phys. 377(3), 1883–1959 (2020)
Graf, G.M., Solovej, J.P.: A correlation estimate with applications to quantum systems with coulomb interactions. Rev. Math. Phys. 06(05a), 977–997 (1994)
Grech, P., Seiringer, R.: The excitation spectrum for weakly interacting bosons in a trap. Commun. Math. Phys. 322(2), 559–591 (2013)
Hainzl, C.: One non-relativistic particle coupled to a photon field. Annales Henri Poincaré 4(2), 217–237 (2003)
Haldane, F.D.M.: General relation of correlation exponents and spectral properties of one-dimensional fermi systems: application to the anisotropic \(S=\frac{1}{2}\) Heisenberg chain. Phys. Rev. Lett. 45(16), 1358–1362 (1980)
Haldane, F.D.M.: ‘Luttinger liquid theory’ of one-dimensional quantum fluids. I. Properties of the Luttinger model and their extension to the general 1D interacting spinless Fermi gas. J. Phys. C Solid State Phys. 14(19), 2585–2609 (1981)
Haldane, F.D.M.: Luttinger’s theorem and bosonization of the fermi surface. In: Proceedings of the International School of Physics “Enrico Fermi”, Course CXXI: “Perspectives in Many-Particle Physics”, pp. 5–30. North Holland, Amsterdam (1994)
Heisenberg, W.: Zur Theorie der Supraleitung. Zeitschrift für Naturforschung A 2(4), 185–201 (1947)
Hainzl, C., Hirokawa, M., Spohn, H.: Binding energy for hydrogen-like atoms in the Nelson model without cutoffs. J. Funct. Anal. 220(2), 424–459 (2005)
Hainzl, C., Hamza, E., Seiringer, R., Solovej, J.P.: The BCS functional for general pair interactions. Commun. Math. Phys. 281(2), 349–367 (2008)
Houghton, A., Kwon, H.-J., Marston, J.B., Shankar, R.: Coulomb interaction and the Fermi liquid state: solution by bosonization. J. Phys. Condens. Matter 6(26), 4909 (1994)
Houghton, A., Marston, J.B.: Bosonization and fermion liquids in dimensions greater than one. Phys. Rev. B 48(11), 7790–7808 (1993)
Hainzl, C., Porta, M., Rexze, F.: On the correlation energy of interacting fermionic systems in the mean-field regime. Commun. Math. Phys. 374, 485–524 (2020)
Huxley, M.N.: Exponential sums and lattice points. III. Proc. Lond. Math. Soc. 87, 591–609 (2003)
Kopietz, P., Castilla, G.E.: Higher-dimensional Bosonization with nonlinear energy dispersion. Phys. Rev. Lett. 76(25), 4777–4780 (1996)
Kopietz, P., Hermisson, J., Schönhammer, K.: Bosonization of interacting fermions in arbitrary dimension beyond the Gaussian approximation. Phys. Rev. B 52(15), 10877–10896 (1995)
Khveshchenko, D.V.: Geometrical approach to bosonization of D \(>\)1 dimensional (non)-Fermi liquids. Phys. Rev. B 52(7), 4833–4841 (1995)
Kopietz, P.: Bosonization of Interacting Fermions in Arbitrary Dimensions. Lecture Notes in Physics Monographs. Springer, Berlin (1997)
Kopietz, P., Schönhammer, K.: Functional bosonization of interacting fermions in arbitrary dimensions. Zeitschrift für Physik B Condensed Matter 100(2), 259–262 (1996)
Langmann, E.: A 2D Luttinger model. J. Stat. Phys. 141(1), 17–52 (2010)
Langmann, E.: A two-dimensional analogue of the Luttinger model. Lett. Math. Phys. 92(2), 109–124 (2010)
Leopardi, P.: A partition of the unit sphere into regions of equal area and small diameter. Electron. Trans. Numer. Anal. 25, 309–327 (2006)
Lewin, M., Nam, P.T., Rougerie, N.: Classical field theory limit of many-body quantum Gibbs states in 2D and 3D. Inventiones mathematicae (2021). https://doi.org/10.1007/s00222-020-01010-4
Lieb, E.H., Seiringer, R.: Proof of Bose-Einstein condensation for dilute trapped gases. Phys. Rev. Lett. 88, 170409 (2002)
Lewin, M., Nam, P.T., Serfaty, S., Solovej, J.P.: Bogoliubov spectrum of interacting Bose gases. Commun. Pure Appl. Math. 68(3), 413–471 (2015)
Lieb, E.H., Solovej, J.P.: Ground state energy of the one-component charged Bose gas. Commun. Math. Phys. 217(1), 127–163 (2001)
Lieb, E.H., Seiringer, R., Yngvason, J.: Bosons in a trap: a rigorous derivation of the Gross–Pitaevskii energy functional. Phys. Rev. A 61(4), 043602 (2000)
Luther, A.: Tomonaga fermions and the Dirac equation in three dimensions. Phys. Rev. B 19(1), 320–330 (1979)
Macke, W.: Über die Wechselwirkungen im Fermi-Gas. Polarisationserscheinungen, Correlationsenergie, Elektronenkondensation. Zeitschrift für Naturforschung A 5(4), 192–208 (1950)
Mattis, D.C.: Implications of infrared instability in a two-dimensional electron gas. Phys. Rev. B 36(1), 745–747 (1987)
Mattis, D.C., Lieb, E.H.: Exact solution of a many-fermion system and its associated boson field. J. Math. Phys. 6(2), 304–312 (1965)
Mastropietro, V., Porta, M.: Spin hall insulators beyond the Helical Luttinger model. Phys. Rev. B 96(24), 245135 (2017)
Marshalek, E.R., Weneser, J.: Nuclear rotation and the random-phase approximation. Ann. Phys. 53(3), 569–624 (1969)
Nam, P.T., Napiórkowski, M., Ricaud, J., Triay, A.: Optimal rate of condensation for trapped bosons in the Gross–Pitaevskii regime. arXiv:2001.04364 (2020) to appear in Analysis & PDE
Rivasseau, V.: The two dimensional hubbard model at half-filling. I. Convergent contributions. J. Stat. Phys. 106(3), 693–722 (2002)
Ring, P., Schuck, P.: The Nuclear Many-Body Problem, Study Edition. Springer, Berlin (2005)
Sawada, K.: Correlation energy of an electron gas at high density. Phys. Rev. 106(2), 372–383 (1957)
Sawada, K., Brueckner, K.A., Fukuda, N., Brout, R.: Correlation energy of an electron gas at high density: plasma oscillations. Phys. Rev. 108(3), 507–514 (1957)
Seiringer, R.: The excitation spectrum for weakly interacting bosons. Commun. Math. Phys. 306(2), 565–578 (2011)
Simon, B.: Semiclassical analysis of low lying eigenvalues. I. Non-degenerate minima: asymptotic expansions. Annales de l’I.H.P. Physique théorique 38(3), 295–308 (1983)
Syljuåsen, O.F., Luther, A.: Adjacent face scattering and stability of the square Fermi surface. Phys. Rev. B 72(16), 165105 (2005)
Van der Corput, J.G.: Over Roosterpunten in het Platte Vlak. Noordhof, Groningen (1919)
Wigner, E.: On the interaction of electrons in metals. Phys. Rev. 46(11), 1002–1011 (1934)
Acknowledgements
We thank Christian Hainzl for helpful discussions and a referee for very careful reading of the paper and many helpful suggestions. NB and RS were supported by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 694227). Part of the research of NB was conducted on the RZD18 Nice–Milan–Vienna–Moscow. NB thanks Elliott H. Lieb and Peter Otte for explanations about the Luttinger model. PTN has received funding from the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy (EXC-2111-390814868). MP acknowledges financial support from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (ERC StG MaMBoQ, grant agreement No. 802901). BS gratefully acknowledges financial support from the NCCR SwissMAP, from the Swiss National Science Foundation through the Grant “Dynamical and energetic properties of Bose-Einstein condensates” and from the European Research Council through the ERC-AdG CLaQS (grant agreement No. 834782). All authors acknowledge support for workshop participation from Mathematisches Forschungsinstitut Oberwolfach (Leibniz Association). NB, PTN, BS, and RS acknowledge support for workshop participation from Fondation des Treilles.
Funding
Open access funding provided by Institute of Science and Technology (IST Austria).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
A Hartree–Fock theory
In this appendix we consider the Hartree–Fock energy
with the Hamiltonian \(H_N\) in (1.2). We assume that \(N=|B_\text{ F }|\), namely the Fermi ball is completely filled. It is clear that if the interaction vanishes, i. e., \(V=0\), then the Slater determinant of plane waves \(\psi _\text{ pw }\) as in (1.5) is the unique minimizer for \(E_N^\text{ HF }\). However, it is less trivial that the Hartree–Fock minimizer is unchanged if the interaction is sufficiently weak.
Theorem A.1
Consider the Hamiltonian \(H_N\) in (1.2) with
Then the Slater determinant of plane waves \(\psi _\text{ pw }\) in (1.5) is the unique minimizer (up to a phase) for \(E_N^\text{ HF }\).
Note that Theorem A.1 does not require any specific choice of parameters \(\lambda \) and \(\hbar \). In our semiclassical mean-field scaling, \(\hbar = N^{-1/3}\) and \(\lambda =N^{-1}\), the condition \(0\le \lambda \Vert {\widehat{V}} \Vert _{\ell ^1} < \hbar ^2/2\) holds for N large provided that \({\widehat{V}} \in \ell ^1\).
We follow the argument in [60] where the Hartree–Fock energy of the electron gas is studied. The main difference is that in our finite volume setting, the spectral gap of the Laplacian is strong enough to dominate the interaction, ensuring the exact equality \(E_N^\text{ HF }=\langle \psi _\text{ pw }, H_N \psi _\text{ pw }\rangle \) instead of just an exponential closeness as in [60].
Proof
Let \(\Psi =\bigwedge _{j=1}^N u_j\) be a Slater determinant. Then a straightforward computation shows that
where \(\gamma =\sum _{j=1}^N |u_j\rangle \langle u_j|\) is the one-body density matrix of \(\Psi \), \(\gamma (x,y)=\sum _{j=1}^Nu_j(x) \overline{u_j(y)}\) and \(\rho _\gamma (x)=\gamma (x,x)\). Note that the one-body density matrix \(\gamma _\text{ pw }\) of the plane waves has the integral kernel
In the following we estimate the right side of (A.1) term by term.
Direct term. Since \(\rho _{\gamma _\text{ pw }}\) is constant, we have \({\widehat{\rho }}_{\gamma _\text{ pw }}(k)=0\) for all \(0\ne k \in {\mathbb {Z}}^3\). Moreover, \({\widehat{\rho }}_{\gamma }(0)= {\widehat{\rho }}_{\gamma _\text{ pw }}(0)=(2\pi )^{-3}N\). Therefore, using \({\widehat{V}} \ge 0\) we get
Exchange term. We decompose
The first part can be estimated as
For the second part of the exchange term, we write
where G is an operator on \(L^2({\mathbb {T}}^3)\) with kernel \(\gamma _\text{ pw }(y-x) V(y-x)\). Equivalently, G is the multiplication operator in Fourier space with
In particular \(G\ge 0\) and hence \({\text {tr}}( G (\gamma - \gamma _\text{ pw}))\in {\mathbb {R}}\). Thus the exchange term is bounded as
Inserting (A.2) and (A.3) in (A.1) we obtain
Kinetic term. Finally, we prove that if \(0\le \lambda \Vert {\widehat{V}}\Vert _{\ell ^1} \le \hbar ^2\), then
To see that, we proceed similarly to [60, Eq. (5)]. More precisely, let us find a multiplication operator \(A(k) \ge (\hbar ^2 -\lambda \Vert {\widehat{V}}\Vert _{\ell ^1} )/2\) on Fourier space such that
Since \(\gamma \) and \(\gamma _\text{ pw }\) are projections we can decompose
Hence, for any constant \(C_0\in {\mathbb {R}}\),
Here in the last equality we have used \({\text {tr}}(\gamma )= {\text {tr}}(\gamma _\text{ pw})=N\). Thus the desired equality (A.6) holds true if
which is equivalent to
The latter equality holds true when
provided that the constant \(C_0\) satisfies
Note that since the Fermi ball is completely filled we have the gap \(|k_2|^2- |k_1|^2 \ge 1\) for all \(k_1\in B_\text{ F }\) and \(k_2\in B_\text{ F}^c\) (since \(|k_2|^2 - |k_1|^2\) is positive and integer). Furthermore,
When \(0\le \lambda \Vert {\widehat{V}}\Vert _{\ell ^1} \le \hbar ^2\), we can choose
and obtain
The desired estimate (A.5) follows immediately.
Conclusion. Inserting (A.5) in (A.4) we find that
Hence, under the condition \(\lambda \Vert {\widehat{V}}\Vert _{\ell ^1}<\hbar ^2/2\), we conclude that \(\psi _\text{ pw }\) is the unique Hartree–Fock minimizer. \(\square \)
B Kinetic energy estimates
In this appendix we provide a simplified proof of Lemma 2.2, which was first established in [74, Lemma 4.7]. Like [74] we use the following special case of a result by [75].
Theorem B.1
(Integer points in ellipses) Let \(d_0 \in {\mathbb {N}}\). For every \(R>0\) consider the ellipse
Then the number of points in \(E(R) \cap {\mathbb {Z}}^3\) is \(|E(R)| + {\mathcal {O}}(R^{\gamma })\) for \(R\rightarrow \infty \), with any \(\gamma >131/208\). Here \(|E(R)|= \pi d_0^{-1/2} R^2\) is the area of E(R).
We do not need the full power of this theorem; for our purpose, any \(\gamma <1\) is sufficient. With exponent \(\gamma = 2/3\), it is a classic result due to Van der Corput’s thesis [103] from 1919.
Proof of Lemma 2.2
We start by proceeding as in [74, Lemma 4.7]. Using the Cauchy–Schwarz inequality we get
The second factor is bounded by the kinetic energy as claimed,
The hard part is to show that there exists some constant \(C > 0\) such that
which is equivalent to
Heuristically, one may understand this bound as follows: the sum is over lattice points in the grey area of Fig. 1; since \(|k |= {\mathcal {O}}(1)\) and the area of the Fermi surface is of order \(N^{\frac{2}{3}}\), the number of these points is also of order \(N^{\frac{2}{3}}\). Since \(|p|\sim N^{\frac{1}{3}}\) (p has to be close to the Fermi surface) and \(|k|\sim 1\), we expect that in average \(|p|^2 - |p-k|^2 = 2 p \cdot k - k^2 \sim N^{\frac{1}{3}}\), leading us to the order \(N^{\frac{1}{3}}\) in (B.1).
Strictly speaking, \(|p|^2 - |p-k|^2\) can be much smaller than the average size. Actually \(|p|^2 - |p-k|^2\) can be of order O(1) if both p and \(p-k\) are very close to the Fermi surface. Fortunately, integer points very close to the surface of the sphere are quite rare and the contribution from this part can be controlled by number theoretic results.
This argument was formulated rigorously in [74] but the detailed proof is rather technical. Here we present a simplified proof of (B.1) for the reader’s convenience.
Note that for every \(p\in B_\text{ F}^c\cap (B_\text{ F }+k)\) we have
The lower bound 1 follows from the fact that \(|p| >|p-k|\) and that \(p,p-k\in {\mathbb {Z}}^3\). Hence,
Moreover, when \(p\in B_\text{ F}^c\cap (B_\text{ F }+k)\) we have \(|p|\le CN^{\frac{1}{3}}\), and hence \(|p\cdot k|\le CN^{\frac{1}{3}}\). Recall also that \(p,k \in {\mathbb {Z}}^3\), so that \(p\cdot k\in {\mathbb {Z}}\). Thus we get
where
To count \(|B_s|\) we use Theorem B.1; we are going to show that for any \(1>\gamma > 131/208\) we have
Easy case. Assume that \(k=(k_1,0,0)\) with \(k_1\ne 0\). Consider \(p=(p_1,p_2,p_3)\in B_s\). Then \(p_1=(p\cdot k)/k_1 = s/k_1\) and
Thus, considered in the plane \({\mathbb {R}}^2\), we have \((p_2,p_3)\in B(0,R_2)\backslash B(0,R_1)\) with the radii of the two centered balls being
Note that
Thus \(|B_s|\) is bounded by the number of integer points in the annulus \(B(0,R_2)\backslash B(0,R_1)\), which according to Theorem B.1 is bounded by
General case. If k is aligned with one of the basis vectors \((1,0,0), (0,1,0), (0,0,1) \in {\mathbb {R}}^3\), then we can proceed as above. Otherwise, we can assume \(k=(k_1,k_2,k_3)\) with \(k_1,k_2\ne 0\). Consider the set of orthogonal vectors in \({\mathbb {R}}^3\) given by
Every \(p\in {\mathbb {Z}}^3\) is determined uniquely by \((n_1,n_2,n_3)\in {\mathbb {Z}}^3\) with
in fact
Then, using
and the identity \(|k_\bot '|=|k| |k_\bot |\) we find that
Consequently, if \(p\in B_s\), then \(n_1=k\cdot p = s\) and
Thus \((n_2,n_3)\in E(R_2)\backslash E(R_1)\) where E(R) is the ellipse
and
Note that \(R_1<R_2 \le C N^{\frac{1}{3}}\) and \(R_2^2 - R_1^2 \le C s\). Thus \(|B_s|\) is bounded by the number of integer points in \(E(R_2)\backslash E(R_1)\), which is bounded by Theorem B.1 (with \(d_0= |k|^2\)) by:
Thus in conclusion, we have proved that (B.3) holds for every \(0\ne k\in {\mathbb {Z}}^3\). Inserting (B.3) in (B.2) we conclude that
Since \(\gamma <1\), this implies (B.1). \(\square \)
The above proof can be adapted easily to give Lemma 4.2.
Proof of Lemma 4.2
As in the proof of [74, Lemma 4.7], by Cauchy–Schwarz we get
In the second factor we simply drop the constraint \(e(p)+e(p-k) \le 4 N^{-\frac{1}{3} -\delta }\) and bound it by the kinetic energy \({\mathbb {H}}_0\) exactly as in the proof of Lemma 2.2. It remains to prove
We proceed exactly as in the proof of (B.1). The only difference is that the condition \(4 N^{\frac{1}{3} -\delta } \ge p^2 - (p-k)^2 = 2 p\cdot k -k^2 \ge 1\) implies that \(|p\cdot k|\le C N^{\frac{1}{3} -\delta }\). Hence, using (B.3) we have
Since the latter bound holds true for every \(\gamma >131/208\), we conclude that (B.4) holds true for every \(\delta < 77/624\). This completes the proof of Lemma 4.2. \(\square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Benedikter, N., Nam, P.T., Porta, M. et al. Correlation energy of a weakly interacting Fermi gas. Invent. math. 225, 885–979 (2021). https://doi.org/10.1007/s00222-021-01041-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00222-021-01041-5