Abstract
In a posteriori error analysis, the relationship between error and estimator is usually spoiled by so-called oscillation terms, which cannot be bounded by the error. In order to remedy, we devise a new approach where the oscillation has the following two properties. First, it is dominated by the error, irrespective of mesh fineness and the regularity of data and the exact solution. Second, it captures in terms of data the part of the residual that, in general, cannot be quantified with finite information. The new twist in our approach is a locally stable projection onto discretized residuals.
Similar content being viewed by others
1 Introduction
Finite element methods are a successful and well-established technique for the solution of partial differential equations. A key tool for the quality assessment of a given finite element approximation and the application of adaptive techniques are so-called a posteriori error estimators. These are functionals that are computable in terms of data and the finite element approximation and aim at quantifying the approximation error. For all known estimators, their actual relationship to the error is spoiled by oscillation, i.e., by some additive terms measuring distances between non-discrete and discrete data. Remarkably, oscillation may be even greater than the error. This flaw directly interferes with the quality assessment and, on top of that, it weakens results on adaptive methods and complicates their proofs.
In this article we introduce a new approach to a posteriori error estimation, where oscillation is error-dominated, i.e. it is bounded by the error of the finite element approximation, up to a multiplicative constant depending on the shape-regularity of the underlying mesh.
We illustrate this new approach in the simplest case, where the weak solution \(u\in H^1_0(\Omega )\) of the Dirichlet-Poisson problem
is approximated by the Galerkin approximation U that is continuous and piecewise affine over some simplicial mesh \({\mathcal {M}}\) of \(\Omega \). It is instructive to start by recalling the a posteriori error bounds in terms of the standard residual estimator
see, e.g., Ainsworth and Oden [2] or Verfürth [26]. If \(f\in L^2(\Omega )\), then the energy norm error \(\Vert u-U\Vert _{H^1_0(\Omega )}\) and the estimator are almost equivalent. More precisely, we have
where the interfering oscillation is given by
Let us discuss the relationship of this classical \(L^2\)-oscillation and the energy norm error; for the proofs of the nontrival statements, see Sect. 3.8. Customarily, oscillation is associated with higher order. This idea is supported by the following observation: if f is actually in \(H^1(\Omega )\), then \({{\,\mathrm{osc}\,}}_0(f,{\mathcal {M}})=O(h_{\mathcal {M}}^2)\) as \(h_{\mathcal {M}}\mathrel {:=}\max _{K \in {\mathcal {M}}} h_K \searrow 0\).
On any fixed mesh however, the oscillation \({{\,\mathrm{osc}\,}}_0(f,{\mathcal {M}})\) may be arbitrarily greater than the energy norm error \(\Vert u-U\Vert _{H^1_0(\Omega )}\). This is a consequence of the fact that the \(L^2\)-norm is strictly stronger than the \(H^{-1}\)-norm. The use of the \(L^2\)-norm in (1.4) can be traced back to its use in the element residual \(h_K \Vert f\Vert _{L^2(K)}\) in (1.2) and so it can be motivated by the request for the computability of the estimator. In fact, in contrast to an element residual based upon some local \(H^{-1}\)-norm of f, this form reduces to the (approximate) computation of an integral.
One may think that the use of the \(L^2\)-norm is the only reason for the possible relative largeness of oscillations like \({{\,\mathrm{osc}\,}}_0(f,{\mathcal {M}})\). Yet, Cohen, DeVore and Nochetto present in [10] a striking example which entails that even the \(H^{-1}\)-oscillation
from Stevenson [23] may converge slower than the error; see Lemma 21 below. Notice that this contradicts the aforementioned idea that \({{\,\mathrm{osc}\,}}_0(f,{\mathcal {M}})\) is always of higher order and, moreover, in view of \({{\,\mathrm{osc}\,}}_0(f,{\mathcal {M}})\lesssim {\mathcal {E}}_{\mathrm {R}}(U,f,{\mathcal {M}})\), it entails that also the estimator \({\mathcal {E}}_{\mathrm {R}}(U,f,{\mathcal {M}})\) may decrease slightly slower than the error.
The key tool to overcome the shortcomings of the above oscillations is a new projection operator \({\mathcal {P}}_{{\mathcal {M}}}\) enjoying the following properties; see Sects. 3.3–3.6:
-
\({\mathcal {P}}_{{\mathcal {M}}}f\) is discrete for any functional \(f\in H^{-1}(\Omega )\). In comparison to \(P_{0,{\mathcal {M}}}\), the image of \({\mathcal {P}}_{{\mathcal {M}}}\) is enriched by the span of the face-supported Dirac distributions and so contains true functionals.
-
\({\mathcal {P}}_{{\mathcal {M}}}f\) is computable in a local manner. Here computable means that it can be determined from the information available in the linear systems for finite element approximations.
-
The local dual norms of the new oscillation \(f-{\mathcal {P}}_{{\mathcal {M}}}f\) are dominated by corresponding local errors. This property hinges on the face-supported Dirac distributions and on local \(H^{-1}\)-stability of \({\mathcal {P}}_{{\mathcal {M}}}f\).
-
In contrast to the local dual norms of the residual \(f+\Delta U\), the local dual norms of the discretized residual \({\mathcal {P}}_{{\mathcal {M}}}{f}+\Delta U\) can be estimated from below and above in a computable manner.
Thanks to these properties, we derive in Sect. 3.7 abstract a posteriori bounds such that the oscillation is bounded by the error. In Sect. 4 we provide several realizations leading to hierarchical estimators and estimators based on local problems or based on equilibrated fluxes. Furthermore, in Sect. 4.2 we show that an extension of the standard residual estimator (1.2) onto the image of \({\mathcal {P}}_{{\mathcal {M}}}\) satisfies
where \({\mathcal {V}}\) stands for the set of vertices of \({\mathcal {M}}\) and \(\omega _z\) is the star around the vertex z. A comparison with (1.3) immediately yields:
-
Both \({\mathcal {E}}_{\mathrm {R}}(U,f,{\mathcal {M}})\) and the right-hand side of (1.6) bound the energy norm error in terms of U, f, and \({\mathcal {M}}\). However, while the latter one is free of overestimation, the first one may overestimate, even asymptotically.
-
Since \({\mathcal {P}}_{{\mathcal {M}}}{f}\) is discrete and computable in the aforementioned sense, we have that \({\mathcal {E}}_{\mathrm {R}}(U,{\mathcal {P}}_{{\mathcal {M}}}{f},{\mathcal {M}})\) is also computable, while \({\mathcal {E}}_{\mathrm {R}}(U,f,{\mathcal {M}})\) is not.
-
Equivalence (1.6) thus splits the estimation of the error in two parts, reflecting the spirit of Verfürth [26, Remark 1.8] and Ainsworth [1, Section 3.1]: One part is computable and related to the underlying differential operator. The other one depends solely on data; its computation, or rather estimation, hinges on a priori knowledge.
We remark that our new approach also has consequences in the convergence analysis of adaptive methods. In particular, it allows to generalize and sharpen the basic convergence for adaptive methods from [19]; see [15].
2 Model problem and discretization
In order to exemplify our new approach to a posteriori error estimation, we consider the homogeneous Dirichlet problem for Poisson’s equation and the energy norm error of the associated linear finite element solution. The purpose of this section is to recall the relevant properties of this boundary value problem and discretization.
We shall use the following notation associated with a (Lebesgue) measurable set \(\omega \) of \({\mathbb {R}}^d\), \(d\in {\mathbb {N}}\). Given \(m\in {\mathbb {N}}\), we let \(L^2(\omega ;{\mathbb {R}}^m)\) denote the Lebesgue space of square integrable functions over \(\omega \) with values in \({\mathbb {R}}^m\). We write \(\left\langle v,\,w\right\rangle _{\omega }\) and \(\Vert \cdot \Vert _{\omega }^2\) for its scalar product and its induced norm. For \(m=1\), we abbreviate \(L^2(\omega ;{\mathbb {R}})\) to \(L^2(\omega )\).
If \(\omega \subset {\mathbb {R}}^d\) is non-empty and open, \(H^1(\omega )\) stands for the Sobolev space of all functions in \(L^2(\omega )\) whose distributional gradient is also in \(L^2(\omega ;{\mathbb {R}}^d)\). Moreover, we let \(H^1_0(\omega )\) be the closure in \(H^1(\omega )\) of all infinitely differentiable function with compact support in \(\omega \). If the boundary \(\partial \omega \) of \(\omega \) is sufficiently regular (e.g., Lipschitz), this are all functions in \(H^1(\omega )\) with vanishing trace on the boundary \(\partial \omega \). Thanks to Friedrichs’ inequality, \(H^1_0(\omega )\) is a Hilbert space with scalar product \(\left\langle \nabla \cdot ,\,\nabla \cdot \right\rangle _{\omega }\) and norm \(\Vert \nabla \cdot \Vert _{\omega }\). As usual, \(H^{-1}(\omega )\) indicates the dual space of \(H^1_0(\omega )\), i.e. the space of linear and continuous functionals on \(H^1_0(\omega )\). We identify \(L^2(\omega )\) with its dual space and thus have
The norm of \(H^{-1}(\omega )\) is given by
where the dual brackets \(\left\langle \ell ,\,w\right\rangle _\omega := \ell (w)\), \(w\in H^1_0(\omega )\), extend-restrict the scalar product in \(L^2(\omega )\). If \(D\subset {\mathbb {R}}^d\) is a set such that \(\mathring{D}\) is suitable for one of the preceding notations, we also use D instead of the more cumbersome \(\mathring{D}\), e.g. we write also \(H^1(D)\) instead of \(H^1(\mathring{D})\).
Let \(\Omega \) be an open, bounded and connected subset of \({\mathbb {R}}^d\) with Lipschitz boundary and whose closure can be subdivided into simplices. We shall omit \(\Omega \) in the notation of dual pairings and norms. The weak formulation of (1.1) reads as follows:
In other words: we are looking for the Riesz representation of \(f\) in \(H^1_0(\Omega )\). Notice that the Riesz representation theorem establishes an isomorphism between the space \(H^1_0(\Omega )\) of solutions and the space \(H^{-1}(\Omega )\) of loads. In particular, a unique solution exists not only for \(f\in L^2(\Omega )\) but for all \(f\in H^{-1}(\Omega )\). This fact suggests that, at least conceptually, an approximation method for (2.2), along with its a posteriori analysis, should cover also loads in \(H^{-1}(\Omega )\).
In order to approximate the solution of (2.2), we use a Galerkin approximation based upon finite elements. For the sake of simplicity, we restrict ourselves to simplicial meshes and lowest order.
Let \({\mathcal {M}}\) be a simplicial, face-to-face (conforming) mesh of the domain \(\Omega \). Given an element \(K \in {\mathcal {M}}\), we denote by \(h_K:= {{\,\mathrm{diam}\,}}K:= \sup _{x,y\in K} |x-y|\) its diameter and by \(\rho _K:= \sup \{ {{\,\mathrm{diam}\,}}B \mid B \text { ball in }K \}\) the maximal diameter of inscribed balls. In what follows, ‘\(\lesssim \)’ stands for ‘\(\leqslant C\)’, where the generic constant C may depend on d and the shape coefficient
In the case of both inequalities ‘\(\lesssim \)’ and ‘\( > rsim \)’, we shall use ‘\(\simeq \)’ as shorthand.
An interelement face of \({\mathcal {M}}\) is a simplex \(F \) with d vertices arising as the intersection \(F =K _1\cap K _2\) of two uniquely determined elements \(K _1,K _2\in {\mathcal {M}}\). Its associated patch is
We let \({\mathcal {F}}={\mathcal {F}}({\mathcal {M}})\) denote the set of all \((d-1)\)-dimensional interelement faces of \({\mathcal {M}}\). Given \(F \in {\mathcal {F}}\) and \(K \in {\mathcal {M}}\) with \(F \subset K \), we write
for the height of \(K \) over \(F \).
Furthermore, \({\mathcal {V}}={\mathcal {V}}({\mathcal {M}})\) stands for the set of all vertices of \({\mathcal {M}}\). To any vertex \(z\in {\mathcal {V}}\), we associate the sets
for which we have
If \(K \in {\mathcal {M}}\) with \(K \subset \omega _z\) for some \(z\in {\mathcal {V}}\), then the diameter \(h_z\) of \(\omega _z\) verifies
Moreover, if e is a direction, i.e. \(e\in {\mathbb {R}}^d\) with \(|e|=1\), we write \(h_{z;e}\) for the maximal length of a line segment in \(\omega _z\) with direction e. Then
verifies
whenever \(K \in {\mathcal {M}}\) with \(K \subset \omega _z\).
Let \({\mathbb {P}}_k\) be the space of polynomials of degree at most \(k\in {\mathbb {N}}\) over \({\mathbb {R}}^d\) and let
be its piecewise counterpart over \({\mathcal {M}}\). The space of continuous, piecewise affine functions over \({\mathcal {M}}\) is then
Its nodal basis \(\{\phi _z\}_{z\in {\mathcal {V}}}\) is defined by
This basis provides the nodal value representation
for any \(V\in {\mathbb {V}}({\mathcal {M}})\) and the partition of unity
where, for each vertex \(z\in {\mathcal {V}}\), we have \({{\,\mathrm{supp}\,}}\phi _z = \omega _z\), with skeleton \(\sigma _z\). Finally, we recall that, for any element \(K \in {\mathcal {M}}\) and any powers \(\alpha _z\in {\mathbb {N}}_0\), \(z\in {\mathcal {V}}\cap K \), we have
The finite element functions satisfying the boundary condition in (2.2) form the space
The associated Galerkin approximation \(U=U_{f;{\mathcal {M}}}\) is characterized by
Notice that the right-hand side and so U are well-defined, also for \(f\in H^{-1}(\Omega )\), thanks to the conformity of \({\mathbb {V}}_0({\mathcal {M}})\). Céa’s lemma states that the Galerkin approximation is the best approximation with respect to the energy norm error, i.e.,
In order to determine the Galerkin approximation U, one usually obtains its values at the interior vertices \({\mathcal {V}}_0 := {\mathcal {V}}\cap \Omega \) by solving the symmetric positive definite linear system
where
We thus see that the Galerkin approximation U is computable whenever the load evaluations
Strictly speaking, these evaluations are in general not computable. In fact, even if \(f\in L^2(\Omega )\) is a function, the evaluation of \(\left\langle f,\,\phi _y\right\rangle = \int _{\Omega } f\phi _y\) requires the computation of an integral, which in general can be done only approximately by means of numerical integration. Notwithstanding, error analyses of approximations like (2.11) have proved very useful for the theoretical understanding and underpinning of finite element methods and are therefore very common. Accordingly, we shall suppose that the evaluations (2.14) are known to us. In Sect. 3.6 below, we will discuss which kind of additional information is used in our a posteriori analysis.
3 A posteriori analysis with error-dominated oscillation
We present our new approach to a posteriori error analysis by deriving bounds for the energy norm error of the Galerkin approximation (2.11). The new idea for achieving error-dominated oscillation is described in Sect. 3.3.
3.1 Residual norms
Given some load \(f\in H^{-1}(\Omega )\) and a Galerkin approximation \(U_{f;{\mathcal {M}}}\), we want to quantify the energy norm error \(\Vert \nabla (u_f-U_{f;{\mathcal {M}}})\Vert _{}\), where the exact solution \(u_f\) of (2.2) is typically unknown to us.
Our starting point is the so-called residual \({{\,\mathrm{Res}\,}}(f;{\mathcal {M}})\in H^{-1}(\Omega )\) given by
It is defined in terms of data and the computable Galerkin approximation and vanishes if and only if the latter equals the exact solution. The following lemma shows that appropriately measuring the size of the residual relates to the error.
Lemma 1
(Error, residual and load) We have
Proof
Thanks to the differential equation in (2.2), we have, for all \(v\in H^1_0(\Omega )\),
where \(-\Delta \) indicates the distributional Laplacian. Consequently, the claimed equality follows from the fact that \(-\Delta :H^1_0(\Omega ) \rightarrow H^{-1}(\Omega )\) is an isometry (which follows from the Cauchy–Schwarz inequality in \(L^2(\Omega )\) and from testing with \(v=u_f-U_{f;{\mathcal {M}}}\)). The claimed inequality follows by invoking also (2.12):
\(\square \)
Thus, we aim now at quantifying the dual norm \(\Vert {{\,\mathrm{Res}\,}}(f;{\mathcal {M}})\Vert _{H^{-1}(\Omega )}\). The following simple observation shows that this task requires much more information than computing the Galerkin approximation.
Lemma 2
(Bounding residual norms) Without any a priori information on the load \(f\in H^{-1}(\Omega )\), the residual norm \(\Vert {{\,\mathrm{Res}\,}}(f;{\mathcal {M}})\Vert _{H^{-1}(\Omega )}\) cannot be bounded in terms of a finite number of adaptive evaluations of the form: \(\left\langle f,\,v\right\rangle \) with \(v\in H^1_0(\Omega )\).
Proof
Suppose that the claim is false. Then, for each \(f\in H^{-1}(\Omega )\), there is a number \(B(f) \geqslant \Vert {{\,\mathrm{Res}\,}}(f;{\mathcal {M}})\Vert _{H^{-1}(\Omega )}\) which is given in terms of evaluations \(\left\langle f,\,v_i\right\rangle \), \(i=1,\dots ,n_f\), where the choice of \(v_i\) may depend deterministically on the previous evalutations \(\left\langle f,\,v_1\right\rangle , \dots , \left\langle f,\,v_{i-1}\right\rangle \). Fix some functional \(0\ne \ell \in H^{-1}(\Omega )\). Since \(H^1_0(\Omega )\) is infinite-dimensional, we can choose a normalized \(w \in H^1_0(\Omega )\) that is perpendicular to \({\mathbb {V}}_0({\mathcal {M}})\) and all test functions \(v_i\), \(i=1,\dots ,n_{\ell }\) associated with \(\ell \). Set \(\delta := 3B(\ell ) (-\Delta )w\) and observe that \(U_{\delta ;{\mathcal {M}}}=0\) and \(\left\langle \delta ,\,v_i\right\rangle =0\) for all \(i=1,\dots ,n_{\ell }\). Therefore \(\left\langle \ell +\delta ,\,v_i\right\rangle =\left\langle \ell ,\,v_i\right\rangle \) and we obtain the contradiction
\(\square \)
Remark 3
(Load evaluations vs exact integrals) A similar yet simpler argument shows that, without any a priori information on \(f \in L^2(\Omega )\), also \(\Vert f\Vert _{}\) cannot be bounded in terms of adaptive evaluations \(\int _{\Omega } fv\) with \(v\in L^2(\Omega )\).
Before discussing in Sect. 3.3 repercussions of Lemma 2, it is useful to take into account a further requirement for a posteriori bounds.
3.2 Localized residual norm
Adaptive mesh refinement is an important application of a posteriori bounds. It is usually based upon the comparison of ‘local’ quantities. Therefore, it is of interest to split a posteriori bounds, or the residual norm itself, into local contributions.
Such a localization appears implicitly, e.g., in the a posteriori error analysis of Babuška and Miller [3]. It is based upon the \(W^{1,\infty }\)-partition of unity (2.9) and the orthogonality property:
We thus introduce the subclass
of residuals associated with Galerkin approximations. Recall that \({{\,\mathrm{supp}\,}}\phi _z = \omega _z\) and that \(H^{-1}(\omega _z)\) is a shorthand for \(H^{-1}(\mathring{\omega }_z)\).
Lemma 4
(Localization)
Let \(\ell \in H^{-1}(\Omega )\) be any functional.
-
(i)
If \(\ell \in {\mathcal {R}}_{\mathcal {M}}\), then
$$\begin{aligned} \Vert \ell \Vert _{H^{-1}(\Omega )}^2 \lesssim \sum _{z\in {\mathcal {V}}} \Vert \ell \Vert _{H^{-1}(\omega _z)}^2, \end{aligned}$$where the hidden constant depends only on d and the shape coefficient \(\sigma ({\mathcal {M}})\).
-
(ii)
We have
$$\begin{aligned} \sum _{z\in {\mathcal {V}}} \Vert \ell \Vert _{H^{-1}(\omega _z)}^2 \leqslant (d+1) \Vert \ell \Vert _{H^{-1}(\Omega )}^2. \end{aligned}$$
Proof
See also Cohen, DeVore, and Nochetto [10, §3.2 and §3.4], Ern and Guermond [11, Proposition 31.7] or Blechta, Málek, and Vohralík [5, Theorem 3.7]. For the sake of completeness, we provide details. In order to show (i), we fix an arbitrary \(v \in H^1_0(\Omega )\). In view of the partition of unity (2.9) and \(\ell \in {\mathcal {R}}_{\mathcal {M}}\), we can write
where the reals \(c_z\in {\mathbb {R}}\) are given by
Thanks to \(0\leqslant \phi _z\leqslant 1\), the inverse estimate \(\Vert \nabla \phi _z\Vert _{L^\infty (\omega _z)} \leqslant \max _{K \subset \omega _z} \rho _{K}^{-1} \lesssim h_z^{-1}\) and the Poincaré-Friedrichs inequality \(\Vert v-c_z\Vert _{\omega _z} \lesssim h_z \Vert \nabla v\Vert _{\omega _z}\) (see, e.g., Nochetto and Veeser [21, Lemma 4]), we have, for any \(z\in {\mathcal {V}}\),
where the constant \(C_{\sigma ({\mathcal {M}})}\) depends only on \(\sigma ({\mathcal {M}})\). Thus, (3.2) leads to
and the proof of (i) is finished.
To prove (ii), we let \(v_z \in H^1_0(\omega _z)\) with \(\Vert \nabla v_z\Vert _{\omega _z}\leqslant 1\) for any node \(z \in {\mathcal {V}}\) and set \(v=\sum _{z\in {\mathcal {V}}} \left\langle \ell ,\,v_z\right\rangle v_z \in H^1_0(\Omega )\). Then
and, with the help of two Cauchy–Schwarz inequalities,
Consequently, we conclude (ii) by taking the suprema over all \(v_z\) for all \(z\in {\mathcal {V}}\). \(\square \)
Thus, in the context of adaptive mesh refinement, we are also interested in quantifying the single terms of the localized residual norm
Of course, we face the same problem for the local residual norms as for the global one.
Corollary 5
(Bounding local residual norms) Without any a priori information on \(f\in H^{-1}(\Omega )\), each local residual norm \(\Vert {{\,\mathrm{Res}\,}}(f,{\mathcal {M}})\Vert _{H^{-1}(\omega _z)}\), \(z\in {\mathcal {V}}\), cannot be bounded in terms of a finite number of adaptive evaluations of \(f\).
Proof
Replace the domain \(\Omega \) by \(\omega _z\) in the proof of Lemma 2 and extend functionals in \(H^{-1}(\omega _z)\) by 0 on the orthogonal complement of \(H^1_0(\omega _z)\) in \(H^1_0(\Omega )\). \(\square \)
3.3 Towards error-dominated oscillation
Our approach to error-dominated oscillation relies on a projection operator. This section motivates and formulates its key properties. They are summarized in (3.14) and will guide the actual construction in the following sections.
In view of Lemma 2 and Corollary 5, a posteriori bounds for the residual norm or its localized variant require knowledge on the load \(f\) beyond a finite number of evaluations. The actual knowledge of \(f\) can be of different nature and, accordingly, may require different techniques. Here we want to address only aspects of a posteriori error estimation that are independent of the nature of this knowledge. Correspondingly, we split the residual into a discretized residual and data approximation:
where \({\mathcal {P}}_{{\mathcal {M}}}\) maps onto a subspace \(\underline{{\mathbb {D}}}({\mathcal {M}})\) of \(H^{-1}(\Omega )\) such that
-
\(\Vert {\mathcal {P}}_{{\mathcal {M}}}f+\Delta U_{f;{\mathcal {M}}}\Vert _{H^{-1}({\mathcal {M}})}\) can be bounded with the help of a finite number of evaluations of \(f\) and
-
the task of bounding \(\Vert f- {\mathcal {P}}_{{\mathcal {M}}}f\Vert _{H^{-1}({\mathcal {M}})}\) hinges only on knowledge of the load \(f\); this task may be viewed as a matter of approximation theory since, apart from the choice of the norm, it is independent of the boundary value problem (2.2).
Here we have used the localized dual norm \(\Vert \cdot \Vert _{H^{-1}({\mathcal {M}})}\) in order to allow for applications in mesh adaptivity. It is then desirable that both parts are dominated by the error, i. e., we have
In view of Lemmas 1 and 4, the two conditions are equivalent.
The construction of a suitable mapping \({\mathcal {P}}_{{\mathcal {M}}}\) is the new twist in our approach. In order to get first hints on this, let us test out several candidates with necessary conditions arising from (3.6b).
The proof of Corollary 5 suggests that the problem lies in the fact that f is taken from an infinite-dimensional space. The projection \({\mathcal {P}}_{0,{\mathcal {M}}}\) into discrete data from (1.4) is thus a candidate for \({\mathcal {P}}_{{\mathcal {M}}}\). This choice, however, does not verify (3.6). In fact, Lemma 1, Lemma 4 (ii), and (3.6b) imply the stability estimate
while \({\mathcal {P}}_{0,{\mathcal {M}}}f\) is not even defined for a general \(f\in H^{-1}(\Omega )\) (and cannot be continuously extended; cf. Lemma 20).
This flaw is easily remedied. For any element \(K \in {\mathcal {M}}\), we replace in (1.4) the characteristic function \(\chi _K \) of \(K \) by the weighted mean
thanks to (2.10) and consider
Since \(\psi _K \in H^1_0(K)\subset H^1_0(\Omega )\) is an admissible test function, the operator \(\tilde{{\mathcal {P}}}_{0,{\mathcal {M}}}\) is defined for all functionals in \(H^{-1}(\Omega )\) and satisfies the stability estimate (3.7); see Remark 11 below.
But still, the new operator \(\tilde{{\mathcal {P}}}_{0,{\mathcal {M}}}\) does not verify (3.6). To see this, consider \(f=-\Delta V\) with \(V\in {\mathbb {V}}_0({\mathcal {M}})\) arbitrary. We then have
and therefore \({{{\,\mathrm{Res}\,}}(f;{\mathcal {M}})}=0\) and property (3.6b) entails
In addition, integration by parts yields that, for all \(v\in H^1_0(\Omega )\),
where \(\,\mathrm {d}s \) indicates the \((d-1)\)-dimensional Hausdorff measure in \({\mathbb {R}}^d\) and J(V) is the jump in the normal flux \(\nabla V\cdot n\) across interelement sides. More precisely, if \(F = K _1 \cap K _2\) is the intersection of the elements \(K _1,K _2\in {\mathcal {M}}\) with respective outer normals \(\varvec{n}_1\), \(\varvec{n}_2\), then \(J(V)|_{F} \mathrel {:=}\nabla V|_{K _1} \cdot \varvec{n}_1 + \nabla V|_{K _2} \cdot \varvec{n}_2\in {\mathbb {R}}\). If \(V \ne 0\), then we have also \(\Delta V \ne 0\), while (3.11) yields \(\tilde{{\mathcal {P}}}_{0,{\mathcal {M}}}(\Delta V)=0\), in contradiction with (3.10). Hence (3.6) does not hold for \(\tilde{{\mathcal {P}}}_{0,{\mathcal {M}}}\).
The two conditions (3.7) and (3.10) are central to our goals. Although they can be checked without involving the Galerkin approximation (2.11), they are also sufficient for (3.6). Incidentally, they imply that \({\mathcal {P}}_{{\mathcal {M}}}\) has to be a near best ‘interpolation’ operator in light of the Lebesgue lemma.
The failure of (3.10) for \(\tilde{{\mathcal {P}}}_{0,{\mathcal {M}}}\) is not related to the choice of the test functions \(\psi _K \), \(K \in {\mathcal {M}}\), but to its range. In fact, (3.11) and the fundamental lemma of calculus of variation show that \(\Delta V \not \in L^2(\Omega )\) whenever \(V \ne 0\), while \(\tilde{{\mathcal {P}}}_{0,{\mathcal {M}}}({\mathbb {V}}_0({\mathcal {M}})) \subset L^2(\Omega )\). In other words: to remedy, we have to change the range.
Finally, it is desirable that \({\mathcal {P}}_{{\mathcal {M}}}\) is a local operator for two reasons. First, this comes in useful when evaluating \({\mathcal {P}}_{{\mathcal {M}}}\). Second, since \(-\Delta \) is a local operator, we have the following lower bound for the local error:
which follows from testing (3.1) with all v from \(H^1_0(\omega _z)\). This bound can be exploited if we strengthen (3.6) to the local conditions
for all \(z\in {\mathcal {V}}\). We shall therefore demand the stability (3.7) and invariance (3.10) in a suitable local manner.
In order to formulate local invariance, let us introduce the following notations associated with an open subset \(\omega \subset \Omega \). Whenever two functionals \(\ell _1,\ell _2\in H^{-1}(\Omega )\) satisfy \(\ell _1(v) = \ell _2(v)\) for all \(v \in H^1_0(\omega )\), we say \(\ell _1=\ell _2\) on \(\omega \). Moreover, we write \(\ell _1 \in \underline{{\mathbb {D}}}({\mathcal {M}})\) on \(\omega \) when additionally \(\ell _2\) can be chosen such that \(\ell _2\in \underline{{\mathbb {D}}}({\mathcal {M}})\). Notice that, thanks to the fundamental lemma of the calculus of variations, these notions reduce to the usual ones if \(\ell \in L^2(\Omega )\), i.e. \(\ell (v) = \int _\Omega gv\) for all \(v \in H^1_0(\Omega )\).
Let us summarize our discussion by a list of desired properties for the operator \({\mathcal {P}}_{{\mathcal {M}}}\) and its range \(\underline{{\mathbb {D}}}({\mathcal {M}})\subset H^{-1}(\Omega )\), which corresponds to the set of all possible discretized residuals. This list provides the guidelines for our approach and choices. Denoting by \(\Delta ({\mathbb {V}}_0({\mathcal {M}})) = \{ \Delta V \mid V\in {\mathbb {V}}_0({\mathcal {M}})\}\) the image of \({\mathbb {V}}_0({\mathcal {M}})\) under the distributional Laplacian, we aim for the following properties:
Regarding the above discussion, we have that conditions (3.14f), (3.14e) and (3.14a) are equivalent to (3.13); cf. Sect. 3.7. Conditions (3.14d) and (3.14b) allow to quantify the local dual norms of the approximate residual \({\mathcal {P}}_{{\mathcal {M}}}f+ \Delta U_{f;{\mathcal {M}}} \in \underline{{\mathbb {D}}}({\mathcal {M}})\) in a computable manner; compare also with Sect. 3.6 below.
In the next three sections we construct two operators \({\mathcal {P}}_{{\mathcal {M}}}\) fulfilling (3.14).
3.4 Discretized residuals and a locally stable biorthogonal system
We present a possible choice of the set \(\underline{{\mathbb {D}}}({\mathcal {M}})\) of discretized residuals and introduce an associated biorthogonal system, which is instrumental in constructing a suitable operator \({\mathcal {P}}_{{\mathcal {M}}}\) with range \(\underline{{\mathbb {D}}}({\mathcal {M}})\).
We set
Every functional \(\ell \in \underline{{\mathbb {D}}}({\mathcal {M}})\) is thus constant on each element and on each face. Obviously, condition (3.14a) is verified. More precisely, \(\underline{{\mathbb {D}}}({\mathcal {M}})\) is in general a strict superset of \(\Delta ({\mathbb {V}}_0({\mathcal {M}}))\), since in \(\Delta ({\mathbb {V}}_0({\mathcal {M}}))\) only certain linear combinations of the constants \(c_F \), \(F \in {\mathcal {F}}\) are allowed. The fact that these constants are independent in \(\underline{{\mathbb {D}}}({\mathcal {M}})\) facilitates the definition of \({\mathcal {P}}_{{\mathcal {M}}}\). Moreover, we have added the contributions given by the constants \(c_K \), \(K \in {\mathcal {M}}\), for comparability with the classical oscillations and a posteriori error estimators and because similar contributions will appear for higher order elements; cf. Kreuzer and Veeser [14]. In spite of these enlargements, we still have \(\dim \underline{{\mathbb {D}}}({\mathcal {M}})< \infty \). Consequently, an argument as in the proof of Lemma 2, which hinges on infinite dimension, is ruled out.
Let us associate a biorthogonal system with \(\underline{{\mathbb {D}}}({\mathcal {M}})\). To this end, we introduce the surface Dirac distributions
and we identify the characteristic functions \(\chi _K \), \(K \in {\mathcal {M}}\), with their associated distributions
Notice that the definitions of \(\chi _F \) and \(\chi _K \) involve different measures for integration: the \((d-1)\)-dimensional Hausdorff measure for \(\chi _F \) and the d-dimensional Lebesgue measure for \(\chi _K \). Correspondingly, each \(\chi _K \) is absolutely continuous and each \(\chi _F \) is singular with respect to the d-dimensional Lebesgue measure.
We collect all elements and interelement faces in the index set \({\mathcal {I}}={\mathcal {I}}({\mathcal {M}}) \mathrel {:=}{\mathcal {M}}\cup {\mathcal {F}}\) and derive in the next lemma the properties of the functionals \(\chi _i\), \(i\in {\mathcal {I}}\), that are of interest to us.
Lemma 6
(Basis and scaling) The functionals \(\chi _i\), \(i\in {\mathcal {I}}\), are a basis of \(\underline{{\mathbb {D}}}({\mathcal {M}})\). For any element \(K \in {\mathcal {M}}\) and any face \(F \in {\mathcal {F}}\) containing a vertex \(z \in {\mathcal {V}}\), we have
with \({\tilde{\rho }}_z\) from (2.7).
Proof
We will use the Friedrichs inequality
and the following trace theorem: if \(F \in {\mathcal {F}}\) with \(F \ni z\) and \(\varvec{n}\) denotes a normal of \(F \), then
Given \(K \in {\mathcal {M}}\) with \(K \ni z\) and any \(v\in H^1_0(\omega _z)\), the Cauchy–Schwarz inequality and (3.17) yield
which verifies the first claimed inequality. To show the second one, fix \(F \in {\mathcal {F}}\) with \(F \ni z\) and let again \(v \in H^1_0(\omega _z)\). Using (3.18) with \(w=v^2\) and then again (3.17), we derive
and also the second claimed inequality is proved. \(\square \)
In order to complete the basis of Lemma 6 to a biorthogonal system, we use the following test functions: Given any element \(K \in {\mathcal {M}}\), take
Given any interelement face \(F \in {\mathcal {F}}\), let \(z_i\), \(i=1,2\), be the vertices in the patch \(\omega _F \), see (2.3), that are opposite to \(F \) and set
Let us verify that the basis \(\chi _i\), \(i\in {\mathcal {I}}\) and the test functions \(\psi _i\), \(i \in {\mathcal {I}}\), actually form a biorthogonal system with a crucial stability condition.
Lemma 7
(Locally stable biorthogonal system) Together with the basis \(\chi _i\), \(i\in {\mathcal {I}}\), the test functions \(\psi _i\), \(i \in {\mathcal {I}}\), form a locally stable biorthogonal system:
-
(i)
We have
$$\begin{aligned} \forall i,j \in {\mathcal {I}} \quad \left\langle \chi _i,\,\psi _j\right\rangle = \delta _{ij}. \end{aligned}$$ -
(ii)
Let \({\mathcal {I}}_z \mathrel {:=}\{i\in {\mathcal {I}} \mid i \ni z \}\) denote the elements and faces containing a vertex \(z\in {\mathcal {V}}\). Then
$$\begin{aligned} \forall i \in {\mathcal {I}}_z \quad \Vert \chi _i\Vert _{H^{-1}(\omega _z)} \Vert \nabla \psi _i\Vert _{\omega _z} \leqslant C_\psi , \end{aligned}$$where the stability constant \(C_\psi \) only depends on d and the shape coefficient \(\sigma ({\mathcal {M}})\).
Proof
To show (i), we consider the cases of elements \(j\in {\mathcal {M}}\) and faces \(j\in {\mathcal {F}}\) separately. First, let \(K \in {\mathcal {M}}\) be an element. As already seen in (3.8), we have \(\left\langle \chi _K,\,\psi _K \right\rangle = \int _{K} \psi _K = 1\). Moreover, since \(\psi _K = 0\) in \(\Omega \setminus \mathring{K}\), we infer \(\left\langle \chi _{K '},\,\psi _K \right\rangle = 0\) for any \(K '\in {\mathcal {M}}\setminus \{K \}\) and \(\left\langle \chi _F,\,\psi _K \right\rangle = 0\) for any \(F \in {\mathcal {F}}\).
Second, fix a face \(F \in {\mathcal {F}}\). Using (2.10), we obtain
From \(\psi _F = 0\) in \(\Omega \setminus \mathring{\omega }_F \), where \(\omega _F \) is the patch of the two elements containing the face \(F \), we infer \(\left\langle \chi _{F '},\,\psi _F \right\rangle =0\) for any \(F '\in {\mathcal {F}}\setminus \{F \}\) and \(\left\langle \chi _K,\,\psi _F \right\rangle =0\) for any \(K \in {\mathcal {M}}\) with \(K \not \supset F \). Last, let \(K \in {\mathcal {M}}\) such that \(K \supset F \). Using again (2.10), we deduce
For (ii), we again treat elements and faces separately. Let \(K \in {\mathcal {M}}\) be an element containing z. The well-known inverse estimate \(\Vert \nabla \psi _K \Vert _{K} \leqslant C_d \rho _K ^{-1} \Vert \psi _K \Vert _{K}\), \(K \subset \omega _z\) and (2.10) imply
Combining this with the first inequality in Lemma 6 and (2.8), we obtain the claimed inequality for elements:
Let \(F \in {\mathcal {F}}\) be an interelement face containing z and write \(F = K _1 \cap K _2\), where \(K _1, K _2 \in {\mathcal {M}}\) are the two elements containing \(F \). Proceeding as before, we deduce
and
\(\square \)
In what follows, we shall rely only on the properties of the test functions \(\psi _i\), \(i\in {\mathcal {I}}\), expressed in Lemma 7. In other words: what counts is not their special form, but the fact that they form a stable biorthogonal system with the basis \(\chi _i\), \(i\in {\mathcal {I}}\), of \(\underline{{\mathbb {D}}}({\mathcal {M}})\).
3.5 Construction and properties of \({\mathcal {P}}_{{\mathcal {M}}}\)
We now propose a possible choice for the projection operator \({\mathcal {P}}_{{\mathcal {M}}}\) and verify the desired properties (3.14). Set
where the functionals \(\chi _i\), \(i\in {\mathcal {I}}\), are given by (3.16) and the test functions \(\psi _i\), \(i\in {\mathcal {I}}\), by (3.19). Clearly, \({\mathcal {P}}_{{\mathcal {M}}}\) is linear and \({\mathcal {P}}_{{\mathcal {M}}}f\) is locally computable in terms of a finite number of evaluations of \(f\), i. e., we have (3.14c) and (3.14d).
The biorthogonality of these functionals and test functions implies the following local counterparts of the algebraic condition (3.10).
Theorem 8
(Local invariance) For any functional \(\ell \in H^{-1}(\Omega )\), element \(K \in {\mathcal {M}}\), and side \(F \in {\mathcal {F}}\), the operator \({\mathcal {P}}_{{\mathcal {M}}}\) does not change the following discrete restrictions:
-
(i)
If \(\ell \in \underline{{\mathbb {D}}}({\mathcal {M}})\) on \(\mathring{K}\), then \({\mathcal {P}}_{{\mathcal {M}}}\ell = \ell \) on \(\mathring{K}\).
-
(ii)
If \(\ell \in \underline{{\mathbb {D}}}({\mathcal {M}})\) on \(\mathring{\omega }_F \), then \({\mathcal {P}}_{{\mathcal {M}}}\ell = \ell \) on \(\mathring{\omega }_F \).
Proof
Let \(\ell = c\chi _K\) on \(\mathring{K}\) with \(c\in {\mathbb {R}}\). For any \(i\in {\mathcal {I}}\), we have \(\left\langle \ell ,\,\psi _i\right\rangle = c \int _K \psi _i = c \delta _{K,i}\) by means of Lemma 7 (i). Consequently, \({\mathcal {P}}_{{\mathcal {M}}}\ell = c \chi _K \) on \(\mathring{K}\), which proves (i).
To show (ii), let \(K _1,K _2\in {\mathcal {M}}\) be the two elements containing \(F \) and let \(\ell = c \chi _F + \sum _{i=1,2} c_i \chi _{K _i}\) on \(\mathring{\omega }_F \) with \(c, c_1, c_2 \in {\mathbb {R}}\). Using again Lemma 7 (i), we observe
and \(\left\langle \ell ,\,\psi _i\right\rangle = 0\) for all \(i\in {\mathcal {I}}\setminus \{F,K_1,K_2\}\). Consequently,
and also (ii) is verified. \(\square \)
Theorem 8 implies in particular (3.14e). Moreover, it has the following global consequences.
Corollary 9
(Global invariance) The operator \({\mathcal {P}}_{{\mathcal {M}}}\) is a linear projection onto the discretized residuals \(\underline{{\mathbb {D}}}({\mathcal {M}})\) from (3.15). In particular, we have
for any \(V\in {\mathbb {V}}_0({\mathcal {M}})\) and any \({\mathcal {M}}\)-piecewise constant function \(f \in {\mathbb {P}}_0({\mathcal {M}})\).
Next, we verify the local stability (3.14f) of \({\mathcal {P}}_{{\mathcal {M}}}\). As a side product, we also obtain the local stabilty of the operator \(\tilde{{\mathcal {P}}}_{0,{\mathcal {M}}}\), which was left open in Sect. 3.3.
Theorem 10
(Local stability) The linear projection \({\mathcal {P}}_{{\mathcal {M}}}\) is locally \(H^{-1}\)-stable: for any functional \(\ell \in H^{-1}(\Omega )\) and any vertex \(z\in {\mathcal {V}}\), we have
where the hidden constant depends only on d and \(\sigma ({\mathcal {M}})\).
Proof
Given \(v \in H^1_0(\omega _z)\), we derive
where we used Lemma 7 (ii) and \(\#{\mathcal {I}}_z \lesssim 1\). \(\square \)
Remark 11
(Stability of \(\tilde{{\mathcal {P}}}_{0,{\mathcal {M}}}\)) The argument in the proof of Theorem 10 also shows that \(\tilde{{\mathcal {P}}}_{0,{\mathcal {M}}}\) is locally \(H^{-1}\)-stable. In fact, one simply replaces \({\mathcal {P}}_{{\mathcal {M}}}\) by \(\tilde{{\mathcal {P}}}_{0,{\mathcal {M}}}\) and the index set \({\mathcal {I}}_z\) by \({\mathcal {I}}_z \cap {\mathcal {M}}\).
Let us conclude this section with the following further remarks on the linear projection \({\mathcal {P}}_{{\mathcal {M}}}\).
Remark 12
(Orthogonality) For any \(\ell \in H^{-1}(\Omega )\), the error \(\ell - {\mathcal {P}}_{{\mathcal {M}}}\ell \) is orthogonal to \(\text {span}\,\{\psi _i \mid i \in {\mathcal {I}}\}\). This a immediate consequence of Lemma 7 (i).
Remark 13
(Adjoint of \({\mathcal {P}}_{{\mathcal {M}}}\)) Formally, the adjoint of \({\mathcal {P}}_{{\mathcal {M}}}\) is given by
Here Lemma 7 (i) implies
for all elements \(K \in {\mathcal {M}}\) and interelement faces \(F \in {\mathcal {F}}\). The operator \({\mathcal {P}}_{{\mathcal {M}}}^*\) and these conditions, which characterize it, were used in Veeser [24] to derive an a posteriori error upper bound in terms of a hierarchical estimator. That argument, as well as Morin, Nochetto, and Siebert [18, Theorem 3.6] and Verfürth [25, (3.14)], is closely related to Theorem 15 below.
3.6 Required a priori information, an alternative to \({\mathcal {P}}_{{\mathcal {M}}}\), and quantification of the discretized residual
The purpose of this section is twofold. First, we illustrate which type of a priori information on \(f\) in (2.2) is needed to carry out our approach, presenting also a possible alternative to \({\mathcal {P}}_{{\mathcal {M}}}\). Second, we show that a stable biorthogonal system is not only useful to construct \({\mathcal {P}}_{{\mathcal {M}}}\), but also to quantify the local dual norms of discretized residuals.
Clearly, the operator \({\mathcal {P}}_{{\mathcal {M}}}\) of §3.5 can be applied to the right-hand side \(f\) of (2.2) whenever
In order to ensure a meaningful discretized residual, this information goes beyond (2.14), the information necessary for the Galerkin approximation (2.11) on the mesh \({\mathcal {M}}\); it is available, e.g., when one is able to compute the counterpart of (2.11) of order \(d+1\) over \({\mathcal {M}}\).
There are other possibilities to obtain a meaningful discretized residual. The following one fits particularly well to (2.14) in the context of mesh adaptivity. Suppose that we are given an initial mesh and a refinement procedure such that the set \({\mathbb {M}}\) of all refined meshes form a shape-regular family. Furthermore, suppose that, for any mesh \({\mathcal {M}}\in {\mathbb {M}}\), there is a refinement \({{\widetilde{{\mathcal {M}}}}} \in {\mathbb {M}}\) with vertices \({\mathcal {V}}({{\widetilde{{\mathcal {M}}}}})\) that satisfies the following properties:
Let us now fix a mesh \({\mathcal {M}}\in {\mathbb {M}}\) and a refinement \({\widetilde{{\mathcal {M}}}}\in {\mathbb {M}}\) satisfying (3.24). For any \(i \in {\mathcal {I}}({\mathcal {M}})\), using (3.24b), we fix a vertex \({\widetilde{z}} \in {\mathcal {V}}({\widetilde{{\mathcal {M}}}})\) interior to i and denote by \({\widetilde{\phi }}_{{\widetilde{z}}}\) its associated hat function in \({\mathbb {V}}(\widetilde{{\mathcal {M}}})\). We then obtain counterparts \({\widetilde{\psi }}_i\), \(i \in {\mathcal {I}}\), of the test functions \(\psi _i\), \(i \in {\mathcal {I}}\), by using these hat functions with a suitable scaling in place of the element and faces bubble functions in (3.19) such that the following lemma holds. We skip the technical details, referring to Morin, Nochetto and Siebert [17] and Veeser [24].
Lemma 14
(Another locally stable biorthogonal system) Together with the basis \(\chi _i\), \(i\in {\mathcal {I}}\), the test functions \({\widetilde{\psi }}_i\), \(i \in {\mathcal {I}}\), form a locally stable biorthogonal system:
-
(i)
We have
$$\begin{aligned} \forall i,j \in {\mathcal {I}} \quad \left\langle \chi _i,\,{\widetilde{\psi }}_j\right\rangle = \delta _{ij}. \end{aligned}$$ -
(ii)
Let \({\mathcal {I}}_z = \{i\in {\mathcal {I}} \mid i \ni z \}\) denote the elements and faces containing a vertex \(z\in {\mathcal {V}}\). Then
$$\begin{aligned} \forall i \in {\mathcal {I}}_z \quad \Vert \chi _i\Vert _{H^{-1}(\omega _z)} \Vert \nabla {\widetilde{\psi }}_i\Vert _{\omega _z} \leqslant C_{{{\tilde{\psi }}}}, \end{aligned}$$where the stability constant \(C_{{{\tilde{\psi }}}}\) only depends on d and the shape coefficient \(\sigma ({\mathcal {M}})\).
Thus, the operator
defines an alternative to \({\mathcal {P}}_{{\mathcal {M}}}\) and the properties (3.14) without (3.14b) can be established as for \({\mathcal {P}}_{{\mathcal {M}}}\). The operator \(\widetilde{{\mathcal {P}}}_{\mathcal {M}}\) can be evaluated on any mesh \({\mathcal {M}}\in {\mathbb {M}}\) whenever
where \(\{{\widetilde{\phi }}_z\}_{z\in {\mathcal {V}}_0({\widetilde{{\mathcal {M}}}})}\) denotes the nodal basis of \({\mathbb {V}}_0({\widetilde{{\mathcal {M}}}})\). This is exactly (2.14) for all meshes in \({\mathbb {M}}\). Consequently, it is also needed to ensure that an adaptive algorithm with the above refinement procedure can always compute the Galerkin approximation (2.11).
Let us now turn to the quantification of the discretized residual and verify (3.14b), considering a general locally stable biorthogonal system.
Theorem 15
(Quantifying local dual norms) Let \(\psi _i\), \(i\in {\mathcal {I}}\), be the test functions from Lemma 7 or Lemma 14. If \(\ell \in \underline{{\mathbb {D}}}({\mathcal {M}})\) on a star \(\omega _z\), then the corresponding local dual norm can be quantified by a finite number of evaluations:
where the hidden constant depends on d, \(\sigma ({\mathcal {M}})\), and \(C_\psi \).
Proof
Let us first prove the lower bound, which holds for any arbitrary functional \(\ell \in H^{-1}(\Omega )\). In fact, the definition of the dual norm readily yields
for any \(i\in {\mathcal {I}}_z\). Notice that the essential supremum of \(x \mapsto \#\{i\in {\mathcal {I}}_z \mid {{\,\mathrm{supp}\,}}\psi _i \ni x \}\) is bounded by \(d+1\). Arguing as in the proof of Lemma 4 (ii), we therefore obtain
and the proof of the lower bound is finished.
To show the upper bound, we (need to) assume that \(\ell \in \underline{{\mathbb {D}}}({\mathcal {M}})\) on \(\omega _z\). Given \(v \in H^1_0(\omega _z)\), we can then write
In light of the biorthogonality, we have \(c_i = \left\langle \ell ,\,\psi _i\right\rangle \). Using also the local stability of the biorthogonal system, we infer
Since the solid angle of every simplex containing z is bounded away from 0 in terms of d and the shape coefficient \(\sigma ({\mathcal {M}})\), we have \(\#{\mathcal {I}}_z \leqslant C_{\sigma ({\mathcal {M}})}\). Consequently, the Cauchy–Schwarz inequality on the sum implies the desired upper bound. \(\square \)
Theorem 15 implies the missing (3.14b) for both operators \({\mathcal {P}}_{{\mathcal {M}}}\) and \(\widetilde{{\mathcal {P}}}_{\mathcal {M}}\) and, in accordance with Sect. 3.3, we have splittings of the local residual norms with the desired properties. Notice that, in view of the discussion of this section and Corollary 5, bounding the terms
cannot be done in general with a finite number of evaluations of the load \(f\). Notably, these terms involve only the load, and the discretized residuals
can be quantified with finite information, which, in light of Remark 3, is less than the information required for evaluating local \(L^2\)-norms of the load \(f\).
3.7 A posteriori error bounds
We now summarize our preceding results by deriving a posteriori error bounds. The resulting bounds are defined for any load \(f\in H^{-1}(\Omega )\) and the oscillation is dominated by the error.
The following statements remain correct if \({\mathcal {P}}_{{\mathcal {M}}}\) is replaced by \(\widetilde{{\mathcal {P}}}_{\mathcal {M}}\) from (3.25).
Theorem 16
(Abstract upper bound) For any functional \(f\in H^{-1}(\Omega )\) and any conforming mesh \({\mathcal {M}}\), we have
Each local dual norm \(\Vert {\mathcal {P}}_{{\mathcal {M}}}f+\Delta U_{f;{\mathcal {M}}}\Vert _{H^{-1}(\omega _z)}\) of the discretized residual can be quantified with a finite number of evaluations of \(f\), while the quantification of the local dual norms \(\Vert {\mathcal {P}}_{{\mathcal {M}}}f-f\Vert _{H^{-1}(\omega _z)}\) of the oscillation requires additional a priori information on \(f\).
Proof
Lemma 1, Lemma 4 and a triangle inequality imply the claimed bound. Recalling that
Theorem 15 and Corollary 5 ensure the statements about the quantification of the two parts of the bound. \(\square \)
In contrast to previous results available in literature, the complete upper bound in Theorem 16 is also a lower bound, even locally.
Theorem 17
(Abstract local lower bounds) For any functional \(f\in H^{-1}(\Omega )\) and any conforming mesh \({\mathcal {M}}\), the discretized residual and the oscillation are locally dominated by the error: for every vertex \(z\in {\mathcal {V}}\), we have
and
Proof
In light of (3.12), the first claimed inequality follows from the triangle inequality and the second one. The latter is a consequence of Theorems 8 and 10 and (3.12):
\(\square \)
Squaring and summing, we readily get global lower bounds.
Corollary 18
(Abstract global lower bounds) For any functional \(f\in H^{-1}(\Omega )\) and any conforming mesh \({\mathcal {M}}\), the discretized residual and the oscillation are globally dominated by the error in that
and
To summarize: if we are able to quantify the oscillation terms \(\Vert {\mathcal {P}}_{{\mathcal {M}}}f- f\Vert _{H^{-1}(\omega _z)}\), \(z\in {\mathcal {V}}\), then the right-hand side in Theorem 16 is a truly equivalent a posteriori error estimator.
Remark 19
(Surrogate oscillation) The quantification of the local dual norms \(\Vert {\mathcal {P}}_{{\mathcal {M}}}f- f\Vert _{H^{-1}(\omega _z)}\), \(z\in {\mathcal {V}}\), of the oscillation appears to be a difficult matter. In [10, Section 7], Cohen, DeVore, and Nochetto consider similar terms for special \(f\) and resort to surrogates that can be approximated with the help of numerical integration. Those surrogates hinge on additional regularity of \(f\), which entails the risk of overestimation; cf. Lemma 20 below.
3.8 Classical versus error-dominated oscillation
In this section we compare the error-dominated oscillation with the classical
\(L^2\)- and \(H^{-1}\)-oscillation,
from (1.4) and (1.5) in the introduction. Doing so, we verify statements of the introduction and substantiate the advantages of the stability and invariance properties of the operator \({\mathcal {P}}_{{\mathcal {M}}}\).
Let us first show that the error-dominated oscillation is always smaller, up to a multiplicative constant, than both classical oscillations. To this end, let \(f\in H^{-1}(\Omega )\) and let \(g\in {\mathbb {P}}_0({\mathcal {M}})\) be an arbitrary piecewise constant approximation over \({\mathcal {M}}\). The local invariance and stability properties of \({\mathcal {P}}_{{\mathcal {M}}}\) in Theorems 8 and 10 imply that, for all \(z\in {\mathcal {V}}\),
Combining this with Lemma 4 (ii) and minimizing over g, we obtain the bound in terms of the classical \(H^{-1}\)-oscillation:
To show the other bound, suppose \(f\in L^2(\Omega )\). Making use of the orthogonality of \(P_{0,{\mathcal {M}}}\) and Poincaré inequalities in the elements of \(\omega _z\), we deduce
which together with (3.29) gives the bound in terms of the \(L^2\)-oscillation:
The converse bounds of (3.30) do not hold. For the classical \(L^2\)-oscillation, this applies even on a fixed mesh and is in particular due to stability issues. The following lemma provides an illustration, relating directly to the error instead of the error-dominated oscillation.
Lemma 20
(Overestimation of classical \(L^2\)-oscillation) For any conforming mesh \({\mathcal {M}}\), there exists a sequence \((f_k)_{k}\subset L^2(\Omega )\) such that
Proof
Choose \(f\in H^{-1}(\Omega )\setminus L^2(\Omega )\). Since \(L^2(\Omega )\) is dense in \(H^{-1}(\Omega )\), there exists a sequence \((f_k)_k\subset L^2(\Omega )\) such that \(f_k\rightarrow f\) in \(H^{-1}(\Omega )\). On the one hand, the energy norm errors \(\Vert \nabla (u_{f_k} - U_{{f_k};{\mathcal {M}}})\Vert _{}\) are uniformly bounded with respect to k. On the other hand, in view of \(\lim _{k\rightarrow \infty } \Vert f_k\Vert _{L^2(\Omega )}=\infty \), the oscillation \({{\,\mathrm{osc}\,}}_0(f_k,{\mathcal {M}})\) becomes arbitrarily large for \(k\rightarrow \infty \). \(\square \)
In the case of the classical \(H^{-1}\)-oscillation, (3.30a) cannot be inverted because of invariance issues. Let us illustrate this again by the relationship to the Galerkin error. Consider
where \({\mathcal {M}}^\dagger \) is some conforming simplicial mesh of \(\Omega \). For any conforming refinement \({\mathcal {M}}\) of \({\mathcal {M}}^\dagger \), we then have \(u_f= V = U_{f;{\mathcal {M}}}\) and \(f\not \in {\mathbb {P}}_0({\mathcal {M}})\). Hence
where the classical \(H^{-1}\)-oscillation can be made arbitrarily large for a given \({\mathcal {M}}\) but decreases to 0 under suitable refinement. One could argue that the (neighborhoods of the) loads (3.31) are very special, in particular because the optimal convergence rate of (3.31) is formally \(\infty \). Here is another example based upon Cohen, DeVore, and Nochetto [10, Section 6.4], where the optimal nonlinear convergence rate for the error is finite and often encountered in practice.
Lemma 21
(Another overestimation of classical \(H^{-1}\)-oscillation) Let \(\Omega =(0,1)^2\). There is a functional \(f\in H^{-1}(\Omega )\) and a sequence \((L_n)_n\) with \(\log n > rsim L_n \rightarrow \infty \) as \(n\rightarrow \infty \) such that
and
where \({\mathcal {M}}\) varies in all meshes created by recursive or iterative newest vertex bisection of some conforming initial mesh \({\mathcal {M}}_0\) of \(\Omega \).
Proof
In [10, Section 6.4] Cohen, DeVore and Nochetto construct some function \(u_f\in H_0^1(\Omega )\) and a sequence \(L_n\) as claimed for which (3.32a) and
hold. It thus remains to establish (3.32b). To this end, we fix temporarily an arbitrary vertex \(z\in {\mathcal {V}}\) of a conforming mesh \({\mathcal {M}}\) and let \(g\in {\mathbb {P}}_0({\mathcal {M}})\). The inverse triangle and (3.12) yield
By Lemma 7, we have, for all \(K \in {\mathcal {M}}\),
and, for all \(F \in {\mathcal {F}}\) and \(K _1,K _2\in {\mathcal {M}}\) with \(K _1\cap K _2=F \),
Theorem 15 therefore implies
Exploiting also Lemma 4, we arrive at
Consequently, (3.32a) and (3.33) lead to
which, upon redefining \((L_n)_n\), implies (3.32b) and the proof is finished. \(\square \)
Remark 22
(Overestimation of \(H^{-1}\)-variant of standard residual estimator) As pointed out by Cohen, DeVore, and Nochetto [10], the example of Lemma 21 entails that the right-hand side of
a variant of the standard residual estimator defined for all loads \(f \in H^{-1}(\Omega )\), is overestimating. In Sect. 4.2 below, we propose through our new approach another variant that is free of overestimation.
4 Realizations with classical techniques
The a posteriori error bounds in Sect. 3.7 are abstract in that they are given in terms of the local dual norms \(\Vert \cdot \Vert _{H^{-1}(\omega _z)}\), \(z \in {\mathcal {V}}\), of the discretized residual and the oscillation. For the norms \(\Vert {\mathcal {P}}_{{\mathcal {M}}}f+\Delta U_{f;{\mathcal {M}}}\Vert _{H^{-1}(\omega _z)}\), \(z\in {\mathcal {V}}\), of the discretized residual, we required a quantification in terms of finite information on the load and provided a possible realization in Theorem 15. In this section we discuss a selection of alternative realizations. All realizations are motivated by classical approaches to a posteriori analysis and cover two explicit and two implicit techniques. It is worth making the following observations:
-
Hierarchical estimators and estimators based upon local problems implicitly introduce a splitting of the residual like the one proposed in Sect. 3.3.
-
The overestimation of the standard residual estimator in Remark 22 can be cured with the help of the splitting of the residual in Sect. 3.3.
-
Employing different local dual norms, the approach of Sect. 3 can be extended to estimators based on flux equilibration.
-
Each realization quantifies a local dual norm of the discretized residual by a computable, equivalent norm. Both equivalence and computability hinge on the finite-dimensional nature of the discretized residual.
4.1 A hierarchical estimator
Hierarchical estimators investigate the residual on an extension of the given finite element space. While higher order extensions were used originally, Bornemann, Erdmann, and Kornhuber show in [6] that an extension containing the functions
already ensures reliability for piecewise constant loads \(f\in {\mathbb {P}}_0({\mathcal {M}})\). The indicators of a corresponding, ‘minimal’ hierarchical estimator are given by
and computable in terms of \(U_{f;{\mathcal {M}}}\) and the evaluations \(\left\langle f,\,\lambda _i\right\rangle \), \(i \in {\mathcal {I}}\). This definition implies the constant-free local lower bounds
and therefore, cf. (3.28), we have that, for every \(z\in {\mathcal {V}}\) and \({I}_z= \{i\in {\mathcal {I}} \mid i \ni z \}\),
which is a local counterpart of the global lower bound in Veeser [24, Lemma 3.3].
This estimator is very closely related to the discretized residuals of Sect. 3.4 and Theorem 15. Indeed, if \(K \in {\mathcal {M}}\) and \(F \in {\mathcal {F}}\), \(K _1,K _2\in {\mathcal {M}}\) such that \(F = K _1 \cap K _2\), we have
in view of (3.19). Hence \({\text {span}}\{\psi _i \mid i\in {\mathcal {I}}\} = {\text {span}}\{\lambda _i \mid i\in {\mathcal {I}}\}\) and Remark 12 yields \(\left\langle f,\,\lambda _i\right\rangle =\left\langle {\mathcal {P}}_{{\mathcal {M}}}f,\,\lambda _i\right\rangle \), \(i\in {\mathcal {I}}\), and the indicators may be viewed also as evaluations of the discretized residual: for \(i \in {\mathcal {I}}\),
As a consequence, we also have the following counterpart of (4.2):
In order to prove the converse bound, we may proceed with the help of \({\mathcal {P}}_{{\mathcal {M}}}^*\) as in [24]. However, having Theorem 15 at our disposal, it is simpler to exploit (4.3). We immediately see
Moreover, given \(F \in {\mathcal {F}}\), \(K _1,K _2\in {\mathcal {M}}\) with \(F = K _1 \cap K _2\), we deduce
with \(h_F \mathrel {:=}{{\,\mathrm{diam}\,}}F \) and, for \(i \in \{F, K _1, K _2 \}\)
We therefore obtain \(\Vert \nabla \psi _F \Vert _{}^{-1} \Vert \nabla \lambda _i\Vert _{} \lesssim |F |\) and
Summing up, the hierarchical estimator quantifies the local discretized residual,
and we have the following a posteriori bounds.
Theorem 23
(Hierarchical estimator with error-dominated oscillation) For any functional \(f\in H^{-1}(\Omega )\) and any conforming mesh \({\mathcal {M}}\), we have the global equivalence
as well as the following local lower bounds: for every \(z \in {\mathcal {V}}\),
The hidden constants depend only on d and \(\sigma ({\mathcal {M}})\).
Proof
Combine Theorem 16, Theorem 17, Corollary 18, (3.12), (4.5), and (4.2). \(\square \)
4.2 An improved standard residual estimator
The standard residual estimator applies suitably scaled norms to the jump and element residual; see, e.g., Verfürth [26, Section 1.4]. In the case of the discretized residual
this leads to the following indicators:
where \(h_F \) and \(h_K \) denote, respectively, the diameters of \(F \) and \(K \) and computability is given in terms of \(U_{f;{\mathcal {M}}}\) and (3.23).
These indicators actually quantify the discretized residual and in a way that is very tight to Theorem 15: for any interelement face \(F \in {\mathcal {F}}\),
and, for any element \(K \in {\mathcal {M}}\),
where the hidden constants depend only on d and \(\sigma ({\mathcal {M}})\). To see (4.6a), let \(F \in {\mathcal {F}}\) be any interelement face. Lemma 7 (i), the trace inequality (3.18) for \(w=\psi _F ^2\) and the Friedrichs inequality (3.17) for \(v=\psi _F \), both with \(\omega _F \) in place of \(\omega _z\), give
while (3.20) yields \(\Vert \nabla \psi _F \Vert _{\Omega } \lesssim (h_F |F |)^{-1/2}\) and so
Similarly, we obtain (4.6b).
Inserting the combination of Theorem 15 and (4.6) in the abstract a posteriori analysis of Sect. 3.7, we obtain the following result.
Theorem 24
(Standard residual estimator with error-dominated oscillation) For any functional \(f\in H^{-1}(\Omega )\) and any conforming mesh \({\mathcal {M}}\), we have the global equivalence
as well as the following local lower bounds: for \(z\in {\mathcal {V}}\),
The hidden constants depend only on d and \(\sigma ({\mathcal {M}})\).
Theorem 24 relies on key features of the approach in §3, which the following remark elaborates on.
Remark 25
(Classical vs new standard residual estimator) In contrast to the classical standard residual estimator (1.2) and its \(H^{-1}\)-variant in Remark 22, the variant of Theorem 24 is completely equivalent to the error. The reason for this improvement lies in a suitable correction of the original jump residual. To elucidate this, remember that both the classical standard residual estimator and its \(H^{-1}\)-variant in Remark 22 do not discretize the residual and therefore compare them to
which also does not split off an infinite-dimensional part of the load \(f\). The corrections \(\left\langle f,\,\psi _F \right\rangle \), \(F \in {\mathcal {F}}\), of the jump residual make sure that the new jump residual has the invariance properties necessary for avoiding overestimation, i. e., it vanishes whenever the exact solution happens to be discrete. Corrections with this property have been used previously. For example, Nochetto [20] considers the special case \(f= f_1 +{{\,\mathrm{div}\,}}\varvec{f_2}\), where \(f_1,\varvec{f_2}\) are suitable functions, and assigns \(({{\,\mathrm{div}\,}}\varvec{f_2})|_K \), \(K \in {\mathcal {M}}\), to the element residual and the jumps in the normal trace of \(\varvec{f_2}\) across interelement sides correct the jump residual. Similarly, in standard residual estimators for the Stokes problem, pressure jumps correct the jump residual associated with the velocity. The novelty is that the corrections \(\left\langle f,\,\psi _F \right\rangle \), \(F \in {\mathcal {F}}\), are defined for an arbitrary \(f\in H^{-1}(\Omega )\) and also locally \(H^{-1}\)-stable and so fulfill the second necessary condition to avoid local overestimation. Notably, the latter entails that, even if \(f\) is a smooth function, the jump residual will be corrected significantly in certain cases.
4.3 An estimator based on local problems
A local problem lifts the residual to a local extension of the given finite element space and so provides a local correction, the norm of which is used as an error indicator; cf. Babuška and Rheinboldt [4]. While computability requires finite-dimensional extensions, the higher cost with respect to the previous explicit estimators is tied up with the hope of improved accuracy.
The following instance from Verfürth [26, Section 1.7.1 and Remark 1.21] is vertex-based and uses the local extensions
where the functions \(\psi _i\) and \(\lambda _i\) are defined, respectively, in (3.19) and (4.1). Given a vertex \(z\in {\mathcal {V}}\), the indicator is then
where
Thus, \(\nu _z\) is computable in terms of \(U_{f;{\mathcal {M}}}\) and, e.g., (3.23). The indicator \({\mathcal {E}}_{\mathrm {L}}(f,{\mathcal {M}},z)\) may be viewed as an implicit counterpart of \((\sum _{i \in {\mathcal {I}}_z} {\mathcal {E}}_{\mathrm {H}}(f,{\mathcal {M}},i)^2)^{1/2}\) from §4.1. Taking \(\lambda =\nu _z\), we immediately obtain the constant-free lower bound
which slightly improves upon (4.2).
Notice that, in light of Remark 12, the solution \(\nu _z\) can be interpreted also as a lift of the discretized residual \({\mathcal {P}}_{{\mathcal {M}}}f+ \Delta U_{f;{\mathcal {M}}}\). Consequently, the first inequality in
is correct. The second one follows from Remark 13 and Theorem 10 in the spirit of Morin, Nochetto and Siebert [18]. In fact, for \(v\in H^1_0(\omega _z)\), we have
Theorem 26
(Estimator based on local problems with error-dominated oscillation) For any functional \(f\in H^{-1}(\Omega )\) and any conforming mesh \({\mathcal {M}}\), we have the global equivalence
as well as the following local lower bounds: for every \(z\in {\mathcal {V}}\),
The hidden constants depend only on d and \(\sigma ({\mathcal {M}})\).
Proof
Combine Theorem 16, Theorem 17, Corollary 18, (3.12), (4.7) and (4.8). \(\square \)
4.4 An estimator based on flux equilibration
While indicators based on local problems provide constant-free local lower bounds, estimators based on flux equilibration aim for a constant-free, or at least explicit, global upper bound. This is achieved with the help of other, more sophisticated liftings within the framework of the fundamental theorem of Prager and Synge [22], which, for the homogeneous Dirichlet problem (1.1), can be formulated as follows: For any \(v\in H_0^1(\Omega )\), we have
Realizations of this idea in Ainsworth [1], Braess and Schöberl [8], Ern, Smears and Vohralik [12, 13], and Luce and Wohlmuth [16] make use of some classical oscillation. Its replacement by an error-dominated oscillation requires some adjustment to the approach of Sect. 3.
The upper bound in the localization of Lemma 4 involves a non-explicit multiplicative constant. In order to improve on this, we replace the local spaces \(H^1_0(\omega _z)\), \(z\in {\mathcal {V}}\), with
equip them with the norm \(\Vert \nabla \cdot \Vert _{\omega _z}\), and denote the respective dual spaces by \(H_z^*\).
Lemma 27
(Alternative localization) Let \(\ell \in H^{-1}(\Omega )\) be any functional.
-
(i)
If \(\ell \in {\mathcal {R}}_{\mathcal {M}}\), then
$$\begin{aligned} \Vert \ell \Vert _{H^{-1}(\Omega )}^2 \le (d+1)\sum _{z\in {\mathcal {V}}} \Vert \phi _z\ell \Vert _{H^{*}_z}^2. \end{aligned}$$ -
(ii)
We have
$$\begin{aligned} \sum _{z\in {\mathcal {V}}} \Vert \phi _z\ell \Vert _{H^{*}_z}^2 \lesssim \Vert \ell \Vert _{H^{-1}(\Omega )}^2, \end{aligned}$$where the hidden constant depends only on d and the shape coefficient \(\sigma ({\mathcal {M}})\).
Proof
The proof is essentially a regrouping of the arguments proving Lemma 4, where (3.3) slips into the proof of (ii); cf. Canuto et al. [9, Proposition 3.1]. \(\square \)
Splitting the residual up in discretized residual and oscillation, we then obtain the following abstract error bounds; we do not state the global lower bound as it is immediate consequence of the local one.
Lemma 28
(Alternative abstract error bounds) For any functional \(f\in H^{-1}(\Omega )\) and any conforming mesh \({\mathcal {M}}\), we have the global upper bound
as well as the following local lower bounds: for every vertex \(z\in {\mathcal {V}}\),
The hidden constants depend only on d and \(\sigma ({\mathcal {M}})\).
Proof
The global upper bound follows from Lemma 27 (i) and the triangle inequality. To prove the local lower bounds, we recall Theorem 17 and take \(\ell ={\mathcal {P}}_{{\mathcal {M}}}f+\Delta U_{f;{\mathcal {M}}}\) and \(\ell = {\mathcal {P}}_{{\mathcal {M}}}f-f\) in
which exploits (3.3) for \(v_z\in H_z\) and \(z\in {\mathcal {V}}\). \(\square \)
In order to quantify the local discretized residual, we construct local equilibrated fluxes following the ideas of Braess, Pillwein, and Schöberl [7] and Ern, Smears, and Vohralík [12]. To this end, fix any vertex \(z\in {\mathcal {V}}\) and define the operator \(\pi _z:\{ \pi _z: H_z^* \rightarrow H_z^*\} \rightarrow H_z^*\) by
We emphasize that \(\pi _z\big (\phi _z({\mathcal {P}}_{{\mathcal {M}}}f+\Delta U_{f;{\mathcal {M}}})\big )\) can be computed in terms of \(U_{f;{\mathcal {M}}}\) and (3.23). Thanks to the definition of the spaces \(H_z\), \(z\in {\mathcal {V}}\), and the general form of the theorem of Prager and Synge (see, e.g., Verfürth [26, Proposition 1.40]), we have
with the affine space
and the equalities in the definition of \({\mathbb {W}}_z\) have to be understood in the sense of distributions; the space \({\mathbb {W}}_z\) is not empty since \(\left\langle \pi _z\big (\phi _z({\mathcal {P}}_{{\mathcal {M}}}f+\Delta U_{f;{\mathcal {M}}})\big ),\,1\right\rangle =0\) for every \(z\in {\mathcal {V}}_0\).
In order to introduce a discrete counterpart of \({\mathbb {W}}_z\) in (4.12), we employ the Raviart-Thomas-Nédélec spaces
and define
which satisfies
and the hidden constant depends only on d and \(\sigma ({\mathcal {M}})\). Indeed, the right inequality is obvious because of \({\mathbb {W}}_z({\mathcal {M}})\subset {\mathbb {W}}_z\). The left inequality can be proved by an explicit construction; see, e.g., [7, 12]. For the ease of presentation, however, we shall assume
and note
in view of (4.12) and (4.13). Inserting this in the abstract bounds of Lemma 28, we readily obtain the following a posteriori bounds; as before, we suppress the global lower bound.
Theorem 29
(Equilibrated flux estimator with error-dominated oscillation) For any functional \(f\in H^{-1}(\Omega )\) and any conforming mesh \({\mathcal {M}}\), we have the global upper bound
as well as the following local lower bounds: for every vertex \(z\in {\mathcal {V}}\),
The hidden constant depends only on d and \(\sigma ({\mathcal {M}})\).
In contrast to the cited previous bounds, the upper bound in Theorem 29 contains the multiplicative constant \(d+1\). This constant arises from the localization in Lemma 27. As an alternative to this localization, one may use the constant-free upper bound in the following remark and split the estimator part \(\Vert \varvec{\Xi }\Vert _{}\) therein into local \(L^2\)-contributions.
Remark 30
(Alternative upper bound) Observing that
we set \(\varvec{\Xi }\mathrel {:=}\sum _{z\in {\mathcal {V}}}\varvec{\Xi }_z\) and apply the theorem of Prager and Synge (4.9) globally and Lemma 27 to obtain
References
Ainsworth, M.: A framework for obtaining guaranteed error bounds for finite element approximations. J. Comput. Appl. Math. 234, 2618–2632 (2010)
Ainsworth, M., Oden, J.T.: A Posteriori Error Estimation in Finite Element Analysis. Pure and Applied Mathematics. Wiley, New York (2000)
Babuška, I., Miller, A.: A feedback finite element method with a posteriori error estimation: I. The finite element method and some basic properties of the a posteriori error estimator. Comput. Methods Appl. Mech. Eng. 61, 1–40 (1987)
Babuška, I., Rheinboldt, W.: Error estimates for adaptive finite element computations. SIAM J. Numer. Anal. 15, 736–754 (1978)
Blechta, J., Màlek, J., Vohralìk, M., Localization of the \(W^{-1,q}\)-norm for local a posteriori efficiency. IMA J. Numer. Anal. 40, 914–950 (2020)
Bornemann, F.A., Erdmann, B., Kornhuber, R.: A posteriori error estimates for elliptic problems in two and three space dimensions. SIAM J. Numer. Anal. 33, 1188–1204 (1996)
Braess, D., Pillwein, V., Schöberl, J.: Equilibrated residual error estimates are \(p\)-robust. Comput. Methods Appl. Mech. Eng. 198, 1189–1197 (2009)
Braess, D., Schöberl, J.: Equilibrated residual error estimator for edge elements. Math. Comput. 77, 651–672 (2008)
Canuto, C., Nochetto, R.H., Stevenson, R., Verani, M.: On \(p\)-robust saturation for \(hp\)-afem. Comput. Math. Appl. 73, 2004–2022 (2017)
Cohen, A., DeVore, R., Nochetto, R.H.: Convergence rates of AFEM with \(H^{-1}\) data. Found. Comput. Math. 12, 671–718 (2012)
Ern, A., Guermond, J.-L.: Finite elements: II. Applications to PDEs (in preparation)
Ern, A., Smears, I., Vohralík, M.: Discrete \(p\)-robust \({H}(\rm div)\)-liftings and a posteriori estimates for elliptic problems with \({H}^{-1}\) source terms. Calcolo 54, 1009–1025 (2017)
Ern, A., Vohralík, M.: Polynomial-degree-robust a posteriori estimates in a unified setting for conforming, nonconforming, discontinuous Galerkin, and mixed discretizations. SIAM J. Numer. Anal. 53, 1058–1081 (2015)
Kreuzer, C., Veeser, A.: A posteriori analysis with error-dominated oscillation for higher order methods (in preparation)
Kreuzer, C., Veeser, A.: Convergence of adaptive finite element methods with error-dominated oscillation, Numerical mathematics and advanced applications—ENUMATH 2017. Lect. Notes Comput. Sci. Eng., vol. 126, Springer, Cham, pp. 471–479 (2019)
Luce, R., Wohlmuth, B.I.: A local a posteriori error estimator based on equilibrated fluxes. SIAM J. Numer. Anal. 42, 1394–1414 (2004)
Morin, P., Nochetto, R.H., Siebert, K.G.: Convergence of adaptive finite element methods. SIAM Rev. 44, 631–658 (2002)
Morin, P., Nochetto, R.H., Siebert, K.G.: Local problems on stars: a posteriori error estimators, convergence, and performance. Math. Comput. 72, 1067–1097 (2003)
Morin, P., Siebert, K.G., Veeser, A.: A basic convergence result for conforming adaptive finite elements. Math. Models Methods Appl. Sci. 18, 707–737 (2008)
Nochetto, R.H.: Pointwise a posteriori error estimates for elliptic problems on highly graded meshes. Math. Comput. 64, 1–22 (1995)
Nochetto, R.H., Veeser, A.: Primer of adaptive finite element methods, in Multiscale and adaptivity: modeling, numerics and applications, vol. 2040 of Lecture Notes in Math. Springer, Heidelberg, pp. 125–225 (2012)
Prager, W., Synge, J.L.: Approximations in elasticity based on the concept of function space. Quart. Appl. Math. 5, 241–269 (1947)
Stevenson, R.: Optimality of a standard adaptive finite element method. Found. Comput. Math. 7, 245–269 (2007)
Veeser, A.: Convergent adaptive finite elements for the nonlinear Laplacian. Numer. Math. 92, 743–770 (2002)
Verfürth, R.: A Review of A Posteriori Error Estimation and Adaptive Mesh-Refinement Techniques. Numerical Mathematics and Advanced. Wiley, John Chichester (1996)
Verfürth, R.: A Posteriori Error Estimation Techniques for Finite Element Methods, Numerical Mathematics and Scientific Computation. Oxford University Press, Oxford (2013)
Acknowledgements
The authors would like to thank Rüdiger Verfürth for the proposal to define a precursor of the operator \({\mathcal {P}}_{{\mathcal {M}}}\) in terms of a biorthogonal system.
Funding
Open access funding provided by Universitá degli Studi di Milano within the CRUI-CARE Agreement. Christian Kreuzer gratefully acknowledges partial support by the DFG research grant KR 3984/5–1 “Analysis of adaptive nonconforming Galerkin finite element methods”. Andreas Veeser gratefully acknowledges partial support by the GNCS and by Italian PRIN “Numerical Analysis for Full and Reduced Order Methods for the efficient and accurate solution of complex systems governed by Partial Differential Equations (NA–FROM–PDEs)”.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Kreuzer, C., Veeser, A. Oscillation in a posteriori error estimation. Numer. Math. 148, 43–78 (2021). https://doi.org/10.1007/s00211-021-01194-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00211-021-01194-8