A convexity enforcing $${C}^{{0}}$$ interior penalty method for the Monge–Ampère equation on convex polygonal domains

Brenner, Susanne C.; Sung, Li-yeng; Tan, Zhiyu; Zhang, Hongchao

doi:10.1007/s00211-021-01210-x

A convexity enforcing ${C}^{{0}}$ interior penalty method for the Monge–Ampère equation on convex polygonal domains

Open access
Published: 26 June 2021

Volume 148, pages 497–524, (2021)
Cite this article

Download PDF

You have full access to this open access article

Numerische Mathematik Aims and scope Submit manuscript

A convexity enforcing ${C}^{{0}}$ interior penalty method for the Monge–Ampère equation on convex polygonal domains

Download PDF

Susanne C. Brenner¹,
Li-yeng Sung¹,
Zhiyu Tan² &
…
Hongchao Zhang¹

2377 Accesses
1 Citation
Explore all metrics

Abstract

We design and analyze a $C^0$ interior penalty method for the approximation of classical solutions of the Dirichlet boundary value problem of the Monge–Ampère equation on convex polygonal domains. The method is based on an enhanced cubic Lagrange finite element that enables the enforcement of the convexity of the approximate solutions. Numerical results that corroborate the a priori and a posteriori error estimates are presented. It is also observed from numerical experiments that this method can capture certain weak solutions.

An interior penalty coupling strategy for isogeometric non-conformal Kirchhoff–Love shell patches

Article Open access 27 March 2024

Mixed virtual element method for integro-differential equations of parabolic type

Article 16 April 2024

Element-free Galerkin analysis of MHD duct flow problems at arbitrary and high Hartmann numbers

Article 15 April 2024

1 Introduction

The Monge–Ampère equation is a fully nonlinear partial differential equation that appears in geometric analysis and related applications. Various aspects of this important equation can be found in the monographs [6, 20, 38, 42, 44, 49, 72].

The Dirichlet boundary value problem for the Monge–Ampère equation is given by

$$\begin{aligned} \det D^2 u&=\psi&\qquad&\hbox { in}\ \Omega , \end{aligned}$$

(1.1a)

$$\begin{aligned} u&=\phi&\qquad&\hbox { on}\ \partial \Omega . \end{aligned}$$

(1.1b)

If $\Omega $ is a smooth and strictly convex domain, $\psi \in C^3({\bar{\Omega }})$ is strictly positive on ${\bar{\Omega }}$ and $\phi \in C^{4,\delta }({\bar{\Omega }})$ for some $\delta \in (0,1)$, then (1.1) has a unique strictly convex solution $u\in C^{4,\delta }({\bar{\Omega }})$ (cf. [21, p. 371, Remark 2]). Our goal is to develop finite element methods that can capture such smooth convex solutions of (1.1).

Remark 1.1

Throughout this paper we will follow the standard notation for differential operators, function spaces and norms that can be found for example in [1, 17, 23].

As a first step, we consider a finite element method for (1.1) on polygonal domains. Accordingly we assume that $\Omega \subset \mathbb {R}^2$ is a bounded convex polygon,

$$\begin{aligned} \phi&\in {H^4(\Omega )}, \end{aligned}$$

(1.2)

$$\begin{aligned} \psi&\in {H^2(\Omega )}\;\hbox { is strictly positive on}\ {\bar{\Omega }}, \end{aligned}$$

(1.3)

and

$$\begin{aligned} \text {the boundary value problem } (1.1) \text { has a strictly convex solution } u\in {H^4(\Omega )}, \end{aligned}$$

(1.4)

i.e., there exists a positive constant $\alpha _\sharp $ such that the Hessian $D^2 u$ satisfies

$$\begin{aligned} \xi ^t (D^2u)(x)\xi \ge \alpha _\sharp |\xi |^2 \qquad \forall \,x\in \Omega , \;\xi \in \mathbb {R}^2. \end{aligned}$$

(1.5)

Remark 1.2

The extension of our method to strictly convex smooth domains, where the regularity (1.4) follows from appropriate regularity of the data, will be carried out in a forthcoming paper.

Remark 1.3

Since a $2\times 2$ symmetric matrix and its cofactor matrix have identical eigenvalues, the estimate (1.5) is equivalent to

$$\begin{aligned} \xi ^t \mathrm {Cof}(D^2u)(x)\xi \ge \alpha _\sharp |\xi |^2 \qquad \forall \,x\in \Omega , \;\xi \in \mathbb {R}^2. \end{aligned}$$

(1.6)

Remark 1.4

Note that under assumption (1.3) a sufficiently smooth solution of (1.1) is strictly convex if and only if $\Delta u\ge 0$ on $\Omega $. This is the key motivation for the finite element method in this paper.

There are many numerical approaches to the Dirichlet boundary value problem of the Monge–Ampère equation (and related equations) in 2 and 3 spatial dimensions, with respect to different solution classes (classical solutions, Aleksandrov solutions [2] and viscosity solutions [54]). They include (i) geometric finite difference methods [63, 66, 68, 69], (ii) monotone finite difference methods [7,8,9, 39,40,41, 48, 50, 67], (iii) augmented Lagrangian and least-squares finite element methods [19, 28,29,30,31], (iv) finite element methods based on the vanishing moment approach [3, 35,36,37, 57], (v) finite element methods based on $L_2$ projection [4, 5, 10, 11, 13, 15, 27, 51, 58,59,60], (vi) finite element methods based on a reformulation of the Monge–Ampère equation as a Hamilton–Jacobi–Bellman equation [14, 34], and (v) two-scale methods [53, 64, 65]. Comprehensive reviews of the literature can be found in [33, 61].

The method in this paper is also based on a nonlinear least-squares approach. It is different from the least-squares method of Dean and Glowinski [19, 29, 31] in that our least-squares problem is posed only on the finite element spaces and the discrete problems are solved purely as optimization problems.

The key ingredient in our method is an enhanced cubic Lagrange element with exotic degrees of freedom (dofs) that enables us to enforce the convexity of the finite element solutions, which then allows us to develop a simple error analysis based on existing results for second order elliptic problems in non-divergence form.

The rest of the paper is organized as follows. We introduce the enhanced cubic Lagrange element in Sect. 2 together with the discrete nonlinear least-squares problem. We then present a priori and a posteriori error analyses in Sect. 3 and numerical results in Sect. 4. We end with some concluding remarks in Sect. 5. We also put some of the details in three appendices so that the main flow of the presentation is not distracted. Appendix A contains the derivation of a stability result for elliptic problems in non-divergence form needed for the error analysis in Sect. 3. Details of the optimization algorithm that we use to solve the discrete problems are given in Appendix B. An algorithm that we use to check the elementwise convexity of the approximate solutions is outlined in Appendix C.

Throughout the paper we will use C to denote a generic positive constant independent of the mesh size.

2 The discrete problem

The discrete problem is a nonlinear least-squares problem with box constraints. It is based on an exotic finite element space whose degrees of freedom (dofs) can enforce the convexity of the solutions.

2.1 An enhanced cubic Lagrange finite element

We begin by introducing a new finite element where some of the degrees of freedom (dofs) are associated with nodes outside the element domain. Consequently the construction of the local basis requires information from outside an element. Below we will treat a polynomial on an element as the restriction of a polynomial on $\mathbb {R}^2$ and use the same notation to denote both. In other words, we will identify $P_k(\mathbb {R}^2)$ with the space $P_k(T)$ of polynomials of (total) degree $\le k$ on a triangle T.

The construction of the finite element is based on the following lemma, where ${\hat{T}}$ is the reference simplex with vertices (0, 0), (1, 0) and (0, 1), and $\varphi _{{\hat{T}}}=x_1x_2(1-x_1-x_2)$ is the cubic bubble function that vanishes on the boundary of ${\hat{T}}$.

Lemma 2.1

A function $v\in P_3({\hat{T}})\oplus \varphi _{{\hat{T}}}^2 P_1({\hat{T}})$ is uniquely determined by the 10 dofs of the standard cubic Lagrange finite element together with the values of $\Delta v$ at the three points (1, 1), $(-1,1)$ and $(1,-1)$ (cf. Fig. 1).

Proof

Suppose v vanishes at the 9 vertex and edge nodes, then v belongs to $\langle { \varphi _{{\hat{T}}}}\rangle \oplus \varphi _{{\hat{T}}}^2 P_1({\hat{T}})$. A direct calculation shows that 0 is the only polynomial in $\langle { \varphi _{{\hat{T}}}}\rangle \oplus \varphi _{{\hat{T}}}^2 P_1({\hat{T}})$ that vanishes at the center of ${\hat{T}}$ and whose Laplacian also vanishes at the three points (1, 1), $(-1,1)$ and $(1,-1)$. $\square $

Remark 2.2

The space $P_3({\hat{T}})\oplus \varphi _{{\hat{T}}}^2 P_1({\hat{T}})$ and the 13 dofs in Lemma 2.1 do not define a finite element on ${\hat{T}}$ in the classical sense of Ciarlet in [23, page 78] because the shape functions are treated as functions defined globally on $\mathbb {R}^2$, and not as functions defined just on the element domain.

Remark 2.3

The vertices of the reference simplex are the midpoints of the edges of the triangle with vertices (1, 1), $(-1,1)$ and $(1,-1)$. For a general triangle T, the triangle whose midpoints are the vertices of T will be denoted by ${T}_\dag $.

On an arbitrary triangle T, the space of shape functions of the enhanced cubic Lagrange element is $P_3(T)\oplus \varphi _{\scriptscriptstyle T}^2P_1(T)$, where $\varphi _{\scriptscriptstyle T}$ is the cubic bubble function that vanishes on the boundary of T. The dofs of $v\in P_3(T)\oplus \varphi _{\scriptscriptstyle T}^2P_1(T)$ are (i) the values of v at the three vertices, (ii) the values of v at the two points that trisect each edge, (iii) the value of v at the center of T, and (iv) the values of $\mathrm {tr}(J_{\scriptscriptstyle T}^t D^2 vJ_{\scriptscriptstyle T})$ at the three vertices of ${T}_\dag $, where $J_{\scriptscriptstyle T}\in \mathbb {R}^{2\times 2}$ is the Jacobian matrix of an affine map that maps the reference simplex to T.

Remark 2.4

The enhanced cubic Lagrange element is affine-equivalent (cf. [17, 23]) by construction. The exotic dofs at the vertices of ${T}_\dag $ are responsible for enforcing the elementwise convexity of the discrete solutions of (1.1).

2.2 The finite element space ${V}_{h}$

Let $\mathcal {T}_h$ be a quasi-uniform simplicial triangulation of $\Omega $. A function v belongs to the finite element space $V_h \subset H^1(\Omega )$ if and only if (i) v belongs to $C({\bar{\Omega }})$ and (ii) the restriction of v to $T\in \mathcal {T}_h$ belongs to $P_3(T)\oplus \varphi _{\scriptscriptstyle T}^2P_1(T)$.

The (global) dofs of $v\in V_h$ are (i) the values of v at the vertices of $\mathcal {T}_h$, (ii) the values of v at the points that trisect the edges of $\mathcal {T}_h$, (iii) the values of v at the centers of the triangles in $\mathcal {T}_h$, and (iv) the values of $\mathrm {tr}(J_{\scriptscriptstyle T}^t D^2 v_{\scriptscriptstyle T}J_{\scriptscriptstyle T})$ at the three vertices of ${T}_\dag $ for each $T\in \mathcal {T}_h$, where $v_{\scriptscriptstyle T}$ is the restriction of v to T and $J_{\scriptscriptstyle T}$ is the Jacobian of an affine map that maps the reference simplex to T.

Remark 2.5

The dofs in (iii) and (iv) define the bubble functions in $\langle \varphi _{\scriptscriptstyle T}\rangle \oplus \varphi _{\scriptscriptstyle T}^2P_1(T)$.

It follows from the extension theorems for Sobolev spaces (cf. [1, Chapter 5]) that the solution $u\in H^4(\Omega )$ of (1.1) can be extended to a strictly convex function in $H^4({\tilde{\Omega }})$ where ${\tilde{\Omega }}$ is an open set that contains ${\bar{\Omega }}$ in its interior. We will denote this extension again by u. We assume that h is sufficiently small so that

$$\begin{aligned} {T}_\dag \subset {\tilde{\Omega }}\qquad \forall \,T\in \mathcal {T}_h. \end{aligned}$$

(2.1)

The nodal interpolant $\Pi _hu\in V_h$ is then defined by the condition that u and $\Pi _hu$ share the same global dofs mentioned above.

We will denote the piecewise Hessian operator by $D_h^2$, the set of the interior edges of $\mathcal {T}_h$ by $\mathcal {E}_h^i$, the length of an edge e by |e|, and the jump of the normal derivative of v across an (interior) edge by $[[\partial v/\partial n]]$.

Lemma 2.6

The following estimates are valid for $\Pi _hu$:

$$\begin{aligned}&\Vert u-\Pi _h u\Vert _{L_2(\Omega )}+h|u-\Pi _h u|_{H^1(\Omega )}+h\Vert u-\Pi _hu\Vert _{L_\infty (\Omega )} \nonumber \\&\quad +h^2\Vert D_h^2(u-\Pi _hu)\Vert _{L_2(\Omega )}+ h^4\Big (\sum _{T\in \mathcal {T}_h}|D_h^2(\Pi _h u)|_{H^2(T)}^2\Big )^\frac{1}{2} \le C h^4|u|_{H^4({\tilde{\Omega }})}, \end{aligned}$$

(2.2)

$$\begin{aligned}&|u-\Pi _hu|_{W^{2,\infty }(T)}\le Ch|u|_{H^4({\tilde{\Omega }})} \qquad \forall \,T\in \mathcal {T}_h, \end{aligned}$$

(2.3)

$$\begin{aligned}&\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial (\Pi _hu)/\partial n]]\Vert _{L_2(e)}^2=\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial (u-\Pi _hu)/\partial n]]\Vert _{L_2(e)}^2\nonumber \\&\quad \le Ch^4|u|_{H^4({\tilde{\Omega }})}^2, \end{aligned}$$

(2.4)

$$\begin{aligned}&\Vert D_h^2(\Pi _h u)\Vert _{L_2(\Omega )}^2+\sum _{T\in \mathcal {T}_h}|D_h^2(\Pi _h u)|_{H^2(T)}^2 +\max _{T\in \mathcal {T}_h}|\Pi _h u|_{W^{2,\infty }(T)}^2\le C\Vert u\Vert _{H^4({\tilde{\Omega }})}^2. \end{aligned}$$

(2.5)

Proof

The estimates (2.2) and (2.3) follow from the invariance of cubic polynomials under the local nodal interpolation operator, the Bramble-Hilbert lemma [12] and scaling. The estimate (2.2) then implies the estimate (2.4) through the trace theorem with scaling, and the estimate (2.5) follows from (2.2) and (2.3) through the triangle inequality and the Sobolev Embedding Theorem [1, Theorem 4.12]. $\square $

In view of the estimate for $\Vert D_h^2(u-\Pi _hu)\Vert _{L_2(\Omega )}$ in (2.2) and the bound for $\max _{T\in \mathcal {T}_h}|\Pi _h u|_{W^{2,\infty }(T)}$ in (2.5), we immediately arrive at

$$\begin{aligned} \Vert \det D_h^2(\Pi _hu)-\det D^2u\Vert _{L_2(\Omega )}\le Ch^2, \end{aligned}$$

(2.6)

where the positive constant C is independent of h.

Remark 2.7

All the estimates for $\Pi _h u$ are also valid for $\Pi _h\phi $. In particular we have

$$\begin{aligned} \Vert D_h^2(\phi -\Pi _h\phi )\Vert _{L_2(\Omega )}+{ \Big (}\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial (\Pi _h\phi )/\partial n]]\Vert _{L_2(e)}^2{ \Big )^\frac{1}{2}}\le Ch^2. \end{aligned}$$

2.3 A nonlinear least-squares problem with box constraints

Let $\phi _h$ be the one dimensional cubic Lagrange interpolant of $\phi $ along $\partial \Omega $ and the (convex) subset $L_h$ of $V_h$ be defined by

$$\begin{aligned} L_h&=\{v\in V_h:\, v=\phi _h\;\text {on}\;\partial \Omega \;\text {and}\; \mathrm {tr}(J_{\scriptscriptstyle T}^tD^2v_{\scriptscriptstyle T}J_{\scriptscriptstyle T}) \ge 0\; \hbox { at the vertices of}\ {T}_\dag \nonumber \\&\quad \hbox { for every}\ T\in \mathcal {T}_h\}. \end{aligned}$$

(2.7)

Remark 2.8

The inequality constraints in the definition of $L_h$ are motivated by the observation in Remark 1.4 and they are box constraints for the dofs of $V_h$ introduced at the beginning of Sect. 2.2.

Remark 2.9

Note that $\phi _h=\Pi _h\phi =\Pi _h u$ on $\partial \Omega $ and hence $v=\Pi _h u$ on $\partial \Omega $ for all $v\in L_h$.

The discrete problem is to find

$$\begin{aligned} u_h=\mathop {\mathrm{argmin}}_{v\in L_h}\mathcal {J}_h(v), \end{aligned}$$

(2.8)

where the cost function $\mathcal {J}_h$ is defined by

$$\begin{aligned} \mathcal {J}_h(v)&=\frac{h^4}{2}\Vert D_h^2 v\Vert _{L_2(\Omega )}^2+\frac{1}{2}\sum _{T\in \mathcal {T}_h}|\mathrm {tr}(J_{\scriptscriptstyle T}^tD^2v_{\scriptscriptstyle T}J_{\scriptscriptstyle T})|_{H^2(T)}^2\nonumber \\&\quad +\frac{1}{2}\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial v/\partial n]]\Vert _{L_2(e)}^2\nonumber \\&\quad +\frac{1}{2}\Vert \det D_h^2 v-\psi \Vert _{L_2(\Omega )}^2, \end{aligned}$$

(2.9)

and $v_{\scriptscriptstyle T}$ is the restriction of v to T. Note that the Frobenius norm of $J_{\scriptscriptstyle T}$ satisfies

$$\begin{aligned} |J_{\scriptscriptstyle T}|\approx h. \end{aligned}$$

(2.10)

We will use $\Vert \cdot \Vert _h$ to denote the mesh-dependent norm defined by

$$\begin{aligned} \Vert v\Vert _h^2=\Vert D_h^2v\Vert _{L_2(\Omega )}^2+\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial v/\partial n]]\Vert _{L_2(e)}^2. \end{aligned}$$

(2.11)

Remark 2.10

The first two terms in the definition of $\mathcal {J}_h$ are regularization terms that are crucial for the well-posedness of the discrete problem and for enforcing the elementwise convexity of the discrete solutions. The third term is a penalty term that compensates for the fact that $V_h\not \subset H^2(\Omega )$. The last term is the least-squares term for (1.1a).

The solvability of (2.8) is justified by the following result.

Lemma 2.11

The cost function $\mathcal {J}_h:L_h\longrightarrow [0,\infty )$ has a global minimizer.

Proof

According to the Poincaré-Friedrichs inequality for piecewise $H^2$ functions in [18], we have

$$\begin{aligned} \Vert v\Vert _{L_2(\Omega )}\le C\Vert v\Vert _h \qquad \forall \,v\in V_h\cap H^1_0(\Omega ), \end{aligned}$$

which implies (cf. Remark 2.9)

$$\begin{aligned} \Vert v-\Pi _hu\Vert _{L_2(\Omega )}\le C\Vert v-\Pi _hu\Vert _h \qquad \forall \,v\in L_h. \end{aligned}$$

(2.12)

It follows from (2.2), (2.4), (2.5), (2.11) and (2.12) that $\Vert v\Vert _h$ (and hence $\mathcal {J}_h(v)$) approaches $\infty $ if v belongs to $L_h$ and $\Vert v\Vert _{L_2(\Omega )}$ approaches $\infty $. $\square $

3 Error analysis

We will show that any $u_h$ satisfying (2.8) will converge to the solution u of (1.1) as $h\downarrow 0$ and the order of convergence is 2. Since our optimization algorithm does not guarantee that a global minimizer of $\mathcal {J}_h$ can be found, it is also useful to have an a posteriori error estimate that can demonstrate the convergence of our method numerically.

3.1 Some a priori bounds for ${u_h}$

Since $\Pi _h u$ belongs to $L_h$, it follows from (1.1a), (2.4)–(2.6) and (2.10) that

$$\begin{aligned}&h^4\Vert D_h^2u_h\Vert _{L_2(\Omega )}^2+\sum _{T\in \mathcal {T}_h}|\mathrm {tr}(J_{\scriptscriptstyle T}^tD^2(u_h)_{\scriptscriptstyle T}J_{\scriptscriptstyle T})|_{H^2(T)}^2\nonumber \\&\qquad +\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial u_h/\partial n]]\Vert _{L_2(e)}^2 +\Vert \det D_h^2 u_h-\psi \Vert _{L_2(\Omega )}^2\nonumber \\&\quad = 2J_h(u_h)\nonumber \\&\quad \le 2J_h(\Pi _h u)\nonumber \\&\quad =h^4\Vert D_h^2(\Pi _hu)\Vert _{L_2(\Omega )}^2 +\sum _{T\in \mathcal {T}_h}|\mathrm {tr}(J_{\scriptscriptstyle T}^tD^2(\Pi _hu)_{\scriptscriptstyle T}J_{\scriptscriptstyle T})|_{H^2(T)}^2\nonumber \\&\qquad +\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial (\Pi _hu)/\partial n]]\Vert _{L_2(e)}^2+\Vert \det D_h^2(\Pi _hu)-\det D^2 u\Vert _{L_2(\Omega )}^2\nonumber \\&\quad \le Ch^4. \end{aligned}$$

(3.1)

Consequently we have

$$\begin{aligned} \Vert D_h^2u_h\Vert _{L_2(\Omega )}&\le C, \end{aligned}$$

(3.2)

$$\begin{aligned} \Big (\sum _{T\in \mathcal {T}_h}|\mathrm {tr}(J_{\scriptscriptstyle T}^tD^2(u_h)_{\scriptscriptstyle T}J_{\scriptscriptstyle T})|_{H^2(T)}^2\Big )^\frac{1}{2}&\le Ch^2, \end{aligned}$$

(3.3)

$$\begin{aligned} \Big (\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial u_h/\partial n]]\Vert _{L_2(e)}^2\Big )^\frac{1}{2}&\le Ch^2, \end{aligned}$$

(3.4)

$$\begin{aligned} \Vert \det D_h^2 u_h-\psi \Vert _{L_2(\Omega )}&\le Ch^2. \end{aligned}$$

(3.5)

Let $T\in \mathcal {T}_h$ be arbitrary and $\psi _{\scriptscriptstyle T}$ be the $P_1$ Lagrange interpolant of $\psi $ on T. We have, by a standard inverse estimate (cf. [17, 23]),

$$\begin{aligned} \Vert \det D_h^2 u_h-\psi \Vert _{L_\infty (T)}&\le \Vert \det D_h^2 u_h-\psi _{\scriptscriptstyle T}\Vert _{L_\infty (T)} +\Vert \psi -\psi _{\scriptscriptstyle T}\Vert _{L_\infty (T)}\nonumber \\&\le Ch^{-1}\Vert \det D_h^2 u_h-\psi _{\scriptscriptstyle T}\Vert _{L_2(T)}+\Vert \psi -\psi _{\scriptscriptstyle T}\Vert _{L_\infty (T)}\nonumber \\&\le Ch^{-1}\big (\Vert \det D_h^2 u_h-\psi \Vert _{L_2(T)}+\Vert \psi -\psi _{\scriptscriptstyle T}\Vert _{L_2(T)}\big ) \nonumber \\&\qquad +\Vert \psi -\psi _{\scriptscriptstyle T}\Vert _{L_\infty (T)}, \end{aligned}$$

which together with (3.5) and the assumption that $\psi \in { H^2(\Omega )}$ implies

$$\begin{aligned} \Vert \det D_h^2 u_h-\psi \Vert _{L_\infty (T)}\le Ch \qquad \forall \,T\in \mathcal {T}_h. \end{aligned}$$

(3.6)

3.2 The elementwise convexity of ${u_h}$

Let $q_{\scriptscriptstyle T}$ be a polynomial defined on $T\in \mathcal {T}_h$. Recall that $q_{\scriptscriptstyle T}$ is the restriction of a polynomial defined on $\mathbb {R}^2$ which is also denoted by $q_{\scriptscriptstyle T}$. We define $I_Tq_{\scriptscriptstyle T}$ to be the restriction of $I_{{T}_\dag }q_{\scriptscriptstyle T}$ on T, where $I_{{T}_\dag }$ is the $P_1$ nodal interpolation operator associated with ${T}_\dag $. Note that any linear polynomial on T is invariant under $I_T$, and according to (2.7),

$$\begin{aligned} I_T \mathrm {tr}(J_{\scriptscriptstyle T}^t D^2v_{\scriptscriptstyle T}J_{\scriptscriptstyle T})\ge 0 \quad \hbox { on}\ T \text { for all } v\in L_h. \end{aligned}$$

(3.7)

We have, by (3.3), a standard inverse estimate (cf. [17, 23]) and the Bramble-Hilbert lemma,

$$\begin{aligned}&\Vert \mathrm {tr}(J_{\scriptscriptstyle T}^tD^2 (u_h)_{\scriptscriptstyle T} J_{\scriptscriptstyle T})-I_T \mathrm {tr}(J_{\scriptscriptstyle T}^t D^2 (u_h)_{\scriptscriptstyle T} J_{\scriptscriptstyle T}) \Vert _{L_\infty (T)}\nonumber \\&\quad \le C h^{-1}\Vert \mathrm {tr}(J_{\scriptscriptstyle T}^tD^2(u_h)_{\scriptscriptstyle T} J_{\scriptscriptstyle T})-I_T \mathrm {tr}(J_{\scriptscriptstyle T}^t D^2(u_h)_{\scriptscriptstyle T} J_{\scriptscriptstyle T})\Vert _{L_2(T)}\nonumber \\&\quad \le C h|\mathrm {tr}(J_{\scriptscriptstyle T}^tD^2(u_h)_{\scriptscriptstyle T} J_{\scriptscriptstyle T})|_{H^2(T)}\nonumber \\&\quad \le Ch^3. \end{aligned}$$

(3.8)

Lemma 3.1

There exists a positive constant $\alpha _\flat $ independent of h such that, for h sufficiently small, we have

$$\begin{aligned} \xi ^t D_h^2u_h\xi \ge \alpha _\flat |\xi |^2\qquad \hbox {on all}\ T\in \mathcal {T}_h \text { and for all } \xi \in \mathbb {R}^2, \end{aligned}$$

(3.9)

or equivalently the minimum eigenvalue of $D_h^2u_h$ is bounded below by a positive constant independent of h.

Proof

Let $T\in \mathcal {T}_h$ be arbitrary. From (3.6), we have

$$\begin{aligned} \det D^2(u_h)_{\scriptscriptstyle T} \ge \frac{1}{2} \min _{x\in {\bar{\Omega }}}\psi (x)>0 \qquad \hbox { on}\ T \end{aligned}$$

if h is sufficiently small. Consequently, in view of (2.10), we also have

$$\begin{aligned} h^{-4}\det (J_{\scriptscriptstyle T}^t D^2(u_h)_{\scriptscriptstyle T}J_{\scriptscriptstyle T})\ge \frac{ \delta ^2}{2} \min _{x\in {\bar{\Omega }}}\psi (x) \qquad \forall \,T\in \mathcal {T}_h, \end{aligned}$$

(3.10)

where the positive constant $\delta \le \min \{h^{-2}|\det J_{\scriptscriptstyle T}|:\, T\in \mathcal {T}_h\}$ is independent of h.

On the other hand, on each $T\in \mathcal {T}_h$, we have

$$\begin{aligned} h^{-2}\mathrm {tr}(J_{\scriptscriptstyle T}^tD^2(u_h)_{\scriptscriptstyle T} J_{\scriptscriptstyle T})&\ge h^{-2}\big [\mathrm {tr}(J_{\scriptscriptstyle T}^t D^2(u_h)_{\scriptscriptstyle T} J_{\scriptscriptstyle T})-I_T\mathrm {tr}(J_{\scriptscriptstyle T}^tD^2(u_h)_{\scriptscriptstyle T} J_{\scriptscriptstyle T})\big ]\nonumber \\&\ge -h^{-2}\Vert \mathrm {tr}(J_{\scriptscriptstyle T}^t D^2(u_h)_{\scriptscriptstyle T} J_{\scriptscriptstyle T})-I_T \mathrm {tr}(J_{\scriptscriptstyle T}^t D^2(u_h)_{\scriptscriptstyle T} J_{\scriptscriptstyle T}) \Vert _{L_\infty (T)}\nonumber \\&\ge -Ch \end{aligned}$$

(3.11)

by (3.7) and (3.8), where the positive constant C is independent of h.

We conclude from (3.10) and (3.11) that, for h sufficiently small, the minimum eigenvalue of $h^{-2}J_{\scriptscriptstyle T}^tD^2(u_h)_{\scriptscriptstyle T}J_{\scriptscriptstyle T}$ on the triangle T is bounded below by a positive constant independent of T and h, which implies that the same is true for $D_h^2u_h$ because $\mathcal {T}_h$ is a quasi-uniform triangulation. $\square $

Therefore, for h sufficiently small, $u_h$ is a strictly convex polynomial on each $T\in \mathcal {T}_h$. Note that (3.9) is equivalent to

$$\begin{aligned} \xi ^t \mathrm {Cof}(D_h^2u_h)\xi \ge \alpha _\flat |\xi |^2\qquad \text { on all } T\in \mathcal {T}_h \text { and for all } \xi \in \mathbb {R}^2. \end{aligned}$$

(3.12)

3.3 A priori error estimates

We have, by the fundamental theorem of calculus (cf. [38, Lemma A.1]),

$$\begin{aligned} \det D^2 u-\det D_h^2 u_h=\Big [\int _0^1 \mathrm {Cof}D_h^2(tu+(1-t)u_h) dt\Big ]:D_h^2(u-u_h), \end{aligned}$$

(3.13)

which is valid for all points in $\Omega $ except those on the edges of $\mathcal {T}_h$. Here and below we use the colon to denote the Frobenius inner product between matrices.

Let $A\in [L_\infty (\Omega )]^{2\times 2}$ be defined by the integral on the right-hand side of (3.13). For h sufficiently small, we have, by (1.6), (3.6) and (3.12),

$$\begin{aligned} \alpha |\xi |^2\le \xi ^tA(x)\xi \le \beta |\xi |^2 \qquad \forall \,\xi \in \mathbb {R}^2 \quad \text {and almost all } x\in \Omega , \end{aligned}$$

(3.14)

where $0<\alpha \le \beta $ are constants independent of h.

The proof of the following lemma is given in Appendix A.

Lemma 3.2

Under the condition (3.14) we have

$$\begin{aligned} \Vert D_h^2(\zeta - v)\Vert _{L_2(\Omega )}&\le \frac{1}{1-\delta }\Big [\alpha ^{-1}\Vert A:D_h^2 (\zeta -v)\Vert _{L_2(\Omega )}\nonumber \\&\quad +2C_\dag \Big (\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial v/\partial n]]\Vert _{L_2(e)}^2\Big )^\frac{1}{2}\Big ] \end{aligned}$$

(3.15)

for all $\zeta \in H^2(\Omega )\cap H^1_0(\Omega )$ and $v\in V_h\cap H^1_0(\Omega )$, where

$$\begin{aligned} \delta =\frac{\beta -\alpha }{(\alpha ^2+\beta ^2)^\frac{1}{2}} \end{aligned}$$

(3.16)

and the positive constant $C_\dag $ only depends on the shape regularity of $\mathcal {T}_h$.

We can now establish an a priori error estimate for $u_h$.

Theorem 3.3

There exists a positive constant C independent of h such that

$$\begin{aligned} \Vert u-u_h\Vert _h\le Ch^2. \end{aligned}$$

(3.17)

Proof

We can assume h is sufficiently small so that (3.14) is satisfied. We begin with the triangle inequality

$$\begin{aligned} \Vert D_h^2(u-u_h)\Vert _{L_2(\Omega )}\le & {} \Vert D_h^2\big ((u-\phi )-(u_h-\Pi _h\phi )\big )\Vert _{L_2(\Omega )}\nonumber \\&+\Vert D_h^2(\phi -\Pi _h\phi )\Vert _{L_2(\Omega )}. \end{aligned}$$

(3.18)

Note that $u-\phi \in H^2(\Omega )\cap H^1_0(\Omega )$ by (1.1b) and $u_h-\Pi _h\phi \in V_h\cap H^1_0(\Omega )$ by (2.7) and Remark 2.9. Hence it follows from (3.15) that

$$\begin{aligned}&\Vert D_h^2\big ((u-\phi )-(u_h-\Pi _h\phi )\big )\Vert _{L_2(\Omega )}\nonumber \\&\quad \le C\Big [\Vert A:D_h^2\big ((u-\phi )-(u_h-\Pi _h\phi )\big )\Vert _{L_2(\Omega )}\nonumber \\&\qquad +\Big (\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial (u_h-\Pi _h\phi )/\partial n]]\Vert _{L_2(e)}^2\Big )^\frac{1}{2}\Big ]\nonumber \\&\quad \le C\Big [\Vert A:D_h^2(u-u_h)\Vert _{L_2(\Omega )}+\Big (\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial u_h/\partial n]]\Vert _{L_2(e)}^2\Big )^\frac{1}{2}\nonumber \\&\qquad +\Vert D_h^2(\phi -\Pi _h\phi )\Vert _{L_2(\Omega )}+\Big (\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial \Pi _h\phi /\partial n]]\Vert _{L_2(e)}^2\Big )^\frac{1}{2}\Big ]. \end{aligned}$$

(3.19)

Putting (1.1a), (3.13), (3.18) and (3.19) together, we have

$$\begin{aligned} \Vert D_h^2(u-u_h)\Vert _{L_2(\Omega )}&\le C\Big [\Vert \psi -\det D_h^2u_h\Vert _{L_2(\Omega )}+\Big (\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial u_h/\partial n]]\Vert _{L_2(e)}^2\Big )^\frac{1}{2}\nonumber \\&\quad +\mathrm {Osc}(\phi )\Big ], \end{aligned}$$

(3.20)

where

$$\begin{aligned} \mathrm {Osc}(\phi )=\Vert D_h^2(\phi -\Pi _h\phi )\Vert _{L_2(\Omega )}+ \Big (\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial \Pi _h\phi /\partial n]]\Vert _{L_2(e)}^2\Big )^\frac{1}{2}. \end{aligned}$$

(3.21)

It follows from Remark 2.7, (3.4), (3.5), (3.20) and (3.21) that

$$\begin{aligned} \Vert D_h^2(u-u_h)\Vert _{L_2(\Omega )}\le Ch^2, \end{aligned}$$

(3.22)

and then the estimate (3.17) follows from (2.11), (3.4) and (3.22). $\square $

Remark 3.4

A careful tracking of the constant C in (3.17) shows that it takes the form of $C_* (\beta /\alpha )[(1/\alpha )+1]$, where $\alpha $ and $\beta $ are the constants in (3.14) and the positive constant $C_*$ depends only on u, $\phi $ and the shape regularity of $\mathcal {T}_h$.

According to the Poincaré-Friedrichs and Sobolev inequalities for piecewise $H^2$ functions (cf. [16, 18]), we have

$$\begin{aligned} \Vert \zeta \Vert _{L_2(\Omega )}+|\zeta |_{H^1(\Omega )}+\Vert \zeta \Vert _{L_\infty (\Omega )}\le C\Vert \zeta \Vert _h \end{aligned}$$

(3.23)

for all $\zeta \in H^1_0(\Omega )$ that is piecewise $H^2$ with respect to $\mathcal {T}_h$. It follows from (2.2), (2.4), Remark 2.9, (2.11), (3.17), (3.23) and the triangle inequality that

$$\begin{aligned}&\Vert u-u_h\Vert _{L_2(\Omega )}+|u-u_h|_{H^1(\Omega )}+\Vert u-u_h\Vert _{L_\infty (\Omega )}\nonumber \\&\quad \le \Vert u-\Pi _hu\Vert _{L_2(\Omega )}+|u-{ \Pi _h} u_h|_{H^1(\Omega )}+\Vert u-{ \Pi _h}u_h\Vert _{L_\infty (\Omega )}\nonumber \\&\qquad +C(\Vert u-\Pi _hu\Vert _h+\Vert u-u_h\Vert _h)\nonumber \\&\quad \le Ch^2. \end{aligned}$$

(3.24)

Remark 3.5

Numerical results in Example 4.1 indicate that the convergence in $\Vert \cdot \Vert _{L_2(\Omega )}$, $|\cdot |_{H^1(\Omega )}$ and $\Vert \cdot \Vert _{L_\infty (\Omega )}$ is better than $O(h^2)$.

3.4 An a posteriori error estimate

Since finding a global minimizer of a nonconvex function is in general NP-hard, an optimization algorithm usually only produces an approximate stationary point ${\tilde{u}}_h$ of the cost function $\mathcal {J}_h$. Therefore we need more than the a priori error estimate (3.17) to ensure the convergence of the approximate solutions to the solution u of (1.1).

In the following discussion, it suffices to assume that u belonging to $W^{2,\infty }(\Omega )$ is strictly convex, i.e., (1.5) is satisfied almost everywhere in $\Omega $.

Let ${\tilde{u}}_h\in L_h$ be elementwise strictly convex. Then the relation (3.13) is valid with $u_h$ replaced by ${\tilde{u}}_h$, and we have, by Lemma 3.2 and the arguments in the proof of Theorem 3.3,

$$\begin{aligned} \Vert D_h^2(u-{\tilde{u}}_h)\Vert _{L_2(\Omega )}&\le C\Big [\Vert \det D_h^2{\tilde{u}}_h-\psi \Vert _{L_2(\Omega )}+\Big (\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial {\tilde{u}}_h/\partial n]]\Vert _{L_2(e)}^2\Big )^\frac{1}{2}\nonumber \\&\quad +\mathrm {Osc}(\phi )\Big ], \end{aligned}$$

(3.25)

where the positive constant C depends only on the minimum and maximum eigenvalues of $D^2u$ and $D_h^2{\tilde{u}}_h$ over $\Omega $ and the shape regularity of $\mathcal {T}_h$.

Therefore, after verifying the elementwise strict convexity of an approximate solution ${{\tilde{u}}}_h$ for (2.8), we can monitor the convergence of ${{\tilde{u}}}_h$ by evaluating the residual-based error estimator

$$\begin{aligned} \eta _h({{\tilde{u}}}_h)=\Vert \det D_h^2{{\tilde{u}}}_h-\psi \Vert _{L_2(\Omega )}+\Big (\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial {\tilde{u}}_h/\partial n]]\Vert _{L_2(e)}^2\Big )^\frac{1}{2}. \end{aligned}$$

(3.26)

According to (2.11) and (3.25), the estimator $\eta _h$ is reliable for the error measured by the norm $\Vert \cdot \Vert _h$:

$$\begin{aligned} \Vert u-{\tilde{u}}_h\Vert _h\le C\big (\eta _h({\tilde{u}}_h)+\mathrm {Osc}(\phi )\big ). \end{aligned}$$

(3.27)

Moreover $\mathrm {Osc}(\phi )$ is $O(h^2)$ (cf. Remark 2.7).

In the other direction the obvious relations

$$\begin{aligned} |e|^{-1}\Vert [[\partial {\tilde{u}}_h/\partial n]]\Vert _{L_2(e)}^2&= |e|^{-1}\Vert [[\partial (u- {\tilde{u}}_h)/\partial n]]\Vert _{L_2(e)}^2 \end{aligned}$$

(3.28)

and

$$\begin{aligned} \Vert \det D_h^2{\tilde{u}}_h-\psi \Vert _{L_2(T)}&=\Vert \det D^2_h{\tilde{u}}_h-\det D^2u\Vert _{L_2(T)} \end{aligned}$$

(3.29)

imply that $\eta _h({\tilde{u}}_h)$ is also locally efficient. Therefore we can use $\eta _h$ to generate adaptive meshes when the solution of (1.1) is less smooth.

4 Numerical results

We have tested our method on three examples with known solutions. For each example we solve (2.7)–(2.9) by an active set algorithm (cf. Appendix B) that produces an approximate stationary point of the cost function in (2.9). The elementwise convexity of the approximate solutions are checked numerically by Algorithm C.1 in Appendix C.

For Example 4.2 and Example 4.3, where the known solutions do not belong to $H^4(\Omega )$, we have also solved (2.7)–(2.9) on adaptive meshes generated by the error estimator in (3.26) and a Dörfler marking strategy [32].

The relative errors of the approximate solution ${\tilde{u}}_h$ in various norms are defined by

$$\begin{aligned} e^r_{2,h} = \frac{\Vert u - {\tilde{u}}_h\Vert _{h}}{\Vert u\Vert _{H^2(\Omega )}},\quad e^r_{1,h} = \frac{\Vert u - {\tilde{u}}_h\Vert _{H^1(\Omega )}}{\Vert u\Vert _{H^1(\Omega )}},\quad e^r_{0,h} = \frac{\Vert u - {\tilde{u}}_h\Vert _{L^2(\Omega )}}{\Vert u\Vert _{L^2(\Omega )}} \end{aligned}$$

and

$$\begin{aligned} e^r_{\infty ,h} = \frac{\max \limits _{p\in \mathcal {V}_h}|u(p) - {\tilde{u}}_h(p)|}{\Vert u\Vert _{L^\infty (\Omega )}}, \end{aligned}$$

where $\mathcal {V}_h$ is the set of the vertices of the triangulation $\mathcal {T}_h$.

All the numerical experiments were carried out on a MacBook Pro laptop computer with a 2.8GHz Quad-Core Intel Core i7 processor and with 16GB 2133 MHz LPDDR3 memory. We use MATLAB (R2018b v.9.5.0) in our computations.

Example 4.1

This example is from [28], where $\Omega $ is the unit square $(0,1)^2$,

$$\begin{aligned} \psi (x) = (1+|x|^2)e^{|x|^2/2} \quad \text {and}\quad \phi (x) = e^{x^2/2}. \end{aligned}$$

The exact solution is $u = e^{x^2/2}$. The assumptions (1.2)–(1.4) are satisfied.

The errors of the approximate solutions ${\tilde{u}}_h$ obtained by the optimization algorithm on uniform meshes are reported in Table 1. The order of convergence for $e_{2,h}^r$ is 2, which agrees with the estimate in Theorem 3.3 for the solutions $u_h$ of (2.8). The orders of convergence for $e_{1,h}^r$, $e_{0,h}^r$ and $e_{\infty ,h}^r$ are higher.

Table 1 Relative errors versus mesh size h and orders of convergence (Example 4.1)

Full size table

The residual $\eta _h({\tilde{u}}_h)$ and the cost $\mathcal {J}_h({\tilde{u}}_h)$ are reported in Table 2, their behaviors agree with the estimates (3.1), (3.4) and (3.5) for the minimizer $u_h$. It is observed from the CPU times in Table 2 that a good approximate solution at $h=2^{-2}$ was computed in 3 seconds.

We have verified that all the approximate solutions are elementwise strictly convex, and the reliability of the error estimator $\eta _h$ can be observed by comparing $e_{2,h}^r$ in Table 1 and $\eta _h({\tilde{u}}_h)$ in Table 2.

We have also solved the same problem on four other regular polygons (cf. Fig. 2), where the diameters of these polygons are 2.3660 (triangle), 1.6420 (pentagon), 1.5774 (hexagon) and 1.5307 (octagon).

It is observed from the convergence histories of $e_{2,h}^r$ and $e_{\infty ,h}^r$ in Fig. 3 that the performance of our method is similar for all five polygons.

Table 2 Residual, Cost and CPU Time (Example 4.1)

Full size table

Example 4.2

This example is from [63], where $\Omega = (-1,1)^2$,

$$\begin{aligned} \psi (x) = \left\{ \begin{aligned}&16\quad&\ \text{ in }\ \ |x|\le 1/2\\&64 - 16|x|^{-1} \quad&\text{ in }\ \ 1/2\le |x| \end{aligned} \right. , \end{aligned}$$

the exact solution is

$$\begin{aligned} u(x) = \left\{ \begin{aligned}&2|x|^2\quad&\ \text{ in }\ \ |x|\le 1/2\\&2(|x| - 1/2)^2 + 2|x|^2\quad&\text{ in }\ \ 1/2\le |x| \end{aligned} \right. , \end{aligned}$$

and $\phi \in H^4(\Omega )$ equals u in a neighborhood of $\partial \Omega $. For this example, the function $\psi $ is piecewise smooth and discontinuous along the circle defined by $|x|=1/2$, and the Aleksandrov solution u is a piecewise smooth $C^1$ function.

The computational meshes are generated by a bisection procedure to fit the circle where $\psi $ is discontinuous. The first two meshes and the final mesh are presented in Fig. 4.

We have verified that all the approximate solutions are elementwise strictly convex. The relative errors are reported in Table 3. The convergence of $e_{2,h}^r$ is of a reduced order $\approx 0.5$, and the orders of convergence are higher for the lower order norms.

The residual $\eta _h({\tilde{u}}_h)$, the cost $\mathcal {J}_h({\tilde{u}}_h)$ and the CPU time are provided in Table 4. It is observed that a satisfactory approximate solution with 1009 dofs was computed in less than 3 seconds. The reliability estimate (3.27) is confirmed by comparing the values of $e_{2,h}^r$ and $\eta _h({\tilde{u}}_h)$.

Table 3 Relative errors versus mesh size h and orders of convergence (Example 4.2)

Full size table

Table 4 Residual, Cost and CPU Time (Example 4.2)

Full size table

We also tested the performance of the a posteriori error estimator in (3.26) for this example. The convergence histories of $e_{2,h}^r$ and $e_{\infty ,h}^r$ on bisection and adaptive meshes are shown in Fig. 5. The advantages of the adaptive meshes can be observed until round-off errors interfere at finer meshes.

The discrete solution on the final bisection mesh and the adaptive mesh with 3385 dofs are displayed in Fig. 6.

Example 4.3

This example is from [64], which is a modification of an example in [48]. The domain $\Omega $ is the unit square $ = (0,1)^2$,

$$\begin{aligned} \psi (x) = \max \left( 1 - \frac{0.2}{|x - (1/2,1/2)|},0\right) , \end{aligned}$$

the exact solution of this example is

$$\begin{aligned} u(x) = \frac{1}{2}(\max (|x - (1/2,1/2)| - 0.2),0)^2, \end{aligned}$$

and $\phi \in H^4(\Omega )$ equals u in a neighborhood of $\partial \Omega $. For this example the function $\psi $ vanishes on the disc defined by $|x-(1/2,1/2)|\le 0.2$ and the Aleksandrov solution u is a piecewise smooth $C^1$ function.

We solved this problem by the nonlinear least-squares method on uniform meshes. The approximate solutions are elementwise strictly convex outside the disc where $\psi =0$. The relative errors of ${\tilde{u}}_h$ are reported in Table 5. The order of convergence for $e_{2,h}^r$ is roughly 0.5 and the orders of convergence for $e_{1,h}^r$, $e_{0,h}^r$ and $e_{\infty ,h}^r$ are better than 1. The discrete solution at $h=2^{-5}$ can be found in Fig. 8.

The residual $\eta _h({\tilde{u}}_h)$, the cost $\mathcal {J}_h({\tilde{u}}_h)$ and the CPU time are provided in Table 6. Comparing $e_{2,h}^r$ in Table 5 and $\eta _h({\tilde{u}}_h)$ in Table 6, one can see that the reliability estimate (3.27) is no longer valid because of the lack of strict convexity for both the discrete solutions and the continuous solution inside the disc where $\psi =0$. It can also be seen that a satisfactory approximate solution at $h=2^{-3}$ was computed in less than 8 seconds.

Table 5 Relative errors versus mesh size h and orders of convergence (Example 4.3)

Full size table

Table 6 Residual, Cost and CPU Time (Example 4.3)

Full size table

We also tested the performance of the a posteriori residual error estimator in (3.26) for this example. The convergence histories of $e_{2,h}^r$ and $e_{\infty ,h}^r$ on uniform and adaptive meshes are shown in Fig. 7. The advantage of adaptive meshes is observed. The adaptive mesh with 30187 dofs in Fig. 8 clearly captures the singularities of the exact solution along the circle defined by $|x-(1/2,1/2)|=0.2$.

5 Concluding remarks

By going beyond the classical definition of a finite element, we are able to construct a $C^0$ interior penalty method for the Dirichlet boundary value problem of the Monge–Ampère equation, where the elementwise convexity of the approximate solutions can be enforced. This in turn enables us to use existing results for second order elliptic equations in nondivergence form to obtain both a priori and a posteriori error estimates. The a posteriori error estimate is a significant part of our method since it allows us to access the convergence of the approximate solutions generated by optimization algorithms that are not necessarily global minimizers.

The approach in this paper can be extended to smooth domains. We also note that convexity enforcing is useful for the problem of prescribed Gaussian curvature (cf. [38, 42]) and the nonlinear least-squares approach can be applied to the Pucci equations (cf. [20, 42]). These are some of our ongoing projects.

References

Adams, R.A., Fournier, J.J.F.: Sobolev Spaces, 2nd edn. Academic Press, Amsterdam (2003)
MATH Google Scholar
Aleksandrov, A.D.: Dirichlet’s problem for the equation ${\rm Det}\,||z_{ij}|| =\varphi (z_{1},\cdots ,z_{n},z, x_{1},\cdots , x_{n})$. I. Vestnik Leningrad. Univ. Ser. Mat. Meh. Astr. 13, 5–24 (1958)
MathSciNet Google Scholar
Awanou, G.: Spline element method for Monge-Ampère equations. BIT 55, 625–646 (2015)
Article MathSciNet MATH Google Scholar
Awanou, G.: Standard finite elements for the numerical resolution of the elliptic Monge-Ampère equations: classical solutions. IMA J. Numer. Anal. 35, 1150–1166 (2015)
Article MathSciNet MATH Google Scholar
Awanou, G.: Standard finite elements for the numerical resolution of the elliptic Monge-Ampère equation: Aleksandrov solutions. ESAIM Math. Model. Numer. Anal. 51, 707–725 (2017)
Article MathSciNet MATH Google Scholar
Bakelman, I.J.: Convex Analysis and Nonlinear Geometric Elliptic Equations. Springer, Berlin (1994)
Book MATH Google Scholar
Barles, G., Souganidis, P.E.: Convergence of approximation schemes for fully nonlinear second order equations. Asymptotic Anal. 4, 271–283 (1991)
Article MathSciNet MATH Google Scholar
Benamou, J.-D., Collino, F., Mirebeau, J.-M.: Monotone and consistent discretization of the Monge-Ampère operator. Math. Comp. 85, 2743–2775 (2016)
Article MathSciNet MATH Google Scholar
Benamou, J.-D., Froese, B.D., Oberman, A.M.: Two numerical methods for the elliptic Monge-Ampère equation. M2AN Math. Model. Numer. Anal. 44, 737–758 (2010)
Article MathSciNet MATH Google Scholar
Böhmer, K.: On finite element methods for fully nonlinear elliptic equations of second order. SIAM J. Numer. Anal. 46, 1212–1249 (2008)
Article MathSciNet MATH Google Scholar
Böhmer, K., Schaback, R.: A meshfree method for solving the Monge-Ampère equation. Numer. Algorithms, pp 539–551, (2019)
Bramble, J.H., Hilbert, S.R.: Estimation of linear functionals on Sobolev spaces with applications to Fourier transforms and spline interpolation. SIAM J. Numer. Anal. 7, 113–124 (1970)
Article MathSciNet MATH Google Scholar
Brenner, S.C., Gudi, T., Neilan, M., Sung, L.-Y.: $C^0$ penalty methods for the fully nonlinear Monge-Ampère equation. Math. Comp. 80, 1979–1995 (2011)
Article MathSciNet MATH Google Scholar
Brenner, S.C., Kawecki, E.L.: Adaptive $C^0$ interior penalty methods for Hamilton-Jacobi-Bellman equations with Cordes coefficients. J. Comp. Appl. Math. 1, 1 (2020). https://doi.org/10.1016/j.cam.2020.11324
Article Google Scholar
Brenner, S.C., Neilan, M.: Finite element approximations of the three dimensional Monge-Ampère equation. ESAIM Math. Model. Numer. Anal. 46, 979–1001 (2012)
Article MathSciNet MATH Google Scholar
Brenner, S.C., Neilan, M., Reiser, A., Sung, L.-Y.: A $C^0$ interior penalty method for a von Kármán plate. Numer. Math. 135, 803–832 (2017)
Article MathSciNet MATH Google Scholar
Brenner, S.C., Scott, L.R.: The Mathematical Theory of Finite Element Methods $($Third Edition$)$. Springer, New York (2008)
Book Google Scholar
Brenner, S.C., Wang, K., Zhao, J.: Poincaré-Friedrichs inequalities for piecewise $H^2$ functions. Numer. Funct. Anal. Optim. 25, 463–478 (2004)
Article MathSciNet MATH Google Scholar
Caboussat, A., Glowinski, R., Sorensen, D.C.: A least-squares method for the numerical solution of the Dirichlet problem for the elliptic Monge-Ampère equation in dimension two. ESAIM Control Optim. Calc. Var. 19, 780–810 (2013)
Article MathSciNet MATH Google Scholar
Caffarelli, L.A., Cabré, X.: Fully Nonlinear Elliptic Equations. American Mathematical Society, Providence (1995)
Book MATH Google Scholar
Caffarelli, L.A., Nirenberg, L., Spruck, J.: The Dirichlet problem for nonlinear second-order elliptic equations. I. Monge-Ampère equation. Comm. Pure Appl. Math. 37, 369–402 (1984)
Article MathSciNet MATH Google Scholar
Campanato, S.: A Cordes type condition for nonlinear nonvariational systems. Rend. Accad. Naz. Sci. XL Mem. Mat. (5) 13, 307–321 (1989)
MathSciNet MATH Google Scholar
Ciarlet, P.G.: The Finite Element Method for Elliptic Problems. North-Holland, Amsterdam (1978)
MATH Google Scholar
Cordes, H.O.: Über die erste Randwertaufgabe bei quasilinearen Differentialgleichungen zweiter Ordnung in mehr als zwei Variablen. Math. Ann. 131, 278–312 (1956)
Article MathSciNet MATH Google Scholar
Dai, Y.-H., Hager, W.W., Schittkowski, K., Zhang, H.: The cyclic Barzilai-Borwein method for unconstrained optimization. IMA J. Numer. Anal. 26, 604–627 (2006)
Article MathSciNet MATH Google Scholar
Dai, Y.-H., Zhang, H.: Adaptive two-point stepsize gradient algorithm. Numer. Algorithms 27, 377–385 (2001)
Article MathSciNet MATH Google Scholar
Davydov, O., Saeed, A.: Numerical solution of fully nonlinear elliptic equations by Böhmer’s method. J. Comput. Appl. Math. 254, 43–54 (2013)
Article MathSciNet MATH Google Scholar
Dean, E.J., Glowinski, R.: Numerical solution of the two-dimensional elliptic Monge-Ampère equation with Dirichlet boundary conditions: an augmented Lagrangian approach. C. R. Math. Acad. Sci. Paris 336, 779–784 (2003)
Article MathSciNet MATH Google Scholar
Dean, E.J., Glowinski, R.: Numerical solution of the two-dimensional elliptic Monge-Ampère equation with Dirichlet boundary conditions: a least-squares approach. C. R. Math. Acad. Sci. Paris 339, 887–892 (2004)
Article MathSciNet MATH Google Scholar
Dean, E.J., Glowinski, R.: An augmented Lagrangian approach to the numerical solution of the Dirichlet problem for the elliptic Monge-Ampère equation in two dimensions. Electron. Trans. Numer. Anal. 22, 71–96 (2006). ((electronic))
MathSciNet MATH Google Scholar
Dean, E.J., Glowinski, R.: Numerical methods for fully nonlinear elliptic equations of the Monge-Ampère type. Comput. Methods Appl. Mech. Engrg. 195, 1344–1386 (2006)
Article MathSciNet MATH Google Scholar
Dörfler, W.: A convergent adaptive algorithm for Poisson’s equation. SIAM J. Numer. Anal. 33, 1106–1124 (1996)
Article MathSciNet MATH Google Scholar
Feng, X., Glowinski, R., Neilan, M.: Recent developments in numerical methods for fully nonlinear second order partial differential equations. SIAM Rev. 55, 205–267 (2013)
Article MathSciNet MATH Google Scholar
Feng, X., Jensen, M.: Convergent semi-Lagrangian methods for the Monge-Ampère equation on unstructured grids. SIAM J. Numer. Anal. 55, 691–712 (2017)
Article MathSciNet MATH Google Scholar
Feng, X., Neilan, M.: Mixed finite element methods for the fully nonlinear Monge-Ampère equation based on the vanishing moment method. SIAM J. Numer. Anal. 47, 1226–1250 (2009)
Article MathSciNet MATH Google Scholar
Feng, X., Neilan, M.: Vanishing moment method and moment solutions for fully nonlinear second order partial differential equations. J. Sci. Comput. 38, 74–98 (2009)
Article MathSciNet MATH Google Scholar
Feng, X., Neilan, M.: Analysis of Galerkin methods for the fully nonlinear Monge-Ampère equation. J. Sci. Comput. 47, 303–327 (2011)
Article MathSciNet MATH Google Scholar
Figalli, A.: The Monge-Ampère equation and its applications. European Mathematical Society (EMS), Zürich (2017)
Book MATH Google Scholar
Froese, B.D., Oberman, A.M.: Convergent finite difference solvers for viscosity solutions of the elliptic Monge-Ampère equation in dimensions two and higher. SIAM J. Numer. Anal. 49, 1692–1714 (2011)
Article MathSciNet MATH Google Scholar
Froese, B.D., Oberman, A.M.: Fast finite difference solvers for singular solutions of the elliptic Monge-Ampère equation. J. Comput. Phys. 230, 818–834 (2011)
Article MathSciNet MATH Google Scholar
Froese, B.D., Oberman, A.M.: Convergent filtered schemes for the Monge-Ampère partial differential equation. SIAM J. Numer. Anal. 51, 423–444 (2013)
Article MathSciNet MATH Google Scholar
Gilbarg, D., Trudinger, N.S.: Elliptic Partial Differential Equations of Second Order. Classics in Mathematics. Springer, Berlin (2001)
Book MATH Google Scholar
Grisvard, P.: Elliptic Problems in Non Smooth Domains. Pitman, Boston (1985)
MATH Google Scholar
Gutiérrez, C.E.: The Monge-Ampère Equation. Birkhäuser Boston Inc., Boston (2001)
Book MATH Google Scholar
Hager, W.W., Zhang, H.: A new active set algorithm for box constrained optimization. SIAM J. Optim. 17, 526–557 (2006)
Article MathSciNet MATH Google Scholar
Hager, W.W., Zhang, H.: The limited memory conjugate gradient method. SIAM J. Optim. 23, 2150–2168 (2013)
Article MathSciNet MATH Google Scholar
Hager, W.W., Zhang, H.: An active set algorithm for nonlinear optimization with polyhedral constraints. Sci. China Math. 59, 1525–1542 (2016)
Article MathSciNet MATH Google Scholar
Hamfeldt, B.F., Salvador, T.: Higher-order adaptive finite difference methods for fully nonlinear elliptic equations. J. Sci. Comput. 75, 1282–1306 (2018)
Article MathSciNet MATH Google Scholar
Krylov, N.V.: Nonlinear Elliptic and Parabolic Equations of the Second Order. D. Reidel Publishing Co., Dordrecht (1987)
Book MATH Google Scholar
Kuo, H.J., Trudinger, N.S.: Discrete methods for fully nonlinear elliptic equations. SIAM J. Numer. Anal. 29, 123–135 (1992)
Article MathSciNet MATH Google Scholar
Lakkis, O., Pryer, T.: A finite element method for nonlinear elliptic problems. SIAM J. Sci. Comput. 35, A2025–A2045 (2013)
Article MathSciNet MATH Google Scholar
Lebedev, N.N.: Special Functions and Their Applications. Dover, New York (1972)
MATH Google Scholar
Li, W., Nochetto, R.H.: Optimal pointwise error estimates for two-scale methods for the Monge-Ampère equation. SIAM J. Numer. Anal. 56, 1915–1941 (2018)
Article MathSciNet MATH Google Scholar
Lions, P.-L.: Fully nonlinear elliptic equations and applications. In Nonlinear analysis, function spaces and applications, Vol. 2 (Písek, 1982), vol. 49, Teubner-Texte zur Math. pp 126–149. Teubner, Leipzig, (1982)
Maugeri, A., Palagachev, D.K., Softova, L.G.: Elliptic and Parabolic Equations with Discontinuous Coefficients. Wiley-VCH Verlag Berlin GmbH, Berlin (2000)
Book MATH Google Scholar
Miranda, C.: Su di una particolare equazione ellittica del secondo ordine a coefficienti discontinui. An. Şti. Univ. “Al. I. Cuza" Iaşi Secţ. I a Mat. (N.S.) 11B, 209–215 (1965)
MathSciNet MATH Google Scholar
Neilan, M.: A nonconforming Morley finite element method for the fully nonlinear Monge-Ampère equation. Numer. Math. 115, 371–394 (2010)
Article MathSciNet MATH Google Scholar
Neilan, M.: Quadratic finite element approximations of the Monge-Ampère equation. J. Sci. Comput. 54, 200–226 (2013)
Article MathSciNet MATH Google Scholar
Neilan, M.: Finite element methods for fully nonlinear second order PDEs based on a discrete Hessian with applications to the Monge-Ampère equation. J. Comput. Appl. Math. 263, 351–369 (2014)
Article MathSciNet MATH Google Scholar
Neilan, M.: A unified analysis of three finite element methods for the Monge-Ampère equation. Electron. Trans. Numer. Anal. 41, 262–288 (2014)
MathSciNet MATH Google Scholar
Neilan, M., Salgado, A.J., Zhang, W.: Numerical analysis of strongly nonlinear PDEs. Acta Numer. 26, 137–303 (2017)
Article MathSciNet MATH Google Scholar
Neilan, M., Wu, M.: Discrete Miranda-Talenti estimates and applications to linear and nonlinear PDEs. J. Comput. Appl. Math. 356, 358–376 (2019)
Article MathSciNet MATH Google Scholar
Neilan, M., Zhang, W.: Rates of convergence in $W^2_p$-norm for the Monge-Ampère equation. SIAM J. Numer. Anal. 56, 3099–3120 (2018)
Article MathSciNet MATH Google Scholar
Nochetto, R.H., Ntogkas, D., Zhang, W.: Two-scale method for the Monge-Ampère equation: convergence to the viscosity solution. Math. Comp. 88, 637–664 (2019)
Article MathSciNet MATH Google Scholar
Nochetto, R.H., Ntogkas, D., Zhang, W.: Two-scale method for the Monge-Ampère equation: pointwise error estimates. IMA J. Numer. Anal. 39, 1085–1109 (2019)
Article MathSciNet MATH Google Scholar
Nochetto, R.H., Zhang, W.: Pointwise rates of convergence for the Oliker-Prussner method for the Monge-Ampère equation. Numer. Math. 141, 253–288 (2019)
Article MathSciNet MATH Google Scholar
Oberman, A.M.: Wide stencil finite difference schemes for the elliptic Monge-Ampère equation and functions of the eigenvalues of the Hessian. Dis. Contin. Dyn. Syst. Ser. B 10, 221–238 (2008)
MATH Google Scholar
Oliker, V.I., Prussner, L.D.: On the numerical solution of the equation $(\partial ^2z/\partial x^2)(\partial ^2z/\partial y^2)-((\partial ^2z/\partial x\partial y))^2=f$ and its discretizations. I. Numer. Math. 54, 271–293 (1988)
Article MATH Google Scholar
Qiu, W., Tang, L.: A note on the Monge-Ampère type equations with general source terms. Math. Comp. 89, 2675–2706 (2020)
Article MathSciNet MATH Google Scholar
Smears, I., Süli, E.: Discontinuous Galerkin finite element approximation of nondivergence form elliptic equations with Cordès coefficients. SIAM J. Numer. Anal. 51, 2088–2106 (2013)
Article MathSciNet MATH Google Scholar
Talenti, G.: Sopra una classe di equazioni ellittiche a coefficienti misurabili. Ann. Mat. Pura Appl. 4(69), 285–304 (1965)
Article MathSciNet MATH Google Scholar
Villani, C.: Topics in Optimal Transportation. American Mathematical Society, Providence (2003)
Book MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics and Center for Computation and Technology, Louisiana State University, Baton Rouge, LA, 70803, USA
Susanne C. Brenner, Li-yeng Sung & Hongchao Zhang
Center for Computation and Technology, Louisiana State University, Baton Rouge, LA, 70803, USA
Zhiyu Tan

Authors

Susanne C. Brenner
View author publications
You can also search for this author in PubMed Google Scholar
Li-yeng Sung
View author publications
You can also search for this author in PubMed Google Scholar
Zhiyu Tan
View author publications
You can also search for this author in PubMed Google Scholar
Hongchao Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Susanne C. Brenner.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This work of the first two authors was supported in part by the National Science Foundation under Grant No. DMS-19-13035. The work of the fourth author was supported in part by the National Science Foundation under Grant Nos. DMS-18-19161 and DMS-21-10722.

Appendices

Appendix A. The Proof of Lemma 3.2

We note that stability estimates similar to 3.2 and in more general settings can be found for example in [62, 70]. We include a proof here for self-containedness.

The first ingredient is the Miranda–Talenti estimate

$$\begin{aligned} \Vert D^2 \zeta \Vert _{L_2(\Omega )}\le \Vert \Delta \zeta \Vert _{L_2(\Omega )}\qquad \forall \,\zeta \in H^2(\Omega )\cap H^1_0(\Omega )\end{aligned}$$

(A.1)

that is valid for convex domains (cf. [56, 71]). In the case of a polygon, the two sides of (A.1) are actually equal (cf. [43, Sect. 4.3]).

The second ingredient is the existence of an operator $E_h:V_h\cap H^1_0(\Omega )\longrightarrow H^2(\Omega )\cap H^1_0(\Omega )$ that satisfies

$$\begin{aligned} \Vert D_h^2 (v-E_hv)\Vert _{L_2(\Omega )}\le C_\dag \Big (\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial v/\partial n]]\Vert _{L_2(e)}^2\Big )^\frac{1}{2} \qquad \forall \,v\in V_h\cap H^1_0(\Omega ),\nonumber \\ \end{aligned}$$

(A.2)

where $C_\dag $ only depends on the shape regularity of $\mathcal {T}_h$ (cf. [62, Lemma 3]).

Remark A.1

The operator $E_h$ in [62] is for the standard cubic Lagrange finite element space. The extension of $E_h$ and (A.2) to the enhanced cubic Lagrange finite element space in Sect. 2.1 is straightforward since the additional bubble functions are already in $H^2(\Omega )\cap H^1_0(\Omega )$.

It follows from (A.1) and (A.2) that

$$\begin{aligned} \Vert D_h^2(\zeta -v)\Vert _{L_2(\Omega )}&\le \Vert D_h^2(\zeta -E_hv)\Vert _{L_2(\Omega )}+\Vert D_h^2(v-E_hv)\Vert _{L_2(\Omega )}\nonumber \\&\le \Vert \Delta (\zeta -E_hv)\Vert _{L_2(\Omega )}+\Vert D_h^2(v-E_hv)\Vert _{L_2(\Omega )}\nonumber \\&\le \Vert { \Delta _h}(\zeta -v)\Vert _{L_2(\Omega )}+2\Vert D_h^2(v-E_hv)\Vert _{L_2(\Omega )}\nonumber \\&\le \Vert { \Delta _h}(\zeta -v)\Vert _{L_2(\Omega )}+2C_\dag \Big (\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial v/\partial n]]\Vert _{L_2(e)}^2\Big )^\frac{1}{2} \end{aligned}$$

(A.3)

for all $\zeta \in H^2(\Omega )\cap H^1_0(\Omega )$ and $v\in V_h\cap H^1_0(\Omega )$.

Following the treatment of second order linear elliptic equations in nondivergence form in [22, 55, 70], we introduce the function

$$\begin{aligned} \gamma (x)=\frac{A(x):I}{A(x):A(x)} \end{aligned}$$

where I is the $2\times 2$ identity matrix.

Note that A(x) : I is the sum of the eigenvalues of A(x) and A(x) : A(x) is the sum of the squares of the eigenvalues of A(x). Therefore we have, by (3.14), the following upper bound of $\gamma (x)$:

$$\begin{aligned} \gamma (x)\le \max _{\alpha \le \lambda _1,\lambda _2\le \beta }\frac{\lambda _1+\lambda _2}{\lambda _1^2+\lambda _2^2}= \frac{1}{\alpha }, \end{aligned}$$

(A.4)

and also the following Cordes condition (cf. [24]):

$$\begin{aligned} |\gamma (x) A(x)-I|^2&=\gamma (x)^2A(x):A(x)-2\gamma (x)(A(x):I)+2\nonumber \\&=2-\frac{(A(x):I)^2}{A(x):A(x)}\nonumber \\&\le \max _{\alpha \le \lambda _1,\lambda _2\le \beta } \frac{(\lambda _1-\lambda _2)^2}{\lambda _1^2+\lambda _2^2} =\frac{(\beta -\alpha )^2}{\alpha ^2+\beta ^2}=\delta ^2, \end{aligned}$$

(A.5)

where $\delta \, { (<1)}$ is given by (3.16).

It follows from (A.3) and (A.5) that

$$\begin{aligned}&\int _\Omega [\gamma A:D_h^2(\zeta -v)]\Delta _h(\zeta -v)\,dx\nonumber \\&\quad =\Vert \Delta _h(\zeta -v)\Vert _{L_2(\Omega )}^2 +\int _\Omega [\gamma A:D_h^2(\zeta -v)-\Delta _h(\zeta -v)]\Delta _h(\zeta -v)dx\nonumber \\&\quad =\Vert \Delta _h(\zeta -v)\Vert _{L_2(\Omega )}^2+\int _\Omega [(\gamma A-I):D_h^2(\zeta -v)]\Delta _h (\zeta -v)dx\nonumber \\&\quad \ge \Vert \Delta _h(\zeta -v)\Vert _{L_2(\Omega )}^2-\delta \Vert D_h^2(\zeta - v)\Vert _{L_2(\Omega )}\Vert \Delta _h(\zeta -v)\Vert _{L_2(\Omega )}\nonumber \\&\quad \ge (1-\delta )\Vert \Delta _h(\zeta -v)\Vert _{L_2(\Omega )}^2\\&\qquad -2C_\dag \delta \Vert \Delta _h(\zeta -v)\Vert _{L_2(\Omega )}\Big (\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial v/\partial n]]\Vert _{L_2(e)}^2\Big )^\frac{1}{2}, \end{aligned}$$

which together with (A.4) implies

$$\begin{aligned} \Vert \Delta _h(\zeta - v)\Vert _{L_2(\Omega )}&\le \frac{\alpha ^{-1}}{1-\delta }\Vert A:D_h^2(\zeta - v)\Vert _{L_2(\Omega )}\nonumber \\&\quad +2 C_\dag \frac{\delta }{1-\delta } \Big (\sum _{e\in \mathcal {E}_h^i}|e|^{-1}\Vert [[\partial v/\partial n]]\Vert _{L_2(e)}^2\Big )^\frac{1}{2}. \end{aligned}$$

(A.6)

Finally we arrive at (3.15) through (A.3) and (A.6).

Appendix B. An optimization algorithm

An active set algorithm is implemented to solve the bound constrained optimization

$$\begin{aligned} \min \; \{f(\mathbf{{x}}) : \mathbf{{x}} \in {\mathcal {B}} \}, \end{aligned}$$

(B.1)

where $f: R^n \rightarrow R$ is twice continuously differentiable on the the set ${\mathcal {B}} = \{ \mathbf{{x}} \in R^n : \mathbf{{l}} \le \mathbf{{x}} \le \mathbf{{u}} \}$. Our algorithm is based on the active set approach proposed in [45] for solving nonlinear optimization with bound constraints, which was further developed in [47] for handling more general polyhedral constrained optimization.

Here, we just very briefly explain the structure and convergence results of our algorithm. For more details on the theory of the algorithm, one may refer to references [45, 47]. Our active set algorithm consists of two phases: a nonmonotone gradient projection phase and an unconstrained optimization phase, and a set of rules for switching between these two phases for achieving both global and fast local convergence. In particular, a projected cyclic Barzilai-Borwein (PCBB) algorithm is used in the gradient projection phase, where the line search direction at iteration $\mathbf{{x}}_k$ is generated by

$$\begin{aligned} \mathbf{{d}}_k = \text{ P}_{{\mathcal {B}}} (\mathbf{{x}}_k - \alpha _k \mathbf{{g}}_k) - \mathbf{{x}}_k. \end{aligned}$$

(B.2)

Here, $\text{ P}_{{\mathcal {B}}} (\cdot )$ is the projection on the feasible region ${\mathcal {B}}$, $\alpha _k$ is the cyclic Barzilai-Borwein stepsize [25] and $\mathbf{{g}}_k = \nabla f(\mathbf{{x}}_k)$. Along the search direction $\mathbf{{d}}_k$, an adaptive nonmonotone line search proposed in [26] is used to ensure global convergence. This PCBB algorithm of phase one is not only robust in the sense that it converges to a stationary point under mild assumptions, but also very effective for identifying the optimal active constraints where the components of the solution are on the boundary of ${\mathcal {B}}$.

However, the convergence rate of PCBB is often at best linear. Hence, to accelerate the convergence, a more efficient unconstrained optimization algorithm is used in phase two to optimize the objective function by fixing some components of variable $\mathbf{{x}}$ on the boundary of ${\mathcal {B}}$, that is

$$\begin{aligned} \min \; \{f(\mathbf{{x}}) : \mathbf{{x}}_{{\mathcal {A}}} = \mathbf{{b}}_{{\mathcal {A}}} \}. \end{aligned}$$

(B.3)

Here, ${\mathcal {A}}$ is the active index set given by phase one and $\mathbf{{b}}_{{\mathcal {A}}}$ indicates the partial boundary of ${\mathcal {B}}$ where the components of $\mathbf{{x}}$ with index ${\mathcal {A}}$ are fixed. When one iteration of the phase two algorithm lies out of the feasible region ${\mathcal {B}}$, the set rules developed in the algorithm determine whether the algorithm will switch to the gradient projection phase or restart the unconstrained optimization phase by projecting the iterate back to ${\mathcal {B}}$. A limited memory nonlinear conjugate gradient method (L-CG_DESCENT) [46] is used to solve the subspace optimization (B.3) in phase two.

L-CG_DESCENT is a very efficient first-order method which has much more rapid convergence than most gradient descent methods, and maintains cheap iterations since only up to first-order information is used. However, when the optimization problem gets very ill-conditioned, which is often the case for a discrete optimization problem resulting from a finite difference method or a finite element method (such as the $C^0$ interior penalty method studied in this paper), slow convergence is often detected near the solution. Under this situation, in phase two we would switch L-CG_DESCENT to the second-order Newton’s method, which is generally more expensive but often quickly leads to more accurate solutions. The convergence theories developed in [45, 47] guarantee our active set algorithm converges at least to a stationary point of problem (B.1). Furthermore, the active set algorithm would asymptotically reduce the bound constrained optimization (B.1) to an unconstrained optimization (B.3) even when the problem is degenerate. Hence, fast local convergence would be expected by combining the more rapid convergence algorithms such as L-CG_DESCENT and Newton’s method in the phase two optimization.

Appendix C. Elementwise convexity

Since the enhanced cubic Lagrange finite element is affine-equivalent, we can focus on the reference simplex. It is convenient to first consider the convexity of a tensor product polynomial on the unit square $(0,1)\times (0,1)$, for which we will need some explicit inverse estimates.

1.1 C.1 Explicit inverse estimates

Let $\mathbb {Q}_{m,n}$ be the space of tensor product polynomials spanned by $x_1^jx_2^k$ for $0\le j\le m$ and $0\le k\le n$. Given any $q\in \mathbb {Q}_{m,n}$, we can write

$$\begin{aligned} q(x)=\sum _{j=0}^m\sum _{k=0}^n a_{j,k}p_j(x_1)p_k(x_2) \end{aligned}$$

where $p_0,p_1,\ldots $ are the Legendre polynomials.

Let $I=(-1,1)$. It follows from the properties of the Legendre polynomials [52, (4.4.2), (4.5.1) and (4.5.2)] that

$$\begin{aligned} \Vert q\Vert _{L_\infty (I\times I)}&\le \sum _{j=0}^m\sum _{k=0}^n |a_{j,k}|\\&\le \Big (\sum _{j=0}^m\sum _{k=0}^n(j+\frac{1}{2})(k+\frac{1}{2})\Big )^\frac{1}{2} \Big (\sum _{j=0}^m\sum _{k=0}^n a_{j,k}^2(j+\frac{1}{2})^{-1}(k+\frac{1}{2})^{-1}\Big )^\frac{1}{2}\\&=\frac{(m+1)(n+1)}{2}\Vert q\Vert _{L_2(I\times I)}, \end{aligned}$$

which, through scaling, implies

$$\begin{aligned} \Vert q\Vert _{L_\infty ((0,1)\times (0,1))}\le (m+1)(n+1)\Vert q\Vert _{L_2((0,1)\times (0,1))} \qquad \forall \,q\in \mathbb {Q}_{m,n}. \end{aligned}$$

(C.1)

1.2 C.2 Convexity on the unit square

Let ${\hat{K}}$ be the (closed) unit square $[0,1]\times [0,1]$ with vertices ${\hat{p}}_1,{\hat{p}}_2,{\hat{p}}_3,{\hat{p}}_4$, and $Q_{{\hat{K}}}$ be the nodal interpolation operator for the $Q_1$ finite element on ${\hat{K}}$.

For any $q\in \mathbb {Q}_{m,n}$ and $x\in {\hat{K}}$, we have $Q_{{\hat{K}}}\Delta q-\Delta q\in \mathbb {Q}_{m,n}$ and therefore

$$\begin{aligned} (Q_{{\hat{K}}}\Delta q)(x)&= [(Q_{{\hat{K}}}\Delta q)(x)-\Delta q(x)]+\Delta q(x)\\&\le (m+1)(n+1)\Vert Q_{{\hat{K}}}\Delta q-\Delta q\Vert _{L_2({\hat{K}})}+\Delta q(x) \end{aligned}$$

by (C.1). It follows that

$$\begin{aligned} \Delta q(x)&\ge \min _{x\in {\hat{K}}}(Q_{{\hat{K}}}\Delta q)(x) - (m+1)(n+1)\Vert \Delta q - Q_{{\hat{K}}}\Delta q\Vert _{L_2({\hat{K}})}\nonumber \\&= \min _{1\le i\le 4}\Delta q({\hat{p}}_i) - (m+1)(n+1)\Vert \Delta q - Q_{{\hat{K}}}(\Delta q)\Vert _{L_2({\hat{K}})} \qquad \quad \forall \,x\in {\hat{K}}. \end{aligned}$$

(C.2)

Similarly, since $\det D^2 q-Q_{{\hat{K}}}(\det D^2 q)\in \mathbb {Q}_{2m-2,2n-2}$, we have

$$\begin{aligned} (\det D^2 q)(x)&\ge \min _{x\in \hat{K}}[Q_{{\hat{K}}}(\det D^2 q)](x)\nonumber \\&\quad - (2m-1)(2n-1)\Vert \det D^2 q -Q_{{\hat{K}}}(\det D^2 q)\Vert _{L_2({\hat{K}})}\nonumber \\&= \min _{1\le i\le 4}(\det D^2 q)({\hat{p}}_i) \nonumber \\&\quad - (2m-1)(2n-1)\Vert \det D^2 q - Q_{{\hat{K}}}(\det D^2 q)\Vert _{L_2({\hat{K}})}\qquad \forall \,x\in {\hat{K}}. \end{aligned}$$

(C.3)

According to (C.2) and (C.3), if

$$\begin{aligned} \min _{1\le i\le 4}(\Delta q)({\hat{p}}_i) - (m+1)(n+1)\Vert \Delta q - Q_{{\hat{K}}}(\Delta q)\Vert _{L_2({\hat{K}})}>0 \end{aligned}$$

and

$$\begin{aligned} \min _{1\le i\le 4}(\det D^2 q)({\hat{p}}_i) - (2m-1)(2n-1)\Vert \det D^2 q - Q_{{\hat{K}}}(\det D^2 p)\Vert _{L_2({\hat{K}})} >0, \end{aligned}$$

then

$$\begin{aligned} \Delta p>0 \quad \text {and}\quad \det D^2p>0 \qquad \forall \,x\in {\hat{K}}, \end{aligned}$$

which implies that q is strictly convex on ${\hat{K}}$.

1.3 C.3 Convexity on the reference simplex

Let $q\in P_3({\hat{T}})\oplus \varphi _{{\hat{T}}}^2 P_1({\hat{T}})$. Then $q\in \mathbb {Q}_{5,5}$ and we begin by computing (cf. (C.2) and (C.3))

$$\begin{aligned} L_{\Delta ,{\hat{K}}} = \min _{1\le i\le 4}(\Delta q)({\hat{p}}_i)- 36\Vert \Delta q - Q_{{\hat{K}}}(\Delta q)\Vert _{L_2({\hat{K}})} \end{aligned}$$

(C.4)

and

$$\begin{aligned} L_{\det ,{\hat{K}}} = \min _{1\le i\le 4}(\det D^2 q)({\hat{p}}_i)- 81\Vert \det D^2 q - Q_{{\hat{K}}}(\det D^2 q)\Vert _{L_2({\hat{K}})}. \end{aligned}$$

(C.5)

If $L_{\Delta ,{\hat{K}}}> 0$ and $L_{\det , {\hat{K}}}> 0$, then q is strictly convex on ${\hat{K}}$ (and therefore also on ${\hat{T}}$). If this is not the case, then we divide ${\hat{K}}$ into four sub-squares and use scaled versions of (C.4) and (C.5) to check the convexity of q on the sub-squares whose intersection with ${\hat{T}}$ has a positive area. By repeating this procedure we arrive at the following algorithm for checking the convexity of $q\in P_3({\hat{T}})\oplus \varphi _{{\hat{T}}}^2 P_1({\hat{T}})$ on ${\hat{T}}$.

Algorithm C.1

Let $\mathcal {R}_0=\{{\hat{K}}\}$ and L be the maximum refinement level.

1.
If $\mathcal {R}_l\ne \emptyset $ and $l\le L$, compute for $K\in \mathcal {R}_l$ the quantities
$$\begin{aligned} L_{\Delta ,K} = \min _{1\le i\le 4}(\Delta q)(p_{K,i}) - \frac{36}{h_l}\Vert \Delta q - Q_K(\Delta q)\Vert _{L_2(K)} \end{aligned}$$
and
$$\begin{aligned} L_{\det ,K} = \min _{1\le i\le 4} (\det D^2 q)(p_{K,i}) - \frac{81}{h_l}\Vert \det D^2 q - Q_K(\det D^2 q)\Vert _{L_2(K)}, \end{aligned}$$
where $p_{K,i}\ (i=1,2,3,4)$ are the vertices of K, $h_l$ is the width/height of the squares in $\mathcal {R}_l$, and $Q_K$ is the nodal interpolation operator for the $Q_1$ element associated with K. Set
$$\begin{aligned} \mathcal {R}^{nc}_{l} = \{K\in \mathcal {R}_l:\ L_{\Delta ,K}\le 0 \ \text{ or }\ L_{\det ,K} \le 0 \}. \end{aligned}$$
Stop if $\mathcal {R}^{nc}_{l}=\emptyset $. The polynomial is strictly convex on ${\hat{T}}$.
2.
If $\mathcal {R}^{nc}_{l} \ne \emptyset $, divide each ${K}\in \mathcal {R}^{nc}_{l}$ into four sub-squares ${K}_{j}\ (j=1,2,3,4)$ and define
$$\begin{aligned} \mathcal {R}^{nc,d}_{l} = \{{K}_{j}:\ {K}\in \mathcal {R}^{nc}_{l},\ j=1,2,3,4\}. \end{aligned}$$
Set
$$\begin{aligned} \mathcal {R}_{l+1} = \{K\in \mathcal {R}^{nc,d}_{l} :\,|K\cap {\hat{T}}|>0\}, \end{aligned}$$
$h_l = \frac{1}{2}h_l$, $l=l+1$ and go to 1.

Remark C.2

In our numerical experiments we were able to verify the elementwise strict convexity by observing that the Algorithm C.1 terminated before the refinement reached level 6.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Brenner, S.C., Sung, Ly., Tan, Z. et al. A convexity enforcing ${C}^{{0}}$ interior penalty method for the Monge–Ampère equation on convex polygonal domains. Numer. Math. 148, 497–524 (2021). https://doi.org/10.1007/s00211-021-01210-x

Download citation

Received: 14 December 2020
Revised: 19 April 2021
Accepted: 23 May 2021
Published: 26 June 2021
Issue Date: July 2021
DOI: https://doi.org/10.1007/s00211-021-01210-x

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A convexity enforcing \({C}^{{0}}\) interior penalty method for the Monge–Ampère equation on convex polygonal domains

Abstract

Similar content being viewed by others

An interior penalty coupling strategy for isogeometric non-conformal Kirchhoff–Love shell patches

Mixed virtual element method for integro-differential equations of parabolic type

Element-free Galerkin analysis of MHD duct flow problems at arbitrary and high Hartmann numbers

1 Introduction

Remark 1.1

Remark 1.2

Remark 1.3

Remark 1.4

2 The discrete problem

2.1 An enhanced cubic Lagrange finite element

Lemma 2.1

Proof

Remark 2.2

Remark 2.3

Remark 2.4

2.2 The finite element space \({V}_{h}\)

Remark 2.5

Lemma 2.6

Proof

Remark 2.7

2.3 A nonlinear least-squares problem with box constraints

Remark 2.8

Remark 2.9

Remark 2.10

Lemma 2.11

Proof

3 Error analysis

3.1 Some a priori bounds for \({u_h}\)

3.2 The elementwise convexity of \({u_h}\)

Lemma 3.1

Proof

3.3 A priori error estimates

Lemma 3.2

Theorem 3.3

Proof

Remark 3.4

Remark 3.5

3.4 An a posteriori error estimate

4 Numerical results

Example 4.1

Example 4.2

Example 4.3

5 Concluding remarks

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

Appendix A. The Proof of Lemma 3.2

Remark A.1

Appendix B. An optimization algorithm

Appendix C. Elementwise convexity

1.1 C.1 Explicit inverse estimates

1.2 C.2 Convexity on the unit square

1.3 C.3 Convexity on the reference simplex

Algorithm C.1

Remark C.2

Rights and permissions

About this article

Cite this article

Share this article

Mathematics Subject Classification

Search

Navigation