A superconvergent LDG-hybridizable Galerkin method for second-order elliptic problems

By Bernardo Cockburn, Bo Dong, and Johnny Guzmán

Abstract

We identify and study an LDG-hybridizable Galerkin method, which is not an LDG method, for second-order elliptic problems in several space dimensions with remarkable convergence properties. Unlike all other known discontinuous Galerkin methods using polynomials of degree $k\ge 0$ for both the potential as well as the flux, the order of convergence in $L^2$ of both unknowns is $k+1$. Moreover, both the approximate potential as well as its numerical trace superconverge in $L^2$-like norms, to suitably chosen projections of the potential, with order $k+2$. This allows the application of element-by-element postprocessing of the approximate solution which provides an approximation of the potential converging with order $k+2$ in $L^2$. The method can be thought to be in between the hybridized version of the Raviart-Thomas and that of the Brezzi-Douglas-Marini mixed methods.

1. Introduction

In this paper, we consider the LDG-hybridizable (LDG-H) Galerkin methods recently introduced in Reference 11 and show how to define their numerical traces in order to achieve the optimal order of convergence for the approximation to the flux, and to obtain superconvergence properties similar to those of the hybridized mixed methods of Raviart-Thomas (RT) Reference 16 and the Brezzi-Douglas-Marini (BDM) Reference 4 methods; see also Reference 10.

For the sake of simplicity of the exposition, we carry this out in the setting of the model second-order elliptic problem

$$\begin{alignat}{2} \boldsymbol{c}\,\boldsymbol{q}+\nabla u &=0 && \quad \text{ in } \Omega ,\cssId{texmlid2}{\tag{1.1a}}\\ \nabla \cdot \boldsymbol{q}& = f && \quad \text{ in } \Omega ,\cssId{texmlid3}{\tag{1.1b}}\\ u & = g && \quad \text{ on } {\partial \Omega _D}, \cssId{texmlid4}{\tag{1.1c}}\\ \boldsymbol{q}\cdot \boldsymbol{n}& = \mathsf{q_N} && \quad \text{ on } {\partial \Omega _N}, \cssId{texmlid1}{\tag{1.1d}} \end{alignat}$$

where $\Omega \subset \mathbb{R}^d$ is a polyhedral domain ($d\ge 2$), $f \in L^2(\Omega )$, and $\boldsymbol{c}=\boldsymbol{c}(\boldsymbol{x})$ is a symmetric $d\times d$ matrix function that is uniformly positive definite on $\Omega$ with components in $L^\infty (\Omega )$. As usual, we assume that the $(d-1)$-Lebesgue measure of ${\partial \Omega _D}$ is not zero, that $\partial \Omega = \overline{{\partial \Omega _D}}\cup \overline{{\partial \Omega _N}}$ and that ${\partial \Omega _D}\cap {\partial \Omega _N}=\emptyset$.

To describe our results, we need to introduce what we will call hybridized Galerkin methods; they are one of the methods studied in Reference 11. To do that, let us introduce some notation. We denote by ${\Omega _h}=\lbrace K \rbrace$ a triangulation of the domain $\Omega$ of shape-regular simplexes $K$ and set ${\partial \Omega _h}:=\{{\partial K}: K\in {\Omega _h}\}$. We associate to this triangulation the set of interior faces $\mathscr{E}^i_h$ and the set of boundary faces $\mathscr{E}^\partial _h$. We say that $e\in \mathscr{E}^i_h$ if there are two simplexes $K^+$ and $K^-$ in ${\Omega _h}$ such that $e={\partial K}^+\cap {\partial K}^-$, and we say that $e\in \mathscr{E}^\partial _h$ if there is a simplex in ${\Omega _h}$ such that $e={\partial K}\cap {\partial \Omega }$. We set $\mathscr{E}_h:=\mathscr{E}^i_h\cup \mathscr{E}^\partial _h$.

The hybridized Galerkin methods Reference 11 are dual-mixed hybrid methods (see the definition in Reference 8 and an early example in Reference 15) which seek an approximation to the exact solution $(\boldsymbol{q}|_\Omega ,u|_\Omega ,u|_{\mathscr{E}_h\setminus {\partial \Omega _N}})$, $(\boldsymbol{q}_h,u_h,\lambda _h)$, in a finite-dimensional space $\boldsymbol{V}_h\times W_h\times M_h$ of the form

$$\begin{alignat}{1} \boldsymbol{V}_h:=&\lbrace \boldsymbol{v}\in \boldsymbol{L}^2(\Omega ) \,: \, \boldsymbol{v}|_K \in \boldsymbol{V}(K)\quad \forall K\in {\Omega _h}\rbrace ,\cssId{space-Vh}{\tag{1.2a}}\\ W_h:=&\lbrace \omega \in L^2(\Omega )\,:\, \omega |_K \in W(K)\quad \forall K\in {\Omega _h}\rbrace ,\cssId{space-Wh}{\tag{1.2b}}\\ M_h:=&\lbrace \mathsf{m}\in L^2({\partial \Omega _h})\,:\, \mathsf{m}|_e \in M(e)\quad \forall e\in \mathscr{E}_h,\quad \mathsf{m}|_{{\partial \Omega _D}}=0\rbrace , \cssId{space-Mh}{\tag{1.2c}} \end{alignat}$$

and determines it by requiring that

$$\begin{align} &(\boldsymbol{c}\;\boldsymbol{q}_h, \boldsymbol{v})_{{\Omega _h}}-(u_h,\nabla \cdot \boldsymbol{v})_{{\Omega _h}}+\langle \widehat{u}_h,\boldsymbol{v}\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}=0, \cssId{weakapp1}{\tag{1.3a}}\\ &-(\boldsymbol{q}_h, \nabla \omega )_{{\Omega _h}}+\langle \widehat{\boldsymbol{q}}_h\cdot \boldsymbol{n},\omega \rangle _{{\partial \Omega _h}}=(f,\omega )_{{\Omega _h}}, \cssId{weakapp2}{\tag{1.3b}}\\ &\langle \widehat{\boldsymbol{q}}_h\cdot \boldsymbol{n},\mu \rangle _{{\partial \Omega _h}}=\langle \mathsf{q_N},\mu \rangle _{{\partial \Omega _N}}, \cssId{weakapp3}{\tag{1.3c}} \end{align}$$

for all $(\boldsymbol{v},\omega ,\mu )\in \boldsymbol{V}_{h} \times W_{h}\times M_h$. Here, we have used the notation

$$\begin{alignat*}{1} (\boldsymbol{\sigma }, \boldsymbol{v})_{{\Omega _h}}:=&\sum _{K\in {\Omega _h}} \int _{K} \boldsymbol{\sigma }(x)\cdot \boldsymbol{v}(x)\,dx,\\ (\zeta , \omega )_{{\Omega _h}}:=&\sum _{K\in {\Omega _h}} \int _{K} \zeta (x)\;\omega (x)\,dx,\\ \langle \zeta ,\boldsymbol{v}\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}:=&\sum _{K\in {\Omega _h}}\int _{{\partial K}} \zeta (\gamma )\,\boldsymbol{v}(\gamma )\cdot \boldsymbol{n}\,d\gamma , \end{alignat*}$$

for any functions $\boldsymbol{\sigma }, \boldsymbol{v}$ in $\boldsymbol{H}^1({\Omega _h}):=[H^1({\Omega _h})]^d$ and $\zeta , \omega$ in the space $H^1({\Omega _h})=\{v\in L^2(\Omega ):v|_{K}\in H^1(K)\quad \forall K\in {\Omega _h}\}$. The outward normal unit vector to ${\partial K}$ is denoted by $\boldsymbol{n}$.

To complete the description of the hybridized Galerkin methods, the definition of numerical traces $(\widehat{\boldsymbol{q}}_h,\widehat{u}_h)$ on the faces of the triangulation $\mathscr{E}_h$ has to be provided. The choice which is relevant here is

$$\begin{alignat}{1} \widehat{u}_h=&\begin{cases} \mathsf{P}_{\partial }\,g&\quad \text{ on }\mathscr{E}_h\cap {\partial \Omega _D},\\ \lambda _h&\quad \text{ on }\mathscr{E}_h\setminus {\partial \Omega _D}, \end{cases}\cssId{numerical_trace-a}{\tag{1.4a}}\\ \widehat{\boldsymbol{q}}_h=&\boldsymbol{q}_h+\tau \,(u_h-\widehat{u}_h)\boldsymbol{n}\quad \text{ on }\mathscr{E}_h, \cssId{numerical_trace-b}{\tag{1.4b}} \end{alignat}$$

where $\mathsf{P}_{\partial }$ denotes an $L^2$-projection defined as follows. Given any function $\zeta \in L^2(\mathscr{E}_h)$ and an arbitrary face $e\in \mathscr{E}_h$, the restriction of $\mathsf{P}_{\partial }\zeta$ to $e$ is defined as the element of $\mathscr{P}^k(e)$ that satisfies

$$\begin{alignat}{2} \langle \mathsf{P}_{\partial }\zeta -\zeta , \omega \rangle _{e}&=0,&&\qquad \forall \,\omega \in \mathscr{P}^{k}(e). \cssId{ltwoEh-projection}{\tag{1.5}} \end{alignat}$$

Note that by suitably choosing the local spaces $\boldsymbol{V}(K)$, $W(K)$, and $M(e)$, and the values of the local stabilization parameters $\tau$, we can obtain the hybridized RT$_k$, the hybridized BDM$_k$ and the LDG-H$_k$ methods; see Tables 1 and 2. In Table 1 and in the remainder of this paper, we denote the space of polynomials of degree at most $k\ge 0$ defined on $D$ by $\mathscr{P}^k(D)$, and set $\boldsymbol{\mathscr{P}}^k(D):=[\mathscr{P}^k(D)]^d$. Since all these methods can be implemented in the same way and can be used in different elements while being automatically coupled, what is relevant, as argued in Reference 11, is to find out which method should be used in what element in order to optimize the computational effort. It is thus important from this perspective to develop DG methods as accurate and efficient as mixed methods so that they can be used in situations in which mixed method cannot. The LDG-H methods we uncover in this paper are the first example of those methods.

It is well known that the RT$_k$ and BDM$_k$ methods provide an approximation $\boldsymbol{q}_h$ to the flux which converges in $L^2$ with order $k+1$, that $u_h$ and $\lambda _h$ superconverge in $L^2$-like norms to suitably chosen projections of the potential $u$ with order $k+2$, and that, as a consequence, it is possible to postprocess the approximate solution to obtain another approximation $u_h^*$ converging in $L^2$ with order $k+2$; see Reference 1 and Reference 4, and also Reference 10. In this paper, we use an extension of the postprocessing proposed in Reference 17Reference 18 and Reference 14. Given the similarities between these two mixed methods and the LDG-H$_k$ method, it is natural to ask if it is possible to choose the local penalization parameters $\tau$ as to obtain similar convergence and superconvergence results. The main contribution of this paper is to show that this is actually possible.

Indeed, we show that this happens if we take, on each simplex $K\in {\Omega _h}$,

$$\begin{alignat}{1} \tau =&\begin{cases} 0, &\qquad \text{ on } {\partial K}\setminus e_{K}^{\tau },\\ \tau _K , &\qquad \text{ on } e_{K}^{\tau }, \end{cases} \cssId{tau}{\tag{1.6}} \end{alignat}$$

where $e_{K}^{\tau }$ is an arbitrary but fixed face of $K$ and $\tau _K$ is a strictly positive real number. Since the local penalization parameter $\tau$ is nonzero only on a single face of each simplex, we call this LDG-H method the single-face hybridizable DG method; for simplicity, we are going to refer to the method under consideration by the $\text{SF-H}_k$ method. It is interesting to note two of the minimal dissipation DG methods considered in Reference 6, in the framework of a study of superconvergence properties of DG methods for one-dimensional steady-state convection-diffusion problems, happen to be an SF-H method. The first is called the md-DG method (see Table 1 in Reference 6) and is obtained, in our notation, by taking on each interior node $x_i$,

$$\begin{equation*} \tau (x_i^+)=0 \qquad \text{ and } \qquad \tau (x_i^-)=k/h_i, \end{equation*}$$

where $h_i$ is the size of the interval to the left of the node $x_i$. The second is called the md-LDG method, and is obtained by taking the above choice of parameters $\tau$ formally letting $\tau (x_i^-)$ go to infinity. The authors are not aware of any other instance of SF-H methods. In particular, let us emphasize that SF-H methods are not LDG methods whenever the stabilization parameters $\tau$ are finite; see the discussion about LDG-H methods in Reference 11.

In Table 3, we compare the orders of convergence for the flux of this method and the above-mentioned mixed methods. We have also included the order of convergence for the general LDG-H methods; it can be deduced from their characterization Reference 11 and the study of DG methods carried out in Reference 5. Finally, in Table 4, we display the orders of convergence of the postprocessed approximation $u_h^*$ to the potential.

We also uncover new relations between these three methods. One of the main features of the hybridizable Galerkin methods proposed in Reference 11 is that the only degrees of freedom that turn out to be globally coupled are those of the so-called Lagrange multiplier $\lambda _h$. This implies, in particular, that the LDG-H methods can be more efficiently implemented than the LDG methods introduced in Reference 12. In fact, see the discussion in Reference 11, they can be implemented as efficiently as the hybridized RT$_k$; see Reference 7 for the case $k=0$ in two dimensions, and BDM$_k$ mixed methods, see Reference 10 for the case $k\ge 0$ in multi-dimensions for both of these methods. Here we show that the stiffness matrix of the Lagrange multiplier for the RT$_k$, BDM$_k$ and $\text{SF-H}_k$ methods is actually identical and that, when $f|_K\in \mathscr{P}^{k-1}(K)$ for all $K\in {\Omega _h}$, these methods provide the same approximation $(\boldsymbol{q}_h,\lambda _h)$.

Next, let us briefly comment on the approach taken to carry out the a priori error analysis of the $\text{SF-H}_k$ methods. We did not take the approach used in Reference 5 to analyze DG and LDG methods, or that used in the unified analysis of DG methods Reference 2. Instead, we exploited the unifying framework introduced in Reference 11 to render the analysis of the $\text{SF-H}_k$ methods as close as possible to those of the hybridized RT$_k$ and BDM$_k$ methods. Since a key ingredient in those analyzes is the existence of a projection $(\boldsymbol{\Pi },\mathbb{P})$ satisfying the so-called commutativity property

$$\begin{equation*} \nabla \cdot \boldsymbol{\Pi }\boldsymbol{\sigma }= \mathbb{P}^t\,\nabla \cdot \boldsymbol{\sigma }, \end{equation*}$$

for all $\boldsymbol{\sigma }\in \boldsymbol{H}(div,\Omega )$, the crucial step in the analysis was to find a similar projection. Unlike the above-mentioned mixed methods, the space of fluxes $\boldsymbol{V}_h$ of the $\mathop{\text{SF-H}}_k$ methods is not included in $\boldsymbol{H}(div,\Omega )$ and, as a consequence, the above commutativity property can only be satisfied in a weak sense. We found a new projection satisfying the following weak version of the commutativity property:

$$\begin{equation*} -(\nabla \zeta ,\boldsymbol{\Pi }\boldsymbol{\sigma })_{\Omega _h}=(\mathbb{P}\zeta , \nabla \cdot \boldsymbol{\sigma })_{\Omega _h} \end{equation*}$$

for all $(\boldsymbol{\sigma },\zeta )\in \boldsymbol{H}^1({\Omega _h})\times H^1({\Omega _h})$ such that $\zeta |_{{\partial \Omega }}=0$. Just as the local spaces of the $\mathop{\text{SF-H}}_k$ methods are, roughly speaking, “in between” the local spaces of the RT$_k$ and BDM$_k$ methods, this projection can also be considered to be “in between” the corresponding projections of those mixed methods. The construction of this projection, which is intimately linked to the definition of the numerical traces of the method $\widehat{u}_h$ and $\widehat{\boldsymbol{q}}_h$ and to the choice of the local spaces, is certainly the most interesting aspect of the analysis of the $\mathop{\text{SF-H}}_k$ methods. The first component of the projection, $\boldsymbol{\Pi }$, was used in the error analysis of the minimal dissipation LDG method in Reference 9.

The paper is organized as follows. In Section 2, we state and discuss our main results and then prove them in Section 3. In Section 4, we display numerical experiments validating the theoretical results. Finally, in Section 5, we end with some concluding remarks.

2. The main results

2.1. The projection $(\boldsymbol{\Pi },\mathbb{P})$

In this subsection, we define the projection

$$\begin{equation*} (\boldsymbol{\Pi },\mathbb{P}): \boldsymbol{H}^1({\Omega _h})\times H^1({\Omega _h})\rightarrow \boldsymbol{V}_h\times W_h, \end{equation*}$$

and gather its main properties.

Given a function $\boldsymbol{\sigma }\in \boldsymbol{H}^1({\Omega _h})$ and an arbitrary simplex $K\in {\Omega _h}$, the restriction of $\boldsymbol{\Pi }\boldsymbol{\sigma }$ to $K$ is defined as the element of $\boldsymbol{\mathscr{P}}^k(K)$ that satisfies

$$\begin{alignat}{2} (\boldsymbol{\Pi }\boldsymbol{\sigma }-\boldsymbol{\sigma },\boldsymbol{v})_K=&0,&&\qquad \forall \, \boldsymbol{v}\in \boldsymbol{\mathscr{P}}^{k-1}(K),\text{ if } k\ge 1,\cssId{pi-projection-a}{\tag{2.1a}}\\ \langle (\boldsymbol{\Pi }\boldsymbol{\sigma }-\boldsymbol{\sigma })\cdot \boldsymbol{n}, \omega \rangle _{e}=&0,&&\qquad \forall \, \omega \in \mathscr{P}^{k}(e)\text{ and }\forall e\in {\partial K},e\neq e_{K}^{\tau }. \cssId{pi-projection-b}{\tag{2.1b}} \end{alignat}$$

Similarly, given a function $\zeta \in H^1({\Omega _h})$ and an arbitrary simplex $K\in {\Omega _h}$, the restriction of $\mathbb{P}\zeta$ to $K$ is defined as the element of $\mathscr{P}^k(K)$ that satisfies

$$\begin{alignat}{2} (\mathbb{P}\zeta -\zeta , \mathsf{w})_K=&0,&&\qquad \forall \, \mathsf{w}\in \mathscr{P}^{k-1}(K),\text{ if } k\ge 1,\cssId{bp-projection-a}{\tag{2.2a}}\\ \langle \mathbb{P}\zeta -\zeta , \omega \rangle _{e_{K}^{\tau }}=&0,&&\qquad \forall \,\omega \in \mathscr{P}^{k}(e_{K}^{\tau }). \cssId{bp-projection-b}{\tag{2.2b}} \end{alignat}$$

We gather the main properties of this projection in the following result. To state it, we need to recall the definition of some classical projections. Given a function $\boldsymbol{\sigma }\in \boldsymbol{H}^1({\Omega _h})$ and an arbitrary simplex $K\in {\Omega _h}$, the restriction of $\boldsymbol{\Pi }^{{\scriptscriptstyle {\mathrm{RT}}}}\boldsymbol{\sigma }$ to $K$ is defined as the element of $\boldsymbol{\mathscr{P}}^k(K)\oplus \boldsymbol{x}\,\mathscr{P}(K)^k$ that satisfies

$$\begin{alignat}{2} (\boldsymbol{\Pi }^{{\scriptscriptstyle {\mathrm{RT}}}}\boldsymbol{\sigma }-\boldsymbol{\sigma },\boldsymbol{v})_K=&0,&&\qquad \forall \, \boldsymbol{v}\in \boldsymbol{\mathscr{P}}^{k-1}(K),\text{ if } k\ge 1,\cssId{RT-projection-a}{\tag{2.3a}}\\ \langle (\boldsymbol{\Pi }^{{\scriptscriptstyle {\mathrm{RT}}}}\boldsymbol{\sigma }-\boldsymbol{\sigma })\cdot \boldsymbol{n}, \omega \rangle _{e}=&0,&&\qquad \forall \, \omega \in \mathscr{P}^{k}(e), \;\text{for all faces of } K. \cssId{RT-projection-b}{\tag{2.3b}} \end{alignat}$$

Given a function $\zeta \in H^1({\Omega _h})$ and an arbitrary simplex $K\in {\Omega _h}$, the restriction of $\mathsf{P}^\ell \zeta$ to $K$ is defined as the element of $\mathscr{P}^\ell (K)$ that satisfies

$$\begin{alignat}{2} (\mathsf{P}^\ell \zeta -\zeta , \omega )_K=&0,&&\qquad \forall \, \omega \in \mathscr{P}^\ell (K). \cssId{ltwo-projection}{\tag{2.4}} \end{alignat}$$

To simplify the notation, we are going to write $\mathsf{P}$ instead of $\mathsf{P}^k$. Note that $(\boldsymbol{\Pi }^{{\scriptscriptstyle {\mathrm{RT}}}},\mathsf{P})$ is nothing but the projection for the RT$_k$ method. We are now ready to state our result.

Proposition 2.1.

The projection $(\boldsymbol{\Pi },\mathbb{P})$ given by Equation 2.1 and Equation 2.2 is well defined. Moreover, on each simplex $K\in {\Omega _h}$, it satisfies the orthogonality properties

$$\begin{alignat*}{2} (\text{i})&\quad (\zeta -\mathbb{P}\zeta ,\nabla \cdot \boldsymbol{v})_K=0,&& \\ (\text{ii})&\quad (\boldsymbol{\sigma }-\boldsymbol{\Pi }\boldsymbol{\sigma },\nabla \omega )_K=0,&& \\ (\text{iii})&\quad \langle \mathbb{P}\zeta -\mathsf{P}_{\partial }\zeta , \boldsymbol{\Pi }\boldsymbol{\sigma }\cdot \boldsymbol{n}-\mathsf{P}_{\partial }\boldsymbol{\sigma }\cdot \boldsymbol{n}\rangle _{e}=0\quad \text{ for all faces $e$ of $K$},\\ \end{alignat*}$$

for all $(\boldsymbol{v},\omega )\in \boldsymbol{\mathscr{P}}^k(K)\times \mathscr{P}^k(K)$, and the weak commutativity property

$$\begin{alignat*}{2} (\text{iv})&\quad -(\nabla \zeta , \boldsymbol{\Pi }\boldsymbol{\sigma })_K= (\mathbb{P}\zeta , \nabla \cdot \boldsymbol{\sigma })_K -\langle \mathsf{P}_{\partial }\zeta , \mathsf{P}_{\partial }\boldsymbol{\sigma }\cdot \boldsymbol{n}\rangle _{{\partial K}}, \end{alignat*}$$

for all $(\boldsymbol{\sigma },\zeta )\in \boldsymbol{H}^1({\Omega _h})\times H^1({\Omega _h})$. Finally, we have the following approximation estimates

$$\begin{alignat*}{2} (\text{v})& \quad \|\,\boldsymbol{\Pi }\boldsymbol{\sigma }\cdot \boldsymbol{n}-\mathsf{P}_{\partial }\boldsymbol{\sigma }\cdot \boldsymbol{n}\,\|_{L^2(e_{K}^{\tau })} \le \;C\,h_K^{r+1/2}\,|\,\mathsf{P}\nabla \cdot \boldsymbol{\sigma }\,|_{H^r(K)}, \\ (\text{vi})& \quad \|\,\boldsymbol{\Pi }\boldsymbol{\sigma }-\boldsymbol{\Pi }^{{\scriptscriptstyle {\mathrm{RT}}}}\boldsymbol{\sigma }\,\|_{L^2(K)} \le \;C\,h_K^{r+1}\,|\,\mathsf{P}\nabla \cdot \boldsymbol{\sigma }\,|_{H^r(K)}, \\ (\text{vii})& \quad \|\,\mathbb{P}\zeta -\mathsf{P}\zeta \,\|_{L^2(K)} \le \;C\,h_K^{s+1}\,|\,\nabla \zeta \,|_{\boldsymbol{H}^s(K)}, \end{alignat*}$$

where $r,s\in [0,k]$, $h_K$ is the diameter of $K$, and $C$ depends only on $k$ and the shape-regularity parameters of the simplex $K$, for any $(\boldsymbol{\sigma },\zeta )\in \boldsymbol{H}^1(K)\times H^1(K)$.

We are going to show that the three orthogonality properties imply all the others; they are thus the crucial properties for the analysis. Note also that, by simply adding the identity (iv) over all $K\in {\Omega _h}$, we obtain the weak commutativity property discussed in the introduction.

2.2. Characterization of the approximate solution

Next we give a characterization of the approximate solution provided by the $\mathop{\text{SF-H}}_k$ method. We begin by characterizing the difference between the numerical traces and the traces of the approximate solutions on each simplex.

Proposition 2.2.

For each simplex $K\in {\Omega _h}$, we have that,

$$\begin{alignat*}{1} (\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}=\tau \,(u_h-\widehat{u}_h)=\mathsf{P}_{\partial }\boldsymbol{q}\cdot \boldsymbol{n}-\boldsymbol{\Pi }\boldsymbol{q}\cdot \boldsymbol{n}\qquad \text{ on }{\partial K}. \end{alignat*}$$

We see that the jump $(\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}$ is independent of the value of $\tau$ whereas the jump $\widehat{u}_h-u_h$ is inversely proportional to $\tau$. Moreover, by the estimate ($v$) of Proposition 2.1, we have that,

$$\begin{equation*} \|\,(\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}\,\|_{L^2(e_{K}^{\tau })}\le \;C\,h_K^{r+1/2}\,|\,\mathsf{P}f\,|_{H^r(K)}, \end{equation*}$$

for any $r\in [0,k]$, and we see that the size of jump under consideration depends solely on the smoothness of $f|_K$. For example, if $\mathsf{P}f|_K$ is a polynomial of degree $k-1$, then $(\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}=0$ on $e_{K}^{\tau }$. This implies that $(\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}=0$ on ${\partial K}$ for every $K\in {\Omega _h}$ and, as a consequence, that $\boldsymbol{q}_h\in \boldsymbol{H}(div,\Omega )$. Now, if $f\in H^r(K)$, for some $r\in [0,k]$, then we have that

$$\begin{equation*} \|\,(\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}\,\|_{L^2(e_{K}^{\tau })}\le \;C\,h_K^{r+1/2}\,|\,f\,|_{H^r(K)}, \end{equation*}$$

by well-known approximation properties of the projection $\mathsf{P}$.

Next, we give a characterization of the approximate solution which follows from a similar result for more general methods obtained in Reference 11. To state it, we need to introduce the local solvers associated with the method. The first local solver is defined on the simplex $K\in {\Omega _h}$ as the mapping $\mathsf{m}\in L^2({\partial K})\rightarrow (\boldsymbol{\mathcal{Q}}{\mathsf{m}}, \,\mathcal{U}{\mathsf{m}}) \in \boldsymbol{\mathscr{P}}^k(K)\times \mathscr{P}^k(K)$ where

$$\begin{alignat}{1} &(\boldsymbol{c}\,\boldsymbol{\mathcal{Q}}{\mathsf{m}}, \boldsymbol{v})_K - (\,\mathcal{U}{\mathsf{m}}, \nabla \cdot \boldsymbol{v})_K =-\langle {\mathsf{m}},{\boldsymbol{v}\cdot \boldsymbol{n}}\rangle _{{\partial K}},\cssId{lifting-a}{\tag{2.5a}}\\ -&(\nabla w, \boldsymbol{\mathcal{Q}}{\mathsf{m}})_K +\langle {w},{\widehat{\boldsymbol{\mathcal{Q}}}^{}{\mathsf{m}}\cdot \boldsymbol{n}}\rangle _{{\partial K}} =0, \cssId{lifting-b}{\tag{2.5b}} \end{alignat}$$for all $(\boldsymbol{v},w)\in \boldsymbol{\mathscr{P}}^k(K)\times \mathscr{P}^k(K)$, where$$\begin{alignat}{1} & \widehat{\boldsymbol{\mathcal{Q}}}^{}{\mathsf{m}}=\boldsymbol{\mathcal{Q}}{\mathsf{m}}+\tau (\,\mathcal{U}{\mathsf{m}}- \mathsf{P}_{\partial }\mathsf{m}) \boldsymbol{n}. \tag{2.5c} \end{alignat}$$

The other local solver is defined on the simplex $K\in {\Omega _h}$ as the mapping $f\in L^2(K)\rightarrow (\boldsymbol{\mathcal{Q}}{f}, \,\mathcal{U}{f}) \in \boldsymbol{\mathscr{P}}^k(K)\times \mathscr{P}^k(K)$ where

$$\begin{alignat}{2} &(\boldsymbol{c}\,\boldsymbol{\mathcal{Q}}{f}, \boldsymbol{v})_K - (\,\mathcal{U}{f}, \nabla \cdot \boldsymbol{v})_K =0,\cssId{fmap-a}{\tag{2.6a}}\\ -&(\nabla w, \boldsymbol{\mathcal{Q}}{f})_K + \langle {w},{\widehat{\boldsymbol{\mathcal{Q}}}^{}{f}\cdot \boldsymbol{n}}\rangle _{{\partial K}} =(f ,w)_K, \cssId{fmap-b}{\tag{2.6b}} \end{alignat}$$for all $(\boldsymbol{v},w)\in \boldsymbol{\mathscr{P}}^k(K)\times \mathscr{P}^k(K)$, where$$\begin{alignat}{2} &\qquad \qquad \widehat{\boldsymbol{\mathcal{Q}}}^{}{f}=\boldsymbol{\mathcal{Q}}{f}+\tau \,\mathcal{U}{f} \boldsymbol{n}. \tag{2.6c} \end{alignat}$$

We can now state our characterization result.

Theorem 2.3.

The approximate solution $(\boldsymbol{q}_h,u_h,\lambda _h)\in \boldsymbol{V}_h\times W_h\times M_h$ given by the $\mathop{\text{SF-H}}_k$ method is well defined. Moreover, we have that

$$\begin{equation*} (\boldsymbol{q}_h,u_h)= (\boldsymbol{\mathcal{Q}}{\lambda _h},\,\mathcal{U}{\lambda _h}) +(\boldsymbol{\mathcal{Q}}{g},\,\mathcal{U}{g}) + (\boldsymbol{\mathcal{Q}}{f},\,\mathcal{U}{f}), \end{equation*}$$

where $\lambda _h$ can be characterized as the function in $M_h$ satisfying

$$\begin{equation*} a_h ( \lambda _h, \mu ) =b_h(\mu )\qquad \forall \mu \in M_h, \end{equation*}$$

where

$$\begin{alignat*}{1} a_h(\eta ,\mu ) :=& ( \boldsymbol{c}\, \boldsymbol{\mathcal{Q}}{\eta }, \boldsymbol{\mathcal{Q}}{\mu })_{{\Omega _h}}, \\ b_h(\mu ) :=&\langle {g}, \boldsymbol{\mathcal{Q}}{\mu }\cdot \boldsymbol{n}\rangle _{{\partial \Omega _D}} +(f, \,\mathcal{U}{\mu })_{{\Omega _h}} -\langle \mu , \mathsf{q_N}\rangle _{{\partial \Omega _N}}, \end{alignat*}$$

for all $\eta$ and $\mu \in M_h$.

This result allows us to shed light on the effect of local stabilization parameters $\tau$ on the approximate solution. It will also allow us to compare the RT$_k$, the BDM$_k$ and the $\text{SF-H}_k$ methods, see Reference 10 for a comparison of the hybridized version of the RT$_k$ and the BDM$_k$ methods. These results are gathered in the following theorem. To state it, we use the projection $\mathsf{P}^{k-1}$ which is defined by Equation 2.4 for $k\ge 1$ and which we take to be identically zero when $k=0$. We keep this convention in the remainder of the paper.

Theorem 2.4.

We have that

(i): The function $(\boldsymbol{q}_h, \mathsf{P}^{k-1}u_h,\lambda _h)$, is independent of the values of the local stabilization parameters $\tau$. Moreover, changes in the local stabilization parameters $\tau _K$ only affect the function $\,\mathcal{U}{f}|_{e_{K}^{\tau }}$.
(ii): If $\mathsf{P}f|_K\in \mathscr{P}^{k-1}(K)$ for all simplexes $K\in {\Omega _h}$, then $(\boldsymbol{q}_h, \mathsf{P}^{k-1}u_h,\lambda _h)$ is the same for the RT$_k$, the BDM$_k$ (if $k\ge 1$) and the $\text{SF-H}_k$ methods. Moreover, $u_h|_{e_{K}^{\tau }}=\widehat{u}_h|_{e_{K}^{\tau }}$ for all $K\in {\Omega _h}$.
(iii): The bilinear form $a_h(\cdot ,\cdot )$ is always the same for the RT$_k$, the BDM$_k$ (if $k\ge 1$) and the $\text{SF-H}_k$ methods.

2.3. A priori error estimates

In this subsection, we obtain a priori error estimates for the error of the approximation $(\boldsymbol{q}_h,u_h,\lambda _h)\in \boldsymbol{V}_h\times W_h\times M_h$ given by the $\mathop{\text{SF-H}}_k$ and the numerical trace $\widehat{u}_h$ defined by Equation 1.4a. To state them, we need to introduce new notation.

For any real-valued function $\zeta$ in $H^{l}({\Omega _h})$, we set

$$|\,\zeta \,|_{H^l({\Omega _h})} :=\big (\sum _{K\in {\Omega _h}}|\,\zeta \,|^2_{H^l(K)}\big )^\frac{1}{2}.$$

For a vector-valued function $\boldsymbol{\sigma }=(\sigma _1,\dots ,\sigma _d)\in \boldsymbol{H}^l({\Omega _h})$ we set

$$|\,\boldsymbol{\sigma }\,|_{\boldsymbol{H}^l({\Omega _h})}:=\big (\sum _{i=1}^{d}|\,\sigma _i\,|_{H^l({\Omega _h})}^2 \big )^\frac{1}{2}.$$

We can now state our results.

We begin by measuring the error in the approximation of the flux $\boldsymbol{q}$ in the norm $\|\,\boldsymbol{\sigma }\,\|_{L^2({\Omega _h};\boldsymbol{c})}=(\boldsymbol{c}\,\boldsymbol{\sigma },\boldsymbol{\sigma })_{{\Omega _h}}^{1/2}$.

Theorem 2.5.

Suppose that the exact flux $\boldsymbol{q}$ belongs to $\boldsymbol{H}^{r+1}({\Omega _h})$ for some $r\in [0,k]$. Then

$$\begin{equation*} \|\boldsymbol{q}-\boldsymbol{q}_h\|_{\boldsymbol{L}^2({\Omega _h};\boldsymbol{c})} \le \|\boldsymbol{q}-\boldsymbol{\Pi }\boldsymbol{q}\|_{\boldsymbol{L}^2({\Omega _h};\boldsymbol{c})} \le C\,h^{r+1}\,|\, \boldsymbol{q}\,|_{\boldsymbol{H}^{r+1}({\Omega _h})}, \end{equation*}$$

for some constant $C$ independent of $h$ and the exact solution $(\boldsymbol{q},u)$.

Note that the upper bound of the error is independent of the local stabilization parameters $\tau$, in complete agreement with the characterization of the approximate solution given by Theorem 2.3. It is interesting to realize that the first estimate also holds for the RT$_k$ and BDM$_k$ methods when the projection $\boldsymbol{\Pi }$ is suitably chosen; see Reference 13 and Reference 4. Such an estimate is obtained by using the commutativity property and the fact that the image of their projections is in $\boldsymbol{H}(div,\Omega )$. Since our projection $(\boldsymbol{\Pi },\boldsymbol{p})$ only satisfies a weak version of the commutativity property, a much more delicate analysis has to be carried out to obtain it.

In Reference 5, it was shown that for general LDG methods with penalization parameters of order $1/h$, the order of convergence of the approximations for flux $\boldsymbol{q}$ using polynomials of degree $k$ is only $k$; this order is sharp because it is actually attained for some LDG methods. It was also shown that, for DG methods with both penalization parameters of order one, the order of convergence of the approximations for the flux using polynomials of degree $k$ is $(k+1/2)$. Here, we obtain an order of convergence of $(k+1)$. No other DG method has this property.

Next, we present several estimates for the error in the approximation of the potential $u$. The first is a superconvergence result. To state it, we need to introduce the adjoint equations

$$\begin{alignat}{1} \boldsymbol{c}\;\boldsymbol{\psi }+\nabla \varphi &=0\quad \text{in }\Omega ,\cssId{adjoint-a}{\tag{2.7a}}\\ \nabla \cdot \boldsymbol{\psi }&=\theta \quad \text{in }\Omega ,\cssId{adjoint-b}{\tag{2.7b}}\\ \varphi &=0 \quad \text{on }{\partial \Omega _D}\cssId{adjoint-c}{\tag{2.7c}}\\ \boldsymbol{\psi }\cdot \boldsymbol{n}&=0\quad \text{on }{\partial \Omega _N}. \cssId{adjoint-d}{\tag{2.7d}} \end{alignat}$$

We also need to assume that the following elliptic regularity result holds

$$\begin{equation} \|\,\boldsymbol{\psi }\,\|_{\boldsymbol{H}^{s+1}({\Omega _h})}+\|\,\nabla \varphi \,\|_{\boldsymbol{H}^{s+1}({\Omega _h})}\le \mathsf{C}_{er}\,\|\,\theta \,\|_{H^{s}(\Omega )} \cssId{elliptic_regularity}{\tag{2.8}} \end{equation}$$

for $s\in [0,k]$. Note that, since we are working with domains that can be triangulated by using straight-faced simplexes, the above result only holds if such a domain is convex and $s=0$. However, we want to write this assumption in such a generality since the method will be extended to domains with smooth curved boundaries in a forthcoming paper.

Theorem 2.6.

Suppose that the exact flux $\boldsymbol{q}$ belongs to $\boldsymbol{H}^{r+1}({\Omega _h})$ for $r\in [0,k]$. Set $\kappa :=\max _{K\in {\Omega _h}} \frac{1}{h_K\,\tau _K}$. Then,

$$\begin{equation*} \| \mathbb{P}u-u_h \|_{H^{-s}({\Omega _h})}\le C\,\mathfrak{C}_\kappa ^{r,s}(\boldsymbol{q})\;h^{r+s+2}, \end{equation*}$$

where

$$\begin{equation*} \mathfrak{C}_\kappa ^{r,s}(\boldsymbol{q})=\begin{cases} \mathsf{C}_{er}\,|\,\boldsymbol{q}\,|_{\boldsymbol{H}^{r+1}({\Omega _h})} +\kappa \,|\,\nabla \cdot \boldsymbol{q}\,|_{H^{r}({\Omega _h})}, &\text{for $s\in [0,k-1], k\ge 1$,}\\ \mathsf{C}_{er}|\,\boldsymbol{q}\,|_{\boldsymbol{H}^{1}({\Omega _h})} +\mathsf{C}_{er}|\,\mathsf{q_N}\,|_{H^{1}({\partial \Omega _N})}, &\text{for $r=s=k=0$ and $f=0$.} \end{cases} \end{equation*}$$

Moreover, for $k=0$ and general $f\in L^2({\Omega _h})$,

$$\begin{equation*} \| \mathbb{P}u-u_h \|_{L^2({\Omega _h})} \le C\;\mathcal{C}_\kappa (\boldsymbol{q})\;h, \end{equation*}$$

where $\mathcal{C}_\kappa (\boldsymbol{q})=(1+h)\mathsf{C}_{er}\,|\,\boldsymbol{q}\,|_{\boldsymbol{H}^{1}({\Omega _h})} +h\,\kappa \,|\,\nabla \cdot \boldsymbol{q}\,|_{L^2({\Omega _h})}$.

It is interesting to note that the above superconvergence result holds for any choice of local stabilization parameters $\tau _K$ such that $\kappa$ is uniformly bounded, that is, such that $1/(h_K\,\tau _K)$ is uniformly bounded with respect to $h$. This shows that $\tau _K$ cannot be too small for superconvergence to take place.

A straightforward consequence of this theorem is the following result.

Corollary 2.7.

Suppose that the exact flux $(\boldsymbol{q},u)$ belongs to $\boldsymbol{H}^{r+1}({\Omega _h})\times H^{r+1}({\Omega _h})$ for $r\in [0,k]$. Then

$$\begin{equation*} \| u-u_h \|_{L^2({\Omega _h})} \le C\;h^{r+1}\; \left( \mathsf{C}_\kappa (\boldsymbol{q})+|\, u \,|_{H^{r+1}({\Omega _h})}\right), \end{equation*}$$

where

$$\begin{equation*} \mathsf{C}_\kappa (\boldsymbol{q})=\begin{cases} \min \{\mathfrak{C}_\kappa ^{r-1,0}(\boldsymbol{q}),\;h\,\mathfrak{C}_\kappa ^{r,0}(\boldsymbol{q})\} &\text{ if }k\ge 1,\\ \mathcal{C}_\kappa (\boldsymbol{q}) &\text{ if }k=0. \end{cases} \end{equation*}$$

Note that the above result shows that if $1/\tau _K$ is uniformly bounded for quasi-uniform triangulations, the convergence of $u_h$ is still optimal, provided $\boldsymbol{q}$ is smoother than required, that is, provided $\boldsymbol{q}\in \boldsymbol{H}^{r+1}({\Omega _h})$ instead of just $\boldsymbol{q}\in \boldsymbol{H}^{r}({\Omega _h})$. Of course, in this case, the superconvergence of $u_h$ to $\mathbb{P}u$ is lost.

The next result is a superconvergence result for the Lagrange multiplier $\lambda _h$. To state it, we use the following norm:

$$\begin{equation*} \|\,\mathsf{P}_{\partial }u-\widehat{u}_h\,\|_{L^2(\mathscr{E}_h;h)} =(\sum _{K\in {\Omega _h}}\,h_K\|\,\mathsf{P}_{\partial }u-\widehat{u}_h\,\|^2_{L^2({\partial K})})^{1/2}. \end{equation*}$$

Theorem 2.8.

Suppose that the exact solution $(u,\boldsymbol{q})$ of Equation 1.1 belongs to $H^{r+1}({\Omega _h})\times \boldsymbol{H}^{r+1}({\Omega _h})$ for some $r\in [0,k]$. Then,

$$\begin{alignat*}{2} &\| \mathsf{P}_{\partial }u-\widehat{u}_h \|_{L^2(\mathscr{E}_h;h)} \le &&C\,\left(\mathfrak{C}_0^{r,0}(\boldsymbol{q})+|\,\boldsymbol{q}\,|_{\boldsymbol{H}^{r+1}({\Omega _h})}\,\right)\, h^{r+2}, \end{alignat*}$$

if $k\ge 1$, or if $k=0$ and $f=0$.

There are no results of this type for any other DG method. However, the RT$_k$ and the BDM$_k$ methods have both similar results. Here we exploited the similarity of the $\text{SF-H}_k$ methods with the RT$_k$ and BDM$_k$ methods to obtain these superconvergence results.

2.4. Postprocessing

We end this section by showing how to exploit the superconvergence results to postprocess $u_h$, $\boldsymbol{q}_h$ and $\hat{u}_h$ to get a better approximation to $u$ defined as follows.

On the simplex $K$, we define the new approximation of $u$, ${u}^\star _h$, as the function of $\mathscr{P}^{k+1}(K)$ given by

$$\begin{alignat}{1} {u}^\star _h= &\;\overline{u}_h+\tilde{u}_h, \cssId{postprocessing-a}{\tag{2.9a}} \end{alignat}$$where$$\begin{alignat}{1} \overline{u}_h=&\begin{cases} \frac{1}{d}\sum _{e\in {\partial K}} \widehat{u}_h|_e&\quad \text{ if } k=0,\\ \frac{1}{|K|} \int _K u_h \,dx&\quad \text{ if } k>0, \end{cases} \cssId{postprocessing-b}{\tag{2.9b}} \end{alignat}$$and $\tilde{u}_h$ is the polynomial in $\mathscr{P}^{k+1}_0(K)$ satisfying$$\begin{alignat}{1} (\boldsymbol{a}\,\nabla \tilde{u}_h, \nabla w)_K=& (f,w)_{K}- \langle w, \widehat{\boldsymbol{q}}_h \cdot \boldsymbol{n}\rangle _{{\partial K}} \;\;\;\; \forall w \in \mathscr{P}^{k+1}(K). \cssId{postprocessing-c}{\tag{2.9c}} \end{alignat}$$

Here $\boldsymbol{a}=\boldsymbol{c}^{-1}$ and $\mathscr{P}_0^{k+1}(K)$ is the collection of functions in $\mathscr{P}^{k+1}(K)$ with mean zero. The postprocessing technique just introduced is a slight modification of a postprocessing proposed in Reference 17Reference 18 and Reference 14; it consists of using the numerical trace $\widehat{\boldsymbol{q}}_h$ instead of $\boldsymbol{q}_h$.

It is easy to see that this postprocessing is associated to a locally conservative method. Indeed, the scheme satisfied by $u^\star _h$ on each simplex $K\in {\Omega _h}$ is

$$\begin{equation*} (\boldsymbol{a}\,\nabla {u}^\star _h, \nabla w)_K+ \langle w, \widehat{\boldsymbol{q}}_h \cdot \boldsymbol{n}\rangle _{{\partial K}}= (f,w)_{K} \;\;\;\; \forall w \in \mathscr{P}^{k+1}(K). \end{equation*}$$

As a consequence, if we take $D_h$ to be the union of an arbitrary set of simplexes $K\in {\Omega _h}$, we get that

$$\begin{equation*} \langle 1, \widehat{\boldsymbol{q}}_h \cdot \boldsymbol{n}\rangle _{\partial D_h} = (f,1)_{D_h}, \end{equation*}$$

which is nothing but the property of local conservativity.

Note that $\tilde{u}_h$ is well defined. Indeed, if we take $w=1$ in equation Equation 2.9c, the right-hand side is also equal to zero thanks to equation Equation 1.3b. The fact that it provides a better approximation to the potential $u$ than $u_h$ is contained in the following result.

Theorem 2.9.

Suppose that the exact solution $(u,\boldsymbol{q})$ belongs to $H^{r+2}({\Omega _h})\times \boldsymbol{H}^{r+1}({\Omega _h})$ for $r\in [0,k]$. Then, if $k\ge 1$,

$$\begin{alignat*}{2} &\| u-{u}^\star _h \|_{L^2({\Omega _h})} \le && C\,h^{r+2}\left(\mathfrak{C}_0^{r,0}(\boldsymbol{q})+\;|\, \boldsymbol{q}\,|_{\boldsymbol{H}^{r+1}(\Omega _{h})} +|\,u\,|_{H^{r+2}(\Omega _{h})}\right), \end{alignat*}$$

and if $k=0$ and $f\equiv 0$,

$$\begin{alignat*}{2} &\| u-{u}^\star _h \|_{L^2({\Omega _h})} \le &&C\,h^2\,\left(\mathcal{C}_0(\boldsymbol{q})+ |\, u \,|_{H^2({\Omega _h})}\right). \end{alignat*}$$

Note that when $\mathsf{P}f|_K\in \mathscr{P}^{k-1}(K)$ for all $K\in {\Omega _h}$, by Theorem 2.4 we have that the function $(\widehat{\boldsymbol{q}}_h\cdot \boldsymbol{n}, \mathsf{P}^{k-1} u_h, \lambda _h)$ is the same for the RT$_k$, BDM$_k$ ($k \ge 1$) and $\text{SF-H}_k$ methods. As a consequence, the postprocessed approximation $u^\star _h$ is also the same for all these methods.

Note also that in Reference 3, a general postprocessing which is solely based on approximation results was obtained. When applied to the $\text{SF-H}_k$ method for $k\ge 1$, it gives rise to an approximation of $u$ which converges with the same orders as ours. However, unlike such postprocessing, our postprocessed solution $u_h^\star$ is associated to a locally conservative scheme; it is also easier to compute.

Let us end this section by noting that all the error estimates for $k\ge 1$ hold if in the equation Equation 1.3b, we replace $f$ by any function $\mathcal{I}_h f$ such that $\mathcal{I}_h f|_K\in \mathscr{P}^{k-1}(K)$ for all $K\in {\Omega _h}$ and such that

$$\begin{equation*} \|\,f-\mathcal{I}_h f\,\|_{H^{-1}(\Omega )}\le C\,h^{r+1}\,|\,f\,|_{H^r({\Omega _h})}. \end{equation*}$$

Moreover, by statement (ii) of Theorem 2.4, the function $(\boldsymbol{q}_h,\mathsf{P}^{k-1}u_h,\lambda _h)$ provided by the RT$_k$, the BDM$_k$ and the $\text{SF-H}_k$ method is the same; in particular, we have that $\boldsymbol{q}_h\in \boldsymbol{H}(div,\Omega )$. The postprocessed approximation $u^\star _h$ is also the same for those three methods.

3. Proofs

In this section, we present detailed proofs of all our results.

3.1. Proof of Proposition 2.1: The properties of $(\boldsymbol{\Pi }, \mathbb{P})$

3.1.1. Two key auxiliary results about polynomials

To prove Proposition 2.1, we begin by stating and proving two lemmas whose use is crucial in our analysis.

Lemma 3.1.

Given the face $e$ of the simplex $K$ and a function $z\in \mathscr{P}^k(e)$, there is a unique function $Z\in \mathscr{P}^k(K)$ such that

$$\begin{alignat*}{1} (\text{i})&\quad \langle Z,\omega \rangle _e=\langle z,\omega \rangle _e \qquad \forall \;\omega \in \mathscr{P}^k(e), \\ (\text{ii})&\quad (Z,\mathsf{w})_K=0 \qquad \forall \;\mathsf{w}\in \mathscr{P}^{k-1}(K). \end{alignat*}$$

Moreover,

$$\begin{alignat*}{1} (\text{iii})&\quad \|\,Z\,\|_{L^2(K)}\le \,C\,h^{1/2}_K\|\,z\,\|_{L^2(e)}, \end{alignat*}$$

where $h_K$ is the diameter of the simplex $K$ and $C$ depends solely on $k$ and the shape-regularity parameters of the simplex $K$.

Lemma 3.2.

Given the face $e$ of the simplex $K$ and the function $z$ such that for all faces $e'$ of $K$ different from $e$, $z|_{e'}\in \mathscr{P}^k(e')$, there is a unique function $\boldsymbol{Z}\in \boldsymbol{\mathscr{P}}^k(K)$ such that

$$\begin{alignat*}{3} (\text{i})&\quad \langle \boldsymbol{Z}\cdot \boldsymbol{n},\omega \rangle _{e'}=\langle z,\omega \rangle _{e'}\qquad \forall \;\omega \in \mathscr{P}^k(e'), \;e'\neq e, \\ (\text{ii})&\quad (\boldsymbol{Z},\boldsymbol{v})_K=0 \qquad \forall \;\boldsymbol{v}\in \boldsymbol{\mathscr{P}}^{k-1}(K). \end{alignat*}$$

Moreover,

$$\begin{alignat*}{3} (\text{iii})&\quad \|\,\boldsymbol{Z}\,\|_{\boldsymbol{L}^2(K)} \le \,C\,h^{1/2}_K\|\,z\,\|_{L^2({\partial K}\setminus e)}, \end{alignat*}$$

where $h_K$ is the diameter of the simplex $K$ and $C$ depends solely on $k$ and the shape-regularity parameters of the simplex $K$.

We are only going to give a detailed proof of Lemma 3.2 since the proof of Lemma 3.1 is similar and simpler.

Proof of Lemma 3.2.

Let us begin by proving that the function $\boldsymbol{\sigma }\in \boldsymbol{\mathscr{P}}^k(K)$ satisfying (i) and (ii) exists and is unique. Since the linear system determined by equations (i) and (ii) is square, indeed,

$$\begin{alignat*}{1} \operatorname {dim}(\boldsymbol{\mathscr{P}}^{k-1}(K))&=\binom{k-1+d}{d}\times d,\\ \sum _{e'\in {\partial K},e'\neq e}\operatorname {dim}(\mathscr{P}^k(e'))&=\binom{k+d-1}{d-1}\times d,\\ \operatorname {dim}(\boldsymbol{\mathscr{P}}^k(K))&=\binom{k+d}{d}\times d, \end{alignat*}$$

and $\binom{k-1+d}{d}+\binom{k+d-1}{d-1}=\binom{k+d}{d}$, we only need to show that if $\boldsymbol{\sigma }\in \boldsymbol{\mathscr{P}}^k(K)$ satisfies

$$\begin{alignat*}{2} &(\boldsymbol{Z}, \boldsymbol{v})_K=0&&\qquad \forall \, \boldsymbol{v}\in \boldsymbol{\mathscr{P}}^{k-1}(K), \\ &\langle \boldsymbol{Z}\cdot \boldsymbol{n}, \omega \rangle _{e'}=0 &&\qquad \forall \, \omega \in \mathscr{P}^{k}(e'), e'\in {\partial K},e'\neq e, \end{alignat*}$$

then $\boldsymbol{Z}= \boldsymbol{0}$ on $K$.

Let $T$ be the affine mapping that transforms the element $K$ to the reference simplex $\widehat{K}$. Moreover, let us denote by $e_i$, $i=1,\ldots ,d+1$, the faces of $K$ where $e:=e_{d+1}$. Assume that the mapping $T$ is such that $\widehat{e}_i:=T(e_i)$ is the face of $\widehat{K}$ lying on the plane $\widehat{x}_i=0$, and set $\widehat{\boldsymbol{Z}}(\widehat{x}):=\boldsymbol{Z}(T^{-1}(\widehat{x}))$. Then the above equations become

$$\begin{alignat*}{2} &(\widehat{\boldsymbol{Z}}, \widehat{\boldsymbol{v}})_{\widehat{K}}=0 &&\qquad \forall \, \widehat{\boldsymbol{v}}\in \boldsymbol{\mathscr{P}}^{k-1}(\widehat{K}), \\ &\langle \widehat{\boldsymbol{Z}}\cdot \boldsymbol{n}_i, \widehat{\omega }\rangle _{\widehat{e}_i}=0 && \qquad \forall \, \widehat{\omega }\in \mathscr{P}^{k}(\widehat{e}_i),\; i=1,...,d, \end{alignat*}$$

since spaces of polynomials of a given total degree are invariant under affine transformations. Now, let $\{\widehat{\boldsymbol{n}}_j\}_{j=1}^{d}$ be the basis of $\mathbb{R}^d$ dual to $\{{\boldsymbol{n}}_i\}_{i=1}^{d}$, that is, ${\boldsymbol{n}}_i\cdot \widehat{\boldsymbol{n}}_j=\delta _{ij}.$ Then we can write $\widehat{\boldsymbol{Z}}=\sum _{j=1}^d \widehat{p}_j\widehat{\boldsymbol{n}}_j$, where $\widehat{p}_j \in \mathscr{P}^k(\widehat{K})$, and obtain that

$$\begin{alignat*}{2} &\sum _{j=1}^d(\widehat{p}_j, \widehat{\boldsymbol{n}}_j\cdot \widehat{\boldsymbol{v}})_{\widehat{K}}=0 &&\qquad \forall \, \widehat{\boldsymbol{v}}\in \boldsymbol{\mathscr{P}}^{k-1}(\widehat{K}), \\ &\langle \widehat{p}_i, \widehat{\omega }\rangle _{\widehat{e}_i}=0 &&\qquad \forall \, \widehat{\omega }\in \mathscr{P}^{k}(\widehat{e}_i),\; i=1,...,d. \end{alignat*}$$

The last equation implies that, for any $i=1,...,d$, $\widehat{p}_i|_{\widehat{e}_i}=0$ and hence that $\widehat{p}_i= \widehat{x}_i \widehat{\mathsf{p}}_i$ for some polynomial $\widehat{\mathsf{p}}_i$ in $\mathscr{P}^{k-1}(\widehat{K})$. Taking $\widehat{\boldsymbol{v}}=\widehat{\mathsf{p}}_i\,\boldsymbol{n}_i$, we get

$$\begin{equation*} (\widehat{p}_i, \widehat{\mathsf{p}}_i)_{\widehat{K}}=(\widehat{x}_i, \widehat{\mathsf{p}}_i^2)_{\widehat{K}}=0, \end{equation*}$$

and, since $\widehat{x}_i>0$ on $\widehat{K}$, we conclude that $\widehat{\mathsf{p}}_i=0$. This implies that $\boldsymbol{Z}=\boldsymbol{0}$ on $K$. This proves the existence and uniqueness of $\boldsymbol{Z}$ satisfying the conditions (i) and (ii).

The estimate (iii) follows now from a simple scaling argument. This completes the proof.

■

3.1.2. Proof of the orthogonality properties

It is not difficult to see that the fact that $(\boldsymbol{\Pi },\mathbb{P})$ is well defined is a direct corollary of Lemmas 3.1 and 3.2.

Now, let us prove the orthogonality properties. The property (i) follows from the property Equation 2.2a defining $\mathbb{P}$ and the orthogonality property (ii) follows from the property Equation 2.1a defining $\boldsymbol{\Pi }$. The orthogonality property (iii) follows from the properties Equation 2.2b and Equation 2.1b defining $\mathbb{P}$ and $\boldsymbol{\Pi }$, and from the definition of the projection $\mathsf{P}_{\partial }$, Equation 1.5. In fact, it follows from the fact that on each face $e$ of any simplex $K$, we have that either $\mathbb{P}\zeta =\mathsf{P}_{\partial }\zeta$ or $\boldsymbol{\Pi }\boldsymbol{\sigma }\cdot \boldsymbol{n}=\mathsf{P}_{\partial }\boldsymbol{\sigma }\cdot \boldsymbol{n}$.

3.1.3. Proof of the weak commutativity property

The weak commutativity property (iv) is a direct consequence of the three orthogonality properties we just proved. Indeed, we have that

$$\begin{alignat*}{2} -(\nabla \zeta ,\boldsymbol{\Pi }\boldsymbol{\sigma })_K &=(\zeta ,\nabla \cdot \boldsymbol{\Pi }\boldsymbol{\sigma })_K -\langle \zeta ,\boldsymbol{\Pi }\boldsymbol{\sigma }\cdot \boldsymbol{n}\rangle _{{\partial K}} \\ &=(\mathbb{P}\zeta ,\nabla \cdot \boldsymbol{\Pi }\boldsymbol{\sigma })_K -\langle \zeta ,\boldsymbol{\Pi }\boldsymbol{\sigma }\cdot \boldsymbol{n}\rangle _{{\partial K}} &&\quad \text{by (i)}, \\ &=-(\nabla \mathbb{P}\zeta ,\boldsymbol{\Pi }\boldsymbol{\sigma })_K +\langle \mathbb{P}\zeta -\zeta ,\boldsymbol{\Pi }\boldsymbol{\sigma }\cdot \boldsymbol{n}\rangle _{{\partial K}} \\ &=-(\nabla \mathbb{P}\zeta ,\boldsymbol{\sigma })_K +\langle \mathbb{P}\zeta -\zeta ,\boldsymbol{\Pi }\boldsymbol{\sigma }\cdot \boldsymbol{n}\rangle _{{\partial K}} &&\quad \text{by (ii)}, \\ &=-(\nabla \mathbb{P}\zeta ,\boldsymbol{\sigma })_K +\langle \mathbb{P}\zeta -\mathsf{P}_{\partial }\zeta ,\boldsymbol{\sigma }\cdot \boldsymbol{n}\rangle _{{\partial K}} &&\quad \text{by (iii)}, \\ &=(\mathbb{P}\zeta ,\nabla \cdot \boldsymbol{\sigma })_K -\langle \mathsf{P}_{\partial }\zeta ,\boldsymbol{\sigma }\cdot \boldsymbol{n}\rangle _{{\partial K}}, \\ &=(\mathbb{P}\zeta ,\nabla \cdot \boldsymbol{\sigma })_K -\langle \mathsf{P}_{\partial }\zeta ,\mathsf{P}_{\partial }\boldsymbol{\sigma }\cdot \boldsymbol{n}\rangle _{{\partial K}}, \end{alignat*}$$

by the definition of the projection $\mathsf{P}_{\partial }$, Equation 1.5. This completes the proof of (iv).

3.1.4. Proof of the estimates (v) and (vi)

Note that, by the definition of the projections $\boldsymbol{\Pi }$, Equation 2.1, and $\boldsymbol{\Pi }^{{\scriptscriptstyle {\mathrm{RT}}}}$, Equation 2.3, we have that

$$\begin{align*} (\boldsymbol{\Pi }^{{\scriptscriptstyle {\mathrm{RT}}}}\boldsymbol{\sigma }-\boldsymbol{\Pi }\boldsymbol{\sigma },\boldsymbol{v})_K&=0,\\ \langle (\boldsymbol{\Pi }^{{\scriptscriptstyle {\mathrm{RT}}}}\boldsymbol{\sigma }-\boldsymbol{\Pi }\boldsymbol{\sigma })\cdot \boldsymbol{n}, \omega \rangle _{e}&=\langle \mathsf{P}_{\partial }\boldsymbol{\sigma }\cdot \boldsymbol{n}-\boldsymbol{\Pi }\boldsymbol{\sigma }\cdot \boldsymbol{n}, \omega \rangle _{e\cap e_{K}^{\tau }}. \end{align*}$$

for all $\boldsymbol{v}\in \boldsymbol{\mathscr{P}}^{k-1}(K)$, if $k>1$, and for all $\omega \in \mathscr{P}^{k}(e)$ and all faces $e$ of $K$. By a well-known scaling argument, we immediately obtain that

$$\begin{equation*} \|\,\boldsymbol{\Pi }^{{\scriptscriptstyle {\mathrm{RT}}}}\boldsymbol{\sigma }-\boldsymbol{\Pi }\boldsymbol{\sigma }\,\|_{\boldsymbol{L}^2(K)}\le C\,h^{1/2}_K\, \|\,\mathsf{P}_{\partial }\boldsymbol{\sigma }\cdot \boldsymbol{n}-\boldsymbol{\Pi }\boldsymbol{\sigma }\cdot \boldsymbol{n}\,\|_{L^2(e_{K}^{\tau })}. \end{equation*}$$

It remains to estimate the above right-hand side. To do that, we note that, for any $\omega$ in $\mathscr{P}^k(K)$, we have that

$$\begin{equation*} \langle \omega , \boldsymbol{\Pi }\boldsymbol{\sigma }\cdot \boldsymbol{n}-\mathsf{P}_{\partial }\boldsymbol{\sigma }\cdot \boldsymbol{n}\rangle _{e_{K}^{\tau }} =\langle \omega ,(\boldsymbol{\Pi }\boldsymbol{\sigma }-\boldsymbol{\sigma })\cdot \boldsymbol{n}\rangle _{{\partial K}} =(\omega , \nabla \cdot (\boldsymbol{\Pi }\boldsymbol{\sigma }-\boldsymbol{\sigma }))_K, \end{equation*}$$

by the definition of the projections $\boldsymbol{\Pi }$, Equation 2.1. Taking $\omega =Z$, where $Z$ is given by Lemma 3.1 with $z:=\boldsymbol{\Pi }\boldsymbol{\sigma }\cdot \boldsymbol{n}-\mathsf{P}_{\partial }\boldsymbol{\sigma }\cdot \boldsymbol{n}$, we get that, for any $p$ in $\mathscr{P}^{k-1}(K)$,

$$\begin{alignat*}{1} \|\,z\,\|^2_{L^2(e_{K}^{\tau })} =& \|\,\boldsymbol{\Pi }\boldsymbol{\sigma }\cdot \boldsymbol{n}-\mathsf{P}_{\partial }\boldsymbol{\sigma }\cdot \boldsymbol{n}\,\|^2_{L^2(e_{K}^{\tau })} \\ =&-(\mathsf{P}\nabla \cdot \boldsymbol{\sigma }-p,Z)_K \\ \le &\; C\, \,h_K^{1/2}\,\|\,\mathsf{P}\nabla \cdot \boldsymbol{\sigma }-p\,\|_{L^2(K)}\,\|\,z\,\|_{L^2(e_{K}^{\tau })}, \end{alignat*}$$

and, after a direct application of the Bramble-Hilbert lemma, we get

$$\begin{equation*} \|\,\mathsf{P}_{\partial }\boldsymbol{\sigma }\cdot \boldsymbol{n}-\boldsymbol{\Pi }\boldsymbol{\sigma }\cdot \boldsymbol{n}\,\|_{L^2(e_{K}^{\tau })} \le C\,h_K^{r+1/2}\,|\,\mathsf{P}\nabla \cdot \boldsymbol{\sigma }\,|_{H^r(K)}, \end{equation*}$$

where $r\in [0,k]$. This completes the proof of the estimates (v) and (vi).

3.1.5. Proof of the estimate (vii)

Note that, by the definition of the projections $\mathbb{P}$, Equation 2.2, and $\mathsf{P}$, Equation 2.4, we have that

$$\begin{alignat*}{2} (\mathbb{P}\zeta -\mathsf{P}\zeta , \mathsf{w})_K&=0,\\ \langle \mathbb{P}\zeta -\mathsf{P}\zeta , \omega \rangle _{e_{K}^{\tau }}&= \langle \mathsf{P}_{\partial }\zeta -\mathsf{P}\zeta , \omega \rangle _{e_{K}^{\tau }}, \end{alignat*}$$

for all $\mathsf{w}\in \mathscr{P}^{k-1}(K)$, if $k\ge 1$, and for all $\omega \in \mathscr{P}^{k}(e_{K}^{\tau })$. This implies that Lemma 3.1 holds with $z:=\mathsf{P}_{\partial }\zeta -\mathsf{P}\zeta$ and $Z=\mathbb{P}\zeta -\mathsf{P}\zeta$. As a consequence,

$$\begin{equation*} \|\,\mathbb{P}\zeta -\mathsf{P}\zeta \,\|_{L^2(K)} \le C\, h^{1/2}_K\, \|\,\mathsf{P}_{\partial }\zeta -\mathsf{P}\zeta \,\|^2_{L^2(e_{K}^{\tau })}. \end{equation*}$$

It remains to estimate the above right-hand side.

To do that, we note that, for any $\boldsymbol{v}$ in $\boldsymbol{\mathscr{P}}^k(K)\oplus \boldsymbol{x}\,\mathscr{P}^k(K)$, we have that

$$\begin{equation} \langle \mathsf{P}\zeta -\mathsf{P}_{\partial }\zeta ,\boldsymbol{v}\cdot \boldsymbol{n}\rangle _{{\partial K}} =\langle \mathsf{P}\zeta -\zeta ,\boldsymbol{v}\cdot \boldsymbol{n}\rangle _{{\partial K}} =(\nabla \mathsf{P}\zeta -\nabla \zeta , \boldsymbol{v})_K, \cssId{auxiliary}{\tag{3.1}} \end{equation}$$

by the definition of the projection $\mathsf{P}$, Equation 2.4 and that of the projection $\mathsf{P}_{\partial }$, Equation 1.5. A well-known scaling argument states that given any function $z$ such that its restriction to each face $e$ of $k$ belongs to $\mathscr{P}^k(e)$, there is a function $\boldsymbol{Z}$ in $\boldsymbol{\mathscr{P}}^k(K)\oplus \boldsymbol{x}\,\mathscr{P}^k(K)$ such that

$$\begin{align*} (\text{i})&\quad \langle \boldsymbol{Z}\cdot \boldsymbol{n},\omega \rangle _e =\langle z,\omega \rangle _{e}\qquad \forall \;\omega \in \mathscr{P}^k(e), \\ (\text{ii})&\quad (\boldsymbol{Z},\boldsymbol{v})_K=0\qquad \forall \;\boldsymbol{v}\in \boldsymbol{\mathscr{P}}^{k-1}(K). \\ (\text{iii})&\quad \|\,\boldsymbol{Z}\,\|_{\boldsymbol{L}^2(K)} \le \,C\,h^{1/2}_K\|\,z\,\|_{L^2({\partial K})}, \end{align*}$$

where $h_K$ is the diameter of the simplex $K$ and $C$ depends solely on the shape-regularity constants of the simplex $K$. Taking $\boldsymbol{v}:=\boldsymbol{Z}$ with $z=\mathsf{P}\zeta -\mathsf{P}_{\partial }\zeta$ in equation Equation 3.1, we obtain that

$$\begin{alignat*}{1} \|\,z\,\|^2_{L^2({\partial K})} =&(\nabla \mathsf{P}\zeta -\nabla \zeta , \boldsymbol{Z})_K \\ =&(\boldsymbol{p}-\nabla \zeta , \boldsymbol{Z})_K \\ \le &\; C\, \,h_K^{1/2}\,\|\,\nabla \zeta -\boldsymbol{p}\,\|_{L^2(K)}\,\|\,z\,\|_{L^2({\partial K})}, \end{alignat*}$$

for any $\boldsymbol{p}\in \boldsymbol{\mathscr{P}}^{k-1}(K)$. Thus, after a direct application of the Bramble-Hilbert lemma, we get

$$\begin{equation*} \|\,\mathsf{P}\zeta -\mathsf{P}_{\partial }\zeta \,\|_{L^2({\partial K})} \le C\,h_K^{r+1/2}\,|\,\nabla \zeta \,|_{\boldsymbol{H}^r(K)}, \end{equation*}$$

where $r\in [0,k]$. This completes the proof of estimate (vii).

This completes the proof of Proposition 2.1.

3.2. Characterization of the approximate solution

To prove the results of the characterization of the approximate solution of the SF-H method, we begin by proving two auxiliary results concerning key properties of the local solvers.

3.2.1. Two auxiliary results about the local solvers

To state the first auxiliary result, we need to introduce the following decomposition of our local spaces:

$$\begin{align*} \boldsymbol{\mathscr{P}}^{k}(K)&=\boldsymbol{\mathcal{V}}(K) \oplus \boldsymbol{\mathcal{V}}^\perp (K),\\ \mathscr{P}^{k}(K) &= {\mathcal{W}}(K) \oplus {\mathcal{W}}^\perp (K), \end{align*}$$

where

$$\begin{alignat*}{1} \boldsymbol{\mathcal{V}}(K) :=&\{\boldsymbol{v}\in \boldsymbol{\mathscr{P}}^{k}(K):\;\nabla \cdot \boldsymbol{v}=0\},\\ \boldsymbol{\mathcal{V}}^\perp (K):=&\{\boldsymbol{v}\in \boldsymbol{\mathscr{P}}^{k}(K):\; (\boldsymbol{c}\;\boldsymbol{v},\boldsymbol{\sigma })_K=0\;\;\forall \;\boldsymbol{\sigma }\in \boldsymbol{\mathcal{V}}(K)\}, \end{alignat*}$$

and

$$\begin{alignat*}{1} {\mathcal{W}}(K):=&\mathscr{P}^{k-1}(K),\\ {\mathcal{W}}^\perp (K):=&\{\omega \in \mathscr{P}^{k}(K):\; (\omega ,\zeta )_K=0\;\;\forall \;\zeta \in \mathcal{W}(K)\}, \end{alignat*}$$

Lemma 3.3.

Let $K$ be any simplex of the triangulation ${\Omega _h}$. Then the local mapping $(\boldsymbol{\mathcal{Q}}{\mathsf{m}},\,\mathcal{U}{\mathsf{m}})$ given by equations Equation 2.5 can be obtained as follows:

(i): Set$$\begin{equation*} \,\mathcal{U}{\mathsf{m}}|_{e_{K}^{\tau }}=\mathsf{P}_{\partial }\mathsf{m}|_{e_{K}^{\tau }}. \end{equation*}$$
(ii): Compute $\boldsymbol{\mathcal{Q}}{\mathsf{m}}\in \boldsymbol{\mathcal{V}}(K)$ by solving$$\begin{equation*} (\boldsymbol{c}\,\boldsymbol{\mathcal{Q}}{\mathsf{m}}, \boldsymbol{v})_K =-\langle {\mathsf{m}},{\boldsymbol{v}\cdot \boldsymbol{n}}\rangle _{{\partial K}} \quad \forall \;\boldsymbol{v}\in \boldsymbol{{\mathcal{V}}}(K). \end{equation*}$$
(iii): Compute $\mathsf{P}^{k-1}\,\mathcal{U}{\mathsf{m}}$ by solving$$\begin{equation*} (\mathsf{P}^{k-1}\,\mathcal{U}{\mathsf{m}}, \nabla \cdot \boldsymbol{v})_K =\langle {\mathsf{m}},{\boldsymbol{v}\cdot \boldsymbol{n}}\rangle _{{\partial K}} \quad \forall \;\boldsymbol{v}\in \boldsymbol{{\mathcal{V}}}^\perp (K). \end{equation*}$$

Similarly, the local mapping $(\boldsymbol{\mathcal{Q}}{f},\,\mathcal{U}{f})$ given by equations Equation 2.6 can be obtained as follows:

($\alpha$): Compute $\,\mathcal{U}{f}|_{e_{K}^{\tau }}$ by solving$$\begin{equation*} \langle \omega , \tau \,\mathcal{U}{f}\rangle _{e_{K}^{\tau }}=(f,\omega )_K \quad \forall \;\omega \in {\mathcal{W}}^\perp (K). \end{equation*}$$
($\beta$): Compute $\boldsymbol{\mathcal{Q}}{f}\in \boldsymbol{\mathcal{V}}^\perp (K)$ by solving$$\begin{equation*} (\omega ,\nabla \cdot \boldsymbol{\mathcal{Q}}{f})_K =-\langle \omega , \tau \,\mathcal{U}{f}\rangle _{e_{K}^{\tau }}+(f,\omega )_K \quad \forall \;\omega \in \mathcal{W}(K). \end{equation*}$$
($\gamma$): Compute $\mathsf{P}^{k-1}\,\mathcal{U}{f}$ by solving$$\begin{equation*} (\mathsf{P}^{k-1}\,\mathcal{U}{f}, \nabla \cdot \boldsymbol{v})_K =(\boldsymbol{c}\,\boldsymbol{\mathcal{Q}}{f}, \boldsymbol{v})_K \quad \forall \;\boldsymbol{v}\in \boldsymbol{{\mathcal{V}}}^\perp (K). \end{equation*}$$

Proof.

Let us begin by proving the properties of the first local mapping. Thus, integrating by parts in the equation Equation 2.5b, we obtain

$$\begin{equation*} (\omega , \nabla \cdot \boldsymbol{\mathcal{Q}}{\mathsf{m}})_K =\langle \omega ,(\widehat{\boldsymbol{\mathcal{Q}}}^{}{\mathsf{m}}-\boldsymbol{\mathcal{Q}}{\mathsf{m}})\cdot \boldsymbol{n}\rangle _{{\partial K}} =\langle \omega ,\tau (\,\mathcal{U}{\mathsf{m}}-\mathsf{m})\rangle _{e_{K}^{\tau }} \end{equation*}$$

for all $\omega \in \mathscr{P}^{k}(K)$, by the definition of the numerical trace $\widehat{\boldsymbol{\mathcal{Q}}}^{}{\mathsf{m}}$, Equation 1.4b and Equation 1.6. Taking $\omega \in {\mathcal{W}}^\perp (K)$, we see that

$$\begin{equation*} \langle \omega ,\tau (\,\mathcal{U}{\mathsf{m}}-\mathsf{m})\rangle _{e_{K}^{\tau }}=0 \quad \forall \;\omega \in {\mathcal{W}}^\perp (K). \end{equation*}$$

Using the fact that ${\mathcal{W}}^\perp (K)|_{e_{K}^{\tau }}=\mathscr{P}^k(e_{K}^{\tau })$ which follows by a simple application of Lemma 3.1, we have that (i) holds. As a consequence, we see that

$$\begin{equation*} (\omega , \nabla \cdot \boldsymbol{\mathcal{Q}}{\mathsf{m}})_K=0 \quad \forall \;\omega \in \mathscr{P}^{k}(K), \end{equation*}$$

and hence that $\boldsymbol{\mathcal{Q}}{\mathsf{m}}\in \boldsymbol{\mathcal{V}}(K)$. The property (ii) can now be obtained by restricting the test functions $\boldsymbol{v}$ to the space $\boldsymbol{\mathcal{V}}(K)$ in the equation Equation 2.5a. Now that we know $\boldsymbol{\mathcal{Q}}{\mathsf{m}}$, we obtain the formulation (iii) for $\mathsf{P}^{k-1}\,\mathcal{U}{\mathsf{m}}$ by restricting the test functions $\boldsymbol{v}$ to the space $\boldsymbol{\mathcal{V}}^\perp (K)$ in the equation Equation 2.5a. It remains to show that $\mathsf{P}^{k-1}\,\mathcal{U}{\mathsf{m}}$ is uniquely defined by those equations. But this follows from the fact that the system of equations is square and that $\nabla \cdot \boldsymbol{\mathcal{V}}^\perp (K)=\mathscr{P}^{k-1}(K)=\mathcal{W}(K)$, which in turn follows from the fact that $\nabla \cdot \boldsymbol{\mathcal{V}}(K)=\lbrace 0\rbrace$ and $\nabla \cdot \boldsymbol{\mathscr{P}}^{k}(K)=\mathscr{P}^{k-1}(K)$. This completes the proof of the properties of the first local lifting.

The proof the properties ($\alpha$) and ($\gamma$) of the second local mapping is similar to the proof of the properties (i) and (iii) of the first local mapping, respectively. Let us prove property ($\beta$). If we take $\boldsymbol{v}\in \boldsymbol{\mathcal{V}}$ in the equation Equation 2.6a, we see that $\boldsymbol{\mathcal{Q}}{f}\in \boldsymbol{\mathcal{V}}^\perp (K)$. Since the equation in ($\beta$) is obtained from Equation 2.6b by restricting the tests functions $\omega$ to $\mathcal{W}(K)$, we only have to prove that $\boldsymbol{\mathcal{Q}}{f}$ given by ($\beta$) is well defined. But this follows from the fact that the system is a square system and $\nabla \cdot \boldsymbol{\mathcal{V}}^\perp (K)=\mathscr{P}^{k-1}(K)=\mathcal{W}(K)$. This completes the proof.

■

The second auxiliary result concerns the jumps of the local solvers.

Lemma 3.4.

For each simplex $K\in {\Omega _h}$, we have that, on ${\partial K}$,

$$\begin{alignat*}{1} &(\widehat{\boldsymbol{\mathcal{Q}}}^{}{\mathsf{m}}-\boldsymbol{\mathcal{Q}}{\mathsf{m}})\cdot \boldsymbol{n}=\tau \,(\,\mathcal{U}{\mathsf{m}}-\mathsf{P}_{\partial }\mathsf{m})=0,\\ &(\widehat{\boldsymbol{\mathcal{Q}}}^{}{f}-\boldsymbol{\mathcal{Q}}{f})\cdot \boldsymbol{n}=\tau \,\,\mathcal{U}{f}=\mathsf{P}_{\partial }\boldsymbol{q}\cdot \boldsymbol{n}-\boldsymbol{\Pi }\boldsymbol{q}\cdot \boldsymbol{n}. \end{alignat*}$$

Proof.

Let us begin by proving the second identity since its proof is more involved. Taking $\omega =Z$ in the identity ($\beta$) of Lemma 3.3, where $Z$ is given by Lemma 3.1 with $e=e_{K}^{\tau }$, we obtain that

$$\begin{alignat*}{1} \langle z, (\widehat{\boldsymbol{\mathcal{Q}}}^{}{f}-\boldsymbol{\mathcal{Q}}{f})\cdot \boldsymbol{n}\rangle _{e_{K}^{\tau }} =&\langle z, \tau \,\,\mathcal{U}{f}\rangle _{e_{K}^{\tau }} \\ =&(Z,\nabla \cdot (\boldsymbol{q}-\boldsymbol{\Pi }\boldsymbol{q}))_K \\ =& -(\nabla Z, \boldsymbol{q}-\boldsymbol{\Pi }\boldsymbol{q})_K+\langle Z, (\boldsymbol{q}-\boldsymbol{\Pi }\boldsymbol{q})\cdot \boldsymbol{n}\rangle _{{\partial K}} \\ =& \langle z, \mathsf{P}_{\partial }\boldsymbol{q}\cdot \boldsymbol{n}-\boldsymbol{\Pi }\boldsymbol{q}\cdot \boldsymbol{n}\rangle _{e_{K}^{\tau }}, \end{alignat*}$$

by the properties of the projection $\boldsymbol{\Pi }$, Equation 2.1. As a consequence, we immediately obtain that, on $e_{K}^{\tau }$,

$$\begin{equation*} (\widehat{\boldsymbol{\mathcal{Q}}}^{}{f}-\boldsymbol{\mathcal{Q}}{f})\cdot \boldsymbol{n}=\tau _K\,\,\mathcal{U}{f}=\mathsf{P}_{\partial }\boldsymbol{q}\cdot \boldsymbol{n}-\boldsymbol{\Pi }\boldsymbol{q}\cdot \boldsymbol{n}. \end{equation*}$$

A similar argument gives that, on $e_{K}^{\tau }$,

$$\begin{equation*} (\widehat{\boldsymbol{\mathcal{Q}}}^{}{\mathsf{m}}-\boldsymbol{\mathcal{Q}}{\mathsf{m}})\cdot \boldsymbol{n}=\tau _K\,(\,\mathcal{U}{\mathsf{m}}-\mathsf{P}_{\partial }\mathsf{m})=0. \end{equation*}$$

This completes the proof of Lemma 3.4.

■

3.2.2. Proof of Theorem 2.3: Characterization of the approximate solution

The following result is a particular case of a general result proven in Reference 11.

Theorem 3.5.

The approximate solution $(\boldsymbol{q}_h,u_h,\lambda _h)\in \boldsymbol{V}_h\times W_h\times M_h$ given by the $\text{SF-H}_k$ method is well defined. Moreover, we have that

where $\lambda _h$ can be characterized as the function in $M_h$ satisfying

$$\begin{equation*} a_h ( \lambda _h, \mu ) = b_h(\mu ) \qquad \forall \mu \in M_h, \end{equation*}$$

where

$$\begin{alignat*}{2} a_h(\eta ,\mu ) =& ( c\, \boldsymbol{\mathcal{Q}}{\eta }, \boldsymbol{\mathcal{Q}}{\mu })_{{\Omega _h}} &&-\langle \mu -\,\mathcal{U}{\mu }, (\widehat{\boldsymbol{\mathcal{Q}}}^{}{\eta }-\boldsymbol{\mathcal{Q}}{\eta })\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}, \\ b_h(\mu ) =&\langle g,\widehat{\boldsymbol{\mathcal{Q}}}^{}{\mu }\cdot \boldsymbol{n}\rangle _{{\partial \Omega _D}} &&+(f, \,\mathcal{U}{\mu })_{{\Omega _h}}-\langle \mu , \mathsf{q_N}\rangle _{{\partial \Omega _N}} \\ &&&-\langle \mu -\,\mathcal{U}{\mu }, (\widehat{\boldsymbol{\mathcal{Q}}}^{}{f}-\boldsymbol{\mathcal{Q}}{f})\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}} \\ &&&+\langle -\,\mathcal{U}{f}, (\widehat{\boldsymbol{\mathcal{Q}}}^{}{\mu }-\boldsymbol{\mathcal{Q}}{\mu })\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}} \\ &&&+\langle \mu -\,\mathcal{U}{\mu }, (\widehat{\boldsymbol{\mathcal{Q}}}^{}{g}-\boldsymbol{\mathcal{Q}}{g})\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}} \\ &&&-\langle g-\,\mathcal{U}{g} ,(\widehat{\boldsymbol{\mathcal{Q}}}^{}{\mu }-\boldsymbol{\mathcal{Q}}{\mu })\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}, \end{alignat*}$$

for all $\eta$ and $\mu \in M_h$.

Theorem 2.3 follows from this result if we show that on $\mathscr{E}_h$,

$$\begin{equation*} (\widehat{\boldsymbol{\mathcal{Q}}}^{}{\mu }-\boldsymbol{\mathcal{Q}}{\mu })\cdot \boldsymbol{n}=0 \quad \text{and}\quad (\mu -\,\mathcal{U}{\mu })(\widehat{\boldsymbol{\mathcal{Q}}}^{}{f}-\boldsymbol{\mathcal{Q}}{f})=0, \end{equation*}$$

for all $\mu \in M_h$. Since this is a straightforward consequence of Lemma 3.4, this completes the proof of Theorem 2.3.

3.2.3. Proof of Proposition 2.2: Characterization of the jumps

By the definition of the numerical traces Equation 1.4 and Equation 1.6, we have that

$$\begin{alignat*}{1} (\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}=&\begin{cases} \tau _K\,(u_h-\widehat{u}_h)&\qquad \text{ on }e_{K}^{\tau },\\ 0 &\qquad \text{ otherwise} \end{cases} \\ =&\begin{cases} \tau _K\,(u_h-\widehat{u}_h)&\qquad \text{ on }e_{K}^{\tau },\\ \mathsf{P}_{\partial }\boldsymbol{q}\cdot \boldsymbol{n}-\boldsymbol{\Pi }\boldsymbol{q}\cdot \boldsymbol{n}&\qquad \text{ otherwise,} \end{cases} \end{alignat*}$$

by the definition of the projection $\boldsymbol{\Pi }$, Equation 2.1b, and that of the projection $\mathsf{P}_{\partial }$, Equation 1.5. So, we only have to prove that

$$\begin{equation*} (\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}=\mathsf{P}_{\partial }\boldsymbol{q}\cdot \boldsymbol{n}-\boldsymbol{\Pi }\boldsymbol{q}\cdot \boldsymbol{n}\qquad \text{ on }e_{K}^{\tau }. \end{equation*}$$

But, by Theorem 2.3, we have that

$$\begin{alignat*}{1} (\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}=& (\widehat{\boldsymbol{\mathcal{Q}}}^{}{\lambda _h}-\boldsymbol{\mathcal{Q}}{\lambda _h})\cdot \boldsymbol{n}+ (\widehat{\boldsymbol{\mathcal{Q}}}^{}{g}-\boldsymbol{\mathcal{Q}}{g})\cdot \boldsymbol{n}+ (\widehat{\boldsymbol{\mathcal{Q}}}^{}{f}-\boldsymbol{\mathcal{Q}}{f})\cdot \boldsymbol{n}\\ =&\mathsf{P}_{\partial }\boldsymbol{q}\cdot \boldsymbol{n}-\boldsymbol{\Pi }\boldsymbol{q}\cdot \boldsymbol{n} \end{alignat*}$$

on the face $e_{K}^{\tau }$. This completes the proof of Proposition 2.2.

3.2.4. Proof of Theorem 2.4

The statement (i) of Theorem 2.4 follows directly from Theorem 2.3 and from Lemma 3.3.

To prove the remaining statements, we are going to use the fact that the RT$_k$, BDM$_k$ and $\text{SF-H}_k$ methods have exactly the same structure and satisfy the characterization Theorem 2.3; see Reference 11. The only difference between these methods is the choice of local spaces (see Table 1) and the choice of the local stabilization parameters $\tau$; see Table 2. Thus, to prove statement (ii) we only have to show that the functions $(\boldsymbol{\mathcal{Q}}{\mathsf{m}},\mathsf{P}^{k-1}\,\mathcal{U}{\mathsf{m}})$ and $(\boldsymbol{\mathcal{Q}}{f},\mathsf{P}^{k-1}\,\mathcal{U}{f})$ are the same for all these methods whenever $f|_K\in \mathscr{P}^{k-1}(K)$ for all $K\in {\Omega _h}$. Similarly, to prove statement (iii), we only have to show that $\boldsymbol{\mathcal{Q}}{\mathsf{m}}$ is the same for all these methods.

To do that, we begin by noting that we have, by Lemma 3.3, that the function $(\boldsymbol{\mathcal{Q}}{\mathsf{m}},\mathsf{P}^{k-1}\,\mathcal{U}{\mathsf{m}})\in \boldsymbol{\mathcal{V}}(K)\times \mathcal{W}(K)$ is determined by

$$\begin{alignat*}{2} (\boldsymbol{c}\,\boldsymbol{\mathcal{Q}}{\mathsf{m}}, \boldsymbol{v})_K =&-\langle {\mathsf{m}},{\boldsymbol{v}\cdot \boldsymbol{n}}\rangle _{{\partial K}} &&\quad \forall \;\boldsymbol{v}\in \boldsymbol{{\mathcal{V}}}(K), \\ (\mathsf{P}^{k-1}\,\mathcal{U}{\mathsf{m}}, \nabla \cdot \boldsymbol{v})_K =&(\boldsymbol{c}\,\boldsymbol{\mathcal{Q}}{\mathsf{m}}, \boldsymbol{v})_K+\langle {\mathsf{m}},{\boldsymbol{v}\cdot \boldsymbol{n}}\rangle _{{\partial K}} &&\quad \forall \;\boldsymbol{v}\in \boldsymbol{{\mathcal{V}}}^\perp (K). \end{alignat*}$$

and that the function $(\boldsymbol{\mathcal{Q}}{f},\mathsf{P}^{k-1}\,\mathcal{U}{f})\in \boldsymbol{\mathcal{V}}^\perp (K)\times \mathcal{W}(K)$ is determined by the equations

$$\begin{alignat*}{2} (\omega ,\nabla \cdot \boldsymbol{\mathcal{Q}}{f})_K =&-\langle \omega , \tau \,\mathcal{U}{f}\rangle _{e_{K}^{\tau }}+(f,\omega )_K &&\quad \forall \;\omega \in \mathcal{W}(K), \\ (\mathsf{P}^{k-1}\,\mathcal{U}{f}, \nabla \cdot \boldsymbol{v})_K =&(\boldsymbol{c}\,\boldsymbol{\mathcal{Q}}{\mathsf{m}}, \boldsymbol{v})_K &&\quad \forall \;\boldsymbol{v}\in \boldsymbol{{\mathcal{V}}}^\perp (K), \end{alignat*}$$

where $\,\mathcal{U}{f}|_{e_{K}^{\tau }}=0$, by ($\alpha$) of Lemma 3.3, if $f|_K\in \mathscr{P}^{k-1}(K)$. Since the four equations above also hold (the third whenever $f|_K\in \mathscr{P}^{k-1}(K)$) for the BDM$_k$ method, we conclude that the statements (ii) and (iii) hold if we exclude the RT$_k$ method.

To show that these statements also hold if we include it, we note that the above equations hold for the RT$_k$ method if we modify the definition of the spaces $\boldsymbol{\mathcal{V}}(K)$ and $\boldsymbol{\mathcal{V}}(K)$ by

$$\begin{alignat*}{1} \boldsymbol{\mathcal{V}}_{\mathrm{{RT}}}(K) :=&\{\boldsymbol{v}\in \boldsymbol{\mathscr{P}}^{k}(K)\oplus \boldsymbol{x}\,\mathscr{P}^{k}(K):\;\nabla \cdot \boldsymbol{v}=0\},\\ \boldsymbol{\mathcal{V}}^\perp _{\mathrm{{RT}}}(K):=&\{\boldsymbol{v}\in \boldsymbol{\mathscr{P}}^{k}(K)\oplus \boldsymbol{x}\,\mathscr{P}^{k}(K):\; (\boldsymbol{c}\;\boldsymbol{v},\boldsymbol{\sigma })_K=0\;\;\forall \;\boldsymbol{\sigma }\in \boldsymbol{\mathcal{V}}_{\mathrm{{RT}}}(K)\}, \end{alignat*}$$

and if we replace the third equation by

$$\begin{equation*} \nabla \cdot \boldsymbol{\mathcal{Q}}{f}=\mathsf{P}f. \end{equation*}$$

Thus, the result follows from the fact that

$$\begin{equation*} \boldsymbol{\mathcal{V}}_{\mathrm{{RT}}}(K)= \{\boldsymbol{v}\in \boldsymbol{\mathscr{P}}^{k}(K):\;\nabla \cdot \boldsymbol{v}=0\}=\boldsymbol{\mathcal{V}}(K), \end{equation*}$$

and from the fact that, if $\boldsymbol{\mathcal{Q}}{f}\in \boldsymbol{\mathcal{V}}_{\mathrm{{RT}}}^\perp (K)$ and $\nabla \cdot \boldsymbol{\mathcal{Q}}{f}=\mathsf{P}f\in \mathscr{P}^{k-1}(K)$, then $\boldsymbol{\mathcal{Q}}{f}$ belongs to the space $\{\boldsymbol{v}\in \boldsymbol{\mathscr{P}}^{k}(K):\; (\boldsymbol{c}\;\boldsymbol{v},\boldsymbol{\sigma })_K=0\;\;\forall \;\boldsymbol{\sigma }\in \boldsymbol{\mathcal{V}}(K)\}=\boldsymbol{\mathcal{V}}^\perp (K)$. This completes the proof of Theorem 2.4.

3.3. Proof of the error estimates

The proof of the error estimates is based on the error equations and the properties of the projection $(\boldsymbol{\Pi },\mathbb{P})$ gathered in Proposition 2.1. The error equations are

$$\begin{alignat}{1} &(\boldsymbol{c}\;(\boldsymbol{q}-\boldsymbol{q}_h), \boldsymbol{v})_{{\Omega _h}}-(u-u_h,\nabla \cdot \boldsymbol{v})_{{\Omega _h}}+\langle u-\widehat{u}_h,\boldsymbol{v}\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}=0, \cssId{error_equations-a}{\tag{3.2a}}\\ &(\omega ,\nabla \cdot (\boldsymbol{q}-\boldsymbol{q}_h))_{{\Omega _h}} -\langle \omega ,(\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}=0, \cssId{error_equations-b}{\tag{3.2b}}\\ &u-\widehat{u}_h=g-\mathsf{P}_{\partial }g\qquad \text{ on } {\partial \Omega _D},\cssId{error_equations-c}{\tag{3.2c}}\\ &(\boldsymbol{q}-\widehat{\boldsymbol{q}}_h)\cdot \boldsymbol{n}=\mathsf{q_N}-\mathsf{P}_{\partial }\mathsf{q_N} \qquad \text{ on } {\partial \Omega _N}, \cssId{error_equations-d}{\tag{3.2d}} \end{alignat}$$

for all $(\boldsymbol{v},\omega )\in \boldsymbol{V}_h\times W_h$.

A direct consequence of the weak commutativity identity (iv) of Proposition 2.1 that we find convenient to use in our analysis is contained in the following result.

Corollary 3.6.

For all $(\boldsymbol{\sigma },\zeta )\in \boldsymbol{H}^1({\Omega _h})\times H^1({\Omega _h})$, we have

$$\begin{alignat*}{1} (\alpha )&\quad (\zeta , \nabla \cdot \boldsymbol{\Pi }\boldsymbol{\sigma })_{\Omega _h}= (\mathbb{P}\zeta , \nabla \cdot \boldsymbol{\sigma })_{\Omega _h}+\langle \mathsf{P}_{\partial }\zeta , \boldsymbol{\Pi }\boldsymbol{\sigma }\cdot \boldsymbol{n}-\mathsf{P}_{\partial }\boldsymbol{\sigma }\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}, \\ (\beta )& \quad \|\,\boldsymbol{\Pi }\boldsymbol{\sigma }-\boldsymbol{\sigma }\,\|_{\boldsymbol{L}^2({\Omega _h})} \le \;C\,h^{r+1}\,|\,\boldsymbol{\sigma }\,|_{\boldsymbol{H}^{r+1}({\Omega _h})}, \\ (\gamma )& \quad \|\,\mathbb{P}\zeta -\zeta \,\|_{L^2({\Omega _h})} \le \;C\,h^{s+1}\,|\,\zeta \,|_{H^{s+1}({\Omega _h})}, \end{alignat*}$$

where $r,s\in [0,k]$ and $C$ depends only on $k$ and the shape-regularity parameters of the simplexes $K\in {\Omega _h}$.

3.3.1. Proof of Theorem 2.5: The error in the flux

Theorem 2.5 follows immediately from the following auxiliary result.

Lemma 3.7.

We have $(\boldsymbol{c}\,(\boldsymbol{q}-\boldsymbol{q}_h),\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h)_{{\Omega _h}}=0.$

Indeed, this implies that

$$\begin{alignat*}{1} (\boldsymbol{c}\,(\boldsymbol{q}-\boldsymbol{q}_h),\boldsymbol{q}-\boldsymbol{q}_h)_{{\Omega _h}}=&(\boldsymbol{c}\,(\boldsymbol{q}-\boldsymbol{q}_h),\boldsymbol{q}-\boldsymbol{\Pi }\boldsymbol{q})_{{\Omega _h}}, \end{alignat*}$$

and hence, that

$$\begin{equation*} \|\,\boldsymbol{q}-\boldsymbol{q}_h\,\|_{\boldsymbol{L}^2({\Omega _h},\boldsymbol{c})} \le \|\,\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}\|_{\boldsymbol{L}^2({\Omega _h},\boldsymbol{c})} \le C\,h^{s+1}\;|\,\boldsymbol{q}\,|_{\boldsymbol{H}^{s+1}({\Omega _h})}, \end{equation*}$$

for some $s\in [0,k]$, by the estimate (vi) of Proposition 2.1. This proves Theorem 2.5.

Let us prove Lemma 3.7.

Proof.

By the error equation Equation 3.2a with $\boldsymbol{v}:=\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h$, we have

$$\begin{alignat*}{1} (\boldsymbol{c}\,(\boldsymbol{q}-\boldsymbol{q}_h),\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h)_{{\Omega _h}} =&(u-u_h,\nabla \cdot (\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h))_{{\Omega _h}} \\ &-\langle u-\widehat{u}_h,(\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h)\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}. \end{alignat*}$$

By the identity ($\alpha$) of Corollary 3.6 with $(\boldsymbol{\sigma },\zeta ):=(\boldsymbol{q}-\boldsymbol{q}_h,u-u_h)$, we get that

$$\begin{alignat*}{1} (\boldsymbol{c}\,(\boldsymbol{q}-\boldsymbol{q}_h),\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h)_{{\Omega _h}} =&(\mathbb{P}u-u_h,\nabla \cdot (\boldsymbol{q}-\boldsymbol{q}_h))_{{\Omega _h}} \\ & +\langle \mathsf{P}_{\partial }u-u_h,(\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q})\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}} \\ & -\langle u-\widehat{u}_h,(\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h)\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}, \end{alignat*}$$

and by the error equation Equation 3.2b with $\omega :=\mathbb{P}u-u_h$,

$$\begin{alignat*}{1} (\boldsymbol{c}\,(\boldsymbol{q}-\boldsymbol{q}_h),\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h)_{{\Omega _h}} =&\langle \mathbb{P}u-u_h,(\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}} \\ & +\langle \mathsf{P}_{\partial }u-u_h,(\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q})\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}} \\ & -\langle u-\widehat{u}_h,(\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h)\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}. \end{alignat*}$$

It we denote by $T$ the right-hand side of the above equations, it is not difficult to see that, after a few simple algebraic manipulations, we have that $T=\sum _{i=1}^5\,T_i,$ where

$$\begin{alignat*}{1} T_1:=&\langle \widehat{u}_h-u_h,(\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}},\\ T_2:=&\langle \widehat{u}_h-u_h,(\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q})\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}},\\ T_3:=&\langle \widehat{u}_h-u,(\boldsymbol{q}-\widehat{\boldsymbol{q}}_h)\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}},\\ T_4:=&\langle \mathsf{P}_{\partial }u-u,(\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q})\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}},\\ T_5:=&\langle \mathbb{P}u -u,(\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}. \end{alignat*}$$

We are going to show that $T=0$.

We begin by noting that,

$$\begin{alignat*}{1} T_1=&\sum _{K\in {\Omega _h}}\langle \widehat{u}_h -u_h,(\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}\rangle _{{\partial K}} \\ =&\sum _{K\in {\Omega _h}}\langle \widehat{u}_h-u_h,\mathsf{P}_{\partial }\boldsymbol{q}\cdot \boldsymbol{n}-\boldsymbol{\Pi }\boldsymbol{q}\cdot \boldsymbol{n}\rangle _{{\partial K}}, \end{alignat*}$$

by Proposition 2.2. By the definition of the projection $\mathsf{P}_{\partial }$, Equation 1.5,

$$\begin{alignat*}{1} T_1=&\sum _{K\in {\Omega _h}}\langle \widehat{u}_h-u_h,(\boldsymbol{q}-\boldsymbol{\Pi }\boldsymbol{q})\cdot \boldsymbol{n}\rangle _{{\partial K}}\\ =&-T_2, \end{alignat*}$$

by the definition of the projection $\boldsymbol{\Pi }$, Equation 2.1b. Thus, $T_1+T_2=0$.

Next, let us show that $T_3+T_4=0$. By the fact that the numerical trace $\widehat{u}_h$ and the normal component of the numerical trace $\widehat{\boldsymbol{q}}_h$ are single-valued on the interior faces, by definition of $\widehat{u}_h$, Equation 1.4a, and the equation Equation 1.3c satisfied by $\widehat{\boldsymbol{q}}_h$, we have that

$$\begin{alignat*}{1} T_3=&\langle \widehat{u}_h-u,(\boldsymbol{q}-\widehat{\boldsymbol{q}}_h)\cdot \boldsymbol{n}\rangle _{{\partial \Omega }},\\ =& \langle \mathsf{P}_{\partial }g-g,(\boldsymbol{q}-\widehat{\boldsymbol{q}}_h)\cdot \boldsymbol{n}\rangle _{{\partial \Omega _D}} +\langle \lambda _h-u,\mathsf{q_N}-\mathsf{P}_{\partial }\mathsf{q_N}\rangle _{{\partial \Omega _N}}, \end{alignat*}$$

by the definition of the numerical traces at the boundary. By using the definition of the projection $\mathsf{P}_{\partial }$, Equation 1.5, we get

$$\begin{alignat*}{1} T_3 =& \langle \mathsf{P}_{\partial }u-u,\boldsymbol{q}\cdot \boldsymbol{n}-\mathsf{P}_{\partial }\boldsymbol{q}\cdot \boldsymbol{n}\rangle _{{\partial \Omega _D}} +\langle \mathsf{P}_{\partial }u-u,\boldsymbol{q}\cdot \boldsymbol{n}-\mathsf{P}_{\partial }\boldsymbol{q}\cdot \boldsymbol{n}\rangle _{{\partial \Omega _N}}\\ =& \langle \mathsf{P}_{\partial }u-u,\boldsymbol{q}\cdot \boldsymbol{n}-\mathsf{P}_{\partial }\boldsymbol{q}\cdot \boldsymbol{n}\rangle _{{\partial \Omega }}\\ =&\langle \mathsf{P}_{\partial }u-u,\boldsymbol{q}\cdot \boldsymbol{n}-\mathsf{P}_{\partial }\boldsymbol{q}\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}\\ =&\langle \mathsf{P}_{\partial }u-u,\boldsymbol{q}\cdot \boldsymbol{n}-\boldsymbol{\Pi }\boldsymbol{q}\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}\\ =&-T_4, \end{alignat*}$$

Finally, let us show that $T_5=0$. By the definition of the numerical trace $\widehat{\boldsymbol{q}}_h$, Equation 1.4b,

$$\begin{alignat*}{2} T_5=&\sum _{K\in {\Omega _h}}\langle \mathbb{P}u -u,(\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}\rangle _{{\partial K}} \\ =&\sum _{K\in {\Omega _h}}\langle \mathbb{P}u -u,(\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}\rangle _{e_{K}^{\tau }} \\ =&\;0, \end{alignat*}$$

by the definition of the projection $\mathbb{P}$, Equation 2.2b. This completes the proof.

■

3.3.2. Proof of Theorem 2.6: Superconvergence of $u_h$

Since

$$\begin{equation*} \|\,\mathbb{P}u-u_h\,\|_{H^{-s}({\Omega _h})}=\sup _{\theta \in \mathcal{C}^\infty _0(\Omega )}\frac{(\mathbb{P}u-u_h,\theta )_{\Omega }}{\|\,\theta \,\|_{H^{s}(\Omega )}}, \end{equation*}$$

we need to estimate the number $(\mathbb{P}u-u_h,\theta )_{\Omega }$. It is expressed in a suitable way in the following auxiliary result. Let us recall that $\mathsf{P}^{k-1}$ is defined by Equation 2.4 for $k\ge 1$, and is $\mathsf{P}^{k-1}\equiv 0$ for $k=0$.

Lemma 3.8.

We have

$$\begin{alignat*}{1} (\mathbb{P}u-u_h,\theta )_{{\Omega _h}} =& (\boldsymbol{c}\;(\boldsymbol{q}-\boldsymbol{q}_h),\boldsymbol{\Pi }\boldsymbol{\psi }-\boldsymbol{\psi })_{{\Omega _h}} +(\mathsf{P}^{k-1}\nabla \varphi -\nabla \varphi ,\boldsymbol{q}-\boldsymbol{\Pi }\boldsymbol{q})_{\Omega _h}\\ & -\sum _{K\in {\Omega _h}}\tau ^{-1}\langle \boldsymbol{\Pi }\boldsymbol{q}\cdot \boldsymbol{n}-\mathsf{P}_{\partial }\boldsymbol{q}\cdot \boldsymbol{n}, \boldsymbol{\Pi }\boldsymbol{\psi }\cdot \boldsymbol{n}-\mathsf{P}_{\partial }\boldsymbol{\psi }\cdot \boldsymbol{n}\rangle _{e_{K}^{\tau }}. \end{alignat*}$$

Assume that $k\ge 1$. Then, applying the Cauchy-Schwarz inequality and using the estimate of $\boldsymbol{q}-\boldsymbol{q}_h$ in Theorem 2.5, and the approximation properties of the projections $\mathsf{P}^{k-1}$ and $\mathbb{P}$, ($v$) in Proposition 2.1 and ($\beta$) in Corollary 3.6, we readily obtain

$$\begin{alignat*}{1} (\mathbb{P}u-u_h,\theta )_{{\Omega _h}} \le & C\,h^{r+1}\,|\,\boldsymbol{q}\,|_{\boldsymbol{H}^{r+1}({\Omega _h})}\,h^{s+1}\,|\,\boldsymbol{\psi }\,|_{\boldsymbol{H}^{s+1}({\Omega _h})} \\ &+ C\,h^{r+1}\,|\,\boldsymbol{q}\,|_{\boldsymbol{H}^{r+1}({\Omega _h})}\,h^{s+1}\,|\,\nabla \varphi \,|_{\boldsymbol{H}^{s+1}({\Omega _h})} \\ & +\max _{K\in {\Omega _h}} \frac{1}{h_K\,\tau _K}\; C\,h^{r+1}\,|\,f\,|_{H^{r}({\Omega _h})}\,h^{s+1}\,|\,\theta \,|_{H^{s}({\Omega _h})}, \end{alignat*}$$

where $r,s\in [0,k-1]$. Since $\kappa :=\max _{K\in {\Omega _h}} \frac{1}{h_K\,\tau _K}$ and using the elliptic regularity assumption Equation 2.8, we get

$$\begin{equation*} (\mathbb{P}u-u_h,\theta )_{{\Omega _h}}\le C\,\mathfrak{C}_\kappa ^{r,s}(\boldsymbol{q})\,h^{r+s+2}\,|\,\theta \,|_{H^{s}({\Omega _h})}. \end{equation*}$$

This completes the proof of Theorem 2.6 for $k\ge 1$.

In the case $k=0$, we have that

$$\begin{alignat*}{1} (\mathbb{P}u-u_h,\theta )_{{\Omega _h}} \le & C\,h\,|\,\boldsymbol{q}\,|_{\boldsymbol{H}^{1}({\Omega _h})}\,h\,|\,\boldsymbol{\psi }\,|_{\boldsymbol{H}^{1}({\Omega _h})} \\ &+ C\,h\,|\,\boldsymbol{q}\,|_{\boldsymbol{H}^{1}({\Omega _h})}\,|\,\nabla \varphi \,|_{\boldsymbol{H}^{1}({\Omega _h})} \\ & +\kappa \, C\,h\,|\,f\,|_{L^2({\Omega _h})}\,h\,|\,\theta \,|_{L^2({\Omega _h})}, \end{alignat*}$$

and, after using the elliptic regularity assumption Equation 2.8, we get

$$\begin{equation*} (\mathbb{P}u-u_h,\theta )_{{\Omega _h}}\le C\,\mathcal{C}_\kappa (\boldsymbol{q})\,h\,|\,\theta \,|_{L^2({\Omega _h})}. \end{equation*}$$

Finally, let us consider the case $k=0$ and $f=0$. By the identity (v) of Proposition 2.1 we have that $\boldsymbol{\Pi }\boldsymbol{\sigma }\cdot \boldsymbol{n}=\mathsf{P}_{\partial }\boldsymbol{\sigma }\cdot \boldsymbol{n}$, and by the identity (vi) of Proposition 2.1 we have that $\boldsymbol{\Pi }\boldsymbol{q}=\boldsymbol{\Pi }^{{\scriptscriptstyle {\mathrm{RT}}}}\boldsymbol{q}$. This implies that

$$\begin{alignat*}{1} (\mathbb{P}u-u_h,\theta )_{{\Omega _h}} =& (\boldsymbol{c}\;(\boldsymbol{q}-\boldsymbol{q}_h),\boldsymbol{\Pi }\boldsymbol{\psi }-\boldsymbol{\psi })_{{\Omega _h}} +\langle \varphi ,(\boldsymbol{q}-\boldsymbol{\Pi }\boldsymbol{q})\cdot \boldsymbol{n}\rangle _{{\partial \Omega }} \\ =& (\boldsymbol{c}\;(\boldsymbol{q}-\boldsymbol{q}_h),\boldsymbol{\Pi }^{{\scriptscriptstyle {\mathrm{RT}}}}\boldsymbol{\psi }-\boldsymbol{\psi })_{{\Omega _h}} +\langle \varphi -\mathsf{P}_{\partial }\varphi ,\mathsf{q_N}-\mathsf{P}_{\partial }\mathsf{q_N}\rangle _{{\partial \Omega _N}} \end{alignat*}$$

by the adjoint equation Equation 2.7c and the boundary condition Equation 1.1d. As a consequence, we get

$$\begin{alignat*}{1} (\mathbb{P}u-u_h,\theta )_{{\Omega _h}} \le & C\,h\,|\,\boldsymbol{q}\,|_{\boldsymbol{H}^{1}({\Omega _h})}\,h\,|\,\boldsymbol{\psi }\,|_{\boldsymbol{H}^{1}({\Omega _h})} \\ &+ C\,h\,|\,\mathsf{q_N}\,|_{H^{1}({\partial \Omega _N})}\,h\,|\,\varphi \,|_{H^{1}({\partial \Omega _N})}, \end{alignat*}$$

and since

$$\begin{equation*} |\,\varphi \,|_{H^{1}({\partial \Omega _N})}\le C\,|\,\varphi \,|_{H^{2}(\Omega )}, \end{equation*}$$

by the elliptic regularity assumption Equation 2.8, we get

$$\begin{alignat*}{1} (\mathbb{P}u-u_h,\theta )_{{\Omega _h}} \le & C\,\mathfrak{C}_\kappa ^{0,0}(\boldsymbol{q})\,h^2\,|\,\theta \,|_{L^2(\Omega )}. \end{alignat*}$$

It remains to prove Lemma 3.8.

Proof.

By the adjoint equation Equation 2.7b, we have that

$$\begin{alignat*}{1} (\mathbb{P}u-u_h,\theta )_{{\Omega _h}} =&(\mathbb{P}u-u_h,\nabla \cdot \boldsymbol{\psi })_{{\Omega _h}}\\ =&( u-u_h,\nabla \cdot \boldsymbol{\Pi }\boldsymbol{\psi })_{{\Omega _h}} -\langle \mathsf{P}_{\partial }u-u_h, (\boldsymbol{\Pi }\boldsymbol{\psi }-\boldsymbol{\psi })\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}, \end{alignat*}$$

by the identity ($\alpha$) of Corollary 3.6 with $(\boldsymbol{\sigma },\zeta ):=(\boldsymbol{\psi },u-u_h)$. By the error equation Equation 3.2a with $\boldsymbol{v}:=\boldsymbol{\Pi }\boldsymbol{\psi }$, we get

$$\begin{alignat*}{1} (\mathbb{P}u-u_h,\theta )_{{\Omega _h}} =&(\boldsymbol{c}\;(\boldsymbol{q}-\boldsymbol{q}_h),\boldsymbol{\Pi }\boldsymbol{\psi })_{{\Omega _h}} +\langle u-\widehat{u}_h, \boldsymbol{\Pi }\boldsymbol{\psi }\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}} \\ & -\langle \mathsf{P}_{\partial }u-u_h, (\boldsymbol{\Pi }\boldsymbol{\psi }-\boldsymbol{\psi })\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}} \\ =&(\boldsymbol{c}\;(\boldsymbol{q}-\boldsymbol{q}_h),\boldsymbol{\Pi }\boldsymbol{\psi }-\boldsymbol{\psi })_{{\Omega _h}} +(\boldsymbol{c}\;(\boldsymbol{q}-\boldsymbol{q}_h),\boldsymbol{\psi })_{{\Omega _h}} \\ & +\langle u-\widehat{u}_h, \boldsymbol{\Pi }\boldsymbol{\psi }\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}} -\langle \mathsf{P}_{\partial }u-u_h, (\boldsymbol{\Pi }\boldsymbol{\psi }-\boldsymbol{\psi })\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}, \end{alignat*}$$

and, by the adjoint equation Equation 2.7a,

$$\begin{alignat*}{1} (\mathbb{P}u-u_h,\theta )_{{\Omega _h}} =&(\boldsymbol{c}\;(\boldsymbol{q}-\boldsymbol{q}_h),\boldsymbol{\Pi }\boldsymbol{\psi }-\boldsymbol{\psi })_{{\Omega _h}} -(\boldsymbol{q}-\boldsymbol{q}_h,\nabla \varphi )_{{\Omega _h}} \\ & +\langle u-\widehat{u}_h, \boldsymbol{\Pi }\boldsymbol{\psi }\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}} -\langle \mathsf{P}_{\partial }u-u_h, (\boldsymbol{\Pi }\boldsymbol{\psi }-\boldsymbol{\psi })\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}. \end{alignat*}$$

By the orthogonality property (ii) of Proposition 2.1, we get that

$$\begin{alignat*}{1} (\mathbb{P}u-u_h,\theta )_{{\Omega _h}} =&(\boldsymbol{c}\;(\boldsymbol{q}-\boldsymbol{q}_h),\boldsymbol{\Pi }\boldsymbol{\psi }-\boldsymbol{\psi })_{{\Omega _h}} +(\boldsymbol{q}-\boldsymbol{\Pi }\boldsymbol{q},\mathsf{P}^{k-1}\nabla \varphi -\nabla \varphi )_{{\Omega _h}} \\ &-(\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h,\nabla \varphi )_{{\Omega _h}} +\langle u-\widehat{u}_h, \boldsymbol{\Pi }\boldsymbol{\psi }\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}} \\ &-\langle \mathsf{P}_{\partial }u-u_h, (\boldsymbol{\Pi }\boldsymbol{\psi }-\boldsymbol{\psi })\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}. \end{alignat*}$$

If we denote by $T$ the last three terms of the above right-hand side, we see that, after some simple algebraic manipulations, we can write $T=\sum _{i=1}^4 T_i$, where

$$\begin{alignat*}{1} T_1=&-\langle \widehat{u}_h-u_h,(\boldsymbol{\Pi }\boldsymbol{\psi }-\boldsymbol{\psi })\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}},\\ T_2=&-\langle \mathsf{P}_{\partial }u-u,(\boldsymbol{\Pi }\boldsymbol{\psi }-\boldsymbol{\psi })\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}},\\ T_3=&-\langle \widehat{u}_h-u,\boldsymbol{\psi }\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}},\\ T_4=&-(\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h,\nabla \varphi )_{{\Omega _h}}. \end{alignat*}$$

By the definition of the numerical trace $\widehat{u}_h$, Equation 1.4a and Equation 1.6, we have that

$$\begin{alignat*}{1} T_1=&-\sum _{K\in {\Omega _h}}\langle \widehat{u}_h-u_h, \boldsymbol{\Pi }\boldsymbol{\psi }\cdot \boldsymbol{n}-\mathsf{P}_{\partial }\boldsymbol{\psi }\cdot \boldsymbol{n}\rangle _{e_{K}^{\tau }} \\ =&-\sum _{K\in {\Omega _h}}\tau ^{-1}\langle \boldsymbol{\Pi }\boldsymbol{q}\cdot \boldsymbol{n}-\mathsf{P}_{\partial }\boldsymbol{q}\cdot \boldsymbol{n}, \boldsymbol{\Pi }\boldsymbol{\psi }\cdot \boldsymbol{n}-\mathsf{P}_{\partial }\boldsymbol{\psi }\cdot \boldsymbol{n}\rangle _{e_{K}^{\tau }}, \end{alignat*}$$

by Proposition 2.2.

It remains to show that $T_2+T_3+T_4=0$. By the definition of the projection $\mathsf{P}_{\partial }$, Equation 1.5,

$$\begin{alignat*}{1} T_2=&-\langle \mathsf{P}_{\partial }u-u,-\boldsymbol{\psi }\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}}\\ =&-\langle \mathsf{P}_{\partial }u-u,-\boldsymbol{\psi }\cdot \boldsymbol{n}\rangle _{{\partial \Omega _D}}\\ =&\phantom {-}\langle \mathsf{P}_{\partial }u-u,\boldsymbol{\psi }\cdot \boldsymbol{n}\rangle _{{\partial \Omega _D}}. \end{alignat*}$$

By the definition of the numerical trace $\widehat{u}_h$, Equation 1.4a,

$$\begin{alignat*}{1} T_3=&-\langle \widehat{u}_h-u,\boldsymbol{\psi }\cdot \boldsymbol{n}\rangle _{{\partial \Omega }}\\ =&-\langle \widehat{u}_h-u,\boldsymbol{\psi }\cdot \boldsymbol{n}\rangle _{{\partial \Omega _D}}\\ =&-\langle \mathsf{P}_{\partial }u-u,\boldsymbol{\psi }\cdot \boldsymbol{n}\rangle _{{\partial \Omega _D}}\\ =&-T_2. \end{alignat*}$$

Next, we show that $T_4=0$. Integrating by parts, we obtain

$$\begin{alignat*}{1} T_4=&(\nabla \cdot (\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h),\varphi )_{{\Omega _h}} -\langle (\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h)\cdot \boldsymbol{n}, \varphi \rangle _{{\partial \Omega _h}} \\ =&(\nabla \cdot (\boldsymbol{q}-\boldsymbol{q}_h),\mathbb{P}\varphi )_{{\Omega _h}} +\langle (\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q})\cdot \boldsymbol{n}, \mathsf{P}_{\partial }\varphi \rangle _{{\partial \Omega _h}} \\ &-\langle (\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h)\cdot \boldsymbol{n}, \varphi \rangle _{{\partial \Omega _h}} \end{alignat*}$$

by the identity ($\alpha$) of Corollary 3.6 with $(\boldsymbol{\sigma },\zeta ):=(\boldsymbol{q}-\boldsymbol{q}_h,\varphi )$. By the error equation Equation 3.2b with $\omega :=\mathbb{P}\varphi$,

$$\begin{alignat*}{1} T_4=&\langle \mathbb{P}\varphi , (\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}\rangle _{\partial \Omega _h}+\langle (\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q})\cdot \boldsymbol{n}, \mathsf{P}_{\partial }\varphi \rangle _{{\partial \Omega _h}} \\ &-\langle (\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h)\cdot \boldsymbol{n}, \varphi \rangle _{{\partial \Omega _h}} \\ =&\langle \mathsf{P}_{\partial }\varphi , (\widehat{\boldsymbol{q}}_h-\boldsymbol{q}_h)\cdot \boldsymbol{n}\rangle _{\partial \Omega _h}+\langle (\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q})\cdot \boldsymbol{n}, \mathsf{P}_{\partial }\varphi \rangle _{{\partial \Omega _h}} \\ & -\langle (\boldsymbol{\Pi }\boldsymbol{q}-\boldsymbol{q}_h)\cdot \boldsymbol{n}, \mathsf{P}_{\partial }\varphi \rangle _{{\partial \Omega _h}}, \end{alignat*}$$

by the definition of the projection $\mathsf{P}_{\partial }$, Equation 1.5, the definition of the projection $\mathbb{P}$, Equation 2.2b, and the definition of the numerical trace $\widehat{\boldsymbol{q}}_h$, Equation 1.4b. Hence

$$\begin{alignat*}{1} T_4=&\langle \mathsf{P}_{\partial }\varphi , (\widehat{\boldsymbol{q}}_h-\boldsymbol{q})\cdot \boldsymbol{n}\rangle _{{\partial \Omega _h}} =\langle \mathsf{P}_{\partial }\varphi , (\widehat{\boldsymbol{q}}_h-\boldsymbol{q})\cdot \boldsymbol{n}\rangle _{{\partial \Omega }} =0, \end{alignat*}$$

by the adjoint equation Equation 2.7c and the equation Equation 1.3c for $\widehat{\boldsymbol{q}}_h$.

This completes the proof.

■

3.3.3. Proof of Theorem 2.8: Superconvergence of $\widehat{u}_h$

To prove this theorem, let us begin by estimating $\|\,\mathsf{P}_{\partial }u-\widehat{u}_h\,\|^2_{L^2(e)}$ for each face $e$ of each simplex $K$. For the face $e_{K}^{\tau }$, we have that, by definition of the projection $\mathbb{P}$, Equation 2.2,

$$\begin{alignat*}{1} \|\,\mathsf{P}_{\partial }u-\widehat{u}_h \,\|_{L^2(e_{K}^{\tau })} =&\|\,\mathbb{P}u-\widehat{u}_h \,\|_{L^2(e_{K}^{\tau })} \\ \le & \|\,\mathbb{P}u-u_h \,\|_{L^2(e_{K}^{\tau })} + \|\,u_h-\widehat{u}_h \,\|_{L^2(e_{K}^{\tau })} \\ \le & \|\,\mathbb{P}u-u_h \,\|_{L^2(e_{K}^{\tau })} + C\,\frac{h^{r+1/2}_K}{\tau _K}\,|\,f\,|_{H^r(K)} \end{alignat*}$$

by Proposition 2.2 and the identity (v) of Proposition 2.1. By using a classical inverse inequality, we can conclude that

$$\begin{alignat*}{1} h^{1/2}_K\,\|\,\mathsf{P}_{\partial }u-\widehat{u}_h \,\|_{L^2(e_{K}^{\tau })}\le & C\,\left(\|\,\mathbb{P}u-u_h \,\|_{L^2(K)} +\frac{h^{r+1}_K}{\tau _K}\,|\,f\,|_{H^r(K)} \right). \end{alignat*}$$

Now we consider the error in the faces $e$ of $K$ which are different from the face $e_{K}^{\tau }$. By the error equation Equation 3.2a, we have that, for all $\boldsymbol{v}\in \boldsymbol{\mathscr{P}}^k(K)$,

$$\begin{alignat*}{1} \langle \widehat{u}_h-\mathsf{P}_{\partial }u, \boldsymbol{v}\cdot \boldsymbol{n}\rangle _{{\partial K}\setminus e_{K}^{\tau }} =&(\boldsymbol{c}\,(\boldsymbol{q}-\boldsymbol{q}_h), \boldsymbol{v})_K-(\mathbb{P}u-u_h, \nabla \cdot \boldsymbol{v})_K \\ &-\langle \widehat{u}_h-\mathsf{P}_{\partial }u, \boldsymbol{v}\cdot \boldsymbol{n}\rangle _{e_{K}^{\tau }}. \end{alignat*}$$

Taking $\boldsymbol{v}:=\boldsymbol{Z}$ given by Lemma 3.2 with $z= \widehat{u}_h- \mathsf{P}_{\partial }u$, we obtain that

$$\begin{alignat*}{1} \|\, \widehat{u}_h- \mathsf{P}_{\partial }u \,\|_{L^2({\partial K}\setminus e_{K}^{\tau })} \le C\,\left(\right.& h_K^{1/2}\,\|\,\boldsymbol{q}-\boldsymbol{q}_h \,\|_{\boldsymbol{L}^2(K)} +h_K^{-1/2} \|\, \mathbb{P}u-u_h \,\|_{L^2(K)} \\ & + \|\,\widehat{u}_h- \mathsf{P}_{\partial }u \,\|_{L^2(e_{K}^{\tau })}\left.\right), \end{alignat*}$$

and using the estimate for the error in $e_{K}^{\tau }$,

$$\begin{alignat*}{1} h_K^{1/2}\|\, \widehat{u}_h- \mathsf{P}_{\partial }u \,\|_{L^2({\partial K}\setminus e_{K}^{\tau })} \le C\,\left(\right.&\|\, \mathbb{P}u-u_h \,\|_{L^2(K)} + h_K\,\|\,\boldsymbol{q}-\boldsymbol{q}_h \,\|_{\boldsymbol{L}^2(K)}^2\\ & + \frac{h^{r+1}_K}{\tau _K}\,|\,f\,|_{H^r(K)}\left.\right). \end{alignat*}$$

As a consequence, we get

$$\begin{alignat*}{1} \|\,\mathsf{P}_{\partial }u-\widehat{u}_h\,\|_{L^2(\mathscr{E}_h;h)} \le \;C\; (&\|\,\mathbb{P}u-u_h\,\|_{L^2({\Omega _h})} +h \|\,\boldsymbol{q}-\boldsymbol{q}_h\,\|_{\boldsymbol{L}^2({\Omega _h})} \\ &+\kappa \,h^{r+2}\,|\,f\,|_{H^r({\Omega _h})}), \end{alignat*}$$

where $\kappa :=\max _{K\in {\Omega _h}}1/(\tau _K\,h_K)$. The result now follows from Theorems 2.6, 2.5 and 2.4 (i). This completes the proof of Theorem 2.8.

3.3.4. Proof of Theorem 2.9: The error estimate for ${u}^\star _h$

By the definition of ${u}^\star _h$, Equation 2.9a, we have that

$$\begin{alignat*}{1} \|\, u-{u}^\star _h\,\|_{L^2(K)} \le & \|\, \overline{u}-\overline{u}_h\,\|_{L^2(K)} +\|\,\tilde{u}-\tilde{u}_h \,\|_{L^2(K)}, \end{alignat*}$$

where $\overline{u}$ is defined in Equation 2.9b and $\tilde{u}=u-\overline{u}$. We estimate each of the two terms of the right-hand side separately.

We begin by estimating the second term. Since, by Poincaré’s inequality, we have

$$\begin{equation*} \|\, \tilde{u}-\tilde{u}_h \,\|_{L^2(K)} \le C\, h_K\, \|\,\nabla (\tilde{u}-\tilde{u}_h) \,\|_{L^2(K)}, \end{equation*}$$

it is enough to estimate the error in the gradient. To do that, we note that, by the definition of $\tilde{u}_h$, Equation 2.9c, we have

$$\begin{equation*} (\boldsymbol{a}\;\nabla (\tilde{u}-\tilde{u}_h), \nabla w)_K=- \langle w, (\boldsymbol{q}-\widehat{\boldsymbol{q}}_h)\cdot \boldsymbol{n}\rangle _{{\partial K}} \qquad \forall w \in \mathscr{P}_0^{k+1}(K). \end{equation*}$$

Then

$$\begin{alignat*}{1} \|\,\nabla (\mathsf{P}^{k+1}\tilde{u}-\tilde{u}_h)\,\|_{L^2(K;\boldsymbol{a})}^2 =& (\boldsymbol{a}\;\nabla (\tilde{u}-\tilde{u}_h), \nabla (\mathsf{P}^{k+1}\tilde{u}-\tilde{u}_h))_K \\ &+(\boldsymbol{a}\;\nabla (\mathsf{P}^{k+1}\tilde{u}-\tilde{u}), \nabla (\mathsf{P}^{k+1}\tilde{u}-\tilde{u}_h))_{ K} \\ =& -\langle \mathsf{P}^{k+1}\tilde{u}-\tilde{u}_h, (\boldsymbol{q}-\widehat{\boldsymbol{q}}_h)\cdot \boldsymbol{n}\rangle _{{\partial K}} \\ &+ (\boldsymbol{a}\nabla (\mathsf{P}^{k+1}\tilde{u}-\tilde{u}), \nabla (\mathsf{P}^{k+1}\tilde{u}-\tilde{u}_h))_{ K}. \end{alignat*}$$

Let us estimate the first term of the right-hand side. For any arbitrary $\omega \in \mathscr{P}^{k+1}_0(K)$, we have

$$\begin{alignat*}{1} \langle \omega , (\boldsymbol{q}-\widehat{\boldsymbol{q}}_h)\cdot \boldsymbol{n}\rangle _{{\partial K}} &=\langle \omega , (\boldsymbol{q}-{\boldsymbol{q}}_h)\cdot \boldsymbol{n}\rangle _{{\partial K}} + \langle \omega , (\boldsymbol{q}_h-\widehat{\boldsymbol{q}}_h)\cdot \boldsymbol{n}\rangle _{{\partial K}}\\ &=\sum _{i=1}^3 T_i, \end{alignat*}$$

where

$$\begin{alignat*}{1} T_1=&(\nabla \omega , \boldsymbol{q}-{\boldsymbol{q}}_h)_K,\\ T_2=&(\omega ,\nabla \cdot (\boldsymbol{q}-\boldsymbol{q}_h))_K,\\ T_3=&\langle \omega , (\boldsymbol{q}_h-\widehat{\boldsymbol{q}}_h)\cdot \boldsymbol{n}\rangle _{{\partial K}}. \end{alignat*}$$

By using Cauchy-Schwarz inequality, we get that

$$\begin{alignat*}{1} T_1\le &\|\,\nabla \omega \,\|_{L^2(K;\boldsymbol{a})}\,\|\,\boldsymbol{q}-{\boldsymbol{q}}_h\,\|_{L^2(K;\boldsymbol{c})}. \end{alignat*}$$

By using the definition of the Raviart-Thomas projection $\boldsymbol{\Pi }^{{\scriptscriptstyle {\mathrm{RT}}}}$, Equation 2.3, and by using its commutativity property, we get that, for any $r\in [0,k]$,

$$\begin{alignat*}{1} T_2=&(\omega , f-\mathsf{P}f)_K + (\omega ,\nabla \cdot (\boldsymbol{\Pi }^{{\scriptscriptstyle {\mathrm{RT}}}}\boldsymbol{q}-\boldsymbol{q}_h))_K\\ \le &\|\,\omega \,\|_{L^2(K)}\,\left(h_K^r\,|\,f\,|_{H^r(K)}+h_K^{-1}\|\,\boldsymbol{q}-\boldsymbol{q}_h\,\|_{L^2(K;\boldsymbol{c})}\right) \\ \le &\|\,\nabla \omega \,\|_{L^2(K;\boldsymbol{a})}\,\left(h_K^{r+1}\,|\,f\,|_{H^r(K)}+\|\,\boldsymbol{q}-\boldsymbol{q}_h\,\|_{L^2(K;\boldsymbol{c})}\right) \end{alignat*}$$

by Poincaré’s inequality. Finally, by the definition of the numerical trace $\widehat{\boldsymbol{q}}_h$, Equation 1.4b,

$$\begin{alignat*}{1} T_3=&\|\,\omega \,\|_{L^2(e_{K}^{\tau })}\,\|\,(\boldsymbol{q}_h-\widehat{\boldsymbol{q}}_h)\cdot \boldsymbol{n}\,\|_{L^2(e_{K}^{\tau })} \\ \le &\,C\,\|\,\omega \,\|_{L^2(e_{K}^{\tau })}\, h_K^{r+1/2}\,|\,f\,|_{H^r(K)}, \end{alignat*}$$

by Proposition 2.2 and identity (v) of Proposition 2.1. Applying a simple inverse inequality, we get

$$\begin{alignat*}{1} T_3 \le &\,C\,\|\,\nabla \omega \,\|_{L^2(K;\boldsymbol{a})}\, h_K^{r+1}\,|\,f\,|_{H^r(K)}. \end{alignat*}$$

As a consequence,

$$\begin{alignat*}{1} \langle \omega , (\boldsymbol{q}-\widehat{\boldsymbol{q}}_h)\cdot \boldsymbol{n}\rangle _{{\partial K}} \le C\,\|\,\nabla \omega \,\|_{L^2(K;\boldsymbol{a})}\,\left(\|\,\boldsymbol{q}-{\boldsymbol{q}}_h\,\|_{L^2(K;\boldsymbol{c})} + h_K^{r+1}\,|\,f\,|_{H^r(K)}\right). \end{alignat*}$$

This implies that

$$\begin{alignat*}{1} \|\, \nabla (\mathsf{P}^{k+1} \tilde{u} -\tilde{u}_h)\,\|_{L^2(K;\boldsymbol{a})} \le & \|\, \nabla ( \mathsf{P}^{k+1} \tilde{u}-\tilde{u})\,\|_{L^2(K)} \\ &+C\,\left(\|\,\boldsymbol{q}-{\boldsymbol{q}}_h\,\|_{L^2(K;\boldsymbol{c})} + h_K^{r+1}\,|\,f\,|_{H^r(K)}\right), \end{alignat*}$$

and so,

$$\begin{alignat*}{1} \|\, \mathsf{P}^{k+1} \tilde{u} -\tilde{u}_h\,\|_{L^2({\Omega _h};\boldsymbol{a})} \le & Ch\,\|\, \nabla ( \mathsf{P}^{k+1} \tilde{u}-\tilde{u})\,\|_{L^2({\Omega _h})} \\ &+C\,\left(\,h\,\|\,\boldsymbol{q}-{\boldsymbol{q}}_h\,\|_{L^2({\Omega _h};\boldsymbol{c})} + h^{r+2}\,|\,f\,|_{H^r({\Omega _h})}\right)\\ \le & C\,h^{r+2}\,\left(|\,\boldsymbol{q}\,|_{\boldsymbol{H}^{r+1}({\Omega _h})}+|\,u\,|_{H^{r+2}({\Omega _h})}\right), \end{alignat*}$$

by Theorem 2.5 and the well-known approximation properties of $\mathsf{P}^{k+1}$.

Let us now estimate the error $\overline{u}-\overline{u}_h$. We begin by considering the case $k\ge 1$. In this case, since $\overline{u}-\overline{u}_h=\overline{\mathbb{P}(u-u_h)}$, we get

$$\begin{equation*} \|\, \overline{u}-\overline{u}_h \,\|_{L^2(K)} \le \|\, \mathbb{P}u-u_h\,\|_{L^2(K)} \le C\,\mathfrak{C}_\kappa ^{r,0}(\boldsymbol{q})\,h^{r+2} \end{equation*}$$

by Theorem 2.6. Note that by Theorem 2.4, $\mathsf{P}^{k-1}u_h$ is independent of the value of the local stabilization parameters $\tau$. This implies that the same is true for $\overline{u}_h$ and so, we get that

$$\begin{equation*} \|\, \overline{u}-\overline{u}_h \,\|_{L^2(K)} \le C\,\mathfrak{C}_0^{r,0}(\boldsymbol{q})\,h^{r+2}. \end{equation*}$$

It remains to consider the case $k=0$ and $f=0$. We have that

$$\begin{alignat*}{1} \overline{u}-\overline{u}_h =&\frac{1}{|K|}\int _K u(x)\,dx-\frac{1}{d}\sum _{e\in {\partial K}}\widehat{u}_h|_e\\ =&\frac{1}{|K|}\int _K (u-\mathsf{P}^1 u)(x)\,dx \\ &+\frac{1}{|K|}\int _K \mathsf{P}^1 u(x)\,dx -\frac{1}{d}\sum _{e\in {\partial K}}\frac{1}{|e|}\int _e \mathsf{P}_{\partial }(\mathsf{P}^1 u)\,d\gamma \\ &-\frac{1}{d}\sum _{e\in {\partial K}}\frac{1}{|e|}\int _e (\mathsf{P}_{\partial }(u-\mathsf{P}^1 u))\,d\gamma \\ &-\frac{1}{d}\sum _{e\in {\partial K}}\frac{1}{|e|}\int _e (\widehat{u}_h-\mathsf{P}_{\partial }u)\,d\gamma . \end{alignat*}$$

Since, for any function $\omega \in \mathscr{P}^1(K)$, we have that

$$\begin{equation*} \frac{1}{|K|}\int _K \omega (x)\,dx -\frac{1}{d}\sum _{e\in {\partial K}}\frac{1}{|e|}\int _e \mathsf{P}_{\partial }\omega \,d\gamma =0, \end{equation*}$$

we readily obtain that

$$\begin{alignat*}{1} \|\, \overline{u}-\overline{u}_h \,\|_{L^2(K)} \le &C\,\|\, u-\mathsf{P}^1 u \,\|_{L^2(K)} + C\,h_K\,|\, u-\mathsf{P}^1 u \,|_{H^1(K)} \\ &+ C\,h_k\,\|\,\widehat{u}_h-\mathsf{P}_{\partial }u\,\|_{L^2({\partial K})}, \end{alignat*}$$

and so,

$$\begin{alignat*}{1} \|\, \overline{u}-\overline{u}_h \,\|_{L^2({\Omega _h})} \le &C\,h^2\,\left(\mathcal{C}_\kappa (\boldsymbol{q})+ |\, u \,|_{H^2({\Omega _h})}\right). \end{alignat*}$$

Since, by Theorem 2.4, $\lambda _h$ is independent of the value of the local stabilization parameter $\tau$, so is $\overline{u}_h$ and so

$$\begin{alignat*}{1} \|\, \overline{u}-\overline{u}_h \,\|_{L^2(K)} \le &C\,h^2\,\left(\mathcal{C}_0(\boldsymbol{q})+ |\, u \,|_{H^2({\Omega _h})}\right). \end{alignat*}$$

This completes the proof of Theorem 2.9.

4. Numerical experiments

In this section, we carry out numerical experiments to validate the theoretical convergence properties of the $\text{SF-H}_k$ method.

To do that, we use uniform meshes obtained by discretizing $\Omega =(-{\frac{1}{2}},{\frac{1}{2}})\times (-{\frac{1}{2}},{\frac{1}{2}})$ with squares of side $2^{-l}$ which are then divided into two triangles as indicated in Figure 1; the resulting mesh is denoted by “mesh=$l$”.

The test problem is obtained by taking ${\partial \Omega _N}=\emptyset , \boldsymbol{c}=\boldsymbol{I}$ and choosing $g$ and $f$ so that the exact solution is $u(x,y)=\cos (\pi x) \cos (\pi y)$ on the domain $\Omega$. The history of convergence of the SF-H method with

$$\begin{equation*} \tau _K=1/h=2^{l}, \end{equation*}$$

on the “mesh=$l$”, is displayed in Table 5 for polynomials of degree $k=0$, $k=1$ and $k=2$. We observe optimal convergence rates of the quantities $\|u-u_h\|_{L^2(\Omega )}$ and $\|q-q_h\|_{L^2(\Omega )}$ for $k=0,1,2$ as predicted by Theorems 2.5 and 2.7. We also see that $\|P_{\partial } u-\lambda _h\|_{L^2(\mathcal{E}_h;h)}$ and $\|u-u_h^*\|_{L^2(\Omega )}$ superconverges with rate $O(h^{k+2})$ for $k=1,2$ just as predicted by Theorems 2.8 and 2.9. These results do not guarantee that these quantities are superconvergent if $k=0$ and $f \not \equiv 0$. Since we do not observe superconvergence, we can conclude that the theoretical results for such a case are actually sharp.

Next we explore the effect of the size of $\tau _K$ on the quality of the approximation. In Table 6, we see that as $\tau$ diminishes the quality of the approximation to $u$ deteriorates. However, the effect of taking $\tau =1/h^2$ or $\tau =1/h$ is almost negligible especially when the grids are not coarse. We also see that the order of convergence is $k+1$ for $\tau =1/h^2, \tau =1/h$ and $\tau =1$, but it is only $k$ for $\tau =h$. This is in perfect agreement with Corollary 2.7.

We end with an example where the exact solution is harmonic, that is, $\boldsymbol{c}=\boldsymbol{I}, f \equiv 0$, and display the convergence rates for $k=0$ in Table 7. We take ${\partial \Omega _N}=\emptyset$ and choose $g$ and so that $u(x,y)= e^x \sin (y)$ is the solution. We see that the quantities $\|P_{\partial } u-\lambda _h\|_{L^2(\mathcal{E}_h;h)}$ and $\|u-u_h^*\|_{L^2(\Omega )}$ superconverge with the rate $O(h^{2})$ as our theoretical results predict.

5. Concluding remarks

The error analysis carried out here for the $\text{SF-H}_k$ method also holds for the hybridized versions of the RT$_K$ and the BDM$_k$ methods. We simply have to replace the local space $\boldsymbol{\mathscr{P}}^{k}(K)\times \mathscr{P}^k(K)$ by the local space $\boldsymbol{V}(K)\times W(K)$ given by Table 1, use the definition of the local stabilization parameter $\tau$ given in Table 2, and suitably define the projection $(\boldsymbol{\Pi },\mathbb{P})$. Indeed, with such changes, the first four properties of Proposition 2.1, on which the whole analysis is based, hold. For this reason, we can consider this analysis to be a unifying analysis of these three methods.

A study of the optimal way to choose the local stabilization parameter $\tau$ falls beyond the scope of this paper and will be carried out elsewhere. Extensions of these results to more general second-order elliptic equations and other boundary conditions are straightforward. The extension of these results to the case of hanging nodes, variable-degree approximations and curved domains constitute the subject of ongoing work.

	mesh	$\\|u-u_h\\|_{L^2(\Omega )}$		$\\|q-q_h\\|_{L^2(\Omega )}$		$\\|P_{\partial } u-\lambda _h\\|_{L^2(\mathcal{E}_h;h)}$		$\\|u-u_h^\star \\|_{L^2(\Omega )}$
$k$	$\ell$	error	order	error	order	error	order	error	order

	1	.11e+1	-	.17e+1	-	.28e-0	-	.22e-0	-
	2	.36e-0	1.54	.78e-0	1.12	.92e-1	1.61	.57e-1	1.96
0	3	.12e-0	1.50	.41e-0	0.94	.35e-1	1.37	.19e-1	1.57
	4	.53e-1	1.29	.21e-0	0.97	.14e-1	1.21	.79e-2	1.27
	5	.24e-1	1.13	.10e-0	0.98	.69e-2	1.15	.36e-2	1.14
	6	.12e-1	1.05	.53e-1	0.99	.32e-2	1.12	.17e-2	1.10


	1	.21e-0	-	.23e-0	-	.31e-1	-	.21e-1	-
	2	.43e-1	2.27	.12e-0	0.94	.75e-2	2.02	.40e-2	2.38
1	3	.78e-2	2.47	.31e-1	1.94	.10e-2	2.96	.53e-3	2.91
	4	.17e-2	2.19	.79e-2	1.99	.12e-3	3.00	.68e-4	2.98
	5	.42e-3	2.05	.20e-2	2.00	.15e-4	3.01	.85e-5	2.99
	6	.10e-3	2.01	.50e-3	2.00	.19e-5	3.00	.11e-6	2.99


	1	.68e-1	-	.89e-1	-	.72e-2	-	.81e-2	-
	2	.38e-2	4.12	.91e-2	3.29	.41e-3	4.12	.40e-3	4.35
2	3	.32e-3	3.58	.12e-2	2.96	.27e-4	3.93	.25e-4	3.97
	4	.32e-4	3.31	.15e-3	2.98	.18e-5	3.96	.16e-5	3.99
	5	.37e-5	3.12	.19e-4	2.99	.11e-6	3.98	.10e-6	4.00
	6	.45e-7	3.01	.23e-5	3.00	.70e-8	3.99	.63e-8	4.00

	mesh	$\tau =1/h^2$		$\tau =1/h$		$\tau =1$		$\tau =h$
$k$	$\ell$	error	order	error	order	error	order	error	order

	1	.61e+0	-	.11e+1	-	.21e+1	-	.40e+1	-
	2	.21e+0	1.50	.36e-0	1.54	.11e+1	0.88	.42e+1	-0.06
0	3	.95e-1	1.17	.12e-0	1.50	.57e-0	0.97	.43e+1	-0.02
	4	.46e-1	1.04	.53e-1	1.29	.28e-0	1.00	.43e+1	-0.00
	5	.23e-1	1.02	.24e-1	1.13	.14e-0	1.01	.43e+1	0.00
	6	.11e-1	1.01	.12e-1	1.05	.70e-1	1.01	.43e+1	0.00


	1	.15e+0	-	.21e-0	-	.35e-0	-	.67e-0	-
	2	.27e-1	2.47	.43e-1	2.27	.14e-0	1.34	.54e-0	0.29
1	3	.66e-2	2.06	.78e-2	2.47	.35e-1	1.98	.28e-0	0.97
	4	.16e-2	2.00	.17e-2	2.19	.88e-2	2.00	.14e-0	0.99
	5	.41e-3	2.00	.42e-3	2.05	.22e-2	2.00	.67e-1	1.00
	6	.10e-3	2.00	.10e-3	2.01	.60e-3	2.00	.35e-1	1.00


	1	.33e-1	-	.68e-1	-	.14e-0	-	.28e-0	-
	2	.19e-2	4.10	.38e-2	4.12	.14e-1	3.31	.56e-1	2.33
2	3	.23e-3	3.07	.32e-3	3.58	.18e-2	2.96	.14e-1	1.98
	4	.28e-4	3.00	.32e-4	3.31	.23e-3	2.99	.35e-2	2.00
	5	.36e-5	3.00	.37e-5	3.12	.28e-4	3.00	.88e-3	2.00
	6	.45e-6	3.00	.45e-7	3.01	.35e-5	3.00	.22e-3	2.00

	mesh	$\\|u-u_h\\|_{L^2(\Omega )}$		$\\|q-q_h\\|_{L^2(\Omega )}$		$\\|P_{\partial } u-\lambda _h\\|_{L^2(\mathcal{E}_h;h)}$		$\\|u-u_h^\star \\|_{L^2(\Omega )}$
$k$	$\ell$	error	order	error	order	error	order	error	order

	1	.17e-0	-	.22e-0	-	.29e-1	-	.23e-1	-
	2	.87e-1	0.94	.11e-0	0.96	.79e-2	1.84	.62e-2	1.87
0	3	.44e-1	0.99	.57e-1	0.98	.21e-2	1.90	.16e-2	1.93
	4	.22e-1	0.99	.29e-1	0.99	.55e-3	1.96	.41e-3	1.97
	5	.11e-1	1.00	.14e-1	1.00	.10e-3	1.98	.10e-3	1.99
	6	.55e-2	1.00	.72e-2	1.00	.35e-4	2.00	.26e-4	2.00

A superconvergent LDG-hybridizable Galerkin method for second-order elliptic problems

Abstract

1. Introduction

2. The main results

2.1. The projection $(\boldsymbol{\Pi },\mathbb{P})$

2.2. Characterization of the approximate solution

2.3. A priori error estimates

2.4. Postprocessing

3. Proofs

3.1. Proof of Proposition 2.1: The properties of $(\boldsymbol{\Pi }, \mathbb{P})$

3.1.1. Two key auxiliary results about polynomials

3.1.2. Proof of the orthogonality properties

3.1.3. Proof of the weak commutativity property

3.1.4. Proof of the estimates (v) and (vi)

3.1.5. Proof of the estimate (vii)

3.2. Characterization of the approximate solution

3.2.1. Two auxiliary results about the local solvers

3.2.2. Proof of Theorem 2.3: Characterization of the approximate solution

3.2.3. Proof of Proposition 2.2: Characterization of the jumps

3.2.4. Proof of Theorem 2.4

3.3. Proof of the error estimates

3.3.1. Proof of Theorem 2.5: The error in the flux

3.3.2. Proof of Theorem 2.6: Superconvergence of $u_h$

3.3.3. Proof of Theorem 2.8: Superconvergence of $\widehat{u}_h$

3.3.4. Proof of Theorem 2.9: The error estimate for ${u}^\star _h$

4. Numerical experiments

5. Concluding remarks

Table of Contents

Figures

Mathematical Fragments

References

Article Information

Settings

method	$\boldsymbol{V}(K)$	$W(K)$	$M(e)$
RT$_k$	$\boldsymbol{\mathscr{P}}^k(K) \oplus \boldsymbol{x}\,\mathscr{P}^k(K)$	$\mathscr{P}^k(K)$	$\mathscr{P}^k(e)$
LDG-H$_k$	$\boldsymbol{\mathscr{P}}^k(K)$	$\mathscr{P}^k(K)$	$\mathscr{P}^k(e)$
BDM$_k$	$\boldsymbol{\mathscr{P}}^k(K)$	$\mathscr{P}^{k-1}(K)$	$\mathscr{P}^k(e)$

method	$\tau \|_{{\partial K}}$
RT$_k$	$\equiv 0$
LDG-H$_k$	$\ge 0, \not \equiv 0$
BDM$_k$	$\equiv 0$

method	$\\|\,\boldsymbol{q}-\boldsymbol{q}_h\,\\|_{L^2({\Omega _h})}$	$\\|\, u-u_h\,\\|_{L^2({\Omega _h})}$	condition
RT$_k$	$k+1$	$k+1$	$k\ge 0$
LDG-H$_k$ [5]	$k$	$k+1$	$k\ge 1$ and $\tau =\mathcal{O}(1/h)$
LDG-H$_k$ [5]	$k+1/2$	$k+1$	$k\ge 0$ and $\tau =\mathcal{O}(1)$
$\mathop{\text{SF-H}}_k$	$k+1$	$k+1$	$k\ge 0$
BDM$_k$	$k+1$	$k$	$k\ge 1$