1 Introduction

Given a bounded domain \(\Omega \) in \(\mathbb {R}^N, N\ge 2\), with Lipschitz boundary \(\partial \Omega \), we study a double phase implicit obstacle problem with a multivalued operator given in the form

$$\begin{aligned} \begin{aligned} -\mathrm{div}\big (|\nabla u|^{p-2}\nabla u+\mu (x)|\nabla u|^{q-2}\nabla u\big )+\partial j(x,u)&\ni f(x)\quad&\text {in } \Omega ,\\ u&= 0&\text {on } \partial \Omega ,\\ T(u)&\le U(u), \end{aligned} \end{aligned}$$
(1.1)

where \(1<p<q<N, \mu :\overline{\Omega }\rightarrow [0,\infty )\) and \(T,U:W_0^{1,\mathcal H}(\Omega )\rightarrow \mathbb {R}\) are given functions satisfying appropriate conditions (see Sect. 2). Here \(W_0^{1,\mathcal H}(\Omega )\) is a subspace of the Sobolev–Musielak–Orlicz space \(W^{1,\mathcal H}(\Omega )\) and \(j:\Omega \times \mathbb {R}\rightarrow \mathbb {R}\) is supposed to be locally Lipschitz with respect to the second variable.

In this paper we prove the existence of at least one weak solution (see Definition (3.4)) of problem (1.1) by applying a surjectivity theorem for multivalued mappings, Kluge’s fixed point principle and tools from nonsmooth analysis. In general, problem (1.1) combines several interesting phenomena like a double phase operator along with a multivalued mapping in form of Clarke’s generalized gradient and an implicit obstacle given by the functions \(T:W_0^{1,\mathcal H}(\Omega )\rightarrow \mathbb {R}\) and \(U:W_0^{1,\mathcal H}(\Omega )\rightarrow (0,+\infty )\), see H(T) and H(U) in Sect. 3 for the precise conditions. Indeed, a solution \(u \in W_0^{1,\mathcal H}(\Omega )\) of (1.1) has to belong to K(u) which is the image of the multivalued map \(K:W_0^{1,\mathcal H}(\Omega )\rightarrow 2^{W_0^{1,\mathcal H}(\Omega )}\) defined by

$$\begin{aligned} K(u):=\left\{ v\in W_0^{1,\mathcal H}(\Omega )\ \mid \ T(v)-U(u)\le 0\right\} . \end{aligned}$$

To the best of our knowledge, this is the first work which combines a double phase phenomena along with Clarke’s generalized gradient and an implicit obstacle. A main tool in our treatment will be a surjectivity result of Le [24] for multivalued mappings generated by the sum of a maximal monotone multivalued operator and a bounded multivalued pseudomonotone mapping.

One difficulty in the study of (1.1) is the occurrence of the so-called double phase operator defined by

$$\begin{aligned} -\mathrm{div}\left( |\nabla u|^{p-2}\nabla u+\mu (x)|\nabla u|^{q-2}\nabla u\right) , \quad u \in W_0^{1,\mathcal H}(\Omega ), \end{aligned}$$

where \(1<p<q<N\). Although this operator looks like the (pq)-differential operator the difference is the weight function \(\mu :\overline{\Omega }\rightarrow [0,\infty )\) which can have values in zero. That means we cannot search for weak solutions in the usual Sobolev space \(W^{1,p}_0(\Omega )\), we need a certain type of a Sobolev–Musielak–Orlicz space equipped with the Luxemburg norm, see Sect. 2 for its definition. The idea to treat problems driven by the double phase operators goes back to the 1980s and the work of Zhikov [36] who introduced such classes of operators to describe models of strongly anisotropic materials by treating the functional

$$\begin{aligned} \omega \mapsto \int \left( |\nabla \omega |^p+\mu (x)|\nabla \omega |^q\right) \,dx, \end{aligned}$$
(1.2)

see also Zhikov [37, 38] and the monograph of Zhikov–Kozlov–Oleinik [39]. Integral functionals of the form (1.2) have been considered by several authors concerning regularity results and non-standard growth, see for example, Baroni–Colombo–Mingione [4, 5, 7], Baroni–Kussi–Mingione [6], Cupini–Marcellini–Mascolo [16], Colombo–Mingione [14, 15], Marcellini [27, 28] and the references therein.

In the case of single-valued equations in the whole space we refer to the works of Colasuonno–Squassina [13], Gasiński–Papageorgiou [17, Proposition 3.4], Gasiński–Winkert [20, 21], Liu–Dai [26], Perera-Squassina [33] concerning existence and multiplicity results.

Works which are closely related to our paper dealing with certain types of double phase problems or multivalued problems can be found in Bahrouni–Rădulescu–Repovš [1], Bahrouni–Rădulescu–Winkert [2, 3], Carl–Le-Motreanu [9], Cencelj–Rădulescu–Repovš [10], Clarke [12], Gasiński–Papageorgiou [18, 19], Papageorgiou–Rădulescu–Repovš [30, 31], Rădulescu [34], Zhang–Rădulescu [35] and the references therein.

The paper is organized as follows. In Sect. 2 we recall the definition of the used function spaces, some embedding results and we state the surjectivity results of Le [24] for multivalued mappings as well as Kluge’s fixed point theorem. In Sect. 3 we present the full assumptions on the data of problem (1.1), give the definition of the weak solution and consider an auxiliary problem defined in (3.7). Next, we prove some properties of the solution set of (3.7) stated as Theorem 3.6 whose proof is mainly based on tools from nonsmooth analysis in terms of multivalued mappings. Taking these results into account we are able to prove our main result which says that the solution set of (1.1) is nonempty, bounded and weakly closed in \(W_0^{1,\mathcal H}(\Omega )\), see Theorem 3.5.

2 Preliminaries

In the whole paper we suppose that \(\Omega \) is a bounded domain in \(\mathbb {R}^N\). Given \(1\le r\le \infty , L^r(\Omega )\) and \(L^r(\Omega ;\mathbb {R}^N)\) stand for the usual Lebesgue spaces equipped with the norm \(\Vert \cdot \Vert _p\) while \(W^{1,r}(\Omega )\) and \(W^{1,r}_0(\Omega )\) denote the Sobolev spaces endowed with the norms \(\Vert \cdot \Vert _{1,r}\) and \(\Vert \cdot \Vert _{1,r,0}\), respectively. By \(r'\), we denote the conjugate of \(r \in (1,\infty )\), that is, \(\frac{1}{r}+\frac{1}{r'}=1\).

For the weight function in (1.1) we suppose the following condition:

\(\hbox {H}(\mu ):\):

\(\mu :\overline{\Omega }\rightarrow [0,\infty )\) is Lipschitz continuous and \(1<p<q<N\) are chosen such that

$$\begin{aligned} \frac{q}{p}<1+\frac{1}{N}. \end{aligned}$$

Set \(\mathbb {R}_+:=[0,\infty )\) and consider the modular function \(\mathcal H:\Omega \times \mathbb {R}_+\rightarrow \mathbb {R}_+\) given by

$$\begin{aligned} \mathcal H(x,t)=t^p+\mu (x)t^q \quad \text {for all }(x,t)\in \Omega \times \mathbb {R}_+. \end{aligned}$$

The Musielak–Orlicz space \(L^\mathcal H(\Omega )\) is defined by

$$\begin{aligned} L^\mathcal {H}(\Omega )=\left\{ u ~ \Big | ~ u: \Omega \rightarrow \mathbb {R}\text { is measurable and } \rho _{\mathcal {H}}(u):=\int _{\Omega }\mathcal {H}(x,|u|)\,dx< +\infty \right\} \end{aligned}$$

equipped with the Luxemburg norm

$$\begin{aligned} \Vert u\Vert _{\mathcal {H}} = \inf \left\{ \tau >0 \ \big | \ \rho _{\mathcal {H}}\left( \frac{u}{\tau }\right) \le 1 \right\} . \end{aligned}$$

The space \(L^\mathcal H(\Omega )\) is uniformly convex and so a reflexive Banach space. Furthermore, we introduce the seminormed function space \(L_\mu ^q(\Omega )\)

$$\begin{aligned} L^q_\mu (\Omega )=\left\{ u ~ \Big | ~ u: \Omega \rightarrow \mathbb {R}\text { is measurable and } \int _{\Omega }\mu (x) |u|^q \,dx< +\infty \right\} \end{aligned}$$

endowed with the seminorm

$$\begin{aligned} \Vert u\Vert _{q,\mu } = \left( \int _{\Omega }\mu (x) |u|^q \,dx \right) ^{\frac{1}{q}}. \end{aligned}$$

From Colasuonno–Squassina [13, Proposition 2.15 (i), (iv) and (v)] we know that the embeddings

$$\begin{aligned} L^{q}(\Omega ) \hookrightarrow L^\mathcal {H}(\Omega ) \hookrightarrow L^{p}(\Omega ) \cap L^q_\mu (\Omega ) \end{aligned}$$

are continuous and so, by a simple calculation, we have

$$\begin{aligned} \min \left\{ \Vert u\Vert _\mathcal {H}^p,\Vert u\Vert _\mathcal {H}^q \right\} \le \Vert u\Vert _p^p+\Vert u\Vert ^q_{q,\mu } \le \max \left\{ \Vert u\Vert _\mathcal {H}^p,\Vert u\Vert _\mathcal {H}^q\right\} \end{aligned}$$
(2.1)

for all \(u\in L^\mathcal {H}(\Omega )\).

The corresponding Sobolev–Musielak–Orlicz space \(W^{1,\mathcal {H}}(\Omega )\) is defined by

$$\begin{aligned} W^{1,\mathcal {H}}(\Omega )= \left\{ u \in L^\mathcal {H}(\Omega ) : |\nabla u| \in L^{\mathcal {H}}(\Omega ) \right\} \end{aligned}$$

and is equipped with the norm

$$\begin{aligned} \Vert u\Vert _{1,\mathcal {H}}= \Vert \nabla u \Vert _{\mathcal {H}}+\Vert u\Vert _{\mathcal {H}}, \end{aligned}$$

where \(\Vert \nabla u\Vert _\mathcal {H}=\Vert |\nabla u|\Vert _{\mathcal {H}}\).

The Sobolev–Musielak–Orlicz space with zero traces, denoted by \(W_0^{1,\mathcal H}(\Omega )\), is the completion of \(C_0^\infty (\Omega )\) in \(W^{1,\mathcal H}(\Omega )\), that is,

$$\begin{aligned} W_0^{1,\mathcal H}(\Omega )=\overline{C^\infty _0(\Omega )}^{W^{1,\mathcal H}(\Omega )}. \end{aligned}$$

From the assumption on \(\mu :\overline{\Omega }\rightarrow \mathbb {R}_+\) in H(\(\mu \)) combined with Colasuonno–Squassina [13, Proposition 2.18], it is known that

$$\begin{aligned} \Vert u\Vert _{1,\mathcal {H},0}=\Vert \nabla u\Vert _{\mathcal {H}} \quad \text {for all }u\in W_0^{1,\mathcal H}(\Omega ), \end{aligned}$$

defines an equivalent norm on \(W_0^{1,\mathcal H}(\Omega )\). Based on this we obtain directly from (2.1) that

$$\begin{aligned} \min \left\{ \Vert u\Vert _{1,\mathcal {H},0}^p,\Vert u\Vert _{1,\mathcal {H},0}^q \right\} \le \Vert \nabla u\Vert _p^p+\Vert \nabla u\Vert ^q_{q,\mu } \le \max \left\{ \Vert u\Vert _{1,\mathcal {H},0}^p,\Vert u\Vert _{1,\mathcal {H},0}^q\right\} \end{aligned}$$
(2.2)

is true for all \(u\in W^{1,\mathcal {H}}_0(\Omega )\). Moreover, both spaces \(W^{1,\mathcal {H}}(\Omega )\) and \(W^{1,\mathcal {H}}_0(\Omega )\) are uniformly convex and so reflexive Banach spaces as well.

From Colasuonno–Squassina [13, Proposition 2.15] we know that the embedding

$$\begin{aligned} W^{1,\mathcal {H}}_0(\Omega ) \hookrightarrow L^{r}(\Omega ) \end{aligned}$$
(2.3)

is compact for each \(1<r<p^*\), where \(p^*\) stands for the critical exponent to p given by

$$\begin{aligned} p^*:=\frac{Np}{N-p}. \end{aligned}$$

Let us recall some properties of the eigenvalue problem for the r-Laplacian (\(1<r<\infty \)) with homogeneous Dirichlet boundary condition given by

$$\begin{aligned} \begin{aligned} -\Delta _r u&=\lambda |u|^{r-2}u\quad&\text {in } \Omega ,\\ u&= 0&\text {on } \partial \Omega . \end{aligned} \end{aligned}$$
(2.4)

It is known that the set \(\sigma _r\) has a smallest element \(\lambda _{1,r}\) which is positive, isolated, simple and it can be variationally characterized through

$$\begin{aligned} \lambda _{1,r} =\inf \left\{ \frac{\Vert \nabla u\Vert _{r}^r}{\Vert u\Vert _{r}^r}: u \in W^{1,r}_0(\Omega ), u \ne 0 \right\} , \end{aligned}$$

see Lê [25].

Let \(A:W^{1,\mathcal {H}}_0(\Omega )\rightarrow W^{1,\mathcal {H}}_0(\Omega )^*\) be the operator defined by

$$\begin{aligned} \langle A(u),v\rangle _{\mathcal {H}} :=\int _{\Omega }\left( |\nabla u|^{p-2}\nabla u+\mu (x)|\nabla u|^{q-2}\nabla u \right) \cdot \nabla v \,dx, \end{aligned}$$
(2.5)

for \(u,v\in W^{1,\mathcal {H}}_0(\Omega )\), where \(\langle \cdot ,\cdot \rangle _{\mathcal {H}}\) stands for the duality pairing between \(W^{1,\mathcal {H}}_0(\Omega )\) and its dual space \(W^{1,\mathcal {H}}_0(\Omega )^*\). The properties of the operator \(A:W^{1,\mathcal {H}}_0(\Omega )\rightarrow W^{1,\mathcal {H}}_0(\Omega )^*\) can be summarized as follows, see Liu–Dai [26].

Proposition 2.1

The operator A defined by (2.5) is bounded, continuous, monotone (hence maximal monotone) and of type \((S_+)\).

The notion of pseudomonotonicity for multivalued operators is recalled in the next definition.

Definition 2.2

Let X be a real reflexive Banach space. The operator \(A:X\rightarrow 2^{X^*}\) is called pseudomonotone if the following conditions hold:

  1. (i)

    the set A(u) is nonempty, bounded, closed and convex for all \(u \in X\);

  2. (ii)

    A is upper semicontinuous from each finite-dimensional subspace of X to the weak topology on \(X^*\);

  3. (iii)

    if \(\{u_n\} \subset X\) with \(u_n\rightharpoonup u\) in X and if \(u_n^*\in A(u_n)\) is such that

    $$\begin{aligned} \limsup _{n\rightarrow \infty }\langle u_n^*,u_n-u\rangle _{X^*\times X}\le 0, \end{aligned}$$

    then to each element \(v \in X\), exists \(u^*(v) \in A(u)\) with

    $$\begin{aligned} \langle u^*(v),u-v\rangle _{X^*\times X}\le \liminf _{n\rightarrow \infty }\langle u_n^*,u_n-v\rangle _{X^*\times X}. \end{aligned}$$

Let X be a real Banach space with its dual space \(X^*\). A function \(J:X\rightarrow \mathbb {R}\) is said to be locally Lipschitz at \(u\in X\) if there exist a neighborhood N(u) of u and a constant \(L_u>0\) such that

$$\begin{aligned} |J(w)-J(v)|\le L_u\Vert w-v\Vert _X \quad \text {for all } w, v\in N(u). \end{aligned}$$

Definition 2.3

Let \(J :X \rightarrow \mathbb {R}\) be a locally Lipschitz function and let \(u,v\in X\). The generalized directional derivative \(J^\circ (u; v)\) of J at the point u in the direction v is defined by

$$\begin{aligned} J^\circ (u;v): = \limsup \limits _{w\rightarrow u,\, t\downarrow 0 } \frac{J(w+t v)-J(w)}{t}. \end{aligned}$$

The generalized gradient \(\partial J:X\rightarrow 2^{X^*}\) of \(J :X \rightarrow \mathbb {R}\) is defined by

$$\begin{aligned} \partial J(u): =\left\{ \, \xi \in X^{*} \ \mid \ J^\circ (u; v)\ge \langle \xi , v\rangle _{X^*\times X} \ \text { for all } v \in X \right\} \quad \text {for all } u\in X. \end{aligned}$$

The next proposition collects some basic results, see for example, Migórski–Ochal–Sofonea [29, Proposition 3.23].

Proposition 2.4

Let \(J:X \rightarrow \mathbb {R}\) be locally Lipschitz with Lipschitz constant \(L_{u}>0\) at \(u\in X\). Then we have the following:

  1. (i)

    The function \(v\mapsto J^\circ (u;v)\) is positively homogeneous, subadditive, and satisfies

    $$\begin{aligned} |J^\circ (u;v)|\le L_{u}\Vert v\Vert _X \quad \text {for all }v\in X. \end{aligned}$$
  2. (ii)

    The function \((u,v)\mapsto J^\circ (u;v)\) is upper semicontinuous.

  3. (iii)

    For each \(u\in X, \partial J(u)\) is a nonempty, convex, and weak\(^*\) compact subset of \(X^*\) with \( \Vert \xi \Vert _{X^{*}}\le L_{u}\) for all \(\xi \in \partial J(u)\).

  4. (iv)

    \(J^\circ (u;v) = \max \left\{ \langle \xi ,v\rangle _{X^*\times X} \mid \xi \in \partial J(u)\right\} \) for all \(v\in X\).

  5. (v)

    The multivalued function \(X\ni u\mapsto \partial J(u)\subset X^*\) is upper semicontinuous from X into w\(^*\)-\(X^*\).

Since our results are based on fixed point results, so we now recall the fixed point theorem of Kluge [23].

Theorem 2.5

Let Z be a real reflexive Banach space and let \(C\subset Z\) be nonempty, closed and convex. Assume that \(\Psi :C\rightarrow 2^C\) is a multivalued mapping such that for every \(u\in C\), the set \(\Psi (u)\) is nonempty, closed, and convex and the graph of \(\Psi \) is sequentially weakly closed. If either C is bounded or \(\Psi (C)\) is bounded, then the map \(\Psi \) has at least one fixed point in C.

Finally, we end this section by recalling the following surjectivity theorem for multivalued mappings which was proved by Le [24, Theorem 2.2]. We use the notation \(B_R(0):=\{u\in X \ : \ \Vert u\Vert _X<R\}\).

Theorem 2.6

Let X be a real reflexive Banach space, let \(F:D(F)\subset X\rightarrow 2^{X^*}\) be a maximal monotone operator, let \(G:D(G)=X\rightarrow 2^{X^*}\) be a bounded multivalued pseudomonotone operator and let \(L\in X^*\). Assume that there exist \(u_0\in X\) and \(R\ge \Vert u_0\Vert _X\) such that \(D(F)\cap B_R(0)\ne \emptyset \) and

$$\begin{aligned} \langle \xi +\eta -L,u-u_0\rangle _{X^*\times X}>0 \end{aligned}$$

for all \(u\in D(F)\) with \(\Vert u\Vert _X=R\), for all \(\xi \in F(u)\) and for all \(\eta \in G(u)\). Then the inclusion

$$\begin{aligned} F(u)+G(u)\ni L \end{aligned}$$

has a solution in D(F).

3 Main results

We impose the following assumptions for the data of problem (1.1).

\(\hbox {H}(f):\):

\(f\in L^{p'}(\Omega )\).

\(\hbox {H}(j):\):

\(j:\Omega \times \mathbb {R}\rightarrow \mathbb {R}\) is such that

(i):

\(x\mapsto j(x,s)\) is measurable for all \(s\in \mathbb {R}\) and there exists a function \(l\in L^{q_1}(\Omega )\) with \(q_1\in (1,p^*)\) such that the function \(x\mapsto j(x,l(x))\) belongs to \(L^1(\Omega )\);

(ii):

\(s\mapsto j(x,s)\) is locally Lipschitz continuous for a. a. \(x\in \Omega \).

(iii):

there exist \(\theta \ge 1\) with \(\theta \le \min \{q_1,p\}, \alpha _{j}\ge 0\) with \(\alpha _j< \lambda _{1,p}^{-1}\delta _\theta \) and \(\beta _{j}\in L^1_+(\Omega )\) such that

$$\begin{aligned} j^\circ (x,s;-s)\le \alpha _{j}|s|^\theta +\beta _{j}(x) \quad \end{aligned}$$

for a. a. \(x\in \Omega \) and for all \(s\in \mathbb {R}\), where \(\delta _\theta \) is defined by

$$\begin{aligned} \delta _\theta = {\left\{ \begin{array}{ll} 1 &{}\text {if }\theta =p,\\ +\infty &{} \text {otherwise}, \end{array}\right. } \end{aligned}$$

and \(\lambda _{1,p}\) denotes the first eigenvalue of the Dirichlet eigenvalue problem for the p-Laplacian, see (2.4);

(iv):

there exist \(c_j\ge 0\) and \(\gamma _j\in L_+^{\frac{q_1}{q_1-1}}(\Omega )\) satisfying

$$\begin{aligned} |\xi |\le c_j|r|^{q_1-1}+\gamma _j(x) \end{aligned}$$

for a. a. \(x\in \Omega \), for all \(\xi \in \partial j(x,s)\) and for all \(s\in \mathbb {R}\), where \(\partial j(x,s)\) stands for the generalized gradient of j with respect to the variable s and \(q_1\) is given in (i);

(v):

there exists a constant \(m_j\ge 0\) such that

$$\begin{aligned} (\xi _1-\xi _2)(s_1-s_2)\ge -m_{j}|s_1-s_2|^p \end{aligned}$$

for a. a. \(x\in \Omega \) and for all \(s_1, s_2\in \mathbb {R}\) whenever \(\xi _1\in \partial j(x,s_1)\) and \(\xi _2\in \partial j(x,s_2)\).

H(T)::

\(T:W_0^{1,\mathcal H}(\Omega )\rightarrow \mathbb {R}\) is positively homogeneous and subadditive such that

$$\begin{aligned} T(u)\le \limsup _{n\rightarrow \infty } T(u_n) \end{aligned}$$
(3.1)

whenever \(\{u_n\}\subset W_0^{1,\mathcal H}(\Omega )\) is such that \(u_n\rightharpoonup u\) in \(W_0^{1,\mathcal H}(\Omega )\) for some \(u\in W_0^{1,\mathcal H}(\Omega )\).

\(\hbox {H}(U):\):

\(U:W_0^{1,\mathcal H}(\Omega )\rightarrow (0,+\infty )\) is weakly continuous, that is, for any sequence \(\{u_n\}\subset W_0^{1,\mathcal H}(\Omega )\) such that \(u_n\rightharpoonup u\) for some \(u\in W_0^{1,\mathcal H}(\Omega )\), we have

$$\begin{aligned} U(u_n)\rightarrow U(u). \end{aligned}$$

Remark 3.1

(a) Assumption \(\mathrm{H}(j)\mathrm{(v)}\) is usually called the relaxed monotone condition for the locally Lipschitz function \(s\mapsto j(x,s)\), see for example, Migórski–Ochal–Sofonea [29]. It is equivalent to the inequality

$$\begin{aligned} j^\circ \left( x,s_1;s_2-s_1\right) +j^\circ \left( x,s_2;s_1-s_2\right) \le m_j|s_1-s_2|^p \end{aligned}$$

for a. a. \(x\in \Omega \) and for all \(s_1, s_2\in \mathbb {R}\).

(b) Positive homogeneity and subadditivity of T imply that T is also a convex function. On the other hand, it is not difficult to see that if \(T:W_0^{1,\mathcal H}(\Omega )\rightarrow \mathbb {R}\) is lower semicontinuous, then inequality (3.1) holds automatically.

We introduce the following multivalued map \(K:W_0^{1,\mathcal H}(\Omega )\rightarrow 2^{W_0^{1,\mathcal H}(\Omega )}\) defined by

$$\begin{aligned} K(u):=\left\{ v\in W_0^{1,\mathcal H}(\Omega )\ \mid \ T(v)-U(u)\le 0\right\} \end{aligned}$$
(3.2)

for all \(u\in W_0^{1,\mathcal H}(\Omega )\).

Lemma 3.2

Let \(U:W_0^{1,\mathcal H}(\Omega )\rightarrow (0,+\infty )\) and \(T:W_0^{1,\mathcal H}(\Omega )\rightarrow \mathbb {R}\) be such that H(T) holds. Then, for each \(u\in W_0^{1,\mathcal H}(\Omega )\), the set K(u) is nonempty, closed and convex in \(W_0^{1,\mathcal H}(\Omega )\).

Proof

For any fixed \(u\in W_0^{1,\mathcal H}(\Omega )\) it is clear that \(U(u)>0\). Since T is positively homogeneous, we have \(T(0)=0\). This implies \(T(0)=0<U(u)\), that is, \(0\in K(u)\) and so, K(u) is nonempty.

Let \(\{v_n\}\subset K(u)\) be a sequence such that \(v_n\rightarrow v\) in \(W_0^{1,\mathcal H}(\Omega )\) for some \(v\in W_0^{1,\mathcal H}(\Omega )\). Hence, for each \(n\in \mathbb N\), one has

$$\begin{aligned} T(v_n)\le U(u). \end{aligned}$$

Passing to the upper limit as \(n\rightarrow \infty \) and taking inequality (3.1) into account, we obtain

$$\begin{aligned} T(v)\le \limsup _{n\rightarrow \infty }T(v_n)\le U(u). \end{aligned}$$

Hence, \(v\in K(u)\) which shows that K(u) is closed.

Let \(v_1,v_2\in K(u)\) and \(t\in (0,1)\) be arbitrary. We set \(v_t=tv_1+(1-t)v_2\). Then, we have \(T(v_i)\le U(u)\) for \(i=1,2\). By the convexity of T, see Remark 3.1, it follows that

$$\begin{aligned} T(v_t)\le tT(v_1)+(1-t)T(v_2)\le tU(u)+(1-t)U(u)=U(u). \end{aligned}$$

Thus \(v_t\in K(u)\). Therefore, we conclude that K(u) is a convex set in \(W_0^{1,\mathcal H}(\Omega )\). \(\square \)

Let us introduce the functional \(J:L^{q_1}(\Omega ) \rightarrow \mathbb {R}\) defined by

$$\begin{aligned} J(u):=\int _\Omega j(x,u(x))\,dx \quad \text {for all }u\in L^{q_1}(\Omega ). \end{aligned}$$
(3.3)

From hypotheses H(j) and the definition of J in (3.3), the next lemma is a direct consequence of Migórski–Ochal–Sofonea [29, Theorem 3.47].

Lemma 3.3

Under the assumptions \(\mathrm{H}(j)\mathrm{(i)-(iv)}\), the following hold:

  1. (i)

    \(J:L^{q_1}(\Omega ) \rightarrow \mathbb {R}\) is locally Lipschitz continuous;

  2. (ii)

    there hold

    $$\begin{aligned} J^\circ (u;v)&\le \int _\Omega j^\circ (x,u(x);v(x))\,dx,\\ J^\circ (u;-u)&\le \alpha _j\Vert u\Vert _\theta ^\theta +\Vert \beta _j\Vert _1 \end{aligned}$$

    for all \(u,v\in L^{q_1}(\Omega )\);

  3. (iii)

    for each \(u\in L^{q_1}(\Omega )\), one has

    $$\begin{aligned} \partial J(u)&\subset \int _\Omega \partial j(x,u(x))\,dx,\\ \Vert \xi \Vert _{q_1'}&\le c_J(1+\Vert u\Vert _{q_1}^{q_1-1})\quad \text { for all }\xi \in \partial J(u) \end{aligned}$$

    with some \(c_J>0\).

Moreover, if condition \(\mathrm{H}(j)\mathrm{(v)}\) holds, then the inequality

$$\begin{aligned} J^\circ (u;v-u)+J^\circ (v;u-v)\le m_j\Vert u-v\Vert _{p}^p \end{aligned}$$
(3.4)

is satisfied for all \(u,v\in W_0^{1,\mathcal H}(\Omega )\).

The weak solutions for problem (1.1) are understood in the following sense.

Definition 3.4

We say that \(u\in W_0^{1,\mathcal H}(\Omega )\) is a weak solution of problem (1.1) if \(u\in K(u)\) and

$$\begin{aligned}&\int _\Omega \left( |\nabla u|^{p-2} \nabla u \cdot \nabla (v-u) +\mu (x)|\nabla u|^{q-2} \nabla u \cdot \nabla (v-u)\right) \,dx\\&\quad +\int _\Omega j^\circ (x,u;v-u)\,dx\ge \int _\Omega f(x) \left( v-u\right) \,dx \end{aligned}$$

for all \(v\in K(u)\), where the multivalued function K is given by (3.2).

Our main results read as follows.

Theorem 3.5

Assume that \(\mathrm{H}(\mu ), \mathrm{H}(f), \mathrm{H}(j), \mathrm{H}(T)\,\mathrm{and}\,\mathrm{H}(U)\) are satisfied. If \(p\ge 2\) and the smallness condition

$$\begin{aligned} m_j\lambda _{1,p}^{-1}< c(p), \end{aligned}$$
(3.5)

holds, then the set of solutions for problem (1.1), denoted by \(\mathcal S\), is nonempty, bounded and weakly closed in \(W_0^{1,\mathcal H}(\Omega )\), where \(c(p)>0\) is the largest constant such that

$$\begin{aligned} \left( |x|^{p-2}x-|y|^{p-2}y \right) \cdot \left( x-y\right) \ge c(p)|x-y|^p \quad \text {for all }x,y\in \mathbb {R}^N. \end{aligned}$$

From Lemma 3.3(ii) we see that if \(u\in W_0^{1,\mathcal H}(\Omega )\) solves the following problem:

Find \(u\in W_0^{1,\mathcal H}(\Omega )\) such that \(u\in K(u)\) and

$$\begin{aligned} \begin{aligned}&\int _\Omega \left( |\nabla u|^{p-2} \nabla u \cdot \nabla (v-u)+\mu (x)|\nabla u|^{q-2} \nabla u \cdot \nabla (v-u)\right) \,dx\\&\quad +J^\circ (u;v-u)\ge \int _\Omega f(x)\left( v-u\right) \,dx \end{aligned} \end{aligned}$$
(3.6)

for all \(v\in K(u)\), then u is also a weak solution of problem (1.1). Based on this fact, in the sequel, we are going to explore the existence of solutions for problem (3.6).

To this end, we now introduce the following auxiliary problem:

For given \(w\in W_0^{1,\mathcal H}(\Omega )\), find \(u\in K(w)\) such that

$$\begin{aligned} \begin{aligned}&\int _\Omega \left( |\nabla u|^{p-2}\nabla u \cdot \nabla (v-u)+\mu (x)|\nabla u|^{q-2}\nabla u \cdot \nabla (v-u)\right) \,dx\\&\quad +J^\circ (u;v-u)\ge \int _\Omega f(x)\left( v-u\right) \,dx \end{aligned} \end{aligned}$$
(3.7)

for all \(v\in K(w)\). Setting the multivalued \(\Gamma :W_0^{1,\mathcal H}(\Omega )\rightarrow 2^{W_0^{1,\mathcal H}(\Omega )}\) by

$$\begin{aligned} \Gamma (w):=\left\{ u\in W_0^{1,\mathcal H}(\Omega ) \ \mid \ u\text { solves problem } \text {corresponding to} w\right\} \end{aligned}$$

for all \(w\in W_0^{1,\mathcal H}(\Omega )\), it is not difficult to verify that \(u\in W_0^{1,\mathcal H}(\Omega )\) is a fixed point of \(\Gamma \) if and only if u solves problem (3.6). Motivated by this fact, we shall employ Kluge’s fixed point principle, see Theorem 2.5, in order to show that \(\Gamma \) has at least one fixed point in \(W_0^{1,\mathcal H}(\Omega )\).

Theorem 3.6

Let \(U:W_0^{1,\mathcal H}(\Omega )\rightarrow (0,+\infty )\). Under the assumptions \(\mathrm{H}(\mu ), \hbox {H}(f), \hbox {H}(j)\hbox {(i)--(iv)}\) and \(\mathrm{H}(T)\), the following hold:

  1. (i)

    for each \(w\in W_0^{1,\mathcal H}(\Omega )\), the set of solutions to problem (3.7) is nonempty, bounded and closed in \(W_0^{1,\mathcal H}(\Omega )\), that is, \(\Gamma \) has nonempty, bounded and closed values.

  2. (ii)

    if, in addition, \(p\ge 2\), H(j)(v) and the smallness condition (3.5) are fulfilled, then for each \(w\in W_0^{1,\mathcal H}(\Omega )\), problem (3.7) has a unique solution \(u_w\in W_0^{1,\mathcal H}(\Omega )\), namely, \(\Gamma (w)=\{u_w\}\).

Proof

(i) For any fixed \(w\in W_0^{1,\mathcal H}(\Omega )\), we consider the indicator function \(I_{K(w)}:W_0^{1,\mathcal H}(\Omega )\rightarrow \overline{\mathbb {R}}:=\mathbb {R}\cup \{+\infty \}\) of K(w) defined by

$$\begin{aligned} I_{K(w)}(u):= {\left\{ \begin{array}{ll} 0 &{}\text {if }u \in K(w),\\ +\infty &{}\text {otherwise}. \end{array}\right. } \end{aligned}$$

From the fact that \(f\in L^{p'}(\Omega )\subset W_0^{1,\mathcal H}(\Omega )^*\), problem (3.7) can be expressed as the variational-hemivariational inequality: Find \(u\in W_0^{1,\mathcal H}(\Omega )\) such that

$$\begin{aligned} \langle Au,v-u\rangle _\mathcal H +J^\circ (u;v-u)+I_{K(w)}(v)-I_{K(w)}(u) \ge \langle f,v-u\rangle _\mathcal H \end{aligned}$$
(3.8)

for all \(v\in W_0^{1,\mathcal H}(\Omega )\), where \(A:W^{1,\mathcal H}_0(\Omega )\rightarrow W_0^{1,\mathcal H}(\Omega )^*\) is the double phase operator defined in (2.5). Employing the first separation theorem, see for example, Papageorgiou–Winkert [32, Theorem 3.1.57], it is not difficult to see that inequality problem (3.8) is equivalent to the following inclusion problem: Find \(u\in W_0^{1,\mathcal H}(\Omega )\) such that

$$\begin{aligned} Au+\partial J(u)+\partial _C I_{K(w)}(u)\ni f, \end{aligned}$$
(3.9)

where the notation \(\partial _C I_{K(w)}\) stands for the subdifferential of \(I_{K(w)}\) in the sense of convex analysis.

First, we are going to apply the surjectivity theorem for multivalued mappings, see Theorem 2.6, in order to prove that problem (3.9) has at least one solution in \(W_0^{1,\mathcal H}(\Omega )\). In fact, we have the following claims.

Claim 1

\(A+\partial J:W_0^{1,\mathcal H}(\Omega )\rightarrow 2^{ W_0^{1,\mathcal H}(\Omega )^*}\) is a bounded pseudomonotone multivalued operator such that for each \(u\in W_0^{1,\mathcal H}(\Omega )\), the set \(A(u)+\partial J(u)\) is closed and convex in \(W_0^{1,\mathcal H}(\Omega )^*\).

Indeed, Proposition 2.4 and Lemma 3.3 imply that for each \(u\in W_0^{1,\mathcal H}(\Omega )\), the set \(A(u)+\partial J(u)\) is closed and convex in \(W_0^{1,\mathcal H}(\Omega )^*\). Besides, Proposition 2.1, Lemma 3.3(iii), (2.3) and the fact that \(p_1<p^*\) guarantee that

$$\begin{aligned} W_0^{1,\mathcal H}(\Omega )\ni u\mapsto A(u)+\partial J(u)\subset W_0^{1,\mathcal H}(\Omega )^*\ \text {is a bounded map.} \end{aligned}$$

Next are going to apply Proposition 3.8 in Migórski–Ochal–Sofonea [29] in order to conclude that \(W_0^{1,\mathcal H}(\Omega )\ni u\mapsto A(u)+\partial J(u)\subset W_0^{1,\mathcal H}(\Omega )^*\) is upper semicontinuous from \(W_0^{1,\mathcal H}(\Omega )\) to \(W_0^{1,\mathcal H}(\Omega )^*\) equipped with the weak topology. It is sufficient to show that for any weakly closed subset D in \(W_0^{1,\mathcal H}(\Omega )^*\), the set \((A+\partial J)^-(D)\) is closed in \(W_0^{1,\mathcal H}(\Omega )\).

Let \(\{u_n\}\subset (A+\partial J)^-(D)\) be a sequence such that

$$\begin{aligned} u_n\rightarrow u \text { in }W_0^{1,\mathcal H}(\Omega ) \text { for some } u\in W_0^{1,\mathcal H}(\Omega ). \end{aligned}$$
(3.10)

Hence, for each \(n\in \mathbb N\), we are able to find \(\xi _n\in \partial J(u_n)\) such that

$$\begin{aligned} u_n^*:=Au_n+\xi _n\in \big (A(u_n)+\partial J(u_n)\big )\cap D. \end{aligned}$$

The continuity of A, see Proposition 2.1, implies that \(A(u_n)\rightarrow A(u)\) in \(W_0^{1,\mathcal H}(\Omega )^*\). Lemma 3.3(iii) and convergence (3.10) imply that the sequence \(\{\xi _n\}\) is bounded in \(W_0^{1,\mathcal H}(\Omega )^*\). So, without any loss of generality, we may assume that

$$\begin{aligned} \xi _n\rightharpoonup \xi \text { in }W_0^{1,\mathcal H}(\Omega )^*. \end{aligned}$$

Recall that \(\partial J\) is upper semicontinuous from \(W_0^{1,\mathcal H}(\Omega )\) to \(W_0^{1,\mathcal H}(\Omega )^*\) equipped with the weak topology and has bounded, convex, closed values, see Proposition 2.4(iv). Hence, it has a closed graph in \(W_0^{1,\mathcal H}(\Omega )\times W_0^{1,\mathcal H}(\Omega )^*\), see Kamenskii–Obukhovskii–Zecca [22, Theorem 1.1.4]. But, thanks to the weak closedness of D, we derive that \(A(u)+\xi \in D\) and \(\xi \in \partial J(u)\), which provides that \(u\in \big (A+\partial J\big )^-(D)\). Consequently, \(A+\partial J\) is upper semicontinuous from \( W_0^{1,\mathcal H}(\Omega )\) to \( W_0^{1,\mathcal H}(\Omega )^*\) equipped with the weak topology.

We now prove that \(A+\partial J\) is pseudomonotone. Let \(\{u_n\}\) and \(\{u_n^*\}\) be sequences such that

$$\begin{aligned} u_n&\rightharpoonup u \quad \text {in } W_0^{1,\mathcal H}(\Omega ), \end{aligned}$$
(3.11)
$$\begin{aligned} u_n^*&\in A(u_n)+\partial J(u_n)\quad \text {with } \limsup _{n\rightarrow \infty }\langle u_n^*,u_n-u\rangle _\mathcal H\le 0. \end{aligned}$$
(3.12)

Our goal is to produce for each \(v\in W_0^{1,\mathcal H}(\Omega )\) an element \(u^*(v)\in A(u)+\partial J(u)\) such that

$$\begin{aligned} \liminf _{n\rightarrow \infty }\langle u_n^*,u_n-v\rangle _\mathcal H \ge \langle u^*(v),u-v\rangle _\mathcal H. \end{aligned}$$
(3.13)

From (3.12), there is a sequence \(\{\xi _n\}\subset W_0^{1,\mathcal H}(\Omega )^*\) such that for each \(n\in \mathbb N, \xi _n\in \partial J(u_n)\) and

$$\begin{aligned} u_n^*= A(u_n)+\xi _n. \end{aligned}$$

From (3.12), and the above equality it follows that

$$\begin{aligned} \limsup _{n\rightarrow \infty }\langle Au_n,u_n-u\rangle _\mathcal H +\liminf _{n\rightarrow \infty }\langle \xi _n,u_n-u\rangle _\mathcal H\le 0. \end{aligned}$$
(3.14)

Applying (3.11) and the compact embedding of \(W_0^{1,\mathcal H}(\Omega )\) into \(L^{q_1}(\Omega )\), see (2.3), we have

$$\begin{aligned} u_n\rightarrow u \quad \text {in }L^{q_1}(\Omega ). \end{aligned}$$

By virtue of Theorem 2.2 of Chang [11], we know that

$$\begin{aligned} \partial \left( J|_{W_0^{1,\mathcal H}(\Omega )}\right) (u)\subset \partial \left( J|_{L^{q_1}(\Omega )}\right) (u)\quad \text {for all }u\in W_0^{1,\mathcal H}(\Omega ), \end{aligned}$$

which shows that

$$\begin{aligned} \langle \xi _n,u_n-u\rangle _\mathcal H= \langle \xi _n,u_n-u\rangle _{L^{q_1}(\Omega )}. \end{aligned}$$
(3.15)

Additionally, Lemma 3.3(iii) and the boundedness of \(\{u_n\}\) in \(W_0^{1,\mathcal H}(\Omega )\) entail that the sequence \(\{\xi _n\}\) is bounded both in \(L^{q_1}(\Omega )\) and \(W_0^{1,\mathcal H}(\Omega )^*\). Then, we pass to the limit in (3.15) as \(n\rightarrow \infty \), to get

$$\begin{aligned} \lim _{n\rightarrow \infty }\langle \xi _n,u_n-u\rangle _\mathcal H = \lim _{n\rightarrow \infty }\langle \xi _n,u_n-u\rangle _{L^{q_1}(\Omega )}=0. \end{aligned}$$

Combining this with (3.14) leads to

$$\begin{aligned} \limsup _{n\rightarrow \infty }\langle Au_n,u_n-u\rangle _\mathcal H =\limsup _{n\rightarrow \infty }\langle Au_n,u_n-u\rangle _\mathcal H +\liminf _{n\rightarrow \infty }\langle \xi _n,u_n-u\rangle _\mathcal H\le 0. \end{aligned}$$

Hence, since A is of type \((S_+)\), see Proposition 2.1, and (3.11) yields \(u_n\rightarrow u\) in \(W_0^{1,\mathcal H}(\Omega )\). On the other hand, by the reflexivity of \(W_0^{1,\mathcal H}(\Omega )^*\) and the boundedness of \(\{\xi _n\}\subset W_0^{1,\mathcal H}(\Omega )^*\), we can suppose that

$$\begin{aligned} \xi _n\rightharpoonup \xi \quad \text {in } W_0^{1,\mathcal H}(\Omega )^* \text { for some }\xi \in W_0^{1,\mathcal H}(\Omega )^*. \end{aligned}$$

Then, from Kamenskii–Obukhovskii–Zecca [22, Theorem 1.1.4], we can conclude that \(\xi \in \partial J(u)\). Because of

$$\begin{aligned} \liminf _{n\rightarrow \infty }\langle u_n^*,u_n-v\rangle _\mathcal H =\liminf _{n\rightarrow \infty }\langle A(u_n)+\xi _n,u_n-v\rangle _\mathcal H =\langle A(u)+\xi ,u-v\rangle _\mathcal H, \end{aligned}$$

it is clear that (3.13) holds with \(u^*= A(u)+\xi \in A(u)+\partial J(u)\). We conclude that \(A+\partial J\) is pseudomonotone. This proves Claim 1.

Claim 2

There exists \(R>0\) such that

$$\begin{aligned} \langle Au+\xi +\eta -f,u\rangle _\mathcal H>0 \end{aligned}$$
(3.16)

for all \(u\in K(w)\) with \(\Vert u\Vert _{1,\mathcal H,0}=R\), for all \(\xi \in \partial J(u)\) and for all \(\eta \in \partial _C(I_{K(w)})(u)\).

Let \(u\in W_0^{1,\mathcal H}(\Omega )\) be fixed. Since \(0\in K(w)\) and \(f\in L^{p'}(\Omega )\subset W_0^{1,\mathcal H}(\Omega )^*\), for any \(\xi \in \partial J(u)\) and \(\eta \in \partial _C(I_{K(w)})(u)\), we can find

$$\begin{aligned} \begin{aligned}&\langle Au+\xi +\eta -f,u\rangle _\mathcal H\\&\quad \ge \int _\Omega |\nabla u|^{p-2}\nabla u \cdot \nabla u\,dx+ \int _\Omega \mu (x)|\nabla u|^{q-2}\nabla u \cdot \nabla u\,dx\\&\qquad +\int _\Omega \xi u \,dx +I_{K(w)}(u)-I_{K(w)}(0)-\Vert f\Vert _{W^{1,\mathcal H}_0(\Omega )^*}\Vert u\Vert _{1,\mathcal H,0}\\&\quad \ge \Vert \nabla u\Vert _p^p+\Vert \nabla u\Vert _{q,\mu }^q -\int _\Omega \xi [-u]\,dx+I_{K(w)}(u) -\Vert f\Vert _{W^{1,\mathcal H}_0(\Omega )^*}\Vert u\Vert _{1,\mathcal H,0}\\&\quad \ge \Vert \nabla u\Vert _p^p+\Vert \nabla u\Vert _{q,\mu }^q-J^\circ (u;-u) +I_{K(w)}(u)-\Vert f\Vert _{W^{1,\mathcal H}_0(\Omega )^*}\Vert u\Vert _{1,\mathcal H,0}. \end{aligned} \end{aligned}$$
(3.17)

Note that \(I_{K(w)}:W_0^{1,\mathcal H}(\Omega )\rightarrow \overline{\mathbb {R}}\) is a proper, convex and lower semicontinuous function. Hence, we can apply Proposition 1.10 of Brezis [8] to find \(a_{K(w)},b_{K(w)}>0\) such that

$$\begin{aligned} I_{K(w)}(v)\ge -a_{K(w)}\Vert v\Vert _{1,\mathcal H,0}-b_{K(w)}\quad \text {for all }v\in W_0^{1,\mathcal H}(\Omega ). \end{aligned}$$
(3.18)

In addition, by Lemma 3.3(ii), we have

$$\begin{aligned} J^\circ (u;-u)\le \alpha _j\Vert u\Vert _\theta ^\theta +\Vert \beta _j\Vert _1. \end{aligned}$$
(3.19)

We consider now the two cases \(\theta <p\) and \(\theta =p\). Suppose first \(\theta <p\) and let \(c(\theta )>0\) be such that

$$\begin{aligned} \Vert u\Vert _\theta \le c(\theta )\Vert u\Vert _{1,\mathcal H,0}\quad \text {for all }u\in W_0^{1,\mathcal H}(\Omega ) \end{aligned}$$
(3.20)

due to the continuity of the embedding from \(W_0^{1,\mathcal H}(\Omega )\) to \(L^r(\Omega )\) for all \(r\in (1,p^*)\). Applying (3.18) and (3.19) in (3.17) and using (3.20) we get

$$\begin{aligned} \begin{aligned}&\langle Au+\xi +\eta -f,u\rangle _\mathcal H\\&\quad \ge \Vert \nabla u\Vert _p^p+\Vert \nabla u\Vert _{q,\mu }^q-\alpha _j\Vert u\Vert _\theta ^\theta -\Vert \beta _j\Vert _1 -a_{K(w)}\Vert u\Vert _{1,\mathcal H,0}-b_{K(w)}\\&\qquad -\Vert f\Vert _{W^{1,\mathcal H}_0(\Omega )^*}\Vert u\Vert _{1,\mathcal H,0}\\&\quad \ge \Vert \nabla u\Vert _p^p+\Vert \nabla u\Vert _{q,\mu }^q-\alpha _jc(\theta )^\theta \Vert u\Vert _{1,\mathcal H,0}^\theta -\Vert \beta _j\Vert _1 -a_{K(w)}\Vert u\Vert _{1,\mathcal H,0}\\&\qquad -b_{K(w)} -\Vert f\Vert _{W^{1,\mathcal H}_0(\Omega )^*}\Vert u\Vert _{1,\mathcal H,0}\\&\quad \ge \min \big \{\Vert u\Vert _{1,\mathcal H,0}^p,\Vert u\Vert _{1,\mathcal H,0}^q\big \}-\alpha _jc(\theta )^\theta \Vert u\Vert _{1,\mathcal H,0}^\theta -\Vert \beta _j\Vert _1 \\&\qquad -a_{K(w)}\Vert u\Vert _{1,\mathcal H,0}-b_{K(w)}-\Vert f\Vert _{W^{1,\mathcal H}_0(\Omega )^*}\Vert u\Vert _{1,\mathcal H,0}, \end{aligned} \end{aligned}$$
(3.21)

where the last inequality is obtained via inequality (2.2). Since \(\theta<p<q\), it is clear that we can find a constant \(R_0>0\) large enough such that \(R_0^p<R_0^q\) and

$$\begin{aligned} R_0^p-\alpha _jc(\theta )^\theta R_0^\theta -\Vert \beta _j\Vert _1 -a_{K(w)}R_0-b_{K(w)} -\Vert f\Vert _{W^{1,\mathcal H}_0(\Omega )^*}R_0>0. \end{aligned}$$

Therefore, for each \(R\ge R_0\) fixed, the desired inequality (3.16) holds.

Suppose now \(\theta =p\). Then, taking \(W_0^{1,\mathcal H}(\Omega )\subset W_0^{1,p}(\Omega )\) into account and the fact that

$$\begin{aligned} \Vert u\Vert _p^p\le \lambda _{1,p}^{-1}\Vert \nabla u\Vert _p^p \quad \text {for all }u \in W_0^{1,p}(\Omega ), \end{aligned}$$
(3.22)

we obtain

$$\begin{aligned} \begin{aligned}&\langle Au+\xi +\eta -f,u\rangle _\mathcal H\\&\quad \ge \Vert \nabla u\Vert _p^p+\Vert \nabla u\Vert _{q,\mu }^q-\alpha _j\Vert u\Vert _p^p-\Vert \beta _j\Vert _1 -a_{K(w)}\Vert u\Vert _{1,\mathcal H,0}-b_{K(w)}\\&\qquad -\Vert f\Vert _{W^{1,\mathcal H}_0(\Omega )^*}\Vert u\Vert _{1,\mathcal H,0}\\&\quad \ge \left( 1-\alpha _j\lambda _{1,p}^{-1}\right) \Vert \nabla u\Vert _p^p+\Vert \nabla u\Vert _{q,\mu }^q -\Vert \beta _j\Vert _1 -a_{K(w)}\Vert u\Vert _{1,\mathcal H,0}-b_{K(w)}\\&\quad -\Vert f\Vert _{W^{1,\mathcal H}_0(\Omega )^*}\Vert u\Vert _{1,\mathcal H,0}\\&\quad \ge \big (1-\alpha _j\lambda _{1,p}^{-1}\big )\big (\Vert \nabla u\Vert _p^p+\Vert \nabla u\Vert _{q,\mu }^q\big ) -\Vert \beta _j\Vert _1 -a_{K(w)}\Vert u\Vert _{1,\mathcal H,0}-b_{K(w)}\\&\qquad -\Vert f\Vert _{W^{1,\mathcal H}_0(\Omega )^*}\Vert u\Vert _{1,\mathcal H,0}\\&\quad \ge \big (1-\alpha _j\lambda _{1,p}^{-1}\big )\min \big \{\Vert u\Vert _{1,\mathcal H,0}^p,\Vert u\Vert _{1,\mathcal H,0}^q\big \} -\Vert \beta _j\Vert _1 -a_{K(w)}\Vert u\Vert _{1,\mathcal H,0}-b_{K(w)}\\&\qquad -\Vert f\Vert _{W^{1,\mathcal H}_0(\Omega )^*}\Vert u\Vert _{1,\mathcal H,0}. \end{aligned} \end{aligned}$$
(3.23)

Since \(1<p<q\) and \(\alpha _j\lambda _{1,p}^{-1}<1\), we can take \(R_0>0\) large enough with \(R_0^p<R_0^q\) such that for all \(R\ge R_0\) it holds

$$\begin{aligned} \left( 1-\alpha _j\lambda _{1,p}^{-1}\right) R^p -\Vert \beta _j\Vert _1 -a_{K(w)}R-b_{K(w)} -\Vert f\Vert _{W^{1,\mathcal H}_0(\Omega )^*}R>0. \end{aligned}$$

Therefore, inequality (3.16) holds and Claim 2 is proved.

Recall that \(I_{K(w)}:W_0^{1,\mathcal H}(\Omega )\rightarrow \overline{\mathbb {R}}\) is a proper, convex and lower semicontinuous function. Hence \(\partial _C I_{K(w)}:W_0^{1,\mathcal H}(\Omega )\rightarrow 2^{W^{1,\mathcal H}_0(\Omega )^*}\) is maximal monotone. This fact combined with Claims 1 and 2 allows us to apply Theorem 2.6 which provides \(u_w\in W_0^{1,\mathcal H}(\Omega )\) satisfying inclusion (3.9). Thus \(\Gamma (w)\ne \emptyset \) for each \(w\in W_0^{1,\mathcal H}(\Omega )\).

Now we want to verify that \(\Gamma (w)\) is closed in \(W_0^{1,\mathcal H}(\Omega )\). Let \(\{u_n\}\subset \Gamma (w)\) be a sequence such that

$$\begin{aligned} u_n\rightarrow u \quad \text {in }W_0^{1,\mathcal H}(\Omega ) \end{aligned}$$

for some \(u\in W_0^{1,\mathcal H}(\Omega )\). So, for each \(n\in \mathbb N\), we have

$$\begin{aligned} \langle Au_n,v-u_n\rangle _\mathcal H+J^\circ (u_n;v-u_n)+I_{K(w)}(v)-I_{K(w)}(u_n)\ge \langle f,v-u_n\rangle _\mathcal H \end{aligned}$$

for all \(v\in W_0^{1,\mathcal H}(\Omega )\). Passing to the upper limit as \(n\rightarrow \infty \), we get

$$\begin{aligned}&\langle Au,v-u\rangle _\mathcal H+J^\circ (u;v-u)+I_{K(w)}(v)-I_{K(w)}(u)\\&\quad \ge \limsup _{n\rightarrow \infty }\left[ \langle Au_n,v-u_n\rangle _\mathcal H+J^\circ (u_n;v-u_n)+I_{K(w)}(v)-I_{K(w)}(u_n)\right] \\&\quad \ge \limsup _{n\rightarrow \infty }\langle f,v-u_n\rangle _\mathcal H\\&\quad =\langle f,v-u\rangle _\mathcal H \end{aligned}$$

for all \(v\in W_0^{1,\mathcal H}(\Omega )\), where we have used the continuity of A, see Proposition 2.1, the upper semicontinuity of \((u,v)\mapsto J^\circ (u;v)\), see Proposition 2.4(iv), and the lower semicontinuity of \(I_{K(w)}\). This shows that \(u\in \Gamma (w)\). Hence, \(\Gamma (w)\) is closed.

Finally, we need to show that \(\Gamma (w)\) is bounded. Arguing by contradiction, we suppose that \(\Gamma (w)\) is unbounded. Then, there exists a sequence \(\{u_n\}\) in \(\Gamma (w)\) such that

$$\begin{aligned} \Vert u_n\Vert _{1,\mathcal H,0}\rightarrow +\infty . \end{aligned}$$
(3.24)

By a simple computing, see (3.21) and (3.23) for example, we are able to find \(N_0\in \mathbb N\) such the for all \(n\ge N_0\), one has

$$\begin{aligned} 0\ge \langle Au_n,u_n\rangle _\mathcal H-J^\circ (u_n;-u_n)+I_{K(w)}(u_n)>0, \end{aligned}$$

where we have used the fact that \(0\in K\) and (3.24) which leads to a contradiction. Therefore, \(\Gamma (w)\) is bounded.

(ii) Now assume that H(f)(v) holds. Let \(u_1,u_2\in W_0^{1,\mathcal H}(\Omega )\) be two solutions of problem (3.7), that is,

$$\begin{aligned} \langle Au_i,v-u_i\rangle _\mathcal H+J^\circ (u_i;v-u_i)+I_{K(w)}(v)-I_{K(w)}(u_i)\ge \langle f,v-u_i\rangle _\mathcal H \end{aligned}$$

for all \(v\in W_0^{1,\mathcal H}(\Omega )\) and for \(i=1,2\). Taking \(v=u_2\) and \(v=u_1\) into the inequalities above for \(i=1\) and \(i=2\), respectively, and applying (3.4) we obtain

$$\begin{aligned} 0&\ge \langle Au_1-Au_2,u_1-u_2\rangle _\mathcal H-\left( J^\circ (u_1;u_2-u_1)+J^\circ (u_2;u_1-u_2)\right) \\&= \int _\Omega \left( |\nabla u_1|^{p-2}\nabla u_1-|\nabla u_2|^{p-2}\nabla u_2\right) \cdot \nabla (u_1-u_2)\,dx\\&\quad +\int _\Omega \mu (x)\left( |\nabla u_1|^{q-2}\nabla u_1-|\nabla u_2|^{q-2}\nabla u_2\right) \cdot \nabla (u_1-u_2)\,dx\\&\quad -m_j\Vert u_1-u_2\Vert _p^p\\&\ge \int _\Omega \left( |\nabla u_1|^{p-2}\nabla u_1-|\nabla u_2|^{p-2}\nabla u_2\right) \cdot \nabla (u_1-u_2)\,dx-m_j\Vert u_1-u_2\Vert _p^p, \end{aligned}$$

where we have used the fact \(\mu (x)\ge 0\) for a. a. \(x\in \Omega \). Taking

$$\begin{aligned} \int _\Omega \left( |\nabla u_1|^{p-2}\nabla u_1-|\nabla u_2|^{p-2}\nabla u_2 \right) \cdot \nabla (u_1- u_2)\,dx\ge c(p)\Vert \nabla (u_1-u_2)\Vert _p^p, \end{aligned}$$

into account gives

$$\begin{aligned} 0&\ge \langle Au_1-Au_2,u_1-u_2\rangle _\mathcal H-\left( J^\circ (u_1;u_2-u_1)+J^\circ (u_2;u_1-u_2)\right) \\&\ge c(p)\Vert \nabla (u_1-u_2)\Vert _p^p-m_j\Vert u_1-u_2\Vert _p^p\\&\ge \big (c(p)-m_j\lambda _{1,p}^{-1}\big )\Vert \nabla (u_1-u_2)\Vert _p^p, \end{aligned}$$

where the last inequality is obtained by using inequality (3.22). Since \(W_0^{1,\mathcal H}(\Omega )\subset W_0^{1,p}(\Omega )\) we deduce that \(u_1=u_2\). Consequently, problem (3.7) has a unique solution in \(W_0^{1,\mathcal H}(\Omega )\). \(\square \)

Now we are in the position to prove Theorem 3.5 by applying Theorem 3.6 and Kluge’s fixed point theorem.

Proof of Theorem 3.5

We have already mentioned that the fixed point set of \(\Gamma \) is the corresponding set of solutions to problem (3.6). Besides, Lemma 3.2 points out that the set of solutions for problem (3.6) is a subset of the set of solutions for problem (1.1). Consequently, it suffices to show that the set of fixed points of \(\Gamma \) is nonempty. \(\square \)

Claim 3

The graph of \(\Gamma \) is sequentially weakly closed.

Let \(\{w_n\}\subset W_0^{1,\mathcal H}(\Omega )\) be a sequence such that \(w_n\rightharpoonup w\) and \(u_n:=\Gamma (w_n)\rightharpoonup u\) in \(W_0^{1,\mathcal H}(\Omega )\) for some \(w,u\in W_0^{1,\mathcal H}(\Omega )\). Then, for each \(n\in \mathbb N\) we have \(u_n\in K(w_n)\), namely \(T(u_n)\le U(w_n)\), and

$$\begin{aligned} \langle Au_n,v-u_n\rangle _\mathcal H+J^\circ (u_n;v-u_n) \ge \langle f,v-u_n\rangle _\mathcal H \end{aligned}$$
(3.25)

for all \(v\in K(w_n)\). The inequality

$$\begin{aligned}&\langle Au_1-Au_2,u_1-u_2\rangle _\mathcal H-\left( J^\circ (u_1;u_2-u_1)+J^\circ (u_2;u_1-u_2)\right) \\&\quad \ge \left( c(p)-m_j\lambda _{1,p}^{-1}\right) \Vert \nabla \left( u_1-u_2\right) \Vert _p^p\ge 0 \end{aligned}$$

for all \(u_1,u_2\in W_0^{1,\mathcal H}(\Omega )\) and smallness condition (3.5) indicate that \(u_n\) solves the following problem

$$\begin{aligned} \langle Av,v-u_n\rangle _\mathcal H+J^\circ (v;v-u_n) \ge \langle f,v-u_n\rangle _\mathcal H \end{aligned}$$
(3.26)

for all \(v\in K(w_n)\).

From hypotheses H(T) and H(U) we know that

$$\begin{aligned} T(u)\le \limsup _{n\rightarrow \infty }T(u_n)\le \limsup _{n\rightarrow \infty }U(w_n)\le U(w). \end{aligned}$$

This means that \(u\in K(w)\).

For any fixed \(v\in K(w)\), since \(U(w)>0\), we can consider the sequence \(\{v_n\}\) defined by

$$\begin{aligned} v_n:=\frac{U(w_n)}{U(w)}v\quad \text {for all }n\in \mathbb N. \end{aligned}$$

The nonnegativity of U, the positive homogeneity of T and \(v\in K(w)\), that is, \(T(v)\le U(w)\), imply

$$\begin{aligned} T(v_n)=T\left( \frac{U(w_n)}{U(w)}v\right) =\frac{U(w_n)}{U(w)}T(v)\le \frac{U(w_n)U(w)}{U(w)}=U(w_n). \end{aligned}$$

Hence \(v_n\in K(w_n)\). A simple calculating gives

$$\begin{aligned} \lim _{n\rightarrow \infty }\Vert v_n-v\Vert _{1,\mathcal H,0}=\lim _{n\rightarrow \infty }|U(w_n)-U(w)|\frac{\Vert v\Vert _{1,\mathcal H,0}}{U(w)}=0. \end{aligned}$$

This shows that \(v_n\rightarrow v\). Taking \(v=v_n\) in (3.26) and passing to the upper limit as \(n\rightarrow \infty \) one has

$$\begin{aligned}&\langle Av,v-u\rangle _\mathcal H+J^\circ (v;v-u)\\&\quad \ge \limsup _{n\rightarrow \infty }\langle Av_n,v_n-u_n\rangle _\mathcal H +\limsup _{n\rightarrow \infty } J^\circ (v_n;v_n-u_n)\\&\quad \ge \limsup _{n\rightarrow \infty }\left[ \langle Av_n,v_n-u_n\rangle _\mathcal H+J^\circ (v_n;v_n-u_n)\right] \\&\quad \ge \limsup _{n\rightarrow \infty }\langle f,v_n-u_n\rangle _\mathcal H=\langle f,v-u\rangle _\mathcal H, \end{aligned}$$

where we have used the compact embedding of \(W_0^{1,\mathcal H}(\Omega )\) in \(L^{q_1}(\Omega )\) and the fact that \(L^{q_1}(\Omega )\times L^{q_1}(\Omega )\ni (v,u)\rightarrow J^\circ (u;v)\in \mathbb {R}\) is upper semicontinuous, see (2.3) and Proposition 2.4. Hence,

$$\begin{aligned} \langle Av,v-u\rangle _\mathcal H+J^\circ (v;v-u)\ge \langle f,v-u\rangle _\mathcal H \end{aligned}$$
(3.27)

for all \(v\in K(w)\). Note that \(u\in K(w)\) and K(w) is closed and convex. Let \(t\in (0,1)\) be arbitrary and set \(v_t:=tv+(1-t)u\). Taking \(v=v_t\) in (3.27) and applying Proposition 2.4(i), we get that

$$\begin{aligned} \langle Av_t,v-u\rangle _\mathcal H+J^\circ (v_t;v-u)\ge \langle f,v-u\rangle _\mathcal H. \end{aligned}$$

Passing to the upper limit in the inequality above as \(t\rightarrow 0^+\) yields

$$\begin{aligned} \langle Au,v-u\rangle _\mathcal H+J^\circ (u;v-u)\ge \langle f,v-u\rangle _\mathcal H \end{aligned}$$

for all \(v\in K(w)\) which means \(u =\Gamma (w)\). Therefore, we conclude that the graph of \(\Gamma \) is sequentially weakly closed and so Claim 3 is proved.

Claim 4

The set \(\Gamma (W_0^{1,\mathcal H}(\Omega ))\) is bounded in \(W_0^{1,\mathcal H}(\Omega )\).

Arguing by contradiction and suppose the claim is not true. Then there exists a sequence \(\{w_n\}\) such that

$$\begin{aligned} \Vert u_n\Vert _{1,\mathcal H,0}\rightarrow \infty , \end{aligned}$$

where \(u_n=\Gamma (w_n)\). For every \(n\in \mathbb N\), one has (3.25) for all \(v\in K(w_n)\). Having in mind that \(0\in K(w)\) for each \(w\in W_0^{1,\mathcal H}(\Omega )\), we take \(v=0\) as test function in (3.25) to obtain

$$\begin{aligned} \langle Au_n,u_n\rangle _\mathcal H-J^\circ \left( u_n;-u_n\right) \le \Vert f\Vert _{W_0^{1,\mathcal H}(\Omega )^*}\Vert u_n\Vert _{1,\mathcal H,0}. \end{aligned}$$

By using the same arguments as in the proof of Theorem 3.6, see (3.21) and (3.23), we can find \(N_0\in \mathbb N\) large enough with \(\Vert u_n\Vert _{1,\mathcal H,0}^p<\Vert u_n\Vert _{1,\mathcal H,0}^q\) for all \(n\ge N_0\) such that

$$\begin{aligned} 0<\langle Au_n,u_n\rangle _\mathcal H-J^\circ \left( u_n;-u_n\right) -\Vert f\Vert _{W_0^{1,\mathcal H}(\Omega )^*}\Vert u_n\Vert _{1,\mathcal H,0}\le 0 \end{aligned}$$

for all \(n\ge N_0\). This is a contradiction. Therefore, we conclude that the set \(\Gamma (W_0^{1,\mathcal H}(\Omega ))\) is bounded in \(W_0^{1,\mathcal H}(\Omega )\), which proves Claim 4.

Now we can apply Theorem 2.5 for the mapping \(\Psi =\Gamma \). This shows that \(\Gamma \) admits a fixed point in \(W_0^{1,\mathcal H}(\Omega )\) which implies that problem (1.1) has at least one weak solution in \(W_0^{1,\mathcal H}(\Omega )\). We still need to show that the set \(\mathcal {S}\) is bounded and weakly closed.

The boundedness of \(\mathcal S\) can be obtained directly via using analogous arguments as in the proof of Claim 4.

It remains to show the weak closedness of \(\mathcal S\). Let \(\{u_n\}\subset \mathcal S\) be a sequence such that \(u_n\rightharpoonup u\) in \(W_0^{1,\mathcal H}(\Omega )\) for some \(u\in W_0^{1,\mathcal H}(\Omega )\). Hence, for each \(n\in \mathbb N\), we see that \(u_n\in K(u_n)\) and

$$\begin{aligned} \langle Av,v-u_n\rangle _\mathcal H+\int _\Omega j^\circ (v(x);v(x)-u_n(x))\,dx\ge \langle f,v-u_n\rangle _\mathcal H \end{aligned}$$
(3.28)

for all \(v\in K(u_n)\). Because the graph of K is sequentially weakly closed, see Claim 3, this implies \(u\in K(u)\). For any \(v\in K(u)\), we set \(v_n:=\frac{U(u_n)}{U(u)}v\). We have \(v_n\in K(u_n)\) and \(v_n\rightarrow v\) in \(W_0^{1,\mathcal H}(\Omega )\). Taking \(v=v_n\) in (3.28) and passing to the upper limit as \(n\rightarrow \infty \) we obtain

$$\begin{aligned} \langle Av,v-u\rangle _\mathcal H+\int _\Omega j^\circ (v(x);v(x)-u(x))\,dx\ge \langle f,v-u\rangle _\mathcal H \end{aligned}$$

for all \(v\in K(u)\), where we have applied Fatou’s Lemma. Invoking the Minty approach gives \(u\in \mathcal S\). Therefore, \(\mathcal S\) is weakly closed in \(W_0^{1,\mathcal H}(\Omega )\). \(\square \)