1 Introduction

Let \(\mathbb {T}=\mathbb {R}/\mathbb {Z}\) and let \(T:\mathbb {T}\rightarrow \mathbb {T}\) be the expanding map \(T(x)=bx ~( \text{ mod } 1)\), where \(b\ge 2\) is an integer. In this note we consider the Schrödinger cocycle on \(\mathbb {T}\times \mathbb {R}^2\) defined by

$$\begin{aligned} F_E:(x,y)\mapsto (T(x),A_E(x)y) \end{aligned}$$

where

$$\begin{aligned} A_E(x)=\left( \begin{matrix} 0 &{} 1 \\ -1 &{} \lambda v(x)-E \end{matrix} \right) \in SL(2,\mathbb {R}) \end{aligned}$$

and \(v:\mathbb {T}\rightarrow \mathbb {R}\) is a continuous function, \(\lambda \in \mathbb {R}\) is a coupling constant and \(E\in \mathbb {R}\) is the energy parameter.

We letFootnote 1

$$\begin{aligned} A^n_E(x)= A_E(T^{n-1}(x))\cdots A_E(x), ~ n\ge 1, \end{aligned}$$

and define the (maximal) Lyapunov exponent by

$$\begin{aligned} L(E)=\lim _{n\rightarrow \infty }\frac{1}{n}\int _\mathbb {T}\log \Vert A_E^n(x)\Vert dx ~(\ge 0). \end{aligned}$$

Recall that the Lebesgue measure on \(\mathbb {T}\) is an invariant measure for T. Since T is ergodic with respect to this measure we have

$$\begin{aligned} \lim _{n\rightarrow \infty }\frac{1}{n}\log \Vert A_E^n(x)\Vert =L(E) \text { for a.e. } x\in \mathbb {T}. \end{aligned}$$
(1.1)

For the important connection to the discrete Schrödinger operator we refer to the articles [4,5,6, 12] and references therein.

A natural question is to ask under which conditions (on \(\lambda ,v,b\) and E) we have \(L(E)>0\). Here we are especially interested in conditions on \(\lambda ,v\) and b which guarantees \(L(E)>\text {const.}>0\) for all \(E\in \mathbb {R}\). Besides the problem in itself, which has a general interest in the theory of non-uniformly hyperbolic dynamical systems, such uniform lower bounds are many times important for deriving finer properties of the associated Schrödinger operator (see, e.g., [3]).

Next follows a brief summary of previous results. It should be stressed that all the results hold for any \(b\ge 2\).

In [6] it is shown that if v is measurable, bounded and non-constant, and \(\lambda >0\), then \(L(E)>0\) for a.e. \(E\in \mathbb {R}\). Moreover, for small \(\lambda \) and smooth non-constant v one has \(L(E)\approx \lambda ^2\) for \(\sqrt{\lambda }<|E|<2-\sqrt{\lambda }\) [4]; and for large \(\lambda \), and under quite general conditions on v, one has \(L(E)\gtrsim \log \lambda \) for all E outside an exponentially small (in \(\lambda \)) set [10, 12].

Furthermore, from Herman’s subharmonic argument [7] it follows that if v is a non-constant trigonometric polynomial, then \(L(E)\gtrsim \log \lambda \) for all \(E\in \mathbb {R}\) and all large \(\lambda \). However, whether the corresponding result holds for v a non-constant real-analytic function does not seem to be known (if instead \(T(x)=x+\omega \), \(\omega \in \mathbb {R}\setminus \mathbb {Q}\), this is a well-known result [11]). Steps towards a proof of this result are taken in [9]. Another situation where one has \(L(E)\gtrsim \log \lambda \) for all \(E\in \mathbb {R}\) and all large \(\lambda \) is when v is \(C^1\) and monotone on (0, 1), with a discontinuity at \(x=0\) [15].

On the other hand, if \(\varphi :\mathbb {T}\rightarrow \mathbb {R}\) is any bounded and measurable function such that \(\int _{\mathbb {T}}\varphi (x)dx=0\), and we let \(v(x)=\exp (\varphi (T(x)))+\exp (-\varphi (x))\) and take \(\lambda =1\), then \(L(0)=0\) (see [2]). Thus there are obstacles for obtaining uniform (in E) lower bounds on L(E).

The aim of this paper is to extend the above results to new situations. We first define the collection of potential functions v with which we shall work.

Definition 1.1

Let \(\mathcal {V}^1(\mathbb {T},\mathbb {R})\) denote the class of \(C^1\)-functions \(v:\mathbb {T}\rightarrow \mathbb {R}\) which satisfy the following condition: there exist \(\varepsilon _0>0, \beta >0\) and an integer \(s\ge 1\) such that for all \(0<\varepsilon \le \varepsilon _0\) and all \(a\in \mathbb {R}\) the set \(\{x\in \mathbb {T}: |v(x)-a|<\varepsilon \}\) consists of at most s intervals, each of length at most \(\varepsilon ^{\beta }\).

Remark 1

It is easy to check that all non-constant real-analytic functions \(v:\mathbb {T}\rightarrow \mathbb {R}\) belongs to \(\mathcal {V}^1(\mathbb {T},\mathbb {R})\). Note also that the assumption on v is very similar to [8, Definition 2.2].

Our main result is the following:

Theorem 1

Assume that \(v\in \mathcal {V}^1(\mathbb {T},\mathbb {R})\). Then there is a \(\lambda _0=\lambda _0(v)>0\) such that for all \(\lambda \ge \lambda _0\) we have \(L(E)>(\log \lambda )/4\) for all \(E\in \mathbb {R}\) provided that \(b\ge \lambda ^3\).

Remark 2

  1. (a)

    We do not aim for optimal conditions on any of the constants or required size of b (as function of \(\lambda \)).

  2. (b)

    It would be very interesting to know if the statement of Theorem  1 holds true for a fixed (large) b independent of \(\lambda \). Unfortunately the method we use in the proof requires b to be (much) larger than \(\lambda \). (Heuristically it should be more likely to have \(L(E)>0\) the bigger is \(\lambda \).)

  3. (c)

    We would like to stress that the proof of Theorem 1 does not use the fact that we have a linear system. In fact one can extend it to non-linear systems (as, e.g., we did in [1]) since what we analyze is the dynamics of forced circle diffeomorphisms (see the map (3.1)). However, since there is an interest in the Schrödinger cocycle, and for (hopefully!) transparency, we perform our analysis for this explicit system.

  4. (d)

    The proof is based on ideas developed in [13]. Related problems (for systems which are not homotopic to the identity) are investigated in [1, 14].

2 Preliminaries

We adopt the following convention on the coupling constant \(\lambda \): In the statements of the lemmas below (where applicable) we always assume (without explicitly stating so) that \(\lambda >0\) is sufficiently large. There are only finitely many largeness conditions on \(\lambda \), and they only depend on v. This will yield the constant \(\lambda _0\) in the statement of Theorem 1.

2.1 Assumption on v

We assume from now on that \(v\in \mathcal {V}(\mathbb {T},\mathbb {R})\) is fixed, and that \(\varepsilon _0,\beta \) and s are as in Definition 1.1. Without loss of generality we assume, for simplicity, that \(|v(x)|\le 1/3\) for all \(x\in \mathbb {T}\) (this only scales \(\lambda \)).

2.2 Projective action

Since

$$\begin{aligned} A_E(x)\left( {\begin{array}{c}1\\ r\end{array}}\right) =r\left( {\begin{array}{c}1\\ \lambda v(x)-E-1/r\end{array}}\right) \end{aligned}$$

we see that the cocycle \(F_E\) induces an action on the projective space \(\mathbb {P}^1(\mathbb {R}^2)\) (with coordinates \(\left( {\begin{array}{c}1\\ r\end{array}}\right) \)) given by

$$\begin{aligned} G_E(x,r)=(T(x),\lambda v(x)-E-1/r). \end{aligned}$$

If \(r_n=\pi _2(G_E^n(x,r)), n\ge 0,\) it is easy to verify thatFootnote 2

$$\begin{aligned} A^{n+1}_E(x)\left( {\begin{array}{c}1\\ r\end{array}}\right) =\left( {\begin{array}{c}r_{n}\cdots r\\ r_{n+1}\cdots r\end{array}}\right) . \end{aligned}$$
(2.1)

Thus, if there for some parameter \(E\in \mathbb {R}\) exists a set \(X\subset \mathbb {T}\) of positive measure such that for each \(x\in X\) there is an \(r\in \mathbb {R}\) such that

$$\begin{aligned} \limsup _{n\rightarrow \infty }\frac{1}{n}\log \Vert r_n\cdots r\Vert \ge C, \end{aligned}$$

then it follows from (1.1) and (2.1) that \(L(E)\ge C\).

2.3 Bounds

Lemma 2.1

If \(|E|\ge \lambda /3+2\sqrt{\lambda }\), then \(L(E)\ge (\log \lambda )/2\).

Proof

Take any \(x\in \mathbb {T}\) and \(|r|>\sqrt{\lambda }\), and define \(r_n=\pi _2(G_E^n(x,r)), n\ge 1\). We note that \(|r_1|=|\lambda v(x)-E-1/r|\ge |E|-|\lambda v(x)|-|1/r|>\sqrt{\lambda }\). By induction we get \(|r\cdot r_1\cdots r_n|>\sqrt{\lambda }^{n+1}\) for all \(n\ge 1\). \(\quad \square \)

Thus, in order to prove Theorem 1, we only need to consider \(|E|<\lambda /3+2\sqrt{\lambda }\) (which of course is the cumbersome region).

The next lemma provides lower bounds on products \(|r_1\cdots r_N|\) under the assumption that we know how many of the iterates \(r_j\) that are in the interval \((-\sqrt{\lambda },\sqrt{\lambda })\).

Lemma 2.2

Assume that \(|E|\le \lambda /3+2\sqrt{\lambda }\). For any \(N\ge 1\) the following holds for all \((x,r)\in \mathbb {T}\times (\mathbb {R}\cup \{\infty \})\): Let \(r_j=\pi _2(G_E^j(x,r))\) and let k (\(0\le k\le N\)) be the number of indices j in [1, N] for which \(|r_j|<\sqrt{\lambda }\). Then

$$\begin{aligned} |r_1\cdots r_N|\ge \sqrt{\lambda }^{N-3k} \text { if } |r_N|\ge 1/\lambda , \text { and } |r_1\cdots r_{N+1}|\ge \sqrt{\lambda }^{N+1-3k} \text { if } |r_N|< 1/\lambda . \end{aligned}$$

Proof

We first note that if \(|r_j|<1/\lambda \), then \(|r_jr_{j+1}|=|r_j(\lambda v(x_j)-E)-1|>1/4\); thus we also have \(|r_{j+1}|>\sqrt{\lambda }\). From these two facts the statement of the lemma follows easily by induction over N. \(\quad \square \)

2.4 Elementary probability

We begin by defining some natural partitions of \(\mathbb {T}\) (relative the transformation T). First, let

$$\begin{aligned} I_j=\left[ \frac{j-1}{b},\frac{j}{b}\right) , \quad j=1,2,\ldots , b. \end{aligned}$$

Note that \(I_1\cup I_2\cup \cdots \cup I_b=\mathbb {T}\) and \(T(I_j)=\mathbb {T}\) for all j.

Next we define the intervalsFootnote 3

$$\begin{aligned} I_{j_1j_2\cdots j_n}=I_{j_1}\cap T^{-1}(I_{j_2})\cap \ldots \cap T^{-n+1}(I_{j_n}), \quad n\ge 2. \end{aligned}$$

Thus, by definition, if \(x\in I_{j_1j_2\cdots j_n}\), then \(x\in I_{j_1}\), \(T(x)\in I_{j_2}, \ldots , T^{n-1}(x)\in I_{j_n}\); and clearly \(I_{j_1\ldots j_n}\subset I_{j_1\ldots j_{n-1}}\subset \ldots \subset I_{j_1}\). Note that the length of each interval \(I_{j_1j_2\cdots j_n}\) is \(b^{-n}\). Note also that for any fixed interval \(I_{j_1j_2\cdots j_n}\) the intervals \(I_{j_1j_2\cdots j_nj}\), \(1\le j\le b\), form a partition of \(I_{j_1j_2\cdots j_n}\) into b pieces of equal length \(b^{-{(n+1)}}\).

In the following lemmaFootnote 4 we use the word “bad” to indicate that an interval does not have a certain property:

Lemma 2.3

Let \(1\le q\le b\) be an integer. Assume that q of the intervals \(I_j\) (\(j=1,\ldots , b\)) are bad. Furthermore, for each interval \(I_{j_1\ldots j_n}\) assume that q of the intervals \(I_{j_1\ldots j_n j}\) (\(j=1,2,\ldots , b\)) are bad. Then for each \(n\ge 1\) and \(0\le m\le n\) the set

$$\begin{aligned} \{x\in \mathbb {T}: x\in I_{j_1\ldots j_n} \text { and exactly } m \text { of the intervals } I_{j_1}, I_{j_1j_2}, \ldots , I_{j_1\ldots j_n} \text { are bad}\} \end{aligned}$$

has measure \(\left( {\begin{array}{c}n\\ m\end{array}}\right) \frac{q^m(b-q)^{n-m}}{b^n}\).

Proof

Easy combinatorics. From the assumption it follows that there are \(\left( {\begin{array}{c}n\\ m\end{array}}\right) q^m(b-q)^{n-m}\)n-tuples \((j_1,\ldots ,j_n)\in \{1,\ldots ,b\}^n\) such that exactly m of the intervals \(I_{j_1}, I_{j_1j_2}, \ldots , I_{j_1\ldots j_n}\) are bad. Using the fact that \(|I_{j_1\ldots j_n}|=b^{-n}\) yields the result. \(\quad \square \)

Moreover,

Lemma 2.4

With the same assumptions as in the previous lemma, the measure of the set

$$\begin{aligned}&M_n=\{x\in \mathbb {T}: x\in I_{j_1\ldots j_n} \text { and at most } [2(q/b)n]\\&\qquad \text { of the intervals } I_{j_1}, I_{j_1j_2}, \ldots , I_{j_1\ldots j_n} \text { are bad}\} \end{aligned}$$

goes to 1 as \(n\rightarrow \infty \).

Proof

By the previous lemma we have (provided that \([2(q/b)n]\le n\))

$$\begin{aligned} |M_n|=\sum _{m=0}^{[2(q/b)n]}\left( {\begin{array}{c}n\\ m\end{array}}\right) \frac{q^m(b-q)^{n-m}}{b^n}=\sum _{m=0}^{[2(q/b)n]}\left( {\begin{array}{c}n\\ m\end{array}}\right) (q/b)^m(1-q/b)^{n-m}. \end{aligned}$$

Applying the de Moivre-Laplace theorem yields the result. \(\quad \square \)

3 Geometry

We write the circle \(S^1=[-\pi /2,\pi /2]/\sim \). A point \(\left( {\begin{array}{c}1\\ r\end{array}}\right) \) in projective coordinates corresponds to \(\arctan (r)\in S^1\). Thus, in these coordinates the map \(G_E\) becomes \(H_E:\mathbb {T}\times S^1\rightarrow \mathbb {T}\times S^1\) given by

$$\begin{aligned} H_E(x,y)=(T(x),\arctan (\lambda v(x)-E-1/\tan (y))). \end{aligned}$$
(3.1)

The following lemma is crucial for the proof of Theorem 1 (compare with “admissible curves” in [13]; and also the idea in [13] that the image of an admissible curve “spreads out” in the y-direction). Recall that the constants \(\beta \) and s come from the assumption on v.

Lemma 3.1

Assume that \(|E|<\lambda /2\) and \(b\ge \lambda ^3\). Assume further that the function \(\varphi :[0,1]\rightarrow S^1\) is \(C^1\) and satisfies

$$\begin{aligned} |\varphi '(x)|<\frac{\lambda \Vert v'\Vert }{b}(1+(2\lambda ^2)/b+\cdots +(2\lambda ^2)^m/b^m) \end{aligned}$$

for all \(x\in [0,1]\) and some integer \(m\ge 0\). For \(j=1,2,\ldots ,b\), let \(\varphi ^j:[0,1]\rightarrow S^1\) be defined by

$$\begin{aligned} \varphi ^j(x)=\arctan \left( \lambda v\left( \frac{x+j-1}{b}\right) -E-\frac{1}{\tan \left( \varphi \left( \frac{x+j-1}{b}\right) \right) }\right) . \end{aligned}$$

Then the following hold:

  1. (1)

    There are at most \((s+1)(2+[2^\beta \lambda ^{-\beta /2}b])\) indices \(j\in \{1,\ldots ,b\}\) for which

    $$\begin{aligned} \min _{x\in [0,1]}|\tan \varphi ^j(x)|<\sqrt{\lambda }. \end{aligned}$$
  2. (2)

    The estimate \(|(\varphi ^j)'(x)|<\frac{\lambda \Vert v'\Vert }{b}(1+(2\lambda ^2)/b+\cdots +(2\lambda ^2)^{m+1}/b^{m+1})\) holds for all \(x\in [0,1]\) and all j.

Remark 3

That the functions \(\varphi ^j\) are defined as above implies that if \(\Gamma _j=\{(x,\varphi (x)): x\in I_j\}\), then \(H_E(\Gamma _j)=\{(x,\varphi ^j(x)):x\in [0,1)\}\). Thus, if \(\Gamma =\{(x,\varphi (x)): x\in [0,1)\}\) is the graph of \(\varphi \), we have \(H_E(\Gamma )=\bigcup _{j=1}^b \{(x,\varphi ^j(x)):x\in [0,1)\}\), i.e., the union of the graphs of the \(\varphi ^j\).

The statement thus says that we have a bound of the number of indices j for which the graph \(\{(x,\varphi ^j(x)):x\in [0,1)\}\) intersects the “bad region” \([0,1]\times (-\arctan \sqrt{\lambda },\arctan \sqrt{\lambda })\); and the derivative estimate shows that we can iterate this process (iterate each of the j graphs) and still have a good control on the derivative (provided that b is large enough, independently of m).

Proof

By the assumptions on b and \(\varphi '\) we have \(|\varphi '|<(\lambda \Vert v'\Vert /b)\sum _{i=0}^\infty (2\lambda ^2/b)^i<2\Vert v'\Vert /\lambda ^2\). We let \(g(x)=-1/\tan (\varphi (x))\).

We first prove that the set \(B:=\{x\in [0,1]:|\lambda v(x)-E+g(x)|<\sqrt{\lambda }\}\) can intersect at most \((s+1)(2+[2^\beta \lambda ^{-\beta /2}b])\) of the intervals \(I_j\) (\(j=1,\ldots , b\)). This clearly gives the first statement of the lemma.

If \(|\varphi (x_0)|<1/\lambda \) for some \(x_0\), then the estimate on \(|\varphi '|\) implies that \(|\varphi (x)|<1/\lambda + 2\Vert v'\Vert /\lambda ^2\) for all x; thus, since \(|\lambda v(x)-E|\le \lambda /3+\lambda /2\), we get \(|\lambda v(x)-E+g(x)|>\sqrt{\lambda }\) for all \(x\in [0,1]\). We conclude that \(B=\emptyset \) in this case.

Assume now that \(|\varphi (x)|\ge 1/\lambda \) for all x. Then we have \(|g'(x)|=|\varphi '(x)|/|\sin ^2(\varphi (x))|<(2\Vert v'\Vert /\lambda ^2)(2\lambda ^2)=4\Vert v'\Vert \), and therefore \(|g(x_1)-g(x_0)|<4\Vert v'\Vert \) for all \(x_0,x_1\in [0,1]\) Hence

$$\begin{aligned} B\subset \{x\in [0,1]:|\lambda v(x)-E+g(0)|<2\sqrt{\lambda }\}. \end{aligned}$$

By the assumption on v the latter set consists of at most \(s+1\) intervals (s intervals on \(\mathbb {T}\) can be at most \(s+1\) intervals in [0, 1]), each of a length \(<(2/\sqrt{\lambda })^\beta \). Since each interval \(I_j\) (\(j=1,\ldots , b\)) has length 1/b, it follows that the set B can intersect at most \((s+1)(2+[2^\beta \lambda ^{-\beta /2}b])\) of them.

We turn to the derivative estimate in (2). From the assumptions on v and E we have \(|\lambda v(t)-E|<\lambda \) for all t. An easy computation shows that \(|(\varphi ^j)'(x)|\le (\lambda \Vert v'\Vert +\Vert \varphi '\Vert 2\lambda ^2)/b\), from which the desired bound follows. To obtain the estimate in the second term we have used the fact that if \(|a|\le \lambda \), and \(\lambda \) is sufficiently large (larger than a numerical constant), then \(\sin ^2t+(a\sin t-\cos t)^2>1/(2\lambda ^2)\) for all t. \(\quad \square \)

4 Proof of Theorem 1

By Lemma 2.1 we only need to consider \(|E|<\lambda /2\). We therefore assume that \(|E|<\lambda /2\) is fixed. We also fix \(b\ge \lambda ^3\) (so that we can apply Lemma 3.1). Given a point (xr) we denote by \(r_j\) the iterate \(r_j=\pi _2(G_E^j(x,r))\).

Let the intervals \(I_{j_1\ldots j_n}\) be defined as in Sect. 2.4. We say that the interval \(I_{j_1\ldots j_n}\) (\(n\ge 1\)) is “good” if for each \(x\in I_{j_1\ldots j_n}\) and \(r=\lambda \) we have \(|r_n|\ge \sqrt{\lambda }\); otherwise the interval is “bad”.

Lemma 4.1

Assume that at most \(q=[b/12]\) of the intervals \(I_j\)\((j=1,\ldots , b)\) are bad, and that for each interval \(I_{j_1\ldots j_n}\) at most q of the intervals \(I_{j_1\ldots j_nj}\)\((j=1,\ldots , b)\) are bad. Then \(L(E)\ge (\log \lambda )/4\).

Proof

Let the sets \(M_n\) be defined as in Lemma 2.4, and let \(\displaystyle M=\limsup _{n\rightarrow \infty } M_n\). Since \(|M_n|\rightarrow 1\) as \(n\rightarrow \infty \) we have that \(|M|=1\), i.e., the set M has full measure.

Take \(x\in M_n\) (\(n\ge 1\)) and \(r=\lambda \). Then \(x\in I_{j_1\ldots j_n}\subset I_{j_1\ldots j_{n-1}}\subset \ldots \subset I_{j_1}\) and at most [2(q/b)n] of the intervals \(I_{j_1},\dots , I_{j_1\ldots j_n}\) are bad. Thus, by definition we get that \(|r_k|<\sqrt{\lambda }\) for at most \([2(q/b)n]\le n/6\) indices \(k\in [1,n]\). It hence follows from Lemma 2.2 that \(|r_1\cdots r_n|\ge \sqrt{\lambda }^{n-3n/6}\) or \(|r_1\cdots r_{n+1}|\ge \sqrt{\lambda }^{n+1-3n/6}\).

Consequently, if \(x\in M\) (and thus \(x\in M_n\) for infinitely many n) and \(r=\lambda \) we have

$$\begin{aligned} \limsup _{n\rightarrow \infty }\frac{1}{n}\log |r_1\cdots r_n|\ge (1/4)\log \lambda . \end{aligned}$$

Recalling (1.1) and (2.1) finishes the proof. \(\quad \square \)

Combining the previous lemma with the next one finishes the proof of Theorem 1.

Lemma 4.2

At most \(q=[b/12]\) of the intervals \(I_j\) (\(j=1,\ldots , b\)) are bad; and for each interval \(I_{j_1\ldots j_n}\) at most q of the intervals \(I_{j_1\ldots j_nj}\) (\(j=1,\ldots , b\)) are bad.

Proof

The strategy is to apply Lemma 3.1 inductively, and we shall begin by iterating the constant graph \(\{(x,\arctan \lambda ):x\in [0,1]\}\).

Note that \((s+1)(2+[2^\beta \lambda ^{-\beta /2}b])<[b/12]\) if \(\lambda \) is sufficiently large (recall that \(b\ge \lambda ^3\)), where the left hand side is the quantity in Lemma 3.1(1). Since E is fixed we write \(H=H_E\) and \(G=G_E\).

For \(j=1,2,\ldots , b\), let \(\varphi _j:[0,1]\rightarrow S^1\) be defined by

$$\begin{aligned} \varphi _j(x)=\pi _2\left( H((x+j-1)/b,\arctan {\lambda })\right) . \end{aligned}$$

Applying Lemma 3.1 with \(\varphi (x)=\arctan (\lambda )\) and \(m=0\) shows that we for each j have \(|\varphi _j'(x)|<\frac{\lambda \Vert v'\Vert }{b}(1+(2\lambda ^2)/b)\) and that there are at most [b/12] indices \(j\in \{1,\ldots , b\}\) for which \(\min _{x\in [0,1]}|\tan \varphi _j(x)|<\sqrt{\lambda }\). We note that \( H(I_{j}\times \{\arctan \lambda \})=\{(x,\varphi _{j}(x)):x\in [0,1)\}, \) and thus

$$\begin{aligned} G(I_{j}\times \{\lambda \})=\{(x,\tan (\varphi _{j}(x))):x\in [0,1)\}. \end{aligned}$$

Consequently, at most [b/12] of the intervals \(I_j\) are bad. This proves the first statement of the lemma.

Inductively (over \(n\ge 1\)) we define, for fixed \((j_1,\ldots ,j_n)\in \{1,\ldots , b\}^n\), the functions \(\varphi _{j_1\ldots j_n j}:[0,1]\rightarrow S^1\) by

$$\begin{aligned} \varphi _{j_1\ldots j_nj}(x)=\pi _2\left( H((x+j-1)/b,\varphi _{j_1\ldots j_n}((x+j-1)/b))\right) , \quad j=1,2,\ldots , b. \end{aligned}$$

Lemma 3.1 gives \(|\varphi _{j_1\ldots j_nj}'(x)|<\frac{\lambda \Vert v'\Vert }{b}(1+(2\lambda ^2)/b+\cdots + (2\lambda ^2)^{n+1}/b^{n+1})\) and that there are at most [b/12] indices \(j\in \{1,\ldots , b\}\) for which \(\min _{x\in [0,1]}|\tan \varphi _{j_1\ldots j_nj}(x)|<\sqrt{\lambda }\).

From the above definition it is easy to verify that for each \(n\ge 1\) and each fixed \((j_1,\ldots ,j_n)\in \{1,\ldots , b\}^n\) we have

$$\begin{aligned} \begin{aligned} H^{n+1}(I_{j_1\ldots j_nj}\times \{\arctan \lambda \})&=\{(x,\varphi _{j_1\ldots j_nj}(x)):x\in [0,1)\}; \text { and } \\ G^{n+1}(I_{j_1\ldots j_nj}\times \{\lambda \})&=\{(x,\tan \varphi _{j_1\ldots j_nj}(x)):x\in [0,1)\}. \end{aligned} \end{aligned}$$

Thus, at most [b/12] of the intervals \(I_{j_1\ldots j_nj}\) (\(1\le j\le b\)) are bad. This finishes the proof of the lemma. \(\quad \square \)