Inexact rational Krylov method for evolution equations

Hashimoto, Yuka; Nodera, Takashi

doi:10.1007/s10543-020-00829-w

Inexact rational Krylov method for evolution equations

Published: 19 January 2021

Volume 61, pages 473–502, (2021)
Cite this article

BIT Numerical Mathematics Aims and scope Submit manuscript

341 Accesses
Explore all metrics

A Correction to this article was published on 06 September 2021

This article has been updated

Abstract

Linear and nonlinear evolution equations have been formulated to address problems in various fields of science and technology. Recently, methods using an exponential integrator for solving evolution equations, where matrix functions must be computed repeatedly, have been investigated and refined. In this paper, we propose a new method for computing these matrix functions which is called an inexact rational Krylov method. This is a more efficient version of the rational Krylov method with appropriate shifts, which was proposed by Hashimoto and Nodera (ANZIAM J 58:C149–C161, 2016). The advantage of the inexact rational Krylov method is that it computes linear equations that appear in the rational Krylov method efficiently while guaranteeing the accuracy of the final solution.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Shift-invert Rational Krylov method for an operator $$\phi $$ -function of an unbounded linear operator

Article 25 February 2019

A Lanczos-like method for non-autonomous linear ordinary differential equations

Article 25 June 2022

A computational procedure for exact solutions of Burgers’ hierarchy of nonlinear partial differential equations

Article 15 July 2020

Change history

06 September 2021
A Correction to this paper has been published: https://doi.org/10.1007/s10543-021-00894-9

References

Beckermann, B., Reichel, L.: Error estimates and evaluation of matrix functions via the Faber transform. SIAM J. Numer. Anal. 47(5), 3849–3883 (2009)
Article MathSciNet MATH Google Scholar
Benzi, M., Boito, P., Razouk, N.: Decay properties of spectral projectors with applications to electronic structure. SIAM Rev. 55(1), 3–64 (2013)
Article MathSciNet MATH Google Scholar
Berljafa, M., Güttel, S.: Parallelization of the rational Arnoldi algorithm. SIAM J. Sci. Comput. 39(5), S197–S221 (2017)
Article MathSciNet MATH Google Scholar
Börner, R., Ernst, O.G., Güttel, S.: Three-dimensional transient electromagnetic modelling using rational Krylov methods. Geophys. J. Int. 202(3), 2025–2043 (2015)
Article Google Scholar
Botchev, M.A.: Krylov subspace exponential time domain solution of Maxwell’s equations in photonic crystal modeling. J. Comput. Appl. Math. 293, 20–34 (2016)
Article MathSciNet MATH Google Scholar
Bouras, A., Frayssé, V., Giraud, L.: A relaxation strategy for inner-outer linear solvers in domain decomposition methods. Technical report TR/PA/00/17 (2000)
Crouzeix, M., Palencia, C.: The numerical range is a $(1+\sqrt{2})$-spectral set. SIAM J. Matrix Anal. Appl. 38(2), 649–655 (2017)
Article MathSciNet MATH Google Scholar
Dinh, K., Sidje, R.: An application of the Krylov-FSP-SSA method to parameter fitting with maximum likelihood. Phys. Biol. 14(6), 065001 (2017)
Article Google Scholar
Druskin, V., Lieberman, C., Zaslavsky, M.: On adaptive choice of shifts in rational Krylov subspace reduction of evolutionary problems. SIAM J. Sci. Comput. 32(5), 2485–2496 (2010)
Article MathSciNet MATH Google Scholar
Elsworth, S., Güttel, S.: The block rational Arnoldi method. SIAM J. Matrix Anal. Appl. 41(2), 365–388 (2020)
Article MathSciNet MATH Google Scholar
Gallopoulos, E., Saad, Y.: Efficient solution of parabolic equations by Krylov approximation methods. SIAM J. Sci. Stat. Comput. 13(5), 1236–1264 (1992)
Article MathSciNet MATH Google Scholar
Göckler, T.: Rational Krylov Subspace Methods for ’$\phi $’-functions in Exponential Integrators. Karlsruher Instituts für Technologie, Karlsruhe (2014)
Google Scholar
Giraud, L., Langou, J., Rozložník, M., van den Eshof, J.: Rounding error analysis of the classical Gram-Schmidt orthogonalization process. Numer. Math. 101(1), 87–100 (2005)
Article MathSciNet MATH Google Scholar
Golub, G.H., Zhang, Z., Zha, H.: Large sparse symmetric eigenvalue problems with homogeneous linear constraints: the Lanczos process with inner-outer iterations. Linear Algebra Its Appl. 309(1), 289–306 (2000)
Article MathSciNet MATH Google Scholar
Grimm, V.: Resolvent Krylov subspace approximation to operator functions. BIT Numer. Math. 52, 639–659 (2012)
Article MathSciNet MATH Google Scholar
Güttel, S.: Rational Krylov Methods for Operator Functions. Technischen Universität Bergakademie Freiberg, Freiberg (2010)
Google Scholar
Güttel, S.: Rational Krylov approximation of matrix functions: numerical methods and optimal pole selection. GAMM-Mitteilungen 36(1), 8–31 (2013)
Article MathSciNet MATH Google Scholar
Hashimoto, Y., Nodera, T.: Inexact shift-invert Arnoldi method for evolution equations. ANZIAM J. 58, E1–E27 (2016)
Article MathSciNet Google Scholar
Hashimoto, Y., Nodera, T.: Shift-invert rational Krylov method for evolution equations. ANZIAM J. 58, C149–C161 (2016)
Article Google Scholar
Higham, N.J.: The scaling and squaring method for the matrix exponential revisited. SIAM J. Matrix Anal. Appl. 26(4), 1179–1193 (2005)
Article MathSciNet MATH Google Scholar
Hochbruck, M., Lubich, C.: On Krylov subspace approximations to the matrix exponential operator. SIAM J. Numer. Anal. 34(5), 1911–1925 (1998)
Article MathSciNet MATH Google Scholar
Hochbruck, M., Lubich, C., Selhofer, H.: Exponential integrators for large systems of differential equations. SIAM J. Sci. Comput. 19(5), 1552–1574 (1997)
Article MathSciNet MATH Google Scholar
Hochbruck, M., Ostermann, A.: Exponential Runge–Kutta methods for parabolic problems. Appl. Numer. Math. 53(2–4), 323–339 (2005)
Article MathSciNet MATH Google Scholar
Hochbruck, M., Ostermann, A.: Exponential integrators. Acta Numer. 19, 209–286 (2010)
Article MathSciNet MATH Google Scholar
Horn, R.A., Johnson, C.R.: Topics in Matrix Analysis. Cambridge University Press, New York (1991)
Book MATH Google Scholar
Lopez, L., Simoncini, V.: Analysis of projection methods for rational function approximation to the matrix exponential. SIAM J. Numer. Anal. 44(2), 613–635 (2006)
Article MathSciNet MATH Google Scholar
Moler, C., Van Loan, C.: Nineteen dubious ways to compute the exponential of a matrix, twenty-five years later. SIAM Rev. 45(1), 3–49 (2003)
Article MathSciNet MATH Google Scholar
Moret, I.: On RD-rational Krylov approximations to the core-functions of exponential integrators. Numer. Linear Algebra Appl. 14(5), 445–457 (2007)
Article MathSciNet MATH Google Scholar
Moret, I., Novati, P.: RD-rational approximations of the matrix exponential. BIT Numer. Math. 44(3), 595–615 (2004)
Article MathSciNet MATH Google Scholar
Moret, I., Novati, P.: On the convergence of Krylov subspace methods for matrix Mittag-Leffler functions. SIAM J. Numer. Anal. 49(5), 2144–2164 (2011)
Article MathSciNet MATH Google Scholar
Moret, I., Popolizio, M.: The restarted shift-and-invert Krylov method for matrix functions. Numer. Linear Algebra Appl. 21(1), 68–80 (2014)
Article MathSciNet MATH Google Scholar
Novati, P.: Using the restricted-denominator rational Arnoldi method for exponential integrators. SIAM J. Matrix Anal. Appl. 32(4), 1537–1558 (2011)
Article MathSciNet MATH Google Scholar
Popolizio, M., Simoncini, V.: Acceleration techniques for approximating the matrix exponential operator. SIAM J. Matrix Anal. Appl. 30(2), 657–683 (2008)
Article MathSciNet MATH Google Scholar
Qiu, C., Güttel, S., Ren, X., Yin, C., Liu, Y., Zhang, B., Egbert, G.: A block rational Krylov method for 3-D time-domain marine controlled-source electromagnetic modelling. Geophys. J. Int. 218(1), 100–113 (2019)
Article Google Scholar
Ruhe, A.: Rational Krylov: a practical algorithm for large sparse nonsymmetric matrix pencils. SIAM J. Sci. Comput. 19(5), 1535–1551 (1998)
Article MathSciNet MATH Google Scholar
Saad, Y., Schultz, M.H.: GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems. SIAM J. Sci. Stat. Comput. 7(3), 856–869 (1986)
Article MathSciNet MATH Google Scholar
Saff, E.B., Schönhage, A., Varga, R.S.: Geometric convergence to $e^{-z}$ by rational functions with real poles. Numer. Math. 25(3), 307–322 (1975)
Article MathSciNet MATH Google Scholar
Sidje, R.B., Winkles, N.: Evaluation of the performance of inexact GMRES. J. Comput. Appl. Math. 235(8), 1956–1975 (2011)
Article MathSciNet MATH Google Scholar
Simoncini, V., Szyld, D.B.: Theory of inexact Krylov subspace methods and applications to scientific computing. SIAM J. Sci. Comput. 25(2), 454–477 (2003)
Article MathSciNet MATH Google Scholar
Skoogh, D.: A parallel rational Krylov algorithm for eigenvalue computations. In: Applied Parallel Computing Large Scale Scientific and Industrial Problems, pp. 521–526 (1998)
Svoboda, Z.: The convective-diffusion equation and its use in building physics. Int. J. Archit. Sci. 1(2), 68–79 (2000)
Google Scholar
Van den Eshof, J., Sleijpen, G.L.G.: Inexact Krylov subspace methods for linear systems. SIAM J. Matrix Anal. Appl. 26(1), 125–153 (2004)
Article MathSciNet MATH Google Scholar
Van den Eshof, J., Hochbruck, M.: Preconditioning Lanczos approximations to the matrix exponential. SIAM J. Sci. Comput. 27(4), 1438–1457 (2006)
Article MathSciNet MATH Google Scholar
Van der Vorst, H.A.: Bi-CGSTAB: a fast and smoothly converging variant of Bi-CG for the solution of nonsymmetric linear systems. SIAM J. Sci. Stat. Comput. 13(2), 631–644 (1992)
Article MathSciNet MATH Google Scholar
Wu, G., Feng, T., Wei, Y.: An inexact shift-and-invert Arnoldi algorithm for Toeplitz matrix exponential. Numer. Linear Algebra Appl. 22(4), 777–792 (2015)
Article MathSciNet MATH Google Scholar
Wu, G., Feng, T., Zhang, L., Yang, M.: Inexact implementation using Krylov subspace methods for large scale exponential discriminant analysis with applications to high dimensionality reduction problems. Pattern Recognit. 66, 328–341 (2017)
Article Google Scholar
Zhang, D.S., Wei, G.W., Kouri, D.J., Hoffman, D.K.: Burgers’ equation with high Reynolds number. Phys. Fluids 9(6), 1853–1855 (1997)
Article MathSciNet MATH Google Scholar
Zhu, H., Shu, H., Ding, M.: Numerical solutions of two-dimensional Burgers’ equations by discrete Adomian decomposition method. Comput. Math. Appl. 60(3), 840–848 (2010)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

We would like to thank the anonymous referee and associate editor Michiel Hochstenbach, whose suggestions greatly improved the paper.

Author information

Authors and Affiliations

Graduate School of Science and Technology, Keio University, 3-14-1 Hiyoshi, Kohoku, Yokohama, Kanagawa, 223-8522, Japan
Yuka Hashimoto
Faculty of Science and Technology, Keio University, 3-14-1 Hiyoshi, Kohoku, Yokohama, Kanagawa, 223-8522, Japan
Takashi Nodera

Authors

Yuka Hashimoto
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Nodera
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuka Hashimoto.

Additional information

Communicated by Michiel E. Hochstenbach.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

A Proof of Proposition 3.1

We use the Householder reflectors for transforming the matrix $\tilde{K}_m$ into an upper Hessenberg matrix. Let $u_j:=(\tilde{k}_{j+1:m,j}-\eta _j e_1)/\Vert \tilde{k}_{j+1:m,j}-\eta _j e_1\Vert $ and let $\tilde{Q}_{j+1}:=-2u_ju_j^*$. Then, $I_{m-j}+\tilde{Q}_{j+1}$ is a unitary matrix and satisfies $(I_{m-j}+\tilde{Q}_{j+1})k_{j+1:m,j}=\eta _je_1$. Thus, the matrix $Q_m$ which is defined as $Q_m:=(I_m+\hat{Q}_{m-1})\cdots (I_m+\hat{Q}_2)$ is a unitary matrix, and there exists an upper Hessenberg matrix $H_m$ such that $Q_m\tilde{K}_mQ_m^*=H_m$. Here, $\hat{Q}_{j+1}:={\text {diag}}\{O_j,\ \tilde{Q}_{j+1}\}$ and $O_j$ is the $j\times j$ zero matrix.

Our first goal is to evaluate the magnitude of each element of the matrix $\tilde{Q}_{j+1}$. By the assumption (3.11), the inequality $\Vert \tilde{k}_{j+1:m,j}-\eta _j e_1\Vert \ge \eta $ holds. In addition, since the vector $\tilde{k}_{j+1:m,j}$ satisfies the condition (3.10), we have

$$\begin{aligned} \vert \eta _j\vert =\Vert \tilde{k}_{j+1:m,j}\Vert \le \sqrt{\sum _{k=1}^{\infty }(\hat{\alpha }\hat{\lambda }^k)^2}= \hat{\alpha }\sqrt{1/(1-\hat{\lambda }^2)}\hat{\lambda }. \end{aligned}$$

The first element of the vector $\tilde{k}_{j+1:m,j}-\eta _je_1$ is equal to $\tilde{k}_{j+1,j}-\eta _j$, and since the identity $\vert \eta _j\vert =\Vert \tilde{k}_{j+1:m,j}\Vert $ holds, we obtain $\vert \tilde{k}_{j+1,j}-\eta _j\vert \le 2\vert \eta _j\vert $. All the elements except for the first element of the vector $\tilde{k}_{j+1:m,j}-\eta _je_1$ are the same as the element $\tilde{k}_{j+1:m,j}$. For these reasons, on the basis of the assumption (3.10), $|u_{i,j}|<\tilde{\alpha }\hat{\lambda }^i$ is satisfied for

$$\begin{aligned} \tilde{\alpha }:=\max (\hat{\alpha }/\eta ,2\hat{\alpha } \sqrt{1/(1-\hat{\lambda }^2)}\hat{\lambda }/\eta ), \end{aligned}$$

where $u_{i,j}$ is the ith element of the vector $u_j\in \mathbb {C}^{m-j}$. Thus, we have

$$\begin{aligned} |(\tilde{Q}_{j+1})_{k,l}|=2|(u_ju_j^*)_{k,l}|=2|u_{k,j}u_{l,j}|\le 2\tilde{\alpha }^2\hat{\lambda }^{k+l}. \end{aligned}$$

Next, we evaluate the magnitude of each element of the matrix $Q_m$. Let $\check{\alpha }:=2\tilde{\alpha }^2$. For $l\ge 2$ and $l<k_1<k_2<\cdots <k_r$, we have

$$\begin{aligned} |(\hat{Q}_{k_r}\cdots \hat{Q}_{k_1}\hat{Q}_l)_{i,j}|&\le \displaystyle \frac{\check{\alpha }^{r+1}\hat{\lambda }^{-2(l-r-1)}}{(1-\hat{\lambda }^2)^r} \hat{\lambda }^{i+j}=\check{\alpha }^{r+1}\alpha ''(l,r)\hat{\lambda }^{i+j}\quad (i,j\le k_r),\nonumber \\ |(\hat{Q}_{k_r}\cdots \hat{Q}_{k_1}\hat{Q}_l)_{i,j}|&=0\quad (i>k_r\ \text {or}\ j>k_r), s \end{aligned}$$

(A.1)

where $\alpha ''(l,r):=\hat{\lambda }^{-2(l-r-1)}/(1-\hat{\lambda }^2)^r$. The inequality (A.1) is proved by the induction on r. For $r=1$, we have

$$\begin{aligned}&|(\hat{Q}_{k_1}\hat{Q}_l)_{i,j}| \le \sum _{a=k_1}^m\check{\alpha }\hat{\lambda }^{i-k_1+1+a-k_1+1} \check{\alpha }\hat{\lambda }^{a-l+1+j-l+1}\\&\qquad =\check{\alpha }^2\hat{\lambda }^{i+j}\hat{\lambda }^{-2(k_1+l-2)} \sum _{a=k_1}^m\hat{\lambda }^{2a}\le \frac{\check{\alpha }^2 \hat{\lambda }^{-2(l-2)}}{1-\hat{\lambda }^2}\hat{\lambda }^{i+j} \quad (i,j\le k_1). \end{aligned}$$

For $r\ge 2$, if the inequality (A.1) is satisfied with $r-1$, then we obtain

$$\begin{aligned}&|(\hat{Q}_{k_r}\cdots \hat{Q}_{k_1}\hat{Q}_l)_{i,j}| \le \sum _{a=k_r}^{m}\check{\alpha }\hat{\lambda }^{i-k_r+1+a-k_r+1} \frac{\check{\alpha }^r\hat{\lambda }^{-2(l-r)}}{(1-\hat{\lambda }^2)^{r-1}} \hat{\lambda }^{a+j}\\&\qquad \le \hat{\lambda }^{i+j}\frac{\check{\alpha }^{r+1} \hat{\lambda }^{-2(k_r+l-r-1)}}{(1-\hat{\lambda }^2)^{r-1}}\sum _{a=k_r}^m \hat{\lambda }^{2a} =\frac{\check{\alpha }^{r+1}\hat{\lambda }^{-2(l-r-1)}}{(1-\hat{\lambda }^2)^r}\hat{\lambda }^{i+j} \quad (i,j\le k_r), \end{aligned}$$

and the inequality (A.1) is also satisfied with r. This is the proof of the inequality (A.1). In the inequality (A.1), if $\hat{\lambda }\le 1/\sqrt{2}$ is satisfied, then the inequality $\alpha ''(l,r+1)\le \alpha ''(l,r)$ holds for any l. This results in $\alpha ''(l,r)\le \alpha ''(l,1)$ for any r and l. The matrix $Q_m$ can be represented as

$$\begin{aligned} Q_m&=(I_m+\hat{Q}_{m-1})\cdots (I_m+\hat{Q}_2)\nonumber \\&=I_m+\sum _{k=3}^{m-1}\sum _{l=2}^{k-1}\sum _{(a_1,a_2,\ldots , a_{k-l-1})\in \{0,1\}^{k-l-1}}\hat{Q}_{k}\hat{Q}_{k-1}^{a_1} \hat{Q}_{k-2}^{a_2}\cdots \hat{Q}_{l+1}^{a_{k-l-1}} \hat{Q}_l+\sum _{k=2}^{m-1}\hat{Q}_k. \end{aligned}$$

(A.2)

As a result, for $2\le \min {\{i,j\}}\le m-1$, the Eq. (A.2) and inequality (A.1) derive

$$\begin{aligned} |(Q_m-I_m)_{i,j}|&\le \check{\alpha }^2\sum _{k=3}^{\min {\{i,j\}}}\sum _{l=2}^{k-1} \left( 1+(k-l-1)\check{\alpha }+\left( \begin{array}{c}k-l-1\\ 2\end{array} \right) \check{\alpha }^2+\cdots +\check{\alpha }^{k-l-1}\right) \\&\qquad \cdot \alpha ''(l,1)\hat{\lambda }^{i+j} +\check{\alpha }^2\sum _{k=2}^{\min {\{i,j\}}}\alpha ''(k,1)\hat{\lambda }^{i+j}\\&\le \check{\alpha }^2\sum _{k=3}^{\min {\{i,j\}}}\sum _{l=2}^{k-1} (1+\check{\alpha })^{k-l-1} \alpha ''(l,1)\hat{\lambda }^{i+j}+\check{\alpha }^2 \sum _{k=2}^{\min {\{i,j\}}}\alpha ''(k,1)\hat{\lambda }^{i+j}. \end{aligned}$$

Therefore, for $2\le \min {\{i,j\}}\le m-1$ and $i\le j$, under the assumptions of (3.11) and (3.12), we have

$$\begin{aligned}&|(Q_m-I_m)_{i,j}|\\&\quad \le \sum _{k=3}^i\frac{(1+\check{\alpha })^{k-1}\check{\alpha }^2 \hat{\lambda }^4}{1-\hat{\lambda }^2} \hat{\lambda }^{i+j}\frac{((1+\check{\alpha })\hat{\lambda }^2)^{-2}}{((1+\check{\alpha })\hat{\lambda }^2)^{-1}-1}\left( ((1+\check{\alpha }) \hat{\lambda }^2)^{-k+2}-1\right) \\&\quad + \frac{\check{\alpha }^2\hat{\lambda }^4}{1-\hat{\lambda }^2}\hat{\lambda }^{i+j} \frac{\hat{\lambda }^{-4}}{\hat{\lambda }^{-2}-1}((\hat{\lambda }^{-2})^{i-1}-1)\\&\quad \le \frac{(1+\check{\alpha })\check{\alpha }^2\hat{\lambda }^4((1 +\check{\alpha })\hat{\lambda }^2)^{-1}}{(1-\hat{\lambda }^2)(1-(1+\check{\alpha })\hat{\lambda }^2)} \hat{\lambda }^{i+j}\sum _{k=3}^i(\hat{\lambda }^{-2})^{k-2} +\frac{\check{\alpha }^2\hat{\lambda }^4}{(1-\hat{\lambda }^2)^2} \hat{\lambda }^{i+j}\hat{\lambda }^{-2i}\\&\quad \le \frac{\check{\alpha }^2\hat{\lambda }^2}{(1-\hat{\lambda }^2)(1-(1+\check{\alpha })\hat{\lambda }^2)} \hat{\lambda }^{i+j}\frac{\hat{\lambda }^{-2}}{\hat{\lambda }^{-2}-1}(\hat{\lambda }^{-2})^{i-2} +\frac{\check{\alpha }^2\hat{\lambda }^4}{(1-\hat{\lambda }^2)^2} \hat{\lambda }^{i+j}\hat{\lambda }^{-2i}\\&\quad =\left( \frac{\check{\alpha }^2\hat{\lambda }^2}{(1-\hat{\lambda }^2) (1-(1+\check{\alpha })\hat{\lambda }^2)}\frac{\hat{\lambda }^4}{1-\hat{\lambda }^2} +\frac{\check{\alpha }^2\hat{\lambda }^4}{(1-\hat{\lambda }^2)^2}\right) \hat{\lambda }^{j-i}\\&\quad =:\alpha '\hat{\lambda }^{j-i}, \end{aligned}$$

where the sum $\sum _{k=3}^i$ becomes 0 for $i=2$, and we use the assumption (3.12) to derive the second inequality. In a similar manner, it can be deduced that $|(Q_m-I_m)_{i,j}|\le \alpha '\hat{\lambda }^{i-j}$ for $i>j$. In the case of $\min {\{i,j\}}=m$, the identity $i=j=m$ holds and we obtain

$$\begin{aligned} |(Q_m-I_m)_{m,m}|\le \check{\alpha }^2\sum _{k=3}^{m-1} \sum _{l=2}^{k-1}(1+\check{\alpha })^{k-l+1}\alpha ''(l,1) \hat{\lambda }^{i+j}+\check{\alpha }^2\sum _{k=2}^{m-1} \alpha ''(k,1)\hat{\lambda }^{i+j}\le \alpha '. \end{aligned}$$

For $\min {\{i,j\}}=1$, by the definition of the matrix $Q_m$, we have

$$\begin{aligned} \left\{ \begin{aligned} |(Q_m)_{1,1}|&=1\\ |(Q_m)_{i,j}|&=0\quad (i\ne 1\ \text {or}\ j\ne 1). \end{aligned} \right. \end{aligned}$$

Since the matrix $I_m$ is a diagonal matrix, redefining the constant $\alpha '$ as the sum of 1 and the previous $\alpha '$ completes the proof.

B Proof of Proposition 3.2

First, we apply Lemma 3.1 and Proposition 3.1 to the matrix $\phi _k\left( D_m-H_m^{-1}T_m\right) H_m^{-1}$. Since the matrix $H_m$ is an upper Hessenberg matrix satisfying the condition (3.13), by setting $f(z)=z^{-1}$ and applying Lemma 3.1, there exist constants $\hat{\alpha }>0$ and $0<\hat{\lambda }<1$ which satisfy the following inequality:

$$\begin{aligned} |(H_m^{-1})_{i,j}|\le \hat{\alpha }\hat{\lambda }^{i-j}\quad (i\ge j). \end{aligned}$$

(B.1)

Since the matrix $D_m$ is a diagonal matrix and the matrix $T_m$ is defined as the Eq. (2.5), redefining the constant $\hat{\alpha }$ as the sum of $\Vert D_m\Vert =N-h$ and the previous $\hat{\alpha }$ leads to

$$\begin{aligned} |(D_m-H_m^{-1}T_m)_{i,j}|\le \hat{\alpha }\hat{\lambda }^{i-j}\quad (i\ge j), \end{aligned}$$

where the constants $\hat{\alpha }$ and $\hat{\lambda }$ do not depend on m. Let

$$\begin{aligned} \mathscr {G}^{{\text {exp}}}(\alpha ,\lambda ):=\{A:\text {a square matrix}\mid |(A)_{i,j}| \le \alpha \lambda ^{|i-j|}\quad (\forall i,j)\}. \end{aligned}$$

By Proposition 3.1, there exists a unitary matrix $Q_m$ and an upper Hessenberg matrix $\tilde{H}_m$ such that $D_m-H_m^{-1}T_m=Q_m^*\tilde{H}_mQ_m$ and $Q_m\in \mathscr {G}^{{\text {exp}}}(\alpha ',\hat{\lambda })$, where $\alpha '>0$ is a constant which does not depend on m. Thus, it is deduced that there exists a matrix $\hat{H}_m\in \mathscr {G}^{{\text {exp}}}(\hat{\alpha },\hat{\lambda })$ such that

$$\begin{aligned} \phi _k\left( D_m-H_m^{-1}T_m\right) H_m^{-1}e_1&=\phi _k(Q_m^*\tilde{H}_mQ_m)H_m^{-1}e_1 =Q_m^*\phi _k(\tilde{H}_m)Q_m\hat{H}_me_1. \end{aligned}$$

The second equality holds, since by the inequality (B.1), there exists a matrix $\hat{H}_m\!\in \mathscr {G}^{{\text {exp}}}(\hat{\alpha },\hat{\lambda })$ which satisfies $H_m^{-1}e_1=\hat{H}_me_1$.

Our task is now to derive an upper bound of the magnitude of each element of the matrix $Q_m^*\phi _k(\tilde{H}_m)Q_m\hat{H}_m$, which is composed of the products of the matrices that have the decay properties. According to Benzi and Boito [2, Theorem 9.2], there exist constants $\alpha ''>0$ and $\lambda ''$ which do not depend on m and satisfy $Q_m\hat{H}_m\in \mathscr {G}^{{\text {exp}}}(\alpha '',\lambda '')$. In addition, because the function $\phi _k$ is an entire function, by setting $f=\phi _k$ in Proposition 3.1, there exist constants $\check{\alpha }>0$ and $0<\check{\lambda }<1$ satisfying $\vert (\phi _k(\tilde{H}_m))_{i,j}\vert \le \check{\alpha }\check{\lambda }^{i-j}$ for $i\ge j$. Let $\varSigma :=\bigcup _{m=1}^n W(D_m-H_m^{-1}T_m)$. Then, the set $\varSigma $ is closed and bounded, and the matrix $\tilde{H}_m$ satisfies

$$\begin{aligned} W(\tilde{H}_m)=W\left( Q_m(D_m-H_m^{-1}T_m)Q_m^*\right) =W(D_m-H_m^{-1}T_m) \subseteq \varSigma . \end{aligned}$$

(B.2)

Therefore, $|e^z|\le C'$ for a constant $C'>0$ is satisfied for $z\in W(\tilde{H}_m)$, and we have

$$\begin{aligned} |\phi _k(z)|=\left| \int _0^1e^{sz}\frac{(1-s)^{k-1}}{(k-1)!}ds \right| \le |e^z|\left| \int _0^1\frac{(1-s)^{k-1}}{(k-1)!}ds \right| \le \frac{C'}{k!}\quad (z\in W(\tilde{H}_m)). \end{aligned}$$

(B.3)

Using the result by Crouzeix [7], the condition (B.2), and the inequality (B.3), there exists a constant $2\le C\le 1+\sqrt{2}$ such that

$$\begin{aligned} \Vert \phi _k(\tilde{H}_m)\Vert \le C\sup _{z\in W(\tilde{H}_m)}|\phi _k(z)|\le \frac{CC'}{k!}. \end{aligned}$$

Redefining the constant $\check{\alpha }$ as the sum of the constant $CC'/(k!)$ and the previous $\check{\alpha }$ results in

$$\begin{aligned}&\left| \left( \phi _k(\tilde{H}_m)\right) _{i,j}\right| \le \check{\alpha }\check{\lambda }^{i-j}\quad (i\ge j), \end{aligned}$$

(B.4)

$$\begin{aligned}&\left| \left( \phi _k(\tilde{H}_m)\right) _{i,j}\right| \le \Vert \phi _k(\tilde{H}_m)\Vert \le \check{\alpha }\quad (i<j). \end{aligned}$$

(B.5)

By the upper bounds (B.4) and (B.5), we obtain

$$\begin{aligned}&\left| \left( \phi _k(\tilde{H}_m)Q_m\hat{H}_m\right) _{i,1}\right| \le \sum _{k=1}^{i}\check{\alpha }\check{\lambda }^{i-k}\alpha ''\lambda ''^{k-1} +\sum _{k=i+1}^m\check{\alpha }\alpha '' \lambda ''^{k-1}\nonumber \\&\qquad \le i\check{\alpha }\alpha ''\bar{\lambda }^{i-1}+\check{\alpha }\alpha '' \frac{\lambda ''^i}{1-\lambda ''}\le i\check{\alpha }\alpha ''\left( 1+\frac{\lambda ''}{1-\lambda ''}\right) \bar{\lambda }^{i-1} =i\bar{\alpha }\bar{\lambda }^{i-1}, \end{aligned}$$

(B.6)

where $\bar{\alpha }:=\check{\alpha }\alpha ''/(1-\lambda '')$ and $\bar{\lambda }:=\max \{\check{\lambda },\lambda ''\}<1$. As a result, using the fact $Q_m\in \mathscr {G}^{{\text {exp}}}(\alpha ',\hat{\lambda })$ and the upper bound (B.6), we have

$$\begin{aligned}&\left| \left( \phi _k(D_m-H_m^{-1}T_m)H_m^{-1}\right) _{i,1}\right| =\left| \left( Q_m^*\phi _k(\tilde{H}_m)Q_m\hat{H}_m\right) _{i,1}\right| \\&\quad \le \sum _{k=1}^{i}\alpha '\hat{\lambda }^{i-k}k\bar{\alpha }\bar{\lambda }^{k-1} +\sum _{k=i+1}^{m}\alpha '\hat{\lambda }^{k-i}k\bar{\alpha }\bar{\lambda }^{k-1}\\&\quad \le \frac{1}{2}(i+1)i\alpha '\bar{\alpha }\lambda ^{i-1}+\alpha '\bar{\alpha } \frac{i+1}{(1-\lambda ^2)^2}\lambda ^{i+1}\\&\quad \le \frac{1}{2}(i+1)i\alpha '\bar{\alpha }\left( 1+ \frac{2\lambda ^2}{(1-\lambda ^2)^2}\right) \lambda ^{i-1} =\frac{1}{2}(i+1)i\alpha \lambda ^{i-1}, \end{aligned}$$

where $\alpha :=\alpha '\bar{\alpha }(1+2\lambda ^2/(1- \lambda ^2)^2)$ and $\lambda :=\max \{\hat{\lambda },\bar{\lambda }\}<1$. This completes the proof of Proposition 3.2.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hashimoto, Y., Nodera, T. Inexact rational Krylov method for evolution equations. Bit Numer Math 61, 473–502 (2021). https://doi.org/10.1007/s10543-020-00829-w

Download citation

Received: 13 August 2019
Accepted: 22 September 2020
Published: 19 January 2021
Issue Date: June 2021
DOI: https://doi.org/10.1007/s10543-020-00829-w

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Inexact rational Krylov method for evolution equations

Abstract

Access this article

Similar content being viewed by others

Shift-invert Rational Krylov method for an operator $$\phi $$ -function of an unbounded linear operator

A Lanczos-like method for non-autonomous linear ordinary differential equations

A computational procedure for exact solutions of Burgers’ hierarchy of nonlinear partial differential equations

Change history

06 September 2021

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

A Proof of Proposition 3.1

B Proof of Proposition 3.2

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Inexact rational Krylov method for evolution equations

Abstract

Access this article

Similar content being viewed by others

Shift-invert Rational Krylov method for an operator $$\phi $$ -function of an unbounded linear operator

A Lanczos-like method for non-autonomous linear ordinary differential equations

A computational procedure for exact solutions of Burgers’ hierarchy of nonlinear partial differential equations

Change history

06 September 2021

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

A Proof of Proposition 3.1

B Proof of Proposition 3.2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation