An exact power series representation of the Baker–Campbell–Hausdorff formula

Jordan C Moodie; M W Long

doi:10.1088/1751-8121/abcbae

1. Introduction

In physics and mathematics [1–7] it is often useful to write the product e^X e^Y as e^Z, for some Z. When the objects X and Y do not commute, as is often the case when dealing with matrices, it may not be simple to find such a Z. Many authors [5, 8–15] attempted to deal with this problem by targeting $Z\left(X,Y\right)\equiv \mathrm{log}\left({e}^{X}{e}^{Y}\right)$ . Such attempts resulted in the Baker–Campbell–Hausdorff formula,

$\begin{equation*}Z\left(X,Y\right)=X+Y+\frac{1}{2}\left[X,Y\right]+\frac{1}{12}\left(\left[X,\left[X,Y\right]\right]+\left[Y,\left[Y,X\right]\right]\right)+\cdots .\end{equation*}$

Dynkin [16] found this formula explicitly in terms of commutators for every order, where order means combined powers of X and Y. Unfortunately, this means that if a truncation of the series is to give a good approximation to the full expansion, both X and Y must be sufficiently close to zero. More recently, work has been done to represent this formula in more convenient ways for specific algebras [2–4, 17–22].

There exists an alternative representation to all orders in X but linear in Y. Letting L_X Y ≡ [X, Y] denote commutator operators, it is given (in, say, [23]) by

$\begin{equation}Z\left(X,Y\right)=X+\frac{\frac{1}{2}{L}_{X}}{\mathrm{sinh}\left(\frac{1}{2}{L}_{X}\right)}\left({e}^{\frac{1}{2}{L}_{X}}Y\right)+\mathcal{O}\left({Y}^{2}\right).\end{equation} \tag{ 1.1 }$

The aim of this work will be to extend this representation to all powers of Y. That is, express Z(X, Y) as

$\begin{equation*}Z\left(X,Y\right)=X+\sum _{n=1}^{\infty }{\hat{G}}_{n}{\left({e}^{\frac{1}{2}{L}_{X}}Y\right)}^{n},\end{equation*}$

finding explicitly the operators ${\hat{G}}_{n}$ , which will depend non-trivially on commutator operators L_X. This series may be truncated and give a good approximation to the full expansion if only Y is small, as opposed to both X and Y in the previous. A discussion of what is meant by small is given in appendix A.

The paper is structured as follows. Section 2 contains the derivation of the main result, that is calculating the operators ${\hat{G}}_{n}$ . Section 3 argues, based upon a conjecture, that the result remains a sum of commutators, as would be expected. These sections can be safely ignored by any reader who wishes to avoid mathematical detail. Instead they may prefer to proceed to section 4, where finite examples are given and discussed which provides immediately usable formulae for the ${\hat{G}}_{n}$ . Section 5 proves an alternative representation for the operators ${\hat{G}}_{n}$ , which is perhaps more practical as it deals with some apparent singularities which shall be encountered. Finally in section 6 it is argued that this result is particularly useful in the basis where the perturbative matrix is diagonal. In this case the operators become merely functions of real numbers and so it is elementary to perform calculations with them.

2. Derivation of main result

Consider a symmetric version of of the Baker–Campbell–Hausdorff formula,

$\begin{equation}\mathcal{S}\left(A,B\right)\equiv \mathrm{log}\left({e}^{A}{e}^{2B}{e}^{A}\right),\end{equation} \tag{ 2.1 }$

for two matrices A and B. While this formulation is more natural to work with than (1.1), each may be transformed into the other and so are equivalent. Employing the notation for commutators which shall be used throughout this article, LB ≡ [A, B] and Lⁿ B ≡ [A, [A, ⋯, [A, B], ⋯, ]], the Baker–Hausdorff formula is given by

$\begin{equation}{e}^{A}B{e}^{-A}={e}^{L}B.\end{equation} \tag{ 2.2 }$

This then implies

$\begin{equation*}{e}^{A}{e}^{B}{e}^{-A}={e}^{{e}^{L}B},\end{equation*}$

from which it is easily seen that $\mathcal{S}\left(A,B\right)=Z\left(2A,2\enspace \mathrm{exp}\left(-L\right)B\right)=Z\left(2\enspace \mathrm{exp}\left(L\right)B,2A\right)$ and additionally $Z\left(X,Y\right)=\mathcal{S}\left(X/2,\mathrm{exp}\left({L}_{X}/2\right)Y/2\right)$ . That is,

$\begin{equation*}{e}^{2A}{e}^{2B}={e}^{A}{e}^{A}{e}^{2B}{e}^{-A}{e}^{A}={e}^{A}{e}^{2{e}^{L}B}{e}^{A},\end{equation*}$

and so all one needs to do is replace any B in the symmetric formula with e^L B to obtain the non-symmetric formula. The factors of two have been introduced here in order to simplify the final representation.

The task ahead is to expand equation (2.1). The matrix B will be the focus, with the aim being to write the expansion as a power series in this matrix. Once this is achieved, the coefficients of the power series will be examined in depth and closed form expressions obtained.

The identity

$\begin{equation}\mathrm{log}\enspace M=-\sum _{l=1}^{\infty }\frac{1}{l}\sum _{m=0}^{l}{\left(-1\right)}^{m}\frac{l!}{m!\left(l-m\right)!}{M}^{m},\end{equation} \tag{ 2.3 }$

will be employed, setting M = exp(A)exp(2B)exp(A). It will be found that M^m separates into the sum of several parts. Each of these parts will take the form f_i exp(2mA)g_i, for m-independent quantities f_i and g_i. The f_i and g_i may then each be pulled out of the above sums, leaving exp(2mA) in place of M^m. The identity then may be used in reverse to obtain log(M) = ∑_i f_i2Ag_i. This then constitutes the fundamental mathematical approach which shall be taken.

2.1. Expanding M^m in powers of B

The focus will now be on calculating M^m. The Baker–Hausdorff formula (2.2) may be used to symmetrically move exponentials of A to the edges, obtaining

$\begin{equation*}{M}^{m}={e}^{mA}\left[\prod _{n=-\frac{m-1}{2}}^{\frac{m-1}{2}}\mathrm{exp}\left(2{e}^{2nL}B\right)\right]{e}^{mA},\end{equation*}$

where the product must be taken in the correct order, namely increasing n. The exponentials involving B may then be Taylor expanded

$\begin{equation*}{M}^{m}={e}^{mA}\left[\prod _{n=-\frac{m-1}{2}}^{\frac{m-1}{2}}\sum _{{k}_{n}=0}^{\infty }\frac{1}{{k}_{n}!}{\left(2{e}^{2nL}B\right)}^{{k}_{n}}\right]{e}^{mA},\end{equation*}$

and terms gathered in orders of B,

$\begin{align*}\hfill {M}^{m}& ={e}^{mA}\left[1+2\left(\sum _{-\frac{m-1}{2}{\leqslant}{n}_{1}{\leqslant}\frac{m-1}{2}}{e}^{2{n}_{1}L}B\right)\right.\hfill \\ \hfill & \quad \left.\left.+{2}^{2}\left(\sum _{-\frac{m-1}{2}{\leqslant}{n}_{1}{< }{n}_{2}{\leqslant}\frac{m-1}{2}}{e}^{2{n}_{1}L}B{e}^{2{n}_{2}L}B+\frac{1}{2!}\sum _{-\frac{m-1}{2}{\leqslant}{n}_{1}{\leqslant}\frac{m-1}{2}}\right)\right.,{e}^{2{n}_{1}L}B{e}^{2{n}_{1}L}B+\cdots \enspace \right]{e}^{mA}.\hfill \end{align*}$

In the above expression, each term exp(2n_i L)B must be thought of as one object—that particular commutator operator L is acting on that particular matrix B and so the two are intrinsically linked. It is helpful to formalise this link, labelling the pair with an index. Then it is understood that the operator L_i acts on only the matrix B_i, and no other. Each such pair may then be labelled. This allows the commutation of operators and matrices with different labels, enabling all matrices B in the above expression to be pulled out of each sum. Explicitly,

$\begin{align}\hfill {M}^{m}& ={e}^{mA}\left[1+2\left(\sum _{-\frac{m-1}{2}{\leqslant}{n}_{1}{\leqslant}\frac{m-1}{2}}{e}^{2{n}_{1}{L}_{1}}\right){B}_{1}\right.\hfill \\ \hfill & \quad \left.+{2}^{2}\left(\sum _{-\frac{m-1}{2}{\leqslant}{n}_{1}{< }{n}_{2}{\leqslant}\frac{m-1}{2}}{e}^{2{n}_{1}{L}_{1}}{e}^{2{n}_{2}{L}_{2}}+\frac{1}{2!}\right.\right.\left.\left.\sum _{-\frac{m-1}{2}{\leqslant}{n}_{1}{\leqslant}\frac{m-1}{2}}{e}^{2{n}_{1}\left({L}_{1}+{L}_{2}\right)}\right){B}_{1}{B}_{2}+\cdots \enspace \right]{e}^{mA}\hfill \end{align} \tag{ 2.4 }$

$\begin{equation}\equiv {e}^{mA}\left[{F}_{0}+{F}_{1}\left({L}_{1}\right){B}_{1}+{F}_{2}\left({L}_{1},{L}_{2}\right){B}_{1}{B}_{2}+{F}_{3}\left({L}_{1},{L}_{2},{L}_{3}\right){B}_{1}{B}_{2}{B}_{3}+\cdots \enspace \right]{e}^{mA}.\end{equation} \tag{ 2.5 }$

2.2. Rewriting F_N in terms of fundamental sums S_N

The first aim has thus been achieved; the formula (2.1) has been expanded with a power series in the matrix B. The next is to find closed form expressions for the coefficients F_N. First define the sum S_N as

$\begin{equation}{S}_{N}\left({L}_{1},{L}_{2},\dots ,{L}_{N}\right)\equiv {2}^{N}\sum _{-\frac{m-1}{2}{\leqslant}{n}_{1}{< }{n}_{2}{< }\cdots {< }{n}_{N}{\leqslant}\frac{m-1}{2}}{e}^{2{n}_{1}{L}_{1}}{e}^{2{n}_{2}{L}_{2}}\cdots {e}^{2{n}_{N}{L}_{N}},\end{equation} \tag{ 2.6 }$

then the first few of the coefficients F_N are given by

$\begin{align}\hfill & {F}_{0}=1,\hfill \\ \hfill & {F}_{1}\left({L}_{1}\right)={S}_{1}\left({L}_{1}\right),\hfill \\ \hfill & {F}_{2}\left({L}_{1},{L}_{2}\right)={S}_{2}\left({L}_{1},{L}_{2}\right)+\frac{2}{2!}{S}_{1}\left({L}_{1}+{L}_{2}\right),\hfill \\ \hfill & {F}_{3}\left({L}_{1},{L}_{2},{L}_{3}\right)={S}_{3}\left({L}_{1},{L}_{2},{L}_{3}\right)+\frac{2}{2!}\left({S}_{2}\left({L}_{1}+{L}_{2},{L}_{3}\right)+{S}_{2}\left({L}_{1},{L}_{2}+{L}_{3}\right)\right)\hfill \\ \hfill & \quad \quad \quad \quad \quad \enspace \quad +\frac{{2}^{2}}{3!}{S}_{1}\left({L}_{1}+{L}_{2}+{L}_{3}\right).\hfill \end{align} \tag{ 2.7 }$

Writing the coefficients F_N for an arbitrary order N is a problem in partitioning. As seen in the above examples, the string L₁ + L₂ + ⋯ + L_N is split in all possible ways. The resultant substrings are then used as arguments for the sums S_n. However, each sum is also divided by factorials. These factorials are determined by the length of the substrings used as arguments. For example, the string L₁ + L₂ + L₃ may be split in the following ways giving the following factorials:

$\begin{equation}\begin{aligned}{L}_{1}+{L}_{2}+{L}_{3}& \quad \rightarrow \enspace \quad 3!\enspace ,\\ {L}_{1}+{L}_{2}\enspace ,\enspace {L}_{3}& \quad \rightarrow \quad \enspace 2!\enspace 1!\enspace ,\\ {L}_{1}\enspace ,\enspace {L}_{2}+{L}_{3}& \quad \rightarrow \quad \enspace 1!\enspace 2!\enspace ,\\ {L}_{1}\enspace ,\enspace {L}_{2}\enspace ,\enspace {L}_{3}& \quad \rightarrow \quad 1!\enspace 1!\enspace 1!,\end{aligned}\end{equation} \tag{ 2.8 }$

demonstrating how F₃ was constructed in equation (2.7).

There are then two major hurdles to finding closed form expressions for each coefficient of the power series. The first is to calculate the explicit sum S_N. As the sum S_N may be thought of as N finite geometric series, it may be expected to have 2^N terms. However, it may be split into N + 1 parts, each of which is a collection of infinite geometric series. This lifting of the constraint is crucial and will be discussed shortly. The second hurdle is then to perform the partition sum, that is to calculate F_N given the functions S_r.

2.3. Calculating S_N

It is useful at this point to deal with a concrete example. Consider the sum

$\begin{equation*}{S}_{2}\left({L}_{1},{L}_{2}\right)\equiv \sum _{-\frac{m-1}{2}{\leqslant}{n}_{1}{< }{n}_{2}{\leqslant}\frac{m-1}{2}}{2}^{2}{e}^{{n}_{1}{L}_{1}}{e}^{{n}_{2}{L}_{2}}.\end{equation*}$

The summation variables, n₁ and n₂, are constrained from both above and below. These constraints may be thought of as forming a triangle, as depicted in figure 1. The sum may then be thought of as the combination of three semi-constrained sums, constructed by taking a given vertex of the triangle and extending the constraining lines to form infinite sectors. Explicitly,

$\begin{align}\hfill \sum _{-\frac{m-1}{2}{\leqslant}{n}_{1}{< }{n}_{2}{\leqslant}\frac{m-1}{2}}{2}^{2}{e}^{2{n}_{1}{L}_{1}}{e}^{2{n}_{2}{L}_{2}}& ={2}^{2}\left(\sum _{{n}_{1}{< }{n}_{2}{\leqslant}\frac{m-1}{2}}{e}^{2{n}_{1}{L}_{1}}{e}^{2{n}_{2}{L}_{2}}\right)\hfill \\ \hfill & \quad -{2}^{2}\left(\sum _{{n}_{1}{< }-\frac{m-1}{2}}{e}^{2{n}_{1}{L}_{1}}\sum _{{n}_{2}{\leqslant}\frac{m-1}{2}}{e}^{2{n}_{2}{L}_{2}}\right)\hfill \\ \hfill & \quad +{2}^{2}\left(\sum _{{n}_{2}{\leqslant}{n}_{1}{< }-\frac{m-1}{2}}{e}^{2{n}_{1}{L}_{1}}{e}^{2{n}_{2}{L}_{2}}\right),\hfill \end{align} \tag{ 2.9 }$

or, using the labels for regions shown in figure 1,

The sums on the right-hand side may then be evaluated to obtain

$\begin{align*}\hfill {S}_{2}\left({L}_{1},{L}_{2}\right)& =\frac{\mathrm{coth}\left({L}_{1}\right)-1}{\mathrm{sinh}\left({L}_{1}+{L}_{2}\right)}{e}^{m\left({L}_{1}+{L}_{2}\right)}+\frac{1}{\mathrm{sinh}\left(-{L}_{1}\right)}\frac{1}{\mathrm{sinh}\left({L}_{2}\right)}{e}^{m\left(-{L}_{1}+{L}_{2}\right)}\hfill \\ \hfill & \quad +\frac{\mathrm{coth}\left(-{L}_{2}\right)-1}{\mathrm{sinh}\left(-{L}_{1}-{L}_{2}\right)}{e}^{m\left(-{L}_{1}-{L}_{2}\right)}.\hfill \end{align*}$

**Figure 1.** A depiction of the parameter space of n₁ and n₂ in equation (2.9). Solid lines imply inclusiveness of that line in a given sum, while dashed imply the line of parameters is excluded. The variables of the original sum are constrained to the triangle formed from the vertices marked with a red circle.
Download figure:
Standard image High-resolution image

Generalising this idea to the sum S_N involves N + 1 vertices of an N-dimensional tetrahedron. The constraining lines are extended, creating N + 1 sums similar to those in equation (2.9). A careful treatment of this is necessary and is done in appendix B. The final result is

$\begin{align}\hfill {S}_{N}\left({L}_{1},\dots ,{L}_{N}\right)& =\sum _{r=0}^{N}{\tilde {S}}_{r}\left(-{L}_{r},-{L}_{r-1},\dots ,-{L}_{1}\right){\tilde {S}}_{N-r}\left({L}_{r+1},{L}_{r+2},\dots ,{L}_{N}\right)\hfill \\ \hfill & \quad {\times}{e}^{m\left(-{L}_{1}-\cdots -{L}_{r}+{L}_{r+1}+\dots +{L}_{N}\right)},\hfill \end{align} \tag{ 2.10 }$

where ${\tilde {S}}_{0}\equiv 1$ and

$\begin{equation}{\tilde {S}}_{r}\left({x}_{1},\dots ,{x}_{r}\right)=\frac{{s}_{r-1}\left({x}_{1},\dots ,{x}_{r-1}\right)}{\mathrm{sinh}\left({x}_{1}+{x}_{2}+\dots +{x}_{r}\right)}\quad \text{for}\quad r\in {\mathbb{Z}}^{+},\end{equation} \tag{ 2.11 }$

where similarly s₀ ≡ 1 and

$\begin{equation}{s}_{r-1}\left({x}_{1},\dots ,{x}_{r-1}\right)=\prod _{j=1}^{r-1}\left[\mathrm{coth}\left({x}_{1}+{x}_{2}+\dots +{x}_{j}\right)-1\right]\quad \text{for}\enspace \left(r-1\right)\in {\mathbb{Z}}^{+}.\end{equation} \tag{ 2.12 }$

There are several things to note from this result. Firstly, the N + 1 different forms that the exponential above may take clearly correspond to the vertices of the N-dimensional tetrahedron discussed previously. As mentioned earlier, this exponential, containing all the m-dependence, is crucial in reversing the identity (2.3). Next, note the splitting of each term into ${\tilde {S}}_{r}$ and ${\tilde {S}}_{N-r}$ functions. This structure remains for the coefficients F_N, as shall be seen shortly, and appears fundamental to the problem. Furthermore, the representation of the result in hyperbolic functions is perhaps not unexpected; previous results showed that the order B term is best written with a sinh function. Finally, the arguments of the hyperbolic functions only ever contain sums of the commutator operators L_i. As such it is mathematically sensible to think of the active variables not as these commutator operators L₁, L₂, L₃ etc, but rather as strings of such operators, for example L₁, L₁ + L₂, L₁ + L₂ + L₃ etc. More will be said of such strings in later sections, in particular section 6.

2.4. Rewriting F_N as a partition sum in terms of f_r

The next task is to perform the partition sum, or in other words calculate F_N given the now known S_r. Once again it is useful to turn to an example. Using the above results, it is simple to read off that

$\begin{align}\hfill {F}_{3}\left({L}_{1},{L}_{2},{L}_{3}\right)& \equiv {S}_{3}\left({L}_{1},{L}_{2},{L}_{3}\right)+\frac{2}{2!}\left[{S}_{2}\left({L}_{1},{L}_{2}+{L}_{3}\right)+{S}_{2}\left({L}_{1}+{L}_{2},{L}_{3}\right)\right]\hfill \\ \hfill & \quad +\frac{{2}^{2}}{3!}{S}_{1}\left({L}_{1}+{L}_{2}+{L}_{3}\right)\hfill \end{align} \tag{ 2.13 }$

$\begin{align}\hfill & ={C}_{0}{e}^{m\left({L}_{1}+{L}_{2}+{L}_{3}\right)}+{C}_{1}{e}^{m\left(-{L}_{1}+{L}_{2}+{L}_{3}\right)}+{C}_{2}{e}^{m\left(-{L}_{1}-{L}_{2}+{L}_{3}\right)}\hfill \\ \hfill & \quad +{C}_{3}{e}^{m\left(-{L}_{1}-{L}_{2}-{L}_{3}\right)},\hfill \end{align} \tag{ 2.14 }$

where

$\begin{align*}\hfill {C}_{0}& =\frac{\left(\mathrm{coth}\left({L}_{1}\right)-1\right)\left(\mathrm{coth}\left({L}_{1}+{L}_{2}\right)-1\right)+\frac{2}{2!}\left[\left(\mathrm{coth}\left({L}_{1}\right)-1\right)+\left(\mathrm{coth}\left({L}_{1}+{L}_{2}\right)-1\right)\right]+\frac{{2}^{2}}{3!}}{\mathrm{sinh}\left({L}_{1}+{L}_{2}+{L}_{3}\right)},\hfill \\ \hfill {C}_{1}& =\left[\frac{1}{\mathrm{sinh}\left(-{L}_{1}\right)}\right]\left[\frac{\left(\mathrm{coth}\left({L}_{2}\right)-1\right)+\frac{2}{2!}}{\mathrm{sinh}\left({L}_{2}+{L}_{3}\right)}\right],\hfill \\ \hfill {C}_{2}& =\left[\frac{\left(\mathrm{coth}\left(-{L}_{2}\right)-1\right)+\frac{2}{2!}}{\mathrm{sinh}\left(-{L}_{1}-{L}_{2}\right)}\right]\left[\frac{1}{\mathrm{sinh}\left({L}_{3}\right)}\right],\hfill \\ \hfill {C}_{3}& =\frac{\left(\mathrm{coth}\left(-{L}_{3}\right)-1\right)\left(\mathrm{coth}\left(-{L}_{2}-{L}_{3}\right)-1\right)+\frac{2}{2!}\left[\left(\mathrm{coth}\left(-{L}_{3}\right)-1\right)+\left(\mathrm{coth}\left(-{L}_{2}-{L}_{3}\right)-1\right)\right]+\frac{{2}^{2}}{3!}}{\mathrm{sinh}\left(-{L}_{1}-{L}_{2}-{L}_{3}\right)}.\hfill \end{align*}$

Within this example many of the previous themes are exposed. As in the sums S_N, the result splits into N + 1 terms. Each of these terms likewise separate into an m-dependent exponential and an m-independent function (the C_i above). The final similarity is the factorisation of these functions, shown clearly in C₁ and C₂. More generally, this factorisation arises from partitioning. Any sums which contribute to the coefficient of a given exponential with argument m(−L₁ − ⋯ − L_r + L_r+1 + ⋯ + L_N) must contain a partition between L_r and L_r+1. Any other partitioning which occurs to the left of the split affects a given sums contribution to the term independently of any partitioning to the right. More concretely, in the example above the function C₁, associated with the exponential with argument m(−L₁ + L₂ + L₃), is contributed to by any sums in equation (2.13) with a partition between L₁ and L₂. These are S₃(L₁, L₂, L₃) and S₂(L₁, L₂ + L₃). In the former there is another partition between L₂ and L₃, giving rise to the coth term in the right factor of C₁, while in the latter there is no such extra partition.

These arguments necessitate the partition sum to take the form

$\begin{align}\hfill {F}_{N}\left({L}_{1},{L}_{2},\dots ,{L}_{N}\right)& =\sum _{r=0}^{N}{\tilde {F}}_{r}\left(-{L}_{r},-{L}_{r-1},\dots ,-{L}_{1}\right){\tilde {F}}_{N-r}\left({L}_{r+1},{L}_{r+2},\dots ,{L}_{N}\right)\hfill \\ \hfill & \quad {\times}{e}^{m\left(-{L}_{1}-\cdots -{L}_{r}+{L}_{r+1}+\dots +{L}_{N}\right)},\hfill \end{align} \tag{ 2.15 }$

where ${\tilde {F}}_{0}\equiv 1$ and

$\begin{equation}{\tilde {F}}_{r}\left({x}_{1},\dots ,{x}_{r}\right)=\frac{{f}_{r-1}\left({x}_{1},\dots ,{x}_{r-1}\right)}{\mathrm{sinh}\left({x}_{1}+{x}_{2}+\dots +{x}_{r}\right)}\quad \text{for}\enspace r\in {\mathbb{Z}}^{+}.\end{equation} \tag{ 2.16 }$

2.5. A partition formula for f_r

The function f_r−1(x₁, x₂, ⋯, x_r−1) will be a partition sum of the functions s_n which are given from (2.12). For example,

$\begin{align*}\hfill {f}_{2}\left({x}_{1},{x}_{2}\right)& ={s}_{2}\left({x}_{1},{x}_{2}\right)+\frac{2}{2!}\left({s}_{1}\left({x}_{1}\right)+{s}_{1}\left({x}_{2}\right)\right)+\frac{{2}^{2}}{3!}\hfill \\ \hfill & =\left[\mathrm{coth}\left({x}_{1}\right)-1\right]\left[\mathrm{coth}\left({x}_{1}+{x}_{2}\right)-1\right]+\frac{2}{2!}\left(\right.\left[\mathrm{coth}\left({x}_{1}\right)-1\right]\hfill \\ \hfill & \quad +\left[\mathrm{coth}\left({x}_{1}+{x}_{2}\right)-1\right]\left.\right)+\frac{{2}^{2}}{3!},\hfill \end{align*}$

is found in both C₀ and C₃ above. In general, f_r−1(x₁, x₂, ⋯, x_r−1) is a sum of terms, each involving a product of coth functions minus one. As shown for f₂, in each of these terms there will be a number of these functions missed out. In a term where m such functions in a row have been missed out, a_m+1 ≡ 2^m/(m + 1)! will be the coefficient. This then implies that

$\begin{align}\hfill {f}_{r-1}& ={a}_{r}+\sum _{n=1}^{r-1}\sum _{{p}_{1}=1}^{\infty }\cdots \sum _{{p}_{n+1}=1}^{\infty }{a}_{{p}_{1}}{a}_{{p}_{2}}\cdots {a}_{{p}_{n+1}}\left[\mathrm{coth}\left({x}_{1}+{x}_{2}+\dots +{x}_{{p}_{1}}\right)-1\right]\cdots \hfill \\ \hfill & \quad {\times}\left[\mathrm{coth}\left({x}_{1}+{x}_{2}+\dots +{x}_{{p}_{1}+\dots +{p}_{n}}\right)-1\right]{\delta }_{r,{p}_{1}+\dots +{p}_{n+1}},\hfill \end{align} \tag{ 2.17 }$

where δ_i,j is the Kronecker delta. The combinatorial aspect of partitioning expressed in this sum is the next thing to be understood.

While superficially complicated, this sum is actually very simple. In essence, the sum index n counts how many coth functions have not been missed out and the numbers p_i give the positions of these. Alternatively, the numbers p_i − 1 can be interpreted as counting how many functions have been missed out in a row. As an example, one of the terms in the functions f₄ which has two coth functions missing (so n = 2 remain) is

In the function f_r−1 there are r − 1 different coth functions; for example, f₂(x₁, x₂) has coth(x₁) and coth(x₁ + x₂). The sum index n indicates the number of coth functions in a given term. If there are only n such functions in a term, that means (r − 1) − n are missing. These missing coth functions determine the numerical coefficient of the term, given by the numbers a_m+1. However, how each function was missed out is important—if m in a row are missed out then they are replaced with a_m+1. The indices of the second sum, p_i, are designed to convey this information. For example, if p₂ is 1 then there has been nothing missed out between the first coth and the second. If, however, it took any other value then p₂ − 1 possible coth functions must have been missed out between these two functions. Continuing this logic gives all terms in the above sum.

2.6. Resumming the partition formula

A simpler form of this function may be obtained. The brackets in the sum may be expanded, putting the function into the form

$\begin{align}\hfill {f}_{r-1}& ={t}_{r}+\sum _{n=1}^{r-1}\sum _{{p}_{1}=1}^{\infty }\cdots \sum _{{p}_{n+1}=1}^{\infty }{t}_{{p}_{1}}{t}_{{p}_{2}}\cdots {t}_{{p}_{n+1}}\mathrm{coth}\left({x}_{1}+{x}_{2}+\dots +{x}_{{p}_{1}}\right)\cdots \hfill \\ \hfill & \quad {\times}\mathrm{coth}\left({x}_{1}+{x}_{2}+\dots +{x}_{{p}_{1}+\dots +{p}_{n}}\right){\delta }_{r,{p}_{1}+\dots +{p}_{n+1}}.\hfill \end{align} \tag{ 2.18 }$

The coefficient ${t}_{{p}_{1}}{t}_{{p}_{2}}\cdots {t}_{{p}_{n+1}}$ is of course still a product of equivalent numbers ${t}_{{p}_{i}}$ as the same partitioning arguments apply. In other words, the numbers p_i still label the size of gaps in the product of coth functions and each provide a number ${t}_{{p}_{i}}$ which depends only upon this size, independent of the location of the gap. Comparing the constant term, that is when all coth functions have been missed out, of (2.17) with that of (2.18) gives

$\begin{equation}{t}_{r}=\sum _{n=0}^{r-1}{\left(-1\right)}^{n}\sum _{{p}_{1}=1}^{\infty }\cdots \sum _{{p}_{n+1}=1}^{\infty }{a}_{{p}_{1}}\cdots {a}_{{p}_{n+1}}{\delta }_{r,{p}_{1}+\dots +{p}_{n+1}}.\end{equation} \tag{ 2.19 }$

This sum, once computed for an arbitrary index, will give all numbers ${t}_{{p}_{i}}$ which appear in equation (2.18). The key to computation is to lift the constraint imposed by the Kronecker delta, and as such generating functions may be employed. First multiply both sides by x^r, and sum over r:

$\begin{equation}\sum _{r=1}^{\infty }{t}_{r}{x}^{r}=\sum _{r=1}^{\infty }\sum _{n=0}^{r-1}{\left(-1\right)}^{n}\sum _{{p}_{1}=1}^{\infty }\cdots \sum _{{p}_{n+1}=1}^{\infty }{a}_{{p}_{1}}{a}_{{p}_{2}}\cdots {a}_{{p}_{n+1}}{\delta }_{r,{p}_{1}+\dots +{p}_{n+1}}{x}^{r}\end{equation} \tag{ 2.20 }$

$\begin{equation}=\sum _{n=0}^{\infty }{\left(-1\right)}^{n}\left[\sum _{{p}_{1}=1}^{\infty }{a}_{{p}_{1}}{x}^{{p}_{1}}\right]\cdots \left[\sum _{{p}_{n+1}=1}^{\infty }{a}_{{p}_{n+1}}{x}^{{p}_{n+1}}\right].\end{equation} \tag{ 2.21 }$

Now each of the sums over p_i can be done freely, resulting in

$\begin{equation}\left(1-\sum _{k=1}^{\infty }{t}_{k}{x}^{k}\right)\left(1+\sum _{k=1}^{\infty }{a}_{k}{x}^{k}\right)=1.\end{equation} \tag{ 2.22 }$

The above is an expression of a kind of 'partition duality'. It is true for any sequence {a_k} and defines a dual sequence {t_k} which satisfies equation (2.19). This also implies that equation (2.19) is invertible, that is one can exchange a_k and −t_k and the equation will still hold. Of course, what has been done here is to replace the (coth(x) − 1) of equation (2.17) with (coth(x) − 0) in equation (2.18). One could instead replace it with a more general (coth(x) − λ), with the analysis being analogous to that which has been performed, though λ = −1, 0, 1 are the only useful cases.

In the present case recall a_k ≡ 2^k−1/k! and hence it is simple to calculate that

$\begin{equation}\sum _{k=0}^{\infty }{t}_{k}{x}^{k}=\mathrm{tanh}\left(x\right)=x-\frac{1}{3}{x}^{3}+\frac{2}{15}{x}^{5}-\frac{17}{315}{x}^{7}+\cdots \enspace ,\end{equation} \tag{ 2.23 }$

demonstrating the numbers t_k are generated by tanh. When combined with equation (2.18), this then gives a clean formula for f_r−1 and thus F_N. That is, f_r−1 is a sum of products of coth functions. In each term of this sum, some even number of these functions in a row will be missed out and replaced with the numbers t_k which come from the Taylor expansion of tanh(x). Finite examples of this concept will be given for clarity in section 4.

2.7. Revisiting M^m and implementing the fundamental mathematical approach

Focus will now turned to the exponential in equation (2.15). It is here that the identity (2.3) will be reversed. Equation (2.5) may now be rewritten as

$\begin{equation*}{M}^{m}={e}^{\mathrm{2}mA}+\sum _{N=1}^{\infty }{e}^{mA}\left(\sum _{r=0}^{N}{\tilde {F}}_{r}{\tilde {F}}_{N-r}{e}^{m\left(-{L}_{1}-\cdots -{L}_{r}+{L}_{r+1}+\dots +{L}_{N}\right)}\right){B}_{1}\cdots {B}_{N}{e}^{mA},\end{equation*}$

where the arguments of the functions have been suppressed for brevity. Upon repeated application of the Baker–Hausdorff formula (2.2) this can be seen as

$\begin{equation}{M}^{m}={e}^{\mathrm{2}mA}+\sum _{N=1}^{\infty }\sum _{r=0}^{N}{\tilde {F}}_{r}{\tilde {F}}_{N-r}\enspace {B}_{1}\cdots {B}_{r}{e}^{\mathrm{2}mA}{B}_{r+1}\cdots {B}_{N}.\end{equation} \tag{ 2.24 }$

The identity (2.3) may then be employed in reverse, obtaining

$\begin{equation*}\mathrm{log}\enspace M=2A+\sum _{N=1}^{\infty }\sum _{r=0}^{N}{\tilde {F}}_{r}{\tilde {F}}_{N-r}{B}_{1}\cdots {B}_{r}2A{B}_{r+1}\cdots {B}_{N}.\end{equation*}$

Using the commutator operators L_i, the matrix A in the above expression may be moved to either side of the matrices B, via

$\begin{equation}{B}_{1}\cdots {B}_{r}A{B}_{r+1}\cdots {B}_{N}=\left(-{L}_{1}-{L}_{2}-\cdots -{L}_{r}\right){B}_{1}\cdots {B}_{N}+A{B}_{1}\cdots {B}_{N}\end{equation} \tag{ 2.25 }$

$\begin{equation}=\left({L}_{r}+{L}_{r+1}+\dots +{L}_{N}\right){B}_{1}\cdots {B}_{N}+{B}_{1}\cdots {B}_{N}A.\end{equation} \tag{ 2.26 }$

The case m = 0 in equation (2.24) gives

$\begin{equation*}1=1+\sum _{N=1}^{\infty }\sum _{r=0}^{N}{\tilde {F}}_{r}{\tilde {F}}_{N-r}{B}_{1}\cdots {B}_{N},\end{equation*}$

which implies that for all N > 0,

$\begin{equation}\sum _{r=0}^{N}{\tilde {F}}_{r}{\tilde {F}}_{N-r}=0.\end{equation} \tag{ 2.27 }$

This identity is extremely useful and will appear again later in this work. For now it allows the extraneous final terms in equations (2.25) and (2.26) to be dropped and hence log M to be written in the form

$\begin{equation*}\mathrm{log}\enspace M=2A+\sum _{N=1}^{\infty }\left[\sum _{r=0}^{N}{\tilde {F}}_{r}{\tilde {F}}_{N-r}\left(-{L}_{1}-\cdots -{L}_{r}+{L}_{r+1}+\dots +{L}_{N}\right)\right]{B}_{1}\cdots {B}_{N}.\end{equation*}$

This then gives the promised expansion in powers of the matrix B.

2.8. Final form

To summarise, it has been found that

$\begin{equation}\mathrm{log}\left({e}^{A}{e}^{2B}{e}^{A}\right)=2A+\sum _{N=1}^{\infty }{\hat{G}}_{N}\enspace {B}_{1}\cdots {B}_{N},\end{equation} \tag{ 2.28 }$

where

$\begin{align}\hfill {\hat{G}}_{N}& =\sum _{r=0}^{N}{\tilde {F}}_{r}\left(-{L}_{r},-{L}_{r-1},\dots ,-{L}_{1}\right){\tilde {F}}_{N-r}\left({L}_{r+1},{L}_{r+2},\dots ,{L}_{N}\right)\hfill \\ \hfill & \quad {\times}\left(-{L}_{1}-\cdots -{L}_{r}+{L}_{r+1}+\dots +{L}_{N}\right),\hfill \end{align} \tag{ 2.29 }$

$\begin{equation}{\tilde {F}}_{r}\left({x}_{1},\dots ,{x}_{r}\right)=\frac{{f}_{r-1}\left({x}_{1},\dots ,{x}_{r-1}\right)}{\mathrm{sinh}\left({x}_{1}+{x}_{2}+\dots +{x}_{r}\right)}\quad \text{for}\enspace r\in {\mathbb{Z}}^{+},\quad {\tilde {F}}_{0}\equiv 1,\end{equation} \tag{ 2.30 }$

and

$\begin{align}\hfill {f}_{r-1}& ={t}_{r}+\sum _{n=1}^{r-1}\sum _{{p}_{1}=1}^{\infty }\cdots \sum _{{p}_{n+1}=1}^{\infty }{t}_{{p}_{1}}{t}_{{p}_{2}}\cdots {t}_{{p}_{n+1}}\mathrm{coth}\left({x}_{1}+{x}_{2}+\dots +{x}_{{p}_{1}}\right)\cdots \hfill \\ \hfill & \quad {\times}\mathrm{coth}\left({x}_{1}+{x}_{2}+\dots +{x}_{{p}_{1}+\dots +{p}_{n}}\right){\delta }_{r,{p}_{1}+\dots +{p}_{n+1}}.\hfill \end{align} \tag{ 2.31 }$

Here the numbers ${t}_{{p}_{i}}$ are given from the Taylor expansion of tanh(x).

3. Representation as a sum of commutators

It is well known that, beyond the initial terms, the Baker–Campbell–Hausdorff formula may be written as the sum of commutators. Unfortunately, for the new representation (2.28) this is not immediately evident. Of course, the commutator operators L_i contained within G_N will be applied to each matrix B_i to form commutators. However, this would naturally lead to products of commutators when, say, a term like L_i L_j is applied to B_i B_j. In this section a representation will be given for which each term is a single commutator. This representation will rely on unproved identities of the function ${\hat{G}}_{N}$ , which have been demonstrated for up to N = 10.

The first identity involves picking one argument of ${\hat{G}}_{N}$ , say L₁, then changing its position while preserving the order of the other arguments. Explicitly for ${\hat{G}}_{4}$ , the following identity is true:

$\begin{align*}\hfill & {\hat{G}}_{4}\left({L}_{1},{L}_{2},{L}_{3},{L}_{4}\right)+{\hat{G}}_{4}\left({L}_{2},{L}_{1},{L}_{3},{L}_{4}\right)+{\hat{G}}_{4}\left({L}_{2},{L}_{3},{L}_{1},{L}_{4}\right)\hfill \\ \hfill & \quad +{\hat{G}}_{4}\left({L}_{2},{L}_{3},{L}_{4},{L}_{1}\right)=0.\hfill \end{align*}$

The next identity involves picking two arguments, say L₁ and L₂. This time the position of both arguments is allowed to change, preserving both their own order and the order of the remaining arguments. Explicitly for ${\hat{G}}_{4}$ ,

$\begin{align*}\hfill & {\hat{G}}_{4}\left({L}_{1},{L}_{2},{L}_{3},{L}_{4}\right)+{\hat{G}}_{4}\left({L}_{1},{L}_{3},{L}_{2},{L}_{4}\right)+{\hat{G}}_{4}\left({L}_{1},{L}_{3},{L}_{4},{L}_{2}\right)\hfill \\ \hfill & \quad +{\hat{G}}_{4}\left({L}_{3},{L}_{1},{L}_{2},{L}_{4}\right)+{\hat{G}}_{4}\left({L}_{3},{L}_{1},{L}_{4},{L}_{2}\right)+{\hat{G}}_{4}\left({L}_{3},{L}_{4},{L}_{1},{L}_{2}\right)=0.\hfill \end{align*}$

In general it is conjectured that identities hold where n < N arguments of ${\hat{G}}_{N}$ are picked and are dealt with in an analogous way to above. Again, it should be noted that this has been tested successfully up to N = 10 and there is no reason to believe this should fail at any higher order.

From these identities it follows, see appendix C for proof, that

$\begin{equation*}{\hat{G}}_{N}{B}_{1}\cdots {B}_{N}=\frac{1}{N}{\hat{G}}_{N}\left[\left[\cdots \left[\left[{B}_{1},{B}_{2}\right],{B}_{3}\right],\cdots \enspace \right],{B}_{N}\right],\end{equation*}$

and hence that

$\begin{equation*}\mathrm{log}\left({e}^{A}{e}^{2B}{e}^{A}\right)=2A+{\hat{G}}_{1}{B}_{1}+\sum _{N=2}^{\infty }\frac{1}{N}{\hat{G}}_{N}\left[\left[\cdots \left[\left[{B}_{1},{B}_{2}\right],{B}_{3}\right],\cdots \enspace \right],{B}_{N}\right].\end{equation*}$

4. Finite examples

While the general formula has been derived in the preceding sections, it may be helpful to examine several low-order terms explicitly. This section will begin with the functions f_r, for r = 0, ⋯, 5, highlighting the patterns previously discussed. From these the operators ${\hat{G}}_{N}$ , the targets of this work, may be immediately written down and indeed will be for N = 1, ⋯, 5.

To begin, consider the functions f_r. The first few of these functions are given by

$\begin{align*}\hfill {f}_{0}& \equiv 1,\hfill \\ \hfill {f}_{1}& ={c}_{1},\hfill \\ \hfill {f}_{2}& ={c}_{1}{c}_{12}-\frac{1}{3},\hfill \\ \hfill {f}_{3}& ={c}_{1}{c}_{12}{c}_{\mathrm{123}}-\frac{1}{3}\left({c}_{1}+{c}_{\mathrm{123}}\right),\hfill \\ \hfill {f}_{4}& ={c}_{1}{c}_{12}{c}_{\mathrm{123}}{c}_{\mathrm{1234}}-\frac{1}{3}\left({c}_{1}{c}_{12}+{c}_{1}{c}_{\mathrm{1234}}+{c}_{\mathrm{123}}{c}_{\mathrm{1234}}\right)+\frac{2}{15},\hfill \end{align*}$

where compact notation (c₁₂₃ = coth(x₁ + x₂ + x₃), for example) has been used. Here the structure previously discussed becomes apparent. In equation (2.31) the term in the sum where n = r − 1 forces each p_i to be equal to one, giving the full product of coth functions with none missing. This is the leading term in each of the examples above. To generate the rest of the terms, neighbouring pairs of coth functions in this term are replaced with −1/3, neighbouring quadruplets are replaced with 2/15, and so on. All possible such replacements appear in the above functions, where the replacing numbers are given from

$\begin{equation*}\mathrm{tanh}\enspace x=x-\frac{1}{3}{x}^{3}+\frac{2}{15}{x}^{5}-\frac{17}{315}{x}^{7}+\cdots \enspace .\end{equation*}$

The targets of this work, the operators ${\hat{G}}_{N}$ , will now be examined. It was previously mentioned that the leading term ${\hat{G}}_{1}$ is already well known and while this was calculated for the regular Baker–Campbell–Hausdorff formula Z(X, Y), it is of course trivial to map it to the symmetric version $\mathcal{S}\left(A,B\right)$ considered here. Using the general formulae of the preceding section, it would be natural to write

$\begin{equation*}{\hat{G}}_{1}=\left[\frac{1}{\mathrm{sinh}\left({L}_{1}\right)}\right]{L}_{1}+\left[\frac{1}{\mathrm{sinh}\left(-{L}_{1}\right)}\right]\left(-{L}_{1}\right).\end{equation*}$

Of course, as both x and sinh(x) are odd functions, the minus signs are irrelevant and there is only really one term.

Next, at second order and third order it is found that

$\begin{align*}\hfill {\hat{G}}_{2}& =\left[\frac{\mathrm{coth}\left({L}_{1}\right)}{\mathrm{sinh}\left({L}_{1}+{L}_{2}\right)}\right]\left({L}_{1}+{L}_{2}\right)+\left[\frac{1}{\mathrm{sinh}\left(-{L}_{1}\right)}\right]\left[\frac{1}{\mathrm{sinh}\left({L}_{2}\right)}\right]\left(-{L}_{1}+{L}_{2}\right)\hfill \\ \hfill & \quad +\left[\frac{\mathrm{coth}\left(-{L}_{2}\right)}{\mathrm{sinh}\left(-{L}_{1}-{L}_{2}\right)}\right]\left(-{L}_{1}-{L}_{2}\right),\hfill \end{align*}$

and

$\begin{align*}\hfill {\hat{G}}_{3}& =\left[\frac{\mathrm{coth}\left({L}_{1}\right)\mathrm{coth}\left({L}_{1}+{L}_{2}\right)-\frac{1}{3}}{\mathrm{sinh}\left({L}_{1}+{L}_{2}+{L}_{3}\right)}\right]\left({L}_{1}+{L}_{2}+{L}_{3}\right)\hfill \\ \hfill & \quad +\left[\frac{1}{\mathrm{sinh}\left(-{L}_{1}\right)}\right]\left[\frac{\mathrm{coth}\left({L}_{2}\right)}{\mathrm{sinh}\left({L}_{2}+{L}_{3}\right)}\right]\left(-{L}_{1}+{L}_{2}+{L}_{3}\right)\hfill \\ \hfill & \quad +\left[\frac{\mathrm{coth}\left(-{L}_{2}\right)}{\mathrm{sinh}\left(-{L}_{1}-{L}_{2}\right)}\right]\left[\frac{1}{\mathrm{sinh}\left({L}_{3}\right)}\right]\left(-{L}_{1}-{L}_{2}+{L}_{3}\right)\hfill \\ \hfill & \quad +\left[\frac{\mathrm{coth}\left(-{L}_{3}\right)\mathrm{coth}\left(-{L}_{2}-{L}_{3}\right)-\frac{1}{3}}{\mathrm{sinh}\left(-{L}_{1}-{L}_{2}-{L}_{3}\right)}\right]\left(-{L}_{1}-{L}_{2}-{L}_{3}\right).\hfill \end{align*}$

With these, some general themes begin to emerge. It is immediately seen that each term factorises into two parts, written above with square brackets. In a given term, all commutator operators with a plus sign gather into one of these parts while those with a minus sign gather into the other. The only question that remains is how the arguments to each coth function are determined.

Consider, for example, the term involving −L₁ − L₂ − L₃ + L₄ + L₅ + L₆ + L₇ in ${\hat{G}}_{7}$ . Pictorially, the arguments for each function can be found from the diagram.

Here, the top red lines highlight the arguments of each sinh function, while the blue lines show the arguments to the coth functions. Combined with the previous discussion on how to write down these coth functions to form the numerators, this says how to write ${\hat{G}}_{N}$ for any order N. Of course equation (2.29) already provides such a formula, but perhaps observing these patterns for finite results may provide a more intuitive understanding.

For reference, the next two orders in the expansion are given by

$\begin{align*}\hfill {\hat{G}}_{4}& =\left[\frac{{c}_{1}{c}_{12}{c}_{\mathrm{123}}-\frac{1}{3}\left({c}_{1}+{c}_{\mathrm{123}}\right)}{{s}_{\mathrm{1234}}}\right]\left({L}_{1}+{L}_{2}+{L}_{3}+{L}_{4}\right)\hfill \\ \hfill & \quad +\left[\frac{1}{{s}_{\bar{1}}}\right]\left[\frac{{c}_{2}{c}_{23}-\frac{1}{3}}{{s}_{\mathrm{234}}}\right]\left(-{L}_{1}+{L}_{2}+{L}_{3}+{L}_{4}\right)\hfill \\ \hfill & \quad +\left[\frac{{c}_{\bar{2}}}{{s}_{\bar{12}}}\right]\left[\frac{{c}_{3}}{{s}_{34}}\right]\left(-{L}_{1}-{L}_{2}+{L}_{3}+{L}_{4}\right)\hfill \\ \hfill & \quad +\left[\frac{{c}_{\bar{3}}{c}_{\bar{23}}-\frac{1}{3}}{{s}_{\bar{123}}}\right]\left[\frac{1}{{s}_{4}}\right]\left(-{L}_{1}-{L}_{2}-{L}_{3}+{L}_{4}\right)\hfill \\ \hfill & \quad +\left[\frac{{c}_{\bar{4}}{c}_{\bar{34}}{c}_{\bar{234}}-\frac{1}{3}\left({c}_{\bar{4}}+{c}_{\bar{234}}\right)}{{s}_{\bar{1234}}}\right]\left(-{L}_{1}-{L}_{2}-{L}_{3}-{L}_{4}\right),\hfill \end{align*}$

and

$\begin{align*}\hfill {\hat{G}}_{5}& =\left[\frac{{c}_{1}{c}_{12}{c}_{\mathrm{123}}{c}_{\mathrm{1234}}-\frac{1}{3}\left({c}_{1}{c}_{12}+{c}_{1}{c}_{\mathrm{1234}}+{c}_{\mathrm{123}}{c}_{\mathrm{1234}}\right)+\frac{2}{15}}{{s}_{\mathrm{12345}}}\right]\hfill \\ \hfill & \quad {\times}\left({L}_{1}+{L}_{2}+{L}_{3}+{L}_{4}+{L}_{5}\right)\hfill \\ \hfill & \quad +\left[\frac{1}{{s}_{\bar{1}}}\right]\left[\frac{{c}_{2}{c}_{23}{c}_{\mathrm{234}}-\frac{1}{3}\left({c}_{2}+{c}_{\mathrm{234}}\right)}{{s}_{\mathrm{2345}}}\right]\left(-{L}_{1}+{L}_{2}+{L}_{3}+{L}_{4}+{L}_{5}\right)\hfill \\ \hfill & \quad +\left[\frac{{c}_{\bar{2}}}{{s}_{\bar{12}}}\right]\left[\frac{{c}_{3}{c}_{34}-\frac{1}{3}}{{s}_{\mathrm{345}}}\right]\left(-{L}_{1}-{L}_{2}+{L}_{3}+{L}_{4}+{L}_{5}\right)\hfill \\ \hfill & \quad +\left[\frac{{c}_{\bar{3}}{c}_{\bar{23}}-\frac{1}{3}}{{s}_{\bar{123}}}\right]\left[\frac{{c}_{4}}{{s}_{45}}\right]\left(-{L}_{1}-{L}_{2}-{L}_{3}+{L}_{4}+{L}_{5}\right)\hfill \\ \hfill & \quad +\left[\frac{{c}_{\bar{4}}{c}_{\bar{34}}{c}_{\bar{234}}-\frac{1}{3}\left({c}_{\bar{4}}+{c}_{\bar{234}}\right)}{{s}_{\bar{1234}}}\right]\left[\frac{1}{{s}_{5}}\right]\left(-{L}_{1}-{L}_{2}-{L}_{3}-{L}_{4}+{L}_{5}\right)\hfill \\ \hfill & \quad +\left[\frac{{c}_{\bar{5}}{c}_{\bar{45}}{c}_{\bar{345}}{c}_{\bar{2345}}-\frac{1}{3}\left({c}_{\bar{5}}{c}_{\bar{45}}+{c}_{\bar{5}}{c}_{\bar{2345}}+{c}_{\bar{345}}{c}_{\bar{2345}}\right)+\frac{2}{15}}{{s}_{\bar{12345}}}\right]\hfill \\ \hfill & \quad {\times}\left(-{L}_{1}-{L}_{2}-{L}_{3}-{L}_{4}-{L}_{5}\right).\hfill \end{align*}$

Here the notation has been made compact by writing, for example, s₁ = sinh(L₁) and ${c}_{\bar{23}}=\mathrm{coth}\left(-{L}_{2}-{L}_{3}\right)$ .

5. Apparent singularities and an alternative representation

One may, upon reading section 4 and the examples therein, be concerned that the operators ${\hat{G}}_{N}$ appear divergent. Both coth(x) and 1/ sinh(x) have simple poles when their argument is zero. This section, however, will provide the framework for removing these apparent singularities at will. While this can be done using the operators as given in the preceding section, it is better to rewrite and potentially simplify using hyperbolic identities, creating alternative representations. In this section one such alternative shall be discussed and used as the basis for an algorithmic approach to removing singularities, which is performed in detail in appendix D. Also in appendix D is an exhaustive list of possible singularities in the operators ${\hat{G}}_{1}$ , ${\hat{G}}_{2}$ , ${\hat{G}}_{3}$ , and ${\hat{G}}_{4}$ , and the result of removing them. A general approach, rather than the algorithmic method we demonstrate on low order examples, is an open problem worthy of study.

The starting point for obtaining this alternative representation is the m = 0 identity (2.27),

$\begin{equation*}\sum _{r=0}^{N}{\tilde {F}}_{r}\left(-{L}_{r},-{L}_{r-1},\dots ,-{L}_{1}\right){\tilde {F}}_{N-r}\left({L}_{r+1},{L}_{r+2},\dots ,{L}_{N}\right)=0.\end{equation*}$

The two outer terms, that is r = 0, N, can be extracted to give

$\begin{align*}\hfill & \frac{{f}_{N-1}\left({L}_{1},\dots ,{L}_{N-1}\right)-{f}_{N-1}\left(-{L}_{N},\dots ,-{L}_{2}\right)}{\mathrm{sinh}\left({L}_{1}+\dots +{L}_{N}\right)}\hfill \\ \hfill & \quad =-\sum _{r=1}^{N-1}\frac{{f}_{r-1}\left(-{L}_{r},\dots ,-{L}_{2}\right)}{\mathrm{sinh}\left(-{L}_{1}-\cdots -{L}_{r}\right)}\frac{{f}_{N-r-1}\left({L}_{r+1},\dots ,{L}_{N-1}\right)}{\mathrm{sinh}\left({L}_{r+1}+\dots +{L}_{N}\right)}.\hfill \end{align*}$

The key to this representation is to eliminate all sinh functions. To that end, the hyperbolic identity

$\begin{align}\hfill & \frac{1}{\mathrm{sinh}\left(-{L}_{1}-\cdots -{L}_{r}\right)}\frac{1}{\mathrm{sinh}\left({L}_{r+1}+\dots +{L}_{N}\right)}\hfill \\ \hfill & \quad =\frac{\mathrm{coth}\left(-{L}_{1}-\cdots -{L}_{r}\right)-\mathrm{coth}\left({L}_{r+1}+\dots +{L}_{N}\right)}{\mathrm{sinh}\left({L}_{1}+\dots +{L}_{N}\right)},\hfill \end{align} \tag{ 5.1 }$

may be used to rewrite the right-hand side of the above equation, and the sinh function is thus eliminated. At this point there is a clear divide, with half of the terms containing the variable L₁ but not L_N and the other half containing L_N but not L₁. The equation can be reorganised to separate each half by the equals sign which, along with linear independence of the functions involving L₁ and L_N, implies that each half separately must be equal to some constant. That is, for the L₁ dependent half,

$\begin{align}\hfill & {f}_{N-1}\left({L}_{1},\dots ,{L}_{N-1}\right)+\sum _{r=1}^{N-1}\enspace \mathrm{coth}\left(-{L}_{1}-\cdots -{L}_{r}\right){f}_{r-1}\left(-{L}_{r},\dots ,-{L}_{2}\right)\hfill \\ \hfill & \quad {\times}{f}_{N-r-1}\left({L}_{r+1},\dots ,{L}_{N-1}\right)=\text{const}\equiv {a}_{N}^{\text{odd}}.\hfill \end{align} \tag{ 5.2 }$

One of the striking features of this representation is the factorisation structure which has been ubiquitous in this work. Its presence here gives reassurance that this formula is natural. Secondly, outside of the f_N−1, all dependence on the variable L₁ appears only in the outer coth terms. These terms can be thought of as a linearly independent basis functions, with the f_r−1 f_N−r−1 terms cast as coefficients. This then gives a more controlled way of dealing with these formulae. This equation will be used to rewrite the overall operator ${\hat{G}}_{N}$ , but first the constant must be found.

Finding this constant term may be done by taking the limit L₁ → ±∞ in f_N−1, which has the effect of setting each coth to one or minus one. Adapting equation (2.17) then,

$\begin{equation*}\underset{{L}_{1}\to {\pm}\infty }{\mathrm{lim}}\enspace {f}_{N-1}={\left({\pm}1\right)}^{N}{a}_{N}\equiv {\left({\pm}1\right)}^{N}\frac{{2}^{N-1}}{N!},\end{equation*}$

and so taking the same limits on equation (5.2) gives

$\begin{equation}{\left({\pm}1\right)}^{N-1}{a}_{N}{\pm}\sum _{r=1}^{N-1}{f}_{r-1}{f}_{N-r-1}={a}_{N}^{\text{odd}}.\end{equation} \tag{ 5.3 }$

The two equations contained in (5.3) can then be summed to find the constant

$\begin{equation*}{a}_{N}^{\text{odd}}=\begin{cases}0,\hfill & N\enspace \text{even},\hfill \\ {a}_{N},\hfill & N\enspace \text{odd},\hfill \end{cases}\end{equation*}$

with generating function

$\begin{equation*}\sum _{N=1}^{\infty }{a}_{N}^{\text{odd}}{x}^{N}=\mathrm{cosh}\left(x\right)\mathrm{sinh}\left(x\right)=x+\frac{2}{3}{x}^{3}+\frac{2}{15}{x}^{5}+\cdots \enspace .\end{equation*}$

Similarly the equation (5.3) may be subtracted, giving a set of identities which will prove useful when dealing with apparent singularities,

$\begin{equation}\sum _{r=1}^{N-1}{f}_{r-1}{f}_{N-r-1}=\begin{cases}_{N},\hfill & N\enspace \text{even,}\hfill \\ 0,\hfill & N\enspace \text{odd},\hfill \end{cases}\equiv {a}_{N}^{\text{even}}\end{equation} \tag{ 5.4 }$

where

$\begin{equation*}\sum _{n}{a}_{n}^{\text{even}}{x}^{n}={\mathrm{sinh}}^{2}\left(x\right)={x}^{2}+\frac{1}{3}{x}^{4}+\frac{2}{45}{x}^{6}+\cdots \enspace .\end{equation*}$

Returning to the alternative representation, the overall operators ${\hat{G}}_{N}$ may now be rewritten. Using the hyperbolic identity (5.1) to combine all sinh functions and the recursion relation (5.2) to eliminate both instances of f_N−1, it can be seen that

$\begin{equation}{\hat{G}}_{N}=2s\left({L}_{1}+\dots +{L}_{N}\right){g}_{N},\end{equation} \tag{ 5.5 }$

where

$\begin{equation}s\left(x\right)=\frac{x}{\mathrm{sinh}\left(x\right)},\end{equation} \tag{ 5.6 }$

$\begin{align}\hfill {g}_{N}& ={a}_{N}^{\text{odd}}+\sum _{r=1}^{N-1}E\left({L}_{1}+\dots +{L}_{r},{L}_{r+1}+\dots +{L}_{N}\right){f}_{r-1}\left(-{L}_{r},\dots ,-{L}_{2}\right)\hfill \\ \hfill & \quad {\times}{f}_{N-r-1}\left({L}_{r+1},\dots ,{L}_{N-1}\right),\hfill \end{align} \tag{ 5.7 }$

and

$\begin{equation}E\left(x,y\right)=\frac{x\enspace \mathrm{coth}\left(x\right)-y\enspace \mathrm{coth}\left(y\right)}{x+y}.\end{equation} \tag{ 5.8 }$

For reference, the first few terms in this representation are given by

$\begin{align*}\hfill & {\hat{G}}_{1}=2s\left({L}_{1}\right),\hfill \\ \hfill & {\hat{G}}_{2}=2s\left({L}_{1}+{L}_{2}\right)E\left({L}_{1},{L}_{2}\right),\hfill \\ \hfill & {\hat{G}}_{3}=2s\left({L}_{1}+{L}_{2}+{L}_{3}\right)\left[\frac{2}{3}+E\left({L}_{1},{L}_{2}+{L}_{3}\right){f}_{1}\left({L}_{2}\right)+E\left({L}_{1}+{L}_{2},{L}_{3}\right){f}_{1}\left(-{L}_{2}\right)\right],\hfill \end{align*}$

and

$\begin{align*}\hfill {\hat{G}}_{4}& =2s\left({L}_{1}+{L}_{2}+{L}_{3}+{L}_{4}\right)\left[E\left({L}_{1},{L}_{2}+{L}_{3}+{L}_{4}\right){f}_{2}\left({L}_{2},{L}_{3}\right)\right.\hfill \\ \hfill & \quad \left.+E\left({L}_{1}+{L}_{2},{L}_{3}+{L}_{4}\right){f}_{1}\left(-{L}_{2}\right){f}_{1}\left({L}_{3}\right)+E\left({L}_{1}+{L}_{2}+{L}_{3},{L}_{4}\right){f}_{2}\left(-{L}_{3},-{L}_{2}\right)\right].\hfill \end{align*}$

As with the previous representation and the example given in section 4, the patterns demonstrated in these early examples continue. In ${\hat{G}}_{4}$ , for example, the variables L₂ and L₃ move from one argument of the E function to the other. When they do so they similarly move from one multiplying f function to another, recalling that f₀ ≡ 1 and so is not written above. As they move between these f functions, they incur a minus sign. These patters allow one to write all later functions ${\hat{G}}_{N}$ . In appendix D, ${\hat{G}}_{6}$ is written explicitly, if one wishes to test their understanding.

It is fairly clear that both s(x) and E(x, y) are regular and infinitely differentiable and as such any apparent singularity involving L₁ or L_N is automatically safe in this representation. Demonstrating that any other limits are safe involves the identities just introduced in equation (5.4). This is done carefully for a particular example in appendix D, but is also done exhaustively in that same appendix for ${\hat{G}}_{1}$ , ${\hat{G}}_{2}$ , ${\hat{G}}_{3}$ , and ${\hat{G}}_{4}$ . It should be noted that in applications it is usual, rather than unusual, that such singularities are relevant. As such, the representation presented in this section should be considered as the starting point for practical use of the new formula.

6. Choice of basis

In this section the sums of commutator operators, that is strings like L₁ + L₂ + ⋯ + L_r, will be discussed. It was previously suggested that these were mathematically natural to use as arguments to various functions. It turns out that in the basis where the matrix A is diagonal, if such a basis exists, these sums result in the difference between two eigenvalues of A. As shall be seen, this drastically reduces the complexity of using the new representation.

First consider the matrix elements of LB ≡ [A, B]:

$\begin{equation*}{\left[LB\right]}_{{n}_{1}{n}_{2}}={A}_{{n}_{1}{n}^{\prime }}{B}_{{n}^{\prime }{n}_{2}}-{B}_{{n}_{1}{n}^{\prime }}{A}_{{n}^{\prime }{n}_{2}},\end{equation*}$

where summation over repeated indices is assumed. If A is a diagonal matrix then its matrix elements are given in terms of its eigenvalues as A_nm = a_n δ_nm, where δ_nm is the Kronecker delta. Hence in the basis A is diagonal the above is given by

$\begin{equation*}{\left[LB\right]}_{{n}_{1}{n}_{2}}=\left({a}_{{n}_{1}}-{a}_{{n}_{2}}\right){B}_{{n}_{1}{n}_{2}}.\end{equation*}$

More generally, for any Taylor expandable function f, it can be seen that

$\begin{align*}\hfill & {\left[f\left({L}_{i}+{L}_{i+1}+\dots +{L}_{i+r}\right){B}_{1}{B}_{2}\cdots {B}_{N}\right]}_{{n}_{1}{n}_{N+1}}=f\left({a}_{{n}_{i}}-{a}_{{n}_{i+r+1}}\right){B}_{{n}_{1}{n}_{2}}{B}_{{n}_{2}{n}_{3}}\cdots {B}_{{n}_{N}{n}_{N+1}}.\hfill \end{align*}$

This is a simple yet powerful result. If the function f is replaced with sinh or coth functions, then ${\hat{G}}_{N}$ may be determined without difficulty. This would allow calculations to be done numerically with relative ease as all the strings of commutator operators are replaced by real numbers. It is this choice of basis, then, which gives the results of this paper a practical raison d'être.

A few words ought to be said about the full expansion in this basis. Using the notation of section 5, note that an overall factor of s(L₁ + L₂ + ⋯ + L_N) may be extracted as, at each order, the argument of this function is the same difference of eigenvalues. That is,

$\begin{align}\hfill {\left[\mathcal{S}\left(A,B\right)\right]}_{n{n}^{\prime }}& ={\left[2A+\sum _{N=1}^{\infty }{\hat{G}}_{N}{B}_{1}\cdots {B}_{N}\right]}_{n{n}^{\prime }}\hfill \\ \hfill & =2{a}_{n}{\delta }_{n,{n}^{\prime }}+2s\left({a}_{n}-{a}_{{n}^{\prime }}\right)\left({B}_{n{n}^{\prime }}+{g}_{2}\left({a}_{n}-{a}_{{n}_{1}},{a}_{{n}_{1}}-{a}_{{n}^{\prime }}\right){B}_{n{n}_{1}}{B}_{{n}_{1}{n}^{\prime }}\right.\hfill \\ \hfill & \quad \left.+{g}_{3}\left({a}_{n}-{a}_{{n}_{1}},{a}_{{n}_{1}}-{a}_{{n}_{2}},{a}_{{n}_{2}}-{a}_{{n}^{\prime }}\right){B}_{n{n}_{1}}{B}_{{n}_{1}{n}_{2}}{B}_{{n}_{2}{n}^{\prime }}+\cdots \enspace \right),\hfill \end{align} \tag{ 6.1 }$

where again summation over repeated indices is assumed. The function s can be interpreted as a Boltzmann suppression factor, appropriately named when one considers potential applications in quantum and statistical mechanics, which reduces the weight of any matrix element for whom the difference in eigenvalues a_n − a_n' is sufficiently large. It is plotted in figure 2(a), for reference.

The reduced functions g_N, then, are what remains. Figure 2(b) displays g₂, which has several features generic to these functions. Firstly, it appears to be bounded and its extrema occur as its arguments diverge. It can be proved, though it will not be done here, that under such limits the relatively complicated g_N and the comparatively simple f_N−1 coincide. Then finding these extrema is elementary as each constituent coth function within f_N−1 takes values ±1. That these limits do in fact correspond to the extrema of g_N is not proved, but has been numerically verified up to g₈ and the results displayed in table 1. Furthermore, a generating function for the outermost of these bounds can be obtained and is given by

$\begin{equation*}b\left(x\right)=\frac{1-{e}^{-2x}}{1+{e}^{-4x}}=x+{x}^{2}-\frac{4}{3}{x}^{3}-\frac{5}{3}{x}^{4}+\frac{122}{45}{x}^{5}+\cdots \end{equation*}$

This series is absolutely convergent when |x| < π/4, which can easily be seen performing the rotation x → ix in the equation above. It is entirely possible for the series to converge outside of this region, however. This provides reassurance that the series (6.1) converges for sufficiently small B.

Table 1. Bounds of the functions g_N, obtained using the procedure outlined in the text and numerically verified.

Function	Lower bound	Upper bound
g₂	−1	1
g₃	−4/3	2/3
g₄	−5/3	5/3
g₅	−6/5	32/15
g₆	−122/45	122/45
g₇	−1088/315	676/315
g₈	−227/63	227/63

7. Conclusion

A new representation for the Baker–Campbell–Hausdorff formula has been found. This representation is a perturbative expansion in just one of two matrices, as opposed to both in the original representation. The series may then be truncated and give a good approximation to the full expansion for situations where only this second object is small. For physical problems this then would give access to a much larger parameter space than is currently available. Additionally, new problems for which the original representation was unusable may now be tackled. Transfer matrices in statistical mechanics is an example of one such problem, which is under active consideration. Appendix A discusses these briefly.

A final note should be made on practical use of this new formula. First, the representation discussed in section 5 and defined in equation (5.5) is perhaps the best starting point. It is simple to work with and automatically deals with several apparent singularities. Next, appendix E provides the computationally simplest way of obtaining the constituent parts of the representation. Finally, if this formula is to be useful, one of the matrices ought to be diagonal as discussed in section 6. Then all operators instead become functions of real numbers and calculations become easy to perform.

Appendix A.: Future use of the formula

In this appendix the potential future applications of the formula proven in this paper will be discussed. This work is the subject of ongoing research and is presented to aid comprehension as to the practical purpose of some of the formulae.

Simply put, in a quantum mechanical scenario it is common to have a Hamiltonian split into a dominant and perturbative part, say

$\begin{equation*}H=2A+2B.\end{equation*}$

Here the notation is designed to draw parallels with the objects within this paper, with A typically being a diagonal matrix and B being considered perturbatively small. A physicist would then turn to perturbation theory to progress, writing

$\begin{equation*}{E}_{0}=2{a}_{0}+2{B}_{00}+2\sum _{n\ne 0}\frac{1}{{a}_{0}-{a}_{n}}{B}_{0n}{B}_{n0}+\cdots \enspace ,\end{equation*}$

where E₀ is the groundstate of H, a_n the eigenvalues of A, and B_nm the matrix elements of B.

In a statistical mechanics scenario one would instead have the equation

$\begin{equation*}{e}^{H}={e}^{A}{e}^{2B}{e}^{A},\end{equation*}$

when considering transfer matrices, with H = −βF where F is a free-energy operator. At low temperature A is a diagonal matrix and B is small, in the same sense one uses in a quantum mechanical context, and so the formulae derived in this paper are relevant. One can then use perturbation theory on these formulae to obtain the equivalent formula as in quantum mechanics

$\begin{equation*}{E}_{0}=2{a}_{0}+2{B}_{00}+2\sum _{n\ne 0}{B}_{0n}{f}_{1}\left({a}_{0}-{a}_{n}\right){B}_{n0}+\cdots \enspace .\end{equation*}$

This E₀ contains the free-energy of the model in question and hence the partition function. Note, as mentioned previously, this is the subject of ongoing research and a full derivation will appear in a subsequent paper, up to at least sixth order. This amounts to an improved form of the well-known high-temperature expansion technique, leading to much more powerful and accurate results for both high- and low-temperature expansions. It is presented here purely as a guide to help understand the context in which this paper operates.

Appendix B.: Calculation of the sums

Presented here is a direct method of calculating the sum (2.6). As described in the main text, the key is to split the starting constrained sum into N + 1 semi-constrained sums (that is, one of the limits of the sum may be made infinite). To that end, note

$\begin{equation*}\sum _{-\frac{m-1}{2}{\leqslant}{n}_{1}{< }{n}_{2}{< }\cdots {< }{n}_{N}{\leqslant}\frac{m-1}{2}}=\sum _{{n}_{1}{< }{n}_{2}{< }\cdots {< }{n}_{N}{\leqslant}\frac{m-1}{2}}-\sum _{-\frac{m-1}{2}{ >}{n}_{1}{< }{n}_{2}{< }\cdots {< }{n}_{N}{\leqslant}\frac{m-1}{2}},\end{equation*}$

which is demonstrated by the diagram below. In this, circles represent the variables of the sum and their position along the line indicates the value said variables take, while rectangles represent the bounds of the sums. Open rectangles and circles allow equality, while filled do not.

This identity has transformed the constrained sum on the left into two sums. One of these is semi-constrained, as was targeted, while the other has one semi-constrained and N − 1 constrained variables. Applying this idea again gives

$\begin{equation*}\sum _{-\frac{m-1}{2}{ >}{n}_{1}{< }{n}_{2}{< }\cdots {< }{n}_{N}{\leqslant}\frac{m-1}{2}}=\sum _{\begin{subarray}{c}{n}_{1}{< }-\frac{m-1}{2}\\ {n}_{2}{< }\cdots {< }{n}_{N}{\leqslant}\frac{m-1}{2}\end{subarray}}-\sum _{-\frac{m-1}{2}{ >}{n}_{1}{\geqslant}{n}_{2}{< }\cdots {< }{n}_{N}{\leqslant}\frac{m-1}{2}},\end{equation*}$

or pictorially,

The number of constrained variables on the right-hand side is now reduced to N − 2. This can then be continued until there are no such variables remaining, resulting in an identity relating a sum with N constrained variables to N + 1 sums with only semi-constrained variables.

A generic term in this identity for the particular sum in the main text is given by

$\begin{align*}{2}^{N}\sum _{\begin{subarray}{c}{n}_{r}{\leqslant}\cdots {\leqslant}{n}_{1}{< }-\frac{m-1}{2}\\ {n}_{r+1}{< }\cdots {< }{n}_{N}{\leqslant}\frac{m-1}{2}\end{subarray}}{e}^{2{n}_{1}{L}_{1}}{e}^{2{n}_{2}{L}_{2}}\cdots {e}^{2{n}_{N}{L}_{N}}\hfill & =\left[{\left(-1\right)}^{r}{2}^{r}\sum _{{n}_{r}{\leqslant}\cdots {\leqslant}{n}_{1}{< }-\frac{m-1}{2}}{e}^{2{n}_{1}{L}_{1}}\cdots {e}^{2{n}_{r}{L}_{r}}\right]\hfill \\ \hfill & \quad {\times}\left[{2}^{N-r}\sum _{{n}_{r+1}{< }\cdots {< }{n}_{N}{\leqslant}\frac{m-1}{2}}{e}^{2{n}_{r+1}{L}_{r+1}}\cdots {e}^{2{n}_{N}{L}_{N}}\right],\hfill \end{align*}$

that is,

A simple change of variables, indicated on the picture above, gives

$\begin{align*}\hfill & \left[{\left(-1\right)}^{r}{2}^{r}\sum _{{p}_{1}=-\infty }^{-\frac{m-1}{2}-1}{e}^{2{p}_{1}\left({L}_{1}+\dots +{L}_{r}\right)}\sum _{{p}_{2}=-\infty }^{0}{e}^{2{p}_{2}\left({L}_{2}+\dots +{L}_{r}\right)}\cdots \sum _{{p}_{r}=-\infty }^{0}{e}^{2{p}_{2}{L}_{r}}\right]\hfill \\ \hfill & \quad {\times}\left[{2}^{N-r}\sum _{{p}_{r+1}=-\infty }^{-1}{e}^{2{p}_{r+1}{L}_{r+1}}\cdots \sum _{{p}_{N-1}=-\infty }^{-1}{e}^{2{p}_{N}-1\left({L}_{r+1}+\dots +{L}_{N-1}\right)}\right.\left.{\times}\sum _{{p}_{N}=-\infty }^{\frac{m-1}{2}}{e}^{2{p}_{N}\left({L}_{r+1}+\dots +{L}_{N}\right)}\right],\hfill \end{align*}$

which may be trivially calculated. Using the identities

$\begin{align*}\hfill & -2\sum _{n=-\infty }^{-\frac{m-1}{2}-1}{e}^{\mathrm{2}nx}=\frac{-2{e}^{-\left(m+1\right)x}}{1-{e}^{-2x}}=\frac{{e}^{-mx}}{\mathrm{sinh}\left(-x\right)},\quad \hfill \\ \hfill & \quad -2\sum _{n=-\infty }^{0}{e}^{\mathrm{2}nx}=\frac{-2}{1-{e}^{-2x}}=\mathrm{coth}\left(-x\right)-1,\hfill \end{align*}$

and

$\begin{align*}\hfill & 2\sum _{n=-\infty }^{\frac{m-1}{2}}{e}^{\mathrm{2}nx}=\frac{2{e}^{\left(m-1\right)x}}{1-{e}^{-2x}}=\frac{{e}^{mx}}{\mathrm{sinh}\left(x\right)},\quad \hfill \\ \hfill & 2\sum _{n=-\infty }^{-1}{e}^{\mathrm{2}nx}=\frac{2{e}^{-2x}}{1-{e}^{-2x}}=\mathrm{coth}\left(x\right)-1,\hfill \end{align*}$

provides the result required in the main text.

Appendix C.: Proof of commutator representation

In this appendix it will be proven that, subject to the identities described in section 3,

$\begin{equation}{\hat{G}}_{N}{B}_{1}\cdots {B}_{N}=\frac{1}{N}{\hat{G}}_{N}\left[\left[\cdots \left[\left[{B}_{1},{B}_{2}\right],{B}_{3}\right],\cdots \enspace \right],{B}_{N}\right].\end{equation} \tag{ C.1 }$

To begin, the commutator on the right-hand side of the above equation may be written in terms of permutations of the string B₁, ⋯, B_N. That is,

$\begin{align}\hfill & \frac{1}{N}{\hat{G}}_{N}\left[\left[\cdots \left[\left[{B}_{1},{B}_{2}\right],{B}_{3}\right],\cdots \enspace \right],{B}_{N}\right]\hfill \\ \hfill & \quad =\frac{1}{N}{G}_{N}\left\{\left(\right)-{\left(1NN-1\cdots 2\right)}_{B}\right\}\cdots \left\{{\left(\right)}_{B}-{\left(132\right)}_{B}\right\}\left\{{\left(\right)}_{B}-{\left(12\right)}_{B}\right\}{B}_{1}\cdots {B}_{N},\hfill \end{align} \tag{ C.2 }$

where ${\left({n}_{1}{n}_{2}\cdots {n}_{N}\right)}_{B}$ represents a permutation of the string B₁, ⋯, B_N. Next the indices in each term may be relabeled, keeping the order B₁, ⋯, B_N and instead permuting the arguments of the function ${\hat{G}}_{N}\left({L}_{1},\enspace \cdots ,\enspace {L}_{N}\right)$ . For example,

$\begin{align*}\hfill {\left(132\right)}_{B}{\hat{G}}_{4}\left({L}_{1},{L}_{2},{L}_{3},{L}_{4}\right){B}_{1}{B}_{2}{B}_{3}{B}_{4}& ={\hat{G}}_{4}\left({L}_{1},{L}_{2},{L}_{3},{L}_{4}\right){B}_{3}{B}_{1}{B}_{2}{B}_{4}\hfill \\ \hfill & ={\hat{G}}_{4}\left({L}_{2},{L}_{3},{L}_{1},{L}_{4}\right){B}_{1}{B}_{2}{B}_{3}{B}_{4}\hfill \\ \hfill & ={\left(123\right)}_{G}{\hat{G}}_{4}\left({L}_{1},{L}_{2},{L}_{3},{L}_{4}\right){B}_{1}{B}_{2}{B}_{3}{B}_{4}.\hfill \end{align*}$

In general, for any permutation P,

$\begin{equation*}{P}_{B}{\hat{G}}_{N}\left({L}_{1},\dots ,{L}_{N}\right){B}_{1}\cdots {B}_{N}={P}_{G}^{-1}{\hat{G}}_{N}\left({L}_{1},\dots ,{L}_{N}\right){B}_{1}\cdots {B}_{N},\end{equation*}$

and so equation (C.2) may be rewritten in terms of permutations on ${\hat{G}}_{N}$ as

$\begin{align}\hfill \frac{1}{N}& {\hat{G}}_{N}\left[\left[\cdots \left[\left[{B}_{1},{B}_{2}\right],{B}_{3}\right],\cdots \enspace \right],{B}_{N}\right]\hfill \\ \hfill & =\frac{1}{N}\left\{{\left(\right)}_{G}-{\left(12\right)}_{G}\right\}\left\{{\left(\right)}_{G}-{\left(123\right)}_{G}\right\}\cdots \left\{{\left(\right)}_{G}-{\left(1\cdots N\right)}_{G}\right\}{\hat{G}}_{N}{B}_{1}\cdots {B}_{N}\hfill \\ \hfill & =\frac{1}{N}\left[{\left(\right)}_{G}+\sum _{m=1}^{N}{\left(-1\right)}^{m}\sum _{1{< }{n}_{m}{< }\cdots {< }{n}_{1}{\leqslant}N}{\left(1\cdots {n}_{m}\right)}_{G}\cdots {\left(1\cdots {n}_{1}\right)}_{G}\right]{\hat{G}}_{N}{B}_{1}\cdots {B}_{N}.\hfill \end{align} \tag{ C.3 }$

The identities of the function ${\hat{G}}_{N}$ may also be written in this permutation style. Most relevantly, choosing the arguments L_m, L_m−1, ..., L₁ and changing their position with respect to the remaining arguments L_m+1, L_m+2, ..., L_N while keeping the two sets internally ordered may be written as

$\begin{equation*}\sum _{1{\leqslant}{n}_{m}{< }\cdots {< }{n}_{1}{\leqslant}N}{\left(1\cdots {n}_{m}\right)}_{G}\cdots {\left(1\cdots {n}_{1}\right)}_{G}{\hat{G}}_{N}\left({L}_{1},{L}_{2},\dots ,{L}_{N}\right)=0.\end{equation*}$

As an example, for m = 2 and N = 4 the above reads

$\begin{align*}\hfill & {\hat{G}}_{4}\left({L}_{2},{L}_{1},{L}_{3},{L}_{4}\right)+{\hat{G}}_{4}\left({L}_{2},{L}_{3},{L}_{1},{L}_{4}\right)+{\hat{G}}_{4}\left({L}_{2},{L}_{3},{L}_{4},{L}_{1}\right)\hfill \\ \hfill & \quad +{\hat{G}}_{4}\left({L}_{3},{L}_{2},{L}_{1},{L}_{4}\right)+{\hat{G}}_{4}\left({L}_{3},{L}_{2},{L}_{4},{L}_{1}\right)+{\hat{G}}_{4}\left({L}_{3},{L}_{4},{L}_{2},{L}_{1}\right)=0.\hfill \end{align*}$

The sum in the identity can be split into two cases: n_m = 1 and n_m ≠ 1. This gives

$\begin{align*}\hfill & \left(-1\right)\sum _{1{< }{n}_{m}{< }\cdots {< }{n}_{1}{\leqslant}N}{\left(1\cdots {n}_{m}\right)}_{G}\cdots {\left(1\cdots {n}_{1}\right)}_{G}{\hat{G}}_{N}\hfill \\ \hfill & \quad =\sum _{1{< }{n}_{m-1}{< }\cdots {< }{n}_{1}{\leqslant}N}\left(1\cdots {n}_{m-1}\right)\cdots \left(1\cdots {n}_{1}\right){\hat{G}}_{N},\hfill \end{align*}$

where the argument to the function ${\hat{G}}_{N}$ have been suppressed for brevity. This leads naturally to recursion. Using lower order identities, that is starting at m − 1 not m and so on, it can been seen that

$\begin{equation*}{\left(-1\right)}^{m}\sum _{1{< }{n}_{m}{< }\cdots {< }{n}_{1}{\leqslant}N}{\left(1\cdots {n}_{m}\right)}_{G}\cdots {\left(1\cdots {n}_{1}\right)}_{G}{\hat{G}}_{N}={\hat{G}}_{N}.\end{equation*}$

The left-hand side of the above is exactly what is obtained when expanding equation (C.3), collecting all terms involving m permutations multiplied together. Exactly N copies of this occur, thus proving equation (C.1).

Appendix D.: Algorithmically removing apparent singularities

This appendix will provide an algorithmic approach to removing any apparent singularities in the operator ${\hat{G}}_{N}$ , using the representation and identities provided in section 5. For immediate use, the first four operators ${\hat{G}}_{N}$ have formulae provided for all possible singularities are provided towards the end of this appendix. However, first the general trends shall be discussed via a single larger example, namely ${\hat{G}}_{6}$ when all of L₂, L₃, L₄, and L₅ are simultaneously zero.

In the language of section 5, that is equation (5.5), the relevant part of ${\hat{G}}_{6}$ without any singularities may be written as

$\begin{align*}\hfill {g}_{6}\left({L}_{1},{L}_{2},{L}_{3},{L}_{4},{L}_{5},{L}_{6}\right)& =E\left({L}_{1},{L}_{2}+{L}_{3}+{L}_{4}+{L}_{5}+{L}_{6}\right){f}_{4}\left({L}_{2},{L}_{3},{L}_{4},{L}_{5}\right)\hfill \\ \hfill & \quad +E\left({L}_{1}+{L}_{2},{L}_{3}+{L}_{4}+{L}_{5}+{L}_{6}\right){f}_{1}\left(-{L}_{2}\right){f}_{3}\left({L}_{3},{L}_{4},{L}_{5}\right)\hfill \\ \hfill & \quad +E\left({L}_{1}+{L}_{2}+{L}_{3},{L}_{4}+{L}_{5}+{L}_{6}\right){f}_{2}\left(-{L}_{3},-{L}_{2}\right){f}_{2}\left({L}_{4},{L}_{5}\right)\hfill \\ \hfill & \quad +E\left({L}_{1}+{L}_{2}+{L}_{3}+{L}_{4},{L}_{5}+{L}_{6}\right){f}_{3}\left(-{L}_{4},-{L}_{3},-{L}_{2}\right){f}_{1}\left({L}_{5}\right)\hfill \\ \hfill & \quad +E\left({L}_{1}+{L}_{2}+{L}_{3}+{L}_{4}+{L}_{5},{L}_{6}\right){f}_{4}\left(-{L}_{5},-{L}_{4},-{L}_{3},-{L}_{2}\right).\hfill \end{align*}$

There are five limits to be taken and the order in which they should be performed is crucial. For the approach which will be laid out in this section, it is best to work from the outside in. That is, it is best to take the limit L₂ + L₃ + L₄ + L₅ → 0 first, followed by L₃ + L₄ + L₅ → 0, and so on. The reason for this will become apparent shortly. For now, under the first limit, both the first and last lines appear singular while the rest are regular. The identity, associated with ${a}_{6}^{\text{even}}$ in equation (5.4),

$\begin{align*}\hfill {f}_{4}\left({L}_{2},{L}_{3},{L}_{4},{L}_{5}\right)+{f}_{1}\left(-{L}_{2}\right){f}_{3}\left({L}_{3},{L}_{4},{L}_{5}\right)+{f}_{2}\left(-{L}_{3},-{L}_{2}\right){f}_{2}\left({L}_{4},{L}_{5}\right)\\ \hfill +{f}_{3}\left(-{L}_{4},-{L}_{3},-{L}_{2}\right){f}_{1}\left({L}_{5}\right)+{f}_{4}\left(-{L}_{5},-{L}_{4},-{L}_{3},-{L}_{2}\right)=\frac{2}{45},\end{align*}$

allows one to replace the f₄ in the first line. The singular part then becomes

$\begin{align*}\hfill & \left[E\left({L}_{1}+{L}_{2}+{L}_{3}+{L}_{4}+{L}_{5},{L}_{6}\right)-E\left({L}_{1},{L}_{2}+{L}_{2}+{L}_{3}+{L}_{4}+{L}_{5}+{L}_{6}\right)\right]\hfill \\ \hfill & \quad {\times}{f}_{4}\left(-{L}_{5},-{L}_{4},-{L}_{3},-{L}_{2}\right)={E}^{\left(1\right)}\left({L}_{1},{L}_{6}\right){\times}\left(-{L}_{2}-{L}_{3}-{L}_{4}-{L}_{5}\right)\hfill \\ \hfill & \quad {\times}{f}_{4}\left(-{L}_{5},-{L}_{4},-{L}_{3},-{L}_{2}\right)+\mathcal{O}\left({\left({L}_{2}+{L}_{3}+{L}_{4}+{L}_{5}\right)}^{2}\right),\hfill \end{align*}$

where

$\begin{equation}{E}^{\left(n\right)}\left(x,y\right)\equiv \frac{\frac{{\mathrm{d}}^{\mathrm{n}}}{\mathrm{d}{\left(-x\right)}^{n}}\left(x\enspace \mathrm{coth}\enspace x\right)-\frac{{\mathrm{d}}^{\mathrm{n}}}{\mathrm{d}{y}^{n}}\left(y\enspace \mathrm{coth}\enspace y\right)}{x+y}.\end{equation} \tag{ D.1 }$

In the limit L₂ → −L₃ − L₄ − L₅ note, using equation (5.2),

$\begin{equation*}\left(-{L}_{2}-{L}_{3}-{L}_{4}-{L}_{5}\right){f}_{4}\left(-{L}_{5},-{L}_{4},-{L}_{3},-{L}_{2}\right)\to {f}_{3}\left(-{L}_{5},-{L}_{4},-{L}_{3}\right).\end{equation*}$

This is the first of four direct limits that will be taken during this example and is the most simple; more will be said later of the general form of these expressions. For now, when L₂ → −L₃ − L₄ − L₅, it has been found that

$\begin{align*}\hfill {g}_{6}& =\frac{2}{45}E\left({L}_{1},{L}_{6}\right)+{E}^{\left(1\right)}\left({L}_{1},{L}_{6}\right){f}_{3}\left(-{L}_{5},-{L}_{4},-{L}_{3}\right)\hfill \\ \hfill & \quad +\left[E\left({L}_{1}-{L}_{3}-{L}_{4}-{L}_{5},{L}_{3}+{L}_{4}+{L}_{5}+{L}_{6}\right)-E\left({L}_{1},{L}_{6}\right)\right]\hfill \\ \hfill & \quad {\times}{f}_{1}\left({L}_{3}+{L}_{4}+{L}_{5}\right){f}_{3}\left({L}_{3},{L}_{4},{L}_{5}\right)+\left[E\left({L}_{1}-{L}_{4}-{L}_{5},{L}_{4}+{L}_{5}+{L}_{6}\right)\right.\hfill \\ \hfill & \quad \left.-E\left({L}_{1},{L}_{6}\right)\right]{f}_{2}\left(-{L}_{3},{L}_{3}+{L}_{4}+{L}_{5}\right){f}_{2}\left({L}_{4},{L}_{5}\right)+\left[E\left({L}_{1}-{L}_{5},{L}_{5}+{L}_{6}\right)\right.\hfill \\ \hfill & \quad \left.-E\left({L}_{1},{L}_{6}\right)\right]{f}_{3}\left(-{L}_{4},-{L}_{3},{L}_{3}+{L}_{4}+{L}_{5}\right){f}_{1}\left({L}_{5}\right).\hfill \end{align*}$

The next limit to consider is when L₃ + L₄ + L₅ → 0. In this case both lines one and two appear singular, and again an identity should be used to rewrite one of them. The identity now is associated with ${a}_{5}^{\text{even}}$ and states

$\begin{align*}\hfill & {f}_{3}\left({L}_{3},{L}_{4},{L}_{5}\right)+{f}_{1}\left(-{L}_{3}\right){f}_{2}\left({L}_{4},{L}_{5}\right)+{f}_{2}\left(-{L}_{4},-{L}_{3}\right){f}_{1}\left({L}_{5}\right)+{f}_{3}\left(-{L}_{5},-{L}_{4},-{L}_{3}\right)=0.\hfill \end{align*}$

This then can be used to replace the f₃ in the first line. One may wonder about the choice of how to use this identity; should the f₃ in the first line or the opposing f₃ in the second line be replaced? The generic answer to this is to replace the function multiplying the highest E⁽ⁿ⁾, in order to form a simple expression to be limited. With this the singular part becomes

$\begin{align*}\hfill & \left\{\left[E\left({L}_{1}-{L}_{3}-{L}_{4}-{L}_{5},{L}_{3}+{L}_{4}+{L}_{5}+{L}_{6}\right)-E\left({L}_{1},{L}_{6}\right)\right]\right.\hfill \\ \hfill & \quad \left.{\times}{f}_{1}\left({L}_{3}+{L}_{4}+{L}_{5}\right)-{E}^{\left(1\right)}\left({L}_{1},{L}_{6}\right)\right\}{f}_{3}\left({L}_{3},{L}_{4},{L}_{5}\right).\hfill \end{align*}$

This is the second direct limit which shall be taken in this example and contains features that appear in all that remain. First note that Taylor expansion gives

$\begin{equation}E\left(x-y,y+z\right)=E\left(x,z\right)+\sum _{r=1}^{\infty }{E}^{\left(r\right)}\left(x,z\right)\frac{{y}^{r}}{r!},\end{equation} \tag{ D.2 }$

as the denominator of E is unchanged. This then is the reason to take the limits from outside to in as described before, as all subsequent expansions will necessarily be of this form. In general, after n limits have been taken, the singular parts of g_N will take the form

$\begin{align}\hfill & \left[E\left(x-y,y+z\right)-E\left(x,z\right)\right]{f}_{n}\left(y,0,\dots ,0\right)-\sum _{r=1}^{n}\frac{1}{r!}{E}^{\left(r\right)}\left(x,z\right){f}_{n-r}\left(y,0,\dots ,0\right)\hfill \\ \hfill & \quad =\frac{1}{\left(n+1\right)!}{E}^{\left(n+1\right)}\left(x,z\right)y+\mathcal{O}\left({y}^{2}\right).\hfill \end{align} \tag{ D.3 }$

This is proved using a generating function for f_n(y, 0, ⋯, 0) in appendix E. Using this knowledge for the current example, the second limit L₃ → −L₄ − L₅ may be taken to leave

$\begin{align*}\hfill {g}_{6}& =\frac{2}{45}E\left({L}_{1},{L}_{6}\right)+\frac{1}{2}{E}^{\left(2\right)}\left({L}_{1},{L}_{6}\right){f}_{2}\left(-{L}_{5},-{L}_{4}\right)\hfill \\ \hfill & \quad +\left\{\left[E\left({L}_{1}-{L}_{4}-{L}_{5},{L}_{4}+{L}_{5}+{L}_{6}\right)-E\left({L}_{1},{L}_{6}\right)\right]{f}_{2}\left({L}_{4}+{L}_{5},0\right)\right.\hfill \\ \hfill & \quad \left.-{E}^{\left(1\right)}\left({L}_{1},{L}_{6}\right){f}_{1}\left({L}_{4}+{L}_{5}\right)\right\}{f}_{2}\left({L}_{4},{L}_{5}\right)\hfill \\ \hfill & \quad +\left\{\left[E\left({L}_{1}-{L}_{5},{L}_{5}+{L}_{6}\right)-E\left({L}_{1},{L}_{6}\right)\right]{f}_{3}\left(-{L}_{4},{L}_{4}+{L}_{5},0\right)\right.\hfill \\ \hfill & \quad \left.-{E}^{\left(1\right)}\left({L}_{1},{L}_{6}\right){f}_{2}\left(-{L}_{4},{L}_{4}+{L}_{5}\right)\right\}{f}_{1}\left({L}_{5}\right).\hfill \end{align*}$

The third limit to take is that when L₄ + L₅ → 0, with lines one and two appearing singular. The approach now is hopefully becoming familiar. First use the identity, associated with ${a}_{4}^{\text{even}}$ ,

$\begin{equation*}{f}_{2}\left({L}_{4},{L}_{5}\right)+{f}_{1}\left(-{L}_{4}\right){f}_{1}\left({L}_{5}\right)+{f}_{2}\left(-{L}_{5},-{L}_{4}\right)=\frac{1}{3},\end{equation*}$

to replace the f₂ in the first line. The relevant term is then

$\begin{align*}\hfill & \left\{\left[E\left({L}_{1}-{L}_{4}-{L}_{5},{L}_{4}+{L}_{5}+{L}_{6}\right)-E\left({L}_{1},{L}_{6}\right)\right]{f}_{2}\left({L}_{4}+{L}_{5},0\right)\right.\hfill \\ \hfill & \quad \left.-{E}^{\left(1\right)}\left({L}_{1},{L}_{6}\right){f}_{1}\left({L}_{4}+{L}_{5}\right)-\frac{1}{2}{E}^{\left(2\right)}\left({L}_{1},{L}_{6}\right)\right\}{f}_{2}\left({L}_{4},{L}_{5}\right)\to \frac{1}{3!}{E}^{\left(3\right)}\left({L}_{1},{L}_{6}\right){f}_{1}\left(-{L}_{5}\right),\hfill \end{align*}$

and hence in the limit L₄ → −L₅,

$\begin{align*}\hfill {g}_{6}& =\frac{2}{45}E\left({L}_{1},{L}_{6}\right)+\frac{1}{6}{E}^{\left(2\right)}\left({L}_{1},{L}_{6}\right)+\frac{1}{3!}{E}^{\left(3\right)}\left({L}_{1},{L}_{6}\right){f}_{1}\left(-{L}_{5}\right)\hfill \\ \hfill & \quad +\left\{\left[E\left({L}_{1}-{L}_{5},{L}_{5}+{L}_{6}\right)-E\left({L}_{1},{L}_{6}\right)\right]{f}_{3}\left({L}_{5},0,0\right)\right.\hfill \\ \hfill & \quad \left.-{E}^{\left(1\right)}\left({L}_{1},{L}_{6}\right){f}_{2}\left({L}_{5},0\right)-\frac{1}{2}{E}^{\left(2\right)}\left({L}_{1},{L}_{6}\right){f}_{1}\left({L}_{5}\right)\right\}{f}_{1}\left({L}_{5}\right).\hfill \end{align*}$

The final limit in this example is when L₅ → 0. Now a rather trivial identity may be used,

$\begin{equation*}{f}_{1}\left({L}_{5}\right)+{f}_{1}\left(-{L}_{5}\right)=0,\end{equation*}$

to replace the f₁ in the first line. The relevant term under this limit then is

$\begin{align*}\hfill & \left\{\left[E\left({L}_{1}-{L}_{5},{L}_{5}+{L}_{6}\right)-E\left({L}_{1},{L}_{6}\right)\right]{f}_{3}\left({L}_{5},0,0\right)-{E}^{\left(1\right)}\left({L}_{1},{L}_{6}\right){f}_{2}\left({L}_{5},0\right)\right.\hfill \\ \hfill & \quad \left.-\frac{1}{2}{E}^{\left(2\right)}\left({L}_{1},{L}_{6}\right){f}_{1}\left({L}_{5}\right)-\frac{1}{3!}{E}^{\left(3\right)}\left({L}_{1},{L}_{6}\right)\right\}{f}_{1}\left({L}_{5}\right)\to \frac{1}{4!}{E}^{\left(4\right)}\left({L}_{1},{L}_{6}\right),\hfill \end{align*}$

leaving

$\begin{equation*}{g}_{6}=\frac{2}{45}E\left({L}_{1},{L}_{6}\right)+\frac{1}{6}{E}^{\left(2\right)}\left({L}_{1},{L}_{6}\right)+\frac{1}{4!}{E}^{\left(4\right)}\left({L}_{1},{L}_{6}\right).\end{equation*}$

This then concludes the example for this appendix. The lessons to draw from it are as follows. First, take sequential limits from out to in; this allows the expansion (D.2) to be used as the denominator of E is untouched. Second, using this approach all relevant terms under a limit will be of the form defined in equation (D.3). The limit then can be easily taken and the regular formula found.

More complicated situations than those discussed in this appendix can occur, for example if there are gaps in the set of variables tending to zero. Having L₂, L₃ → 0 while simultaneously taking the limit L₅ → 0 is one such example, as there is a gap between variables due to L₄ ≠ 0. These can be dealt with in an analogous fashion to those of this appendix, but it requires more complicated identities and careful handling. Part of this is discussed in appendix E, but otherwise this will not be dealt with here.

What follows is concrete and usable formulae for the first four operators, in all possible cases. The first two operators are trivial, as

$\begin{equation*}{\hat{G}}_{1}=2s\left({L}_{1}\right),\quad {\hat{G}}_{2}\left({L}_{1},{L}_{2}\right)=2s\left({L}_{1}+{L}_{2}\right)E\left({L}_{1},{L}_{2}\right),\end{equation*}$

are clearly regular. The first non-trivial example then is ${\hat{G}}_{3}$ . This has six apparent singularities, of which five involve either L₁ or L₃ and thus are already resolved. What remains then is the limit L₂ → 0. In this case,

$\begin{equation*}{\hat{G}}_{3}\left({L}_{1},0,{L}_{3}\right)=2s\left({L}_{1}+{L}_{3}\right)\left[\frac{2}{3}+{E}^{\left(1\right)}\left({L}_{1},{L}_{3}\right)\right],\end{equation*}$

where E⁽ⁿ⁾ is defined by equation (D.1).

Next, ${\hat{G}}_{4}$ has ten apparent singularities with three of these being independent of L₁ or L₄. Explicitly these are when L₂ → 0, L₃ → 0, L₂ + L₃ → 0. There is also a double singularity when two of these are taken simultaneously.

For the first limit, it can be found that

$\begin{align*}\hfill {\hat{G}}_{4}\left({L}_{1},0,{L}_{3},{L}_{4}\right)& =2s\left({L}_{1}+{L}_{2}+{L}_{3}\right)\left\{\frac{1}{3}E\left({L}_{1},{L}_{3}+{L}_{4}\right)+{E}^{\left(1\right)}\left({L}_{1},{L}_{3}+{L}_{4}\right){f}_{1}\left({L}_{3}\right)\right.\hfill \\ \hfill & \quad \left.+\left[E\left({L}_{1}+{L}_{3},{L}_{4}\right)-E\left({L}_{1},{L}_{3}+{L}_{4}\right)\right]{f}_{2}\left(-{L}_{3},0\right)\right\}.\hfill \end{align*}$

The second limit can be easily found using the identity ${\hat{G}}_{4}\left({L}_{1},{L}_{2},{L}_{3},{L}_{4}\right)={\hat{G}}_{4}\left(-{L}_{4},-{L}_{3},-{L}_{2},-{L}_{1}\right)$ . The third limit yields

$\begin{align*}\hfill {\hat{G}}_{4}\left({L}_{1},{L}_{2},-{L}_{2},{L}_{4}\right)& =2s\left({L}_{1}+{L}_{4}\right)\left\{\frac{1}{3}E\left({L}_{1}+{L}_{2},-{L}_{2}+{L}_{4}\right)\right.\hfill \\ \hfill & \quad \left.+\left[E\left({L}_{1}+{L}_{2},-{L}_{2}+{L}_{4}\right)-E\left({L}_{1},{L}_{4}\right)\right]\right.\hfill \\ \hfill & \quad {\times}\left.{f}_{1}\left(-{L}_{2}\right){f}_{1}\left(-{L}_{2}\right)-{E}^{\left(1\right)}\left({L}_{1},{L}_{4}\right){f}_{1}\left(-{L}_{2}\right)\right\}.\hfill \end{align*}$

Finally, the fourth, double singularity, limit gives

$\begin{equation*}{\hat{G}}_{4}\left({L}_{1},0,0,{L}_{4}\right)=2s\left({L}_{1}+{L}_{4}\right)\left[\frac{1}{3}E\left({L}_{1},{L}_{4}\right)+\frac{1}{2}{E}^{\left(2\right)}\left({L}_{1},{L}_{4}\right)\right].\end{equation*}$

Appendix E.: Generating functions

In this section a generating function for the operators f_r−1, defined in equation (2.31), will be given. This generating function will be perhaps the simplest way of generating these operators in practice. Additionally, it will be used to provide results needed in appendix D to remove apparent singularities in the operators ${\hat{G}}_{N}$ .

Starting from equation (2.31), by multiplying by x^r, summing over r, and using this sum to eliminate the Kronecker delta on the right-hand side, it can be seen that

$\begin{align*}\hfill \sum _{r=1}^{\infty }{f}_{r-1}{x}^{r-1}& =\frac{\mathrm{tanh}\left(x\right)}{x}+\frac{1}{x}\sum _{n=1}^{\infty }\sum _{{p}_{1}=1}^{\infty }\cdots \sum _{{p}_{n+1}=1}^{\infty }\left({t}_{{p}_{1}}{c}_{{p}_{1}}{x}^{{p}_{1}}\right)\hfill \\ \hfill & \quad {\times}\cdots \left({t}_{{p}_{n}}{c}_{{p}_{1}+\dots +{p}_{n}}{x}^{{p}_{n}}\right){t}_{{p}_{n+1}}{x}^{{p}_{n+1}},\hfill \end{align*}$

where the compact notation c_i = coth(x₁ + ⋯ + x_i) has been used. Next the sum over p₁ can be relabelled to a sum over some index m, and pulled out the front. Relabelling the rest of the variables leaves

$\begin{align}\hfill \sum _{r=1}^{\infty }{f}_{r-1}{x}^{r-1}& =\frac{\mathrm{tanh}\left(x\right)}{x}+\sum _{m=1}^{\infty }{t}_{m}{c}_{m}{x}^{m}\left[\frac{\mathrm{tanh}\left(x\right)}{x}+\frac{1}{x}\sum _{n=1}^{\infty }\sum _{{p}_{1}=1}^{\infty }\cdots \right.\hfill \\ \hfill & \quad \left.{\times}\sum _{{p}_{n+1}=1}^{\infty }\left({t}_{{p}_{1}}{c}_{m+{p}_{1}}{x}^{{p}_{1}}\right)\cdots \left({t}_{{p}_{n}}{c}_{m+{p}_{1}+\dots +{p}_{n}}{x}^{{p}_{n}}\right){t}_{{p}_{n+1}}{x}^{{p}_{n+1}}\right].\hfill \end{align} \tag{ E.1 }$

Next, inspired by the structure of the above, define

$\begin{equation}{\mathcal{F}}_{N}\equiv \frac{\mathrm{tanh}\left(x\right)}{x}+\sum _{m=1}^{\infty }{t}_{m}{c}_{N+m}{x}^{m}{\mathcal{F}}_{N+m}.\end{equation} \tag{ E.2 }$

Then clearly, reading off from equation (E.1),

$\begin{equation*}\sum _{r=1}^{\infty }{f}_{r-1}{x}^{r-1}={\mathcal{F}}_{0}=\frac{\mathrm{tanh}\left(x\right)}{x}+x{c}_{1}{\mathcal{F}}_{1}-\frac{1}{3}{x}^{3}{c}_{3}{\mathcal{F}}_{3}+\cdots \enspace ,\end{equation*}$

giving a generating function for the operators f_r−1. In order to find a given operator, then, one would iteratively substitute in

$\begin{align*}\hfill {\mathcal{F}}_{1}& =\frac{\mathrm{tanh}\left(x\right)}{x}+x{c}_{2}{\mathcal{F}}_{2}-\frac{1}{3}{x}^{3}{c}_{4}{\mathcal{F}}_{4}+\cdots \enspace ,\hfill \\ \hfill {\mathcal{F}}_{2}& =\frac{\mathrm{tanh}\left(x\right)}{x}+x{c}_{3}{\mathcal{F}}_{3}-\frac{1}{3}{x}^{3}{c}_{5}{\mathcal{F}}_{5}+\cdots \enspace ,\hfill \end{align*}$

and so on, then Taylor expand the resulting ${\mathcal{F}}_{0}$ . It should be noted that as substituting in an ${\mathcal{F}}_{N}$ only affects the generating function at $\mathcal{O}\left({x}^{N}\right)$ , if one only cared about finding operators up to and including f_r then all subsequent ${\mathcal{F}}_{M}$ , M > r, may be set to zero with no adverse effect.

The next task for this appendix is to prove some results required for removing apparent singularities in the operators ${\hat{G}}_{N}$ . In particular, equation (D.3) shall be proved. That equation deals with the operator f_n(y, 0, ⋯, 0), which in the language of this appendix implies c₁ = c₂ = ⋯. In this special case, equation (E.2) simplifies to

$\begin{equation*}{\mathcal{F}}_{0}=\frac{\mathrm{tanh}\left(x\right)}{x}\left(1+x{c}_{1}{\mathcal{F}}_{0}\right),\end{equation*}$

with all N equations being identical. Subsequent rearrangement and hyperbolic manipulations provide

$\begin{equation*}\sum _{r=0}^{\infty }{f}_{r}\left(y,0,\dots ,0\right){\alpha }^{r}={\mathcal{F}}_{0}\left(\alpha ;y\right)=\frac{\mathrm{sinh}\left(y\right)\mathrm{sinh}\left(\alpha \right)}{\alpha \enspace \mathrm{sinh}\left(y-\alpha \right)},\end{equation*}$

where the x of this section has been replaced with α in order to distinguish it from the x within equation (D.3). Next take equation (D.3), multiply by αⁿ and sum over n to find

$\begin{align*}\hfill & \left[E\left(x-y,y+z\right)-E\left(x,z\right)\right]\sum _{n=1}^{\infty }{f}_{n}\left(y,0,\dots ,0\right){\alpha }^{n}\hfill \\ \hfill & \quad -\sum _{n=1}^{\infty }\sum _{r=1}^{n}\frac{{\alpha }^{r}}{r!}{E}^{\left(r\right)}\left(x,z\right){f}_{n-r}\left(y,0,\dots ,0\right){\alpha }^{n-r}\hfill \\ \hfill & \quad =\left[E\left(x-y,y+z\right)-E\left(x,z\right)-\sum _{r=1}^{\infty }\frac{{\alpha }^{r}}{r!}{E}^{\left(r\right)}\left(x,z\right)\right]{\mathcal{F}}_{0}\left(\alpha ;y\right)\hfill \\ \hfill & \quad =\sum _{r=1}^{\infty }\frac{\left({y}^{r}-{\alpha }^{r}\right)}{r!}{E}^{\left(r\right)}\left(x,z\right){\mathcal{F}}_{0}\left(\alpha ;y\right).\hfill \end{align*}$

Next note

$\begin{equation*}\left({y}^{r}-{\alpha }^{r}\right){\mathcal{F}}_{0}\left(\alpha ;y\right)=y{\alpha }^{r-1}+\mathcal{O}\left({y}^{2}\right),\end{equation*}$

hence proving the result required.

Similar results are required for more complicated singularities than those observed in appendix D, for example having gaps in the set of variables which tend to zero. That is, having L₂, L₃ → 0 while also simultaneously having L₅ → 0, for instance, leaving a non-zero gap via L₄. In that case identities would have to be used which require a generating function for f_r(x, y, 0, ⋯, 0), though this fact will not be proved. This appendix will close with the case c₂ = c₃ = ⋯, while c₁ is distinct, providing such a generating function. In this case, equation (E.2) gives two distinct functions,

$\begin{align*}\hfill {\mathcal{F}}_{0}& =\frac{\mathrm{tanh}\left(x\right)}{x}+\left[\left({c}_{1}-{c}_{2}\right)x+{c}_{2}\enspace \mathrm{tanh}\left(x\right)\right]{\mathcal{F}}_{1},\hfill \\ \hfill {\mathcal{F}}_{1}& =\frac{\mathrm{tanh}\left(x\right)}{x}\left(1+x{c}_{2}{\mathcal{F}}_{1}\right),\hfill \end{align*}$

which can then be rearranged to provide the required generating function. Ever more complicated sets of singularities will require ever more complicated generating functions, but it should be straightforward to see from what has been done here how this can be generalised.

An exact power series representation of the Baker–Campbell–Hausdorff formula

Article metrics

Submit

Author e-mails

Author affiliations

Author notes

ORCID iDs

Dates

Peer review information

Abstract

1. Introduction

2. Derivation of main result

2.1. Expanding M^m in powers of B

2.2. Rewriting F_N in terms of fundamental sums S_N

2.3. Calculating S_N

2.4. Rewriting F_N as a partition sum in terms of f_r

2.5. A partition formula for f_r

2.6. Resumming the partition formula

2.7. Revisiting M^m and implementing the fundamental mathematical approach

2.8. Final form

3. Representation as a sum of commutators

4. Finite examples

5. Apparent singularities and an alternative representation

6. Choice of basis

7. Conclusion

Appendix A.: Future use of the formula

Appendix B.: Calculation of the sums

Appendix C.: Proof of commutator representation

Appendix D.: Algorithmically removing apparent singularities

Appendix E.: Generating functions

An exact power series representation of the Baker–Campbell–Hausdorff formula

Article metrics

Submit

Share this article

Author e-mails

Author affiliations

Author notes

ORCID iDs

Dates

Peer review information

Abstract

1. Introduction

2. Derivation of main result

2.1. Expanding Mm in powers of B

2.2. Rewriting FN in terms of fundamental sums SN

2.3. Calculating SN

2.4. Rewriting FN as a partition sum in terms of fr

2.5. A partition formula for fr

2.6. Resumming the partition formula

2.7. Revisiting Mm and implementing the fundamental mathematical approach

2.8. Final form

3. Representation as a sum of commutators

4. Finite examples

5. Apparent singularities and an alternative representation

6. Choice of basis

7. Conclusion

Appendix A.: Future use of the formula

Appendix B.: Calculation of the sums

Appendix C.: Proof of commutator representation

Appendix D.: Algorithmically removing apparent singularities

Appendix E.: Generating functions

2.1. Expanding M^m in powers of B

2.2. Rewriting F_N in terms of fundamental sums S_N

2.3. Calculating S_N

2.4. Rewriting F_N as a partition sum in terms of f_r

2.5. A partition formula for f_r

2.7. Revisiting M^m and implementing the fundamental mathematical approach