1 Introduction

The physics programs of the Large Hadron Collider experiments promise data sets of unprecedented sizes for a variety of \(B_s\) decays. Consequently, analyses that emerge from these programs now dominate determinations of absolute branching fractions of \(B_s\) decays. The biggest source of uncertainties in these analyses is the poorly known fraction of the b quark fragmentation into \(\bar{B}_s^0\) versus \(\bar{B}^0\) mesons, denoted as \(f_s / f_d\). A promising approach to determine this ratio [1] from data is the measurement of a ratio of branching fractions for hadronic B decays:

$$\begin{aligned}&\frac{\sigma (pp \rightarrow \bar{B}_s^0 X) \times {\mathcal {B}}(\bar{B}_s^0 \rightarrow D_s^{(*)+} \pi ^-)}{\sigma (pp \rightarrow \bar{B}^0 X) \times {\mathcal {B}}(\bar{B}^0 \rightarrow D^{(*)+} K^-)}\nonumber \\&\quad \equiv \frac{\sigma (pp \rightarrow \bar{B}_s^0 X)}{\sigma (pp \rightarrow \bar{B}^0 X)} \times \mathcal {R}^{P(V)}_{s/d}. \end{aligned}$$
(1)

We consider the hadronic decays in this ratio very advantageous from the theory point of view. Since in these decays all valence quarks are distinguishable, we do not have to account for decay topologies involving penguin operators. For the same reason weak annihilation is not an issue either. Moreover, these decays are dominated by the color-allowed tree topology, and the color-suppressed operator enters only through perturbative or power corrections. In Ref. [2] the ratio that we here call \({\mathcal {R}}^P_{s/d}\) is given with a relative uncertainty of \(\sim 9\%\).

The purpose of the present article is threefold. First, we revisit the QCD factorization framework for the non-leptonic decays \(\bar{B}^0 \rightarrow D^{(*)+} K^-\) and \(\bar{B}_s^0 \rightarrow D_s^{(*)+} \pi ^-\) beyond leading power. We update the values of the \(\bar{B}_q\rightarrow D_q^{(*)}\) form factors based on a recent analysis within the heavy-quark expansion (HQE) [3]. We also provide, for the first time, conservative numerical estimates for the necessary hadronic matrix elements that enter at next-to-leading-power in \(\Lambda _\mathrm{QCD}/m_b\). Based on these improvements we predict the branching fractions of the four decays and, for the first time, use their theoretical correlation to reduce the uncertainty on their ratios \({\mathcal {R}}^{P(V)}_{s/d}\).

Second, we challenge existing experimental data on two-body non-leptonic decays into a heavy-light final state. We point out the subtleties in the comparison between theory and experiment. Our analysis reveals a puzzling pattern in the comparison of theory predictions and data on the absolute branching fractions of the tree-dominated \(\bar{B}_{(s)}^0 \rightarrow D_{(s)}^+ \lbrace \pi ^-, K^-\rbrace \) decays, whereas the predicted ratios \(\mathcal {R}^{P(V)}_{s/d}\) are in good agreement with experiment. We subsequently extract the \(f_s / f_d\) fragmentation fraction for the CDF, D0 and LHCb experiments in a variety of scenarios.

Last but not least, we critically assess possible origins for the observed puzzle – which amounts to a discrepancy of up to five standard deviations for the individual absolute branching fractions – with one of them being contributions from physics beyond the Standard Model (BSM).

This article is organised as follows. In Sect. 2 we revisit the QCD factorization framework, including a thorough discussion of next-to-leading power hadronic matrix elements. We then discuss the numerical input parameters that enter our expressions, and give results for the non-leptonic branching fractions and the ratios \(\mathcal {R}^{P(V)}_{s/d}\) at leading power. Our estimate of \(\mathcal {R}^{P(V)}_{s/d}\) at next-to-leading power shows their robustness against power corrections. In Sect. 3 we compare the theoretical predictions to experimental data, explain in detail our extraction of the \(f_s/f_d\) fragmentation fraction for various hadron colliders and uncover the puzzling pattern in non-leptonic \(\bar{B}_{(s)}^0 \rightarrow D_{(s)}^+ \lbrace \pi ^-, K^-\rbrace \) decays. We determine possible solutions to this puzzle, among them effects from BSM physics. We finally discuss prospects for future extractions of \(f_s/f_d\) using the method discussed here and potential alternatives. We conclude in Sect. 4. The article is supplemented by two appendices. In Appendix A we present the light-cone sum rule calculation for the soft-gluon matrix element relegated from Sect. 2, while in Appendix B we give details on the experimental inputs.

2 Theory prediction of \(\varvec{{\mathcal {B}}(\bar{B}^0\rightarrow D^{(*)+}K^-)}\) and \(\varvec{{\mathcal {B}}(\bar{B}_s^0\rightarrow D_s^{(*)+} \pi ^-)}\)

We begin by briefly summarizing the framework of collinear factorization [also known as QCD factorization (QCDF)] [4] for the decays \(\bar{B}^0\rightarrow D^{(*)+} K^-\) and \(\bar{B}_s^0 \rightarrow D_s^{(*)+} \pi ^-\) in Sect. 2.1. Numerical results are presented subsequently in Sect. 2.2.

2.1 Framework

Theory predictions for the decays under consideration are both relatively simple and particularly clean [4]. They are relatively simple, since neither penguin nor annihilation topologies contribute; they are particularly clean, again due to the absence of pollution from weak annihilation, but also because no chirally enhanced hard-scattering contributions are present at order \(\Lambda _\text {QCD}/m_b\). Since the latter two contributions constitute the main limitation of the QCDF approach, the resulting theory predictions are among the most reliable for non-leptonic decays. This is the basis for the phenomenological application of extracting the ratio \(f_s/f_d\) from these modes [1]. We emphasize that this statement does not hold for modes where the flavour of the spectator anti-quark is present in the valence content of the light meson. An example is the decay \(\bar{B}^0\rightarrow D^+\pi ^-\), which factorizes to leading power, but suffers from larger uncertainties due to endpoint divergences at subleading power.

The effective Lagrangian needed for the description of \(\bar{B}^0\rightarrow D^{(*)+}K^-\) and \(\bar{B}_s^0\rightarrow D_s^{(*)+}\pi ^-\) decays reads

$$\begin{aligned} {\mathcal {L}} = -\frac{4 G_F}{\sqrt{2}} V_{cb} V_{uq_2}^* \left[ C_1 {\mathcal {Q}}_1^{q_2} + C_2 {\mathcal {Q}}_2^{q_2}\right] , \end{aligned}$$
(2)

where \(q_2 = d, s\), and \(V_{cb}\), \(V_{uq_2}^*\) are CKM matrix elements. The Wilson coefficients \(C_{1,2}\) are \(q_2\)-flavour universal in the Standard Model (SM). We choose these two current-current operators in the CMM basis [5]:

$$\begin{aligned} {\mathcal {Q}}_{2(1)}^{q_2} \equiv \left[ \bar{c} \gamma ^\mu P_L (T^A) b\right] \, \left[ \bar{q}_2 \gamma _\mu P_L (T^A) u\right] . \end{aligned}$$
(3)

The Wilson coefficients \(C_{1,2}\) are known to next-to-next-to-leading logarithmic accuracy [6], and at the scale \(\mu = m_b\) we use

$$\begin{aligned} C_2 = +1.010\quad C_1 = -0.291. \end{aligned}$$
(4)

Their uncertainties are negligible compared to those of the hadronic matrix elements.

In this work, the light pseudoscalar (\(\bar{u} q_2)\) bound state is denoted as L when the particular quark flavour \(q_2\) is not relevant to the discussion. For the discussion below we adopt the power-counting \(\varepsilon \sim \Lambda _\text {QCD} / E_L \sim \Lambda _\text {QCD} / m_b\). Following this power counting the matrix elements of the current-current operators factorize to leading power [4]:

$$\begin{aligned}&\langle D_q^{(*)+} L^-| {\mathcal {Q}}_i |\bar{B}_q^0\rangle = \sum _{j} F_j^{\bar{B}_q \rightarrow D_q^{(*)}}(M_L^2) \nonumber \\&\quad \times \int _0^1 du\, T_{ij}(u) \phi _L(u)+ {\mathcal {O}}\left( \frac{\Lambda _\text {QCD}}{m_b}\right) . \end{aligned}$$
(5)

The amplitudes to leading power in \(\varepsilon \) read

$$\begin{aligned} \begin{aligned} {\mathcal {A}}(\bar{B}_{q}^0 \rightarrow D_{q}^+ L^-)&= i \frac{G_F}{\sqrt{2}} V_{uq_2}^* V_{cb} a_1(D_q^+ L^-) f_L \\&\quad \times F_0^{\bar{B}_q \rightarrow D_q}(M_L^2) (M_{B_q}^2 - M_{D_q}^2), \\ {\mathcal {A}}(\bar{B}_{q}^0 \rightarrow D_{q}^{*+} L^-)&= -i \frac{G_F}{\sqrt{2}} V_{uq_2}^* V_{cb} a_1(D_q^{*+} L^-) f_L \\&\quad \times A_0^{\bar{B}_q \rightarrow D_q^*}(M_L^2) 2 M_{D_q^*} \varepsilon ^*(\lambda =0)\cdot q \end{aligned} \end{aligned}$$
(6)

for either a pseudoscalar \(D_q\) meson, or a longitudinal (\(\lambda =0\)) \(D_q^*\) vector meson. The structure of these amplitudes holds to all orders in \(\alpha _s\). The effective Wilson coefficients \(a_1(D_q^{(*)+} L^-)\) have been computed to next-to-next-to-leading order in \(\alpha _s\) in Ref. [2]. We emphasize that the \(SU(3)_F\) breaking of the amplitudes is numerically driven by the decay constants of the light-meson. The deviation from the symmetry limit in heavy-to-heavy form factors is additionally suppressed by the heavy-quark masses and therefore expected to be very small [7,8,9]. A recent analysis of the symmetry breaking in heavy-to-heavy form factors yields results that are compatible with this expectation [3]. To ascertain the stability of QCDF predictions for \({\mathcal {R}}_{s/d}\), control of the symmetry-breaking effects is essential, even if they only arise within power corrections.

The formulas for the \(SU(3)_F\) ratios with either a pseudoscalar (P) or vector (V) meson \(D_q^{(*)}\) read

$$\begin{aligned} {\mathcal {R}}^P_{s/d}&= \frac{{\mathcal {B}}(\bar{B}_s^0 \rightarrow D_s^{+} \pi ^-)}{{\mathcal {B}}(\bar{B}^0 \rightarrow D^{+} K^-)} \nonumber \\&= \frac{\tau _{B_s}}{\tau _{B_d}}\left| \frac{V_{ud}}{V_{us}}\right| ^2 \frac{f_\pi ^2}{f_K^2} \left| \frac{F_0^{\bar{B}_s \rightarrow D_s}(M_\pi ^2)}{F_0^{\bar{B} \rightarrow D}(M_K^2)}\right| ^2 \left| \frac{a_1(D_s^+ \pi ^-)}{a_1(D^+ K^-)}\right| ^2\nonumber \\&\quad \times \left( \frac{M_{B_s}^2 - M_{D_s}^2}{M_B^2 - M_D^2}\right) ^2 \frac{M_B^3}{M_{B_s}^3} \frac{\sqrt{\lambda (M_{B_s}^2, M_{D_s}^2, M_\pi ^2)}}{\sqrt{\lambda (M_B^2, M_D^2, M_K^2)}}, \end{aligned}$$
(7)
$$\begin{aligned} {\mathcal {R}}^V_{s/d}&= \frac{{\mathcal {B}}(\bar{B}_s^0 \rightarrow D_s^{*+} \pi ^-)}{{\mathcal {B}}(\bar{B}^0 \rightarrow D^{*+} K^-)} \nonumber \\&= \frac{\tau _{B_s}}{\tau _{B_d}}\left| \frac{V_{ud}}{V_{us}}\right| ^2 \frac{f_\pi ^2}{f_K^2} \left| \frac{A_0^{\bar{B}_s \rightarrow D_s^*}(M_\pi ^2)}{A_0^{\bar{B} \rightarrow D^*}(M_K^2)}\right| ^2 \left| \frac{a_1(D_s^{*+} \pi ^-)}{a_1(D^{*+} K^-)}\right| ^2\nonumber \\&\quad \times \frac{M_B^3}{M_{B_s}^3} \frac{[\lambda (M_{B_s}^2, M_{D_s^*}^2, M_\pi ^2)]^{3/2}}{[\lambda (M_B^2, M_{D^*}^2, M_K^2)]^{3/2}}. \end{aligned}$$
(8)

Here \(\lambda (a,b,c) = a^2+b^2+c^2-2ab-2ac-2bc\) is the Källén function, and the formulas for the individual branching ratios can be found in [4]. To test the consistency of the QCDF predictions, we also study the fixed-flavour ratios of branching fractions to vector \(D_q^{*+}\) mesons over pseudoscalar \(D_q\) mesons:

$$\begin{aligned} {\mathcal {R}}_s^{V/P}&= \frac{{\mathcal {B}}(\bar{B}_s^0\rightarrow D_s^{*+}\pi ^-)}{{\mathcal {B}}(\bar{B}_s^0\rightarrow D_s^{+}\pi ^-)}, \end{aligned}$$
(9)
$$\begin{aligned} {\mathcal {R}}_d^{V/P}&= \frac{{\mathcal {B}}(\bar{B}^0\rightarrow D^{*+}K^-)}{{\mathcal {B}}(\bar{B}^0\rightarrow D^{+}K^-)}, \end{aligned}$$
(10)

which have analogous expressions.

Power corrections to the amplitudes in Eq. (6) arise from a variety of effects. To order \(\varepsilon \), these can potentially include higher-twist corrections to the light-meson light-cone distribution amplitude (LCDA); emission of a hard-collinear gluon from the spectator q, or from the heavy bottom and charm quarks; and exchange of a soft gluon between the \(\bar{B}_q^0\rightarrow D_q\) system and the light meson. We briefly discuss each of these effects and provide updated numerical estimates for their magnitudes. Importantly, using form factors in terms of QCD quark fields in Eq. (6) avoids explicit \(\Lambda /m_c\) power corrections. We note that implicit corrections due to application of the heavy-quark expansion to the form factors are part of our uncertainty budget as discussed below.

The contributions of the two-particle twist-three light-meson LCDA are suppressed with respect to the leading-twist contribution by one power of \(\varepsilon \). As discussed in the literature, see e.g. Ref. [10], twist-three corrections for a pseudoscalar scale like \(\mu _L / E_L\), which can be numerically large due to the normalization in terms of the factor

$$\begin{aligned} \mu _L \equiv \frac{M_{L^-}^2}{m_u + m_{q_2}}. \end{aligned}$$
(11)

However, the twist-three corrections to the hadronic amplitude vanish algebraically at leading-order in \(\alpha _s\) [11]. We find that higher-twist corrections only enter at order \(\alpha _s\varepsilon ^2\) or higher.

Up to \({\mathcal {O}}\left( \varepsilon \right) \), the exchange of a gluon between the heavy-to-heavy transition and the light meson can possibly enter in one of three ways [11].

(i) The exchange of a hard gluon from the b or c quark is a purely perturbative effect, and accounted for by the \({\mathcal {O}}\left( \alpha _s\right) \) correction to the effective Wilson coefficients \(a_1(D_q^{(*)} L)\).

(ii) The exchange of a hard-collinear gluon from the spectator quark is incompatible with the physical picture of the spectator having a soft momentum inside both the \(\bar{B}_q^0\) and the \(D_q\) meson. The exchange of a hard-collinear gluon from the b or c quark is possible, and contributes through a quark–antiquark–gluon Fock state of the light meson through three-particle LCDAs. Due to the \(V-A\) structure of the weak interaction the twist-three three-particle contribution is absent, and the first contribution emerges at the twist-four level [4]. Within our power counting, the twist-four contribution enters at the \({\mathcal {O}}\left( \varepsilon ^2\right) \) level. This power correction breaks \(SU(3)_F\) symmetry maximally, since it is absent for the \(L=\pi ^-\) case by virtue of G parity, but has a finite contribution for the \(L=K^-\) case. Using the expression for these corrections in Ref. [4], a model based on the conformal expansion of the LCDAs and the numerical results of Ref. [10] we obtain

$$\begin{aligned} \frac{{\mathcal {A}}(\bar{B}^0 \rightarrow D^{(*)+} K^-)\big |_\text {NNLP(hc)}}{{\mathcal {A}}(\bar{B}^0 \rightarrow D^{(*)+} K^-)\big |_\text {LP}} \simeq \frac{C_1}{a_1} \times -0.64\%. \end{aligned}$$
(12)

This term has been computed in the limit \(m_c \ll m_b\). Analysis of eq. (56) of Ref. [4] reveals that corrections due to a non-zero charm mass scale like \({\mathcal {O}}\left( \frac{\Lambda _\text {had}^2m_c}{m_b^3}\right) \), which are negligible to the accuracy for which we aim.

(iii) The exchange of a single soft gluon between the heavy-to-heavy transition and the light meson causes one of the light quarks in the light meson to become hard-collinear [4]. This configuration can be expressed through the heavy-to-heavy matrix elements of a non-local \(\bar{c} G b\) current. We estimate the relevant matrix elements using Light-Cone Sum Rules (LCSRs) in Appendix A. Using these estimates, we obtain ranges for the \({\mathcal {O}}\left( \varepsilon \right) \) soft-gluon correction:

$$\begin{aligned} \frac{{\mathcal {A}}(\bar{B}_q^0 \rightarrow D_q^+ L^-)\big |_\text {NLP}}{{\mathcal {A}}(\bar{B}_q^0 \rightarrow D_q^+ L^-)\big |_\text {LP}}&\simeq \frac{C_1}{4a_1} \times [0.76, 7.6]\%, \end{aligned}$$
(13)
$$\begin{aligned} \frac{{\mathcal {A}}(\bar{B}_q^0 \rightarrow D_q^{*+} L^-)\big |_\text {NLP}}{{\mathcal {A}}(\bar{B}_q^0 \rightarrow D_q^{*+} L^-)\big |_\text {LP}}&\simeq \frac{C_1}{4a_1} \times [0.46, 4.6]\%. \end{aligned}$$
(14)

The lower value of these ranges corresponds to the sum rule results for our nominal inputs as discussed in Appendix A. To obtain a more conservative estimate, we increase the values of the scale-setting parameters \(\lambda _E^2\) and \(\lambda _H^2\) by one order of magnitude, yielding the higher of the values. We emphasize that the ratio of the hadronic matrix elements are matching the size expected from naive dimensional analysis. However, the prefactor of \(C_1 / a_1 \sim -1/3\) renders all estimates for the power corrections small, and supports the picture of these decays being very clean.

2.2 Numerical inputs and results

Form factors For the form factors and form-factor ratios entering our predictions, we use the posterior samples for the HQE parametrization from Ref. [3]. This parametrization includes contributions to orders \(1/m_b,1/m_c^2,\alpha _s\), thereby providing a consistent treatment of the form factors and specifically also their uncertainties including \(1/m_c^2\) contributions, see Refs. [3, 12] for details. In Table 1 the values of the form factors used in this work are collected together with the ones used in Ref. [2]. We notice that with respect to Ref. [2] these form factor predictions have reduced uncertainties, especially for the \(\bar{B}_q^0\rightarrow D^+_q\) cases. This is dominantly due to improved lattice QCD determinations of the corresponding form factors presented in Refs. [13, 14], which are the most constraining inputs for the analysis in Ref. [3]. The correlation coefficients between these form factor values are

$$\begin{aligned} \begin{pmatrix} 1.0000 &{} 0.0814 &{} 0.1702 &{} - 0.0255 \\ 0.0814 &{} 1.0000 &{} 0.0059 &{} 0.4787 \\ 0.1702 &{} 0.0059 &{} 1.0000 &{} 0.0134 \\ - 0.0255 &{} 0.4787 &{} 0.0134 &{} 1.0000 \end{pmatrix}, \end{aligned}$$
(15)

with rows and columns ordered as \(F_0^{\bar{B} \rightarrow D}(M_K^2)\), \(A_0^{\bar{B} \rightarrow D^*}(M_K^2)\), \(F_0^{\bar{B}_s \rightarrow D_s}(M_\pi ^2)\), and \(A_0^{\bar{B}_s \rightarrow D_s^*}(M_\pi ^2)\). Accounting for these correlations, we obtain the form factor ratios:

$$\begin{aligned} \left| \frac{F_0^{\bar{B}_s \rightarrow D_s}(M_\pi ^2)}{F_0^{\bar{B} \rightarrow D}(M_K^2)}\right|&= 1.001 \pm 0.021, \end{aligned}$$
(16)
$$\begin{aligned} \left| \frac{A_0^{\bar{B}_s \rightarrow D_s^*}(M_\pi ^2)}{A_0^{\bar{B} \rightarrow D^*}(M_K^2)}\right|&= 0.9729 \pm 0.080, \end{aligned}$$
(17)
$$\begin{aligned} \left| \frac{F_0^{\bar{B}_s \rightarrow D_s}(M_\pi ^2)}{A_0^{\bar{B}_s \rightarrow D_s^*}(M_\pi ^2)}\right|&= 0.9773 \pm 0.095, \end{aligned}$$
(18)
$$\begin{aligned} \left| \frac{F_0^{\bar{B} \rightarrow D}(M_K^2)}{A_0^{\bar{B} \rightarrow D^*}(M_K^2)}\right|&= 0.9507 \pm 0.095. \end{aligned}$$
(19)

Effective Wilson coefficients   Based on the analytic results for the effective Wilson coefficients to NNLO in \(\alpha _s\) from Ref. [2], we produce numbers for their absolute values. The numbers used in the present work contain more significant digits, but are otherwise identical to those in Ref. [2], see Table 1. In addition, we require the following three ratios of effective Wilson coefficients:

$$\begin{aligned} \left| \frac{a_1(D_s^+ \pi ^-)}{a_1(D^+ K^-)}\right|&= 1.0024^{+0.0023}_{-0.0011}\, , \end{aligned}$$
(20)
$$\begin{aligned} \left| \frac{a_1(D_s^+ \pi ^-)}{a_1(D_s^{*+} \pi ^-)}\right|&= 1.0013^{+0.0006}_{-0.0005}\, , \end{aligned}$$
(21)
$$\begin{aligned} \left| \frac{a_1(D^+ K^-)}{a_1(D^{*+} K^-)}\right|&= 1.0013^{+0.0005}_{-0.0005}\, . \end{aligned}$$
(22)

\(\varvec{|V_{cb}|}\)   For \(|V_{cb}|\), we use the value obtained in Refs. [3, 12], which constitutes an average of the values extracted from inclusive and exclusive semileptonic decays. This value is larger than in Ref. [2] where the value for \(|V_{cb}|\) from exclusive decays as of 2016 was used.

\(\varvec{|V_{uq_2}|}\) and light-meson decay constants Throughout this work, we use directly the products of the CKM matrix elements \(|V_{ud}|\) or \(|V_{us}|\) with the respective light-meson decay constants \(f_\pi \) or \(f_K\) as obtained by the PDG from leptonic \(\pi ^-,K^-\) decays [15]. Our current values together with those from Ref. [2] are again collected in Table 1. Due to the partial cancellation of radiative corrections their ratio is even more precisely known than the individual quantities,

$$\begin{aligned} \left| \frac{V_{ud}}{V_{us}}\right| ^2 \times \left| \frac{f_\pi }{f_K}\right| ^2 = 13.128 \pm 0.038, \end{aligned}$$
(23)

leaving a negligible relative uncertainty of \(1.5\permille \).

Leading-power predictions and comparison Using the above inputs we obtain for the QCDF results at leading power (LP) in \(\varepsilon \) the values in Table 1, again listed together with the results of Ref. [2]. The most significant changes arise due to the shifts in \(|V_{cb}|\) (\(+4\%\)), and the \(\bar{B}_s^0\rightarrow D_s^{(*)}\) form factors. The former shift is mostly due to our inclusion of the \(|V_{cb}|\) value obtained from inclusive decays [16], while the latter (\(-4\%\) for \(F_0\) and \(+33\%\) for \(A_0\)) is caused by improved calculations of the \(\bar{B}_s^0\rightarrow D_s^{(*)}\) form factors [13, 17, 18] which have become available since the analysis of Ref. [2], where the only available calculation [19] as of 2016 was used. The resulting shifts in the absolute branching fractions follow correspondingly, with \(+8\%\) for the \(\bar{B}_d^0\) branching fractions, no significant shift in \(\bar{B}_s^0\rightarrow D_s^+ \pi ^-\), and \(+90\%\) in \(\bar{B}_s^0\rightarrow D_s^{*+} \pi ^-\). Note, that the current form factor results shift the QCDF predictions from \(2.24^{+0.56}_{-0.50}\) (in units of \(10^{-3}\)) to the new value \(4.30^{+0.9}_{-0.8}\) which is now \(2.3\sigma \) away from the experimental value \(2.1 \pm 0.5\) from the third column of Table 2. The uncertainties of all our predictions are dominated by the form factor uncertainties, which will be reduced in the future. In the case of the absolute branching fractions of modes with \(D_q\) final states, the uncertainties from \(|V_{cb}|^2\sim 2.4\%\) and \(|a_1|^2\sim 2.4\%\) are comparable to the form factor ones (\(\sim 3.3\%)\), while for absolute branching fractions for \(D_q^*\) final states and ratios of branching fractions the form factor uncertainties dominate by far.

In addition we obtain, still to leading power, the following ratios of branching fractions:

$$\begin{aligned} {\mathcal {R}}_{s/d}^P\big |_\text {LP}&= 13.5^{+0.6}_{-0.5}, \end{aligned}$$
(24)
$$\begin{aligned} {\mathcal {R}}_{s/d}^V\big |_\text {LP}&= 13.1^{+2.3}_{-2.0}, \end{aligned}$$
(25)
$$\begin{aligned} {\mathcal {R}}_{s}^{V/P}\big |_\text {LP}&= 0.97^{+0.20}_{-0.17}, \end{aligned}$$
(26)
$$\begin{aligned} {\mathcal {R}}_{d}^{V/P}\big |_\text {LP}&= 1.01\pm 0.11. \end{aligned}$$
(27)

Two of these values can be compared to the ones in Ref. [2] where

$$\begin{aligned} {\mathcal {R}}_{s/d}^P\big |_\text {LP}&= 14.67^{+1.34}_{-1.28}, \end{aligned}$$
(28)
$$\begin{aligned} {\mathcal {R}}_{d}^{V/P}\big |_\text {LP}&= 0.863^{+0.158}_{-0.147} \end{aligned}$$
(29)

are obtained. We observe that the central values are compatible within uncertainties, and that in our new analysis the latter are \(55\%\) and \(30\%\) smaller, respectively.

Next-to-leading power predictions When including the hadronic matrix elements at next-to-leading power (NLP) in \(\Lambda _\text {QCD}/m_b\), we obtain:

$$\begin{aligned} {\mathcal {R}}_{s/d}^P\big |_\text {NLP}/{\mathcal {R}}_{s/d}^P\big |_\text {LP}-1&\approx -1.7\permille , \end{aligned}$$
(30)
$$\begin{aligned} {\mathcal {R}}_{s/d}^V\big |_\text {NLP}/{\mathcal {R}}_{s/d}^V\big |_\text {LP}-1&\approx -1.7\permille , \end{aligned}$$
(31)
$$\begin{aligned} {\mathcal {R}}_{s}^{V/P}\big |_\text {NLP}/{\mathcal {R}}_{s}^{V/P}\big |_\text {LP}-1&= (0.4\ldots 8.2)\permille , \end{aligned}$$
(32)
$$\begin{aligned} {\mathcal {R}}_{d}^{V/P}\big |_\text {NLP}/{\mathcal {R}}_{d}^{V/P}\big |_\text {LP}-1&= (0.4\ldots 8.2)\permille . \end{aligned}$$
(33)

The shift from LP to NLP is negligible, as anticipated in Sect. 2.1. This underscores why the QCDF predictions for these decays are considered to be among the most reliable for two-body B decays.

Table 1 Numerical inputs and results for the QCDF expressions for the branching fractions at leading power. We compare the results of Ref. [2] with our results. The predictions for the \(\bar{B}_s^0\) branching fractions are not time-integrated, and therefore differ from the measured branching fractions by a factor of \((1-y_s^2)\) [20]

3 Challenging present measurements

We compare our predictions obtained in the previous section with existing data, and aim for a determination of \(f_s/f_d\) at the LHCb experiment and the Tevatron experiments CDF and D0. This is not trivial, for the following reasons:

  1. 1.

    We face a circular dependence between some measurement of branching fractions and \(f_s/f_d\): for the \(\bar{B}_s^0\) decays in question, the values entering the world average [15] are at least partially using \(f_s/f_d\), which we are aiming to extract. In Table 2 we make this dependence explicit, and introduce two independent quantities for the LHCb and CDF experiments, to highlight a potential dependence on the transverse momentum and hence the experiment.

  2. 2.

    Besides correlations due to \(f_s/f_d\), the measurements of the decays in question exhibit additional sizeable correlations. The largest ones are introduced because LHCb only measures ratios of branching fractions, albeit with very high precision. Furthermore, in almost all measurements the same few \(D_{(s)}\) decay modes are used. Finally, we include explicitly the production fraction of neutral B mesons entering absolute branching fraction measurements by the B factories, since in this case isospin symmetry can be violated sizeably.

Based on these considerations, we summarize all relevant experimental results in Table 4 in the Appendix, explicitly highlighting all cross dependence. The individual branching fractions, their ratios, and the different production fractions presented in Table 2 are produced from a series of fits to the data listed in Table 4. In these fits, we allow for independent production fractions \(f_s/f_d\) at Tevatron, LHCb (7 TeV) and LHCb (13 TeV). The latter does not enter any measurement here and is only given for comparison. Since only one measurement by CDF is available for the considered modes, this effectively determines a value for \(f_s/f_d\) at the Tevatron experiments, which can then be compared to other determinations. We consider therefore the following fit scenarios:

  1. 1.

    We first perform a fit without any input on \(f_s/f_d\). The absolute scale of all \(\bar{B}_s^0\) branching fractions is in this case provided by the measurement of \({\mathcal {B}}(\bar{B}_s^0\rightarrow D_s^+\pi ^-)\) at Belle [21], which yields a very low value for \(f_s/f_d\) with large uncertainties.

  2. 2.

    We next include the \(f_s/f_d\) value from LHCb measured at 7 TeV in semileptonic decays [22, 23], which uses a method independent from the one investigated here. We make the dependence on the \(D_s\) branching fraction explicit also in this case, since it is an important uncertainty and correlated with the other measurements in the fit. The dependence on \({\mathcal {B}}(D^-\rightarrow K^+\pi ^-\pi ^-)\) is not as easily included and its contribution to the uncertainty not as large as for the \(D_s\) case.

The fit results for these two scenarios are collected in Table 2 under “our fits (w/o QCDF)”. Both fits describe the available data perfectly, meaning there are no obvious inconsistencies among the measurements. We observe significant shifts compared to the PDG fit results for \(\bar{B}^0\rightarrow D^+ K^-\) and \(\bar{B}^0\rightarrow D^+ \pi ^-\) modes, but overall our results are well compatible with the PDG fit. More importantly, we obtain the full correlation matrix for these branching fractions, which allows to calculate their ratios with reduced uncertainties. Our improvements significantly sharpen the pattern that was apparent already in Refs. [2, 4]: the ratios of branching fractions are well reproduced, the largest difference between measurement and prediction is 1.3\(\sigma \) for \({\mathcal {R}}^{V/P}_s\). On the other hand, what was a tendency to overestimate the individual branching fractions in the past, is now a clear discrepancy: naively we observe a \(4\sigma \) difference between prediction and measurement in \(\bar{B}_s^0\rightarrow D_s ^+\pi ^-\), over \(5\sigma \) difference in \(\bar{B}^0\rightarrow D^+K^-\), about \(2\sigma \) in \(\bar{B}_s^0\rightarrow D_s^{*+}\pi ^-\) and \(3\sigma \) in \(\bar{B}^0\rightarrow D^{*+}K^-\). A fit to the same data as above, but expressing all branching fractions by their QCDF expressions without allowing for corrections results in \(\chi ^2_\mathrm{min}=38.7\) for 9 degrees of freedom. We see the following possibilities to resolve this discrepancy:

Table 2 Our fits to the available data listed in Table 4 with and without constraints from QCDF, in comparison to the PDG values [15] and our QCDF predictions. The branching fractions are given in units of \(10^{-3}\). Results marked with a \(^*\) indicate that the distribution is non-gaussian. The two fits on the left are not using the assumption that QCDF holds. The two fits on the right are using QCDF input to varying degree, see text. Correlations for these results are available upon request from the authors
  1. 1.

    One obvious option is the presence of large non-factorizable contributions of \({\mathcal {O}}(15{-}20\%)\) at amplitude level in each of the modes. This was already discussed in Ref. [2], where the discrepancy has a smaller statistical significance. When taking our new estimates in Eqs. (12)–(14), which allow already for an enhancement by a factor of 10 in the hadronic matrix elements, at face value, this scenario is clearly and significantly disfavoured at the \(4.4\sigma \) level. We emphasize that we do not only see no enhancement in our calculation of next-to-leading power contributions, but instead a systematic suppression by \(C_1/a_1\sim -1/3\), which renders our result particularly small. Therefore even the generic expectation of \(\Lambda _\text {QCD}/m_b\sim 10\%\) seems already on the high side. We pursue this scenario nevertheless, which still allows us to extract \(f_s/f_d\), albeit with increased uncertainties.

  2. 2.

    We entertain also the possibility that this is an experimental issue. For that it is interesting to note that the fit to the QCDF predictions becomes excellent as soon as the measurements of the absolute branching fractions \(\bar{B}^0\rightarrow D^{(*)+}\pi ^-\) are excluded from the fit. Both values are dominated by the BaBar analysis [24], however, even the less precise CLEO data are already in conflict with the QCDF predictions: the fit without these normalization modes and without input on \(f_s/f_d\) yields \({\mathcal {B}}(\bar{B}^0\rightarrow D^+\pi ^-)=(3.94^{+0.24}_{-0.20})10^{-3}\) and \({\mathcal {B}}(\bar{B}^0\rightarrow D^{*+}\pi ^-)=(3.98^{+0.38}_{-0.37})10^{-3}\). This option would therefore imply a serious experimental problem in more than one experiment. We do not consider this option further. Nevertheless, we would like to encourage additional measurements of absolute branching fractions, which is possible with existing data from the Belle experiment and upcoming data at Belle II.

  3. 3.

    The parametric inputs in Eq. (6) are in principle also potential sources of systematic shifts. However, all of them are very well known from several measurements. The quantity \(|V_{cb}|\) has the largest uncertainty, but a \(20\%\) shift would be in direct contradiction with all existing analyses and the global picture of the CKM unitarity fit.

  4. 4.

    If we assume the experimental results to be correct and assume our estimates in Eqs. (12)–(14) to have the correct order of magnitude, only beyond the Standard Model (BSM) effects can explain the data. The fact that all QCDF predictions are above the corresponding measurements requires this BSM physics to interfere with the SM. Since the ratios are predicted well, an approximately universal factor in \(b\rightarrow c\bar{u} d\) and \(b\rightarrow c\bar{u} s\) transitions is preferred.

Also a combination of the above effects is possible. In the following we discuss options 1 and 4 in detail.

3.1 Extracting \(\varvec{f_s/f_d}\)

Allowing for non-factorizable contributions beyond the size indicated by the QCDF results in all decays at hand, we parametrize

$$\begin{aligned} \frac{\mathcal A(\bar{B}^0\rightarrow D^+K^-)}{\left. \mathcal A(\bar{B}^0\rightarrow D^+K^-)\right| _\mathrm{QCDF,LP}}&= 1+\Delta _P, \end{aligned}$$
(34)
$$\begin{aligned} \frac{\mathcal A(\bar{B}^0_s\rightarrow D^+_s\pi ^-)}{\left. \mathcal A(\bar{B}^0_s\rightarrow D^+_s\pi ^-)\right| _\mathrm{QCDF,LP}}&= 1+r_{SU(3)}^P\Delta _P, \end{aligned}$$
(35)
$$\begin{aligned} \frac{\mathcal A(\bar{B}^0\rightarrow D^{*+}K^-)}{\left. \mathcal A(\bar{B}^0\rightarrow D^{*+}K^-)\right| _\mathrm{QCDF,LP}}&= 1+\Delta _V, \end{aligned}$$
(36)
$$\begin{aligned} \frac{\mathcal A(\bar{B}^0_s\rightarrow D^{*+}_s\pi ^-)}{\left. \mathcal A(\bar{B}^0_s\rightarrow D^{*+}_s\pi ^-)\right| _\mathrm{QCDF,LP}}&= 1+r_{SU(3)}^V\Delta _V. \end{aligned}$$
(37)

Here we use the leading-power QCDF amplitudes as in Eq. (6), \(\Delta _{P,V}\) parametrize non-factorizable contributions to the amplitude (including the parts estimated above) and \(r_{SU(3)}^{P,V}\) parametrizes SU(3) breaking beyond that in the leading amplitude. We expect \(r_{SU(3)}^{P,V}\approx 1\), i.e., the non-factorizable parts to still scale as the decay constants, which is justified by the analytic structure of the NLP results. We choose all four parameters \(\Delta _{P,V},r_{SU(3)}^{P,V}\) real without phenomenological consequences, since we only consider branching fractions here.

Leaving all four parameters in Eqs. (34)–(37) arbitrary is equivalent to not using the QCDF calculation at all. This reproduces the previous fit without QCDF and no input for \(f_s/f_d\) in Table 2. Setting \(r_{SU(3)}^{P,V}\equiv 1\), but leaving \(\Delta _{P,V}\) arbitrary corresponds to the assumption that the ratios \({\mathcal {R}}_{s/d}^{P,V}\) are perfectly predicted by QCDF, while the individual branching fractions receive large non-factorizable contributions. This mimics one of the assumptions employed in Refs. [1, 25] and in particular allows for a comparison with the result in Ref. [25]. We list the results of this scenario in Table 2 under “ratios only”. However, given the significant reduction of the parametric theory uncertainty in this work, the assumption of a negligible uncertainty from SU(3) breaking in the non-factorizable part does not seem appropriate anymore. While we do not use the quantitative results for the NLP contributions here, we still make observations from their analytical structure:

  1. 1.

    In general there is no justification to identify \(\Delta _P\) and \(\Delta _V\).

  2. 2.

    The deviation of \(r_{SU(3)}^{P,V}\) from unity is expected to be small: the sizable breaking from the light-meson decay constants is identical to that of the leading amplitude, and the heavy-to-heavy matrix elements for a single soft gluon at NLP exhibit a similar structure to that of the form factors, for which the results in Ref. [3] show explicitly that the breaking is small, in accordance with theoretical expectations from the HQE [7,8,9]. We therefore consider \(r_{SU(3)}^{P,V}\in [0.9,1.1]\) to be a conservative estimate of this breaking, leaving the two parameters independent.

We consequently perform another fit, leaving \(\Delta _{P,V}\) independent and arbitrary while constraining the SU(3) breaking beyond that in the leading amplitude to be below \(10\%\). In this way we account for the possibility of relevant additional SU(3) breaking, while still exploiting the information from QCDF. The results of this fit are reported again in Table 2.

Since this scenario interpolates in a way between the fit without QCDF and without \(f_s/f_d\) input and the one without SU(3) breaking, it is not surprising that also the results for \(f_s/f_d\) lie between the corresponding values. As mentioned before, we observe large non-factorizable contributions of \(15{-}20\%\), independently of the allowed magnitude of SU(3) breaking. Although the minimal \(\chi ^2\) decreases by 0.9 when allowing for SU(3) breaking, such a fit has two fewer degrees of freedom, so there is no indication for sizable SU(3) breaking in the data. We consider this scenario nevertheless preferable over the one without breaking, since the two parameters \(r_{SU(3)}^{P,V}\) account for the related uncertainty and render our results for \(f_s/f_d\) conservative. While this is not the only way to estimate this uncertainty, the sizable shift between the central values and uncertainties in the scenarios with and without breaking, despite the relatively small breaking beyond that of the leading amplitude of \(\le 10\%\), indicates that this source of uncertainty must be taken into account for a meaningful extraction of \(f_s/f_d\).

Our results are compatible with previous extractions of \((f_s/f_d)_\mathrm{LHCb}^\mathrm{7\,TeV}\) from \({\mathcal {R}}_{s/d}\) [23, 25,26,27,28], but have significantly reduced uncertainties, despite allowing for larger SU(3) breaking in the non-factorizable part. Both our values are also in excellent agreement with the value from semileptonic decays. The uncertainties are comparable, and can be reduced with more data in the future. However, the problem of the large apparent corrections of the QCDF results remains. Furthermore, while we consider our estimate for the range of \(r_{SU(3)}^{P,V}\) to be conservative, it is clear from the data in Table 2 that the result for \(f_s/f_d\) sensitively depends on our assumption. An effort should therefore be made to even further improve the understanding of these modes, in order to either understand the source for these large non-factorizable contributions, or to establish the presence of BSM physics in these modes, as discussed in the next subsection.

Finally, let us comment on the value for \((f_s/f_d)_\mathrm{Tev}\) extracted in Table 2: the value quoted in Ref. [29] reads \((f_s/f_d)_\mathrm{Tev}=0.334\pm 0.040\), which differs by \(\sim 2\sigma \) from the value obtained in both fits using QCDF in Table 2. It is worth emphasizing that our value is independent from the values entering that average, and more precise. On the other hand it relies on LHCb-data and is hence not a pure CDF measurement. It is not clear from the information provided in Ref. [29] how the average is obtained. Updating the external inputs for the analyses [30, 31] and performing a correlated average, we obtain

$$\begin{aligned} (f_s/f_d)_\mathrm{Tev,sl}=0.263\pm 0.031, \end{aligned}$$
(38)

which is also more precise than the average quoted in Ref. [29] and perfectly compatible with both the LHCb results (at different transverse momentum) and our result from non-leptonic decays.

We therefore conclude that while we do not see whence the required large non-factorizable contributions could originate, assuming their presence yields a consistent picture for all available data and an improved determination of \(f_s/f_d\), which can be improved further in the future.

3.2 New physics in \(\varvec{b\rightarrow c\bar{u} s(d)}\) transitions

We now explore the possibility that the discrepancy between data and the SM calculation is caused by BSM physics. This is in part motivated by the observation that our fits including the QCDF constraints allow for \(\Delta _P=\Delta _V\), although this would not be expected for NLP contributions. It would, however, be expected for BSM contributions to the Wilson coefficients \(C_{1,2}^{q_2}\). The ratios \({\mathcal {R}}_{d,s}^{V/P}\) therefore provide tests for our BSM hypothesis.

We do not aim at a detailed BSM analysis, only at establishing whether this option is already ruled out by existing data.

The observation that the new contributions should similarly reduce branching fractions with a pseudoscalar meson \(D_q\) and a vector meson \(D_q^*\) has rather strong consequences, implying specifically a minimal-flavour-violation-like scenario where the BSM contributions scale with the CKM factors of the SM ones. Scalar BSM operators generate \(\Delta _P \sim 1/(m_b-m_c)\), and \(\Delta _V\sim 1/(m_b+m_c)\). This asymmetry does not exclude solutions with scalar operators, but requires a careful treatment of the various BSM contributions. For simplicity we content ourselves with the obvious option that BSM physics only modifies the SM Wilson coefficients, which yields \(\Delta _P= \Delta _V\). More precise data will provide a test of this assumption. Using this assumption, we obtain a universal shift in \(a_1\) for each class of transitions \(b\rightarrow c\bar{u} q_2\), i.e., we have in general two contributions \(\Delta a_1^{q_2}\). Again for simplicity, we use the observation that the data allow for \(\Delta a_1^d=\Delta a_1^s\equiv \Delta a_1\) to simplify our analysis further. Clearly this assumption can be easily tested in the ratios \({\mathcal {R}}_{s/d}^{P,V}\). This scenario corresponds to the one above in the limit \(\Delta _P=\Delta _V\equiv \Delta \tilde{a}_1=\Delta a_1/a_1\), since \(a_1\) to very good approximation is universal.

As suggested by the previous fits, our BSM scenario works very well: we obtain \(\chi ^2_\mathrm{min}=4.9\) for 7 degrees of freedom. We also obtain \((f_s/f_d)_\mathrm{LHCb}^\mathrm{7\, TeV}=0.263^{+0.018}_{-0.016}\), which has the same uncertainty as in the “ratios only” scenario; however, clearly this value depends sensitively on our assumption of \(\Delta a_1^d=\Delta a_1^s\). Most importantly, we obtain

$$\begin{aligned} \Delta \tilde{a}_1 = -0.17\pm 0.03,\quad \text {or}\quad \Delta a_1 = -0.18\pm 0.03, \end{aligned}$$
(39)

ignoring potential imaginary parts for now using the same justification as given above.

We check if our BSM physics hypothesis is already excluded by other observables. In the following we list these observables and discuss the impact of their measurements, see Table 3:

  • \(\Gamma _{q}\): since we modify the leading contribution to the total decay rates of the \(\bar{B}_q\) mesons, we expect a strong constraint from \(\Gamma _q\), \(q=d,s\). However, each individual total decay rate is not as precisely predicted as their ratio.

  • \(\tau _{B_s}/\tau _{B_d}\): this observable is both predicted and measured to very high precision. The main contributions to the individual lifetimes cancel in the ratio, therefore the dominant contribution in our scenario is via dimension-6 operators in the OPE to \(\tau _{B_d}\). We calculate the BSM shift at leading order, using the results from Ref. [32] with updated bag factors [33].

  • \(a_d^{fs}\): the flavour-specific CP-asymmetry in the \(B_d-\bar{B}_d\) system receives also a leading contribution from \(b\rightarrow c\bar{u}d\) operators, which is linear in the corresponding coefficient [34]. This constraint is therefore complementary to the others.

We do not consider further observables that are not as cleanly predicted as the ones above. We note, however, that other B meson decays like \(\bar{B}^0\rightarrow D^{(*)+}\pi ^-\) and \(\bar{B}_{s}\rightarrow D_{s}^{(*)+}K^-\) (and their analogues with \(\rho ,K^*\)) are likewise affected by our BSM physics hypothesis. We emphasize that all corresponding measurements [15] are lower than their (updated) SM predictions [2], thereby strengthening the case for the BSM hypothesis.

Table 3 Measurements and predictions for further observables that constrain BSM physics in \(b\rightarrow c\bar{u} (d,s)\)

We find that surprisingly none of the inclusive observables in Table 3 excludes the possibility of having a shift of \(-15\) to \(-20\%\) in \(a_1\) from BSM physics, in line with Refs. [34, 36]. We conclude that BSM physics is a viable possibility that could explain the observed puzzle. Despite requiring a large contribution of \(\sim -17\%\) of a CKM-leading tree-amplitude, available constraints from the above inclusive observables do not exclude such a scenario, which therefore has to be taken seriously. A detailed BSM analysis of these constraints is, however, beyond the scope of this article and left for future work.

3.3 Prospects

Our results reduce both main uncertainties in the predictions for \(\bar{B}_{(s)}^0 \rightarrow D_{(s)}^+ \lbrace \pi ^-, K^-\rbrace \) decays, thereby enabling future precision analyses of these modes. Measurements of absolute branching fractions can and should be carried out at Belle II, if possible also for the \(\bar{B}_s\) decays modes. High-precision measurements of ratios of branching fractions can be obtained by the LHC experiments. With further reduced uncertainties, it would also be helpful to have smaller uncertainties on the \(D_{(s)}^+\) branching fractions.

Experimentally, it might be advantageous to normalize to a different mode, like \(\bar{B}^0\rightarrow D^{(*)+}\pi ^-\), since it has a larger branching fraction and systematic uncertainties due to the pions cancel. As an example, the prediction for the ratio \({\mathcal {B}}(\bar{B}_s^0\rightarrow D_s^{+}\pi ^-)/{\mathcal {B}}(\bar{B}^0\rightarrow D^{+}\pi ^-)\) involves a different ratio of form factors than the ones already provided. Its value obtained from Ref. [3],

$$\begin{aligned} \left| \frac{F_0^{\bar{B}_s\rightarrow D_s}(m_\pi ^2)}{F_0^{\bar{B}\rightarrow D}(m_\pi ^2)} \right| = 1.005\pm 0.021, \end{aligned}$$
(40)

is rather precise. We caution that these modes suffer from significantly larger uncertainties due to presently unquantifiable \({\mathcal {O}}(\Lambda _\text {QCD}/m_b)\) corrections. Therefore, the theoretically cleanest ratios for the purpose of determining \(f_s/f_d\) are \(\mathcal {R}_{s/d}^{P/V}\), discussed in the previous sections.

As a workaround, we suggest a two-staged approach: the measurement of \({\mathcal {B}}(\bar{B}\rightarrow X)/{\mathcal {B}}(\bar{B}^0 \rightarrow D^+ K^-)\) – which, depending on the choice of X, can be determined at a precision of a few percent and potentially further improved at Belle II or LHCb – can convert the more easily measurable ratio \({\mathcal {B}}(\bar{B}_s^0 \rightarrow D_s^+ \pi ^-) / {\mathcal {B}}(\bar{B}\rightarrow X)\) into one of \(\mathcal {R}_{s/d}\), i.e.,

$$\begin{aligned} {\mathcal {R}}_{s/d}^P&= \frac{{\mathcal {B}}(\bar{B}_s\rightarrow D_s^+\pi ^-)}{{\mathcal {B}}(\bar{B}\rightarrow X)}\frac{{\mathcal {B}}(\bar{B}\rightarrow X)}{{\mathcal {B}}(\bar{B}^0\rightarrow D^+K^-)} \end{aligned}$$
(41)
$$\begin{aligned}&=\frac{{\mathcal {B}}(\bar{B}_s\rightarrow D_s^+\pi ^-)}{{\mathcal {B}}(\bar{B}\rightarrow X)} R_{d}^{X}. \end{aligned}$$
(42)

For instance, with \(X=D^+\pi ^-\) the required experimental ratio would already be available at the \(3{-}4\%\) level:

$$\begin{aligned} R_{d}^{\pi K} \equiv \frac{{\mathcal {B}}(\bar{B}^0\rightarrow D^+\pi ^-)}{{\mathcal {B}}(\bar{B}^0\rightarrow D^+K^-)} {\mathop {=}\limits ^\mathrm{exp}} 12.17^{+0.42}_{-0.37}. \end{aligned}$$
(43)

The situation is similar for \(X= D^0\pi ^-\); the determination of the ratio \({\mathcal {R}}_{ud}^{D^0\pi ^-}={\mathcal {B}}(B^-\rightarrow D^0\pi ^-)/{\mathcal {B}}(\bar{B}^0\rightarrow D^+ K^-)\) requires additionally the determination of the production fraction of charged and neutral B meson pairs at the B factories, or the ratio \(f_u/f_d\) at LHCb (which is however expected to be close to unity). We are looking forward to see these prospects realised in future analyses by the ATLAS, Belle II, CMS and LHCb experiments.

4 Discussion and conclusion

In this work we revisit, extend and update the theoretical predictions for the branching fractions of the decays \(\bar{B}^0\rightarrow D^{(*)+} K^-\) and \(\bar{B}_s^0\rightarrow D^{(*)+}_s\pi ^-\). We obtain our predictions in the framework of QCD factorization (QCDF), using next-to-next-to leading order results for the effective Wilson coefficients and including next-to-leading power (NLP) corrections for the first time. Moreover, we employ updated values of the form factors with reduced theoretical uncertainties compared to previous estimates. Beyond the prediction of the branching fractions, we provide updated results for the fragmentation fraction \(f_s/f_d\) in various scenarios. To obtain a reliable error estimate for the fragmentation fraction, we consider it mandatory to account for potential SU(3) breaking in the non-factorizable corrections. Our results agree with the values extracted by LHCb and CDF from semileptonic decays.

Our comparison of the various experimental measurements of these modes and our theoretical predictions shows a clear and very significant discrepancy at the level of \(4.4\sigma \). This high level is due to drastically reduced parametric uncertainties. We identify the following possible four causes of the discrepancy, none of which is fully satisfactory on its own:

  1. 1.

    The current measurements of the absolute branching fractions for the modes considered could have a systematic bias in form of a downward shift.

    This is unlikely to be the case, since the modes we discuss have large branching fractions, and have only charged particles in the final state, rendering them experimentally well accessible. A systematic bias of the order of \(\sim -30\%\) would lead to questioning the validity of all measurements of B meson branching fractions.

  2. 2.

    The theoretical results within the framework of QCDF could miss a large contribution of \(\sim -20\%\) at the amplitude level in color-allowed tree topologies.

    This is unlikely to be case, since QCDF predictions for color-allowed tree decays into light mesons are in reasonable agreement with the measurements. If such a shift in heavy-light final states is necessary it is hard to understand why a similarly sized shift in light-light final states is absent.

  3. 3.

    The hadronic matrix elements for the non-local soft gluon terms could be underestimated, and cause the observed shift.

    To increase the \(\bar{B}\rightarrow D^{(*)}\) matrix elements to the size needed for agreement with the measurements, we would need an enhancement by a factor of \(\sim 50\). This is also quite unlikely to be the case, since we have already enlarged the possible range of our estimates by multiplying our nominal results by a factor of 10 to obtain conservative estimates.

  4. 4.

    The discrepancy could be caused by contributions from physics beyond the SM.

    Also this option seems unlikely, given that the partonic transition is generated by a tree-level W exchange, and it would be surprising if a correction of \(\sim -20\%\) had eluded attention so far. On the other hand, the same pattern exists for \(b\rightarrow c\bar{u}(d/s)\) transitions that are not quantitatively discussed in this work. The interpretation is not excluded either by a number of inclusive observables like the flavor-specific CP asymmetry \(a_{fs}^d\), the total width of the B meson \(\Gamma _d\), or the ratio of lifetimes \(\tau _s/\tau _d\).

Given the significance of this puzzle and the potential impact of any of the potential explanations discussed above, each of them should be studied in more detail. Doing so exhaustively requires dedicated efforts on both the experimental and the theoretical side. We look forward to further study these options in detail in a forthcoming publication.