Paper The following article is Open access

Low-cost limit of classical communication with restricted quantum measurements

, , and

Published 16 April 2020 © 2020 The Author(s). Published by IOP Publishing Ltd on behalf of the Institute of Physics and Deutsche Physikalische Gesellschaft
, , Citation Ludwig Kunz et al 2020 New J. Phys. 22 043010 DOI 10.1088/1367-2630/ab7bd3

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

1367-2630/22/4/043010

Abstract

We consider a communication scenario where classical information is encoded in an ensemble of quantum states that admit a power series expansion in a cost parameter and converge to a single zero cost state with vanishing cost. For a given measurement scheme, we derive an approximate expression for mutual information in the leading order of the cost parameter. The general results are applied to selected problems in optical communication, where coherent states of light are used as input symbols and the cost is quantified as the average number of photons per symbol. We show that for an arbitrary individual measurement on phase shift keyed (PSK) symbols, the photon information efficiency is upper bounded by 2 nats of information per photon in the low-cost limit, which coincides with the conventional homodyne detection bound. The presented low-cost approximation facilitates a systematic analysis of few-symbol measurements that exhibit superadditivity of accessible information. For the binary PSK alphabet of coherent states, we present designs for two- and three-symbol measurement schemes based on linear optics, homodyning, and single photon detection that offer respectively 2.49% and 3.40% enhancement relative to individual measurements. We also show how designs for scalable superadditive measurement schemes emerge from the introduced low-cost formalism.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

The celebrated Holevo quantity [1] provides a powerful tool to identify how much classical information can be transmitted using symbols drawn from a given ensemble of quantum states. It specifies an upper bound on the mutual information that can be achieved with any quantum measurement satisfying physical constraints. In general, attaining the Holevo quantity requires a collective measurement on an arbitrary number of symbols [25]. When the available class of measurements is restricted, for example to individual or few-symbol detection, optimization of the mutual information with respect to the measurement strategy often becomes a rather challenging task for which universal methods are missing. Interestingly, the maximum mutual information that can be obtained using measurements on a restricted number of symbols, known as accessible information, exhibits superadditive behavior with the number of symbols [6, 7]. This is intimately related to the fact that quantum measurement reveals in general only partial knowledge about the measured states and more relevant information can be retrieved through collective detection strategies. Theoretical analysis of superadditivity of accessible information is highly nontrivial even for binary ensembles of elementary input states [810]. Another problem is the actual physical implementation of optimal measurements, e.g. in the optical domain.

The purpose of this paper is to present a systematic expansion of the mutual information in a cost parameter which characterizes the ensemble of quantum states used for communication. It is assumed that the quantum states admit a power series expansion in the cost parameter and converge to the same zero-cost state when the value of the cost parameter tends to zero. This allows us to derive an asymptotic bound on the mutual information in the leading order of the cost parameter for a given measurement scheme. An essential advantage of the formalism developed here is that the characteristics of the ensemble enters the asymptotic expression through a set of state vectors that are independent of the cost parameter. As we demonstrate here, this greatly simplifies identification of bounds on mutual information attainable with few-symbol measurements and provides a systematic method to construct experimental schemes that demonstrate superadditivity of accessible information.

The usefulness of the presented approach is illustrated with a number of examples motivated by optical communication. In this context, a natural measure for the communication cost is the mean photon number per symbol. Our expansion provides non-trivial bounds on the photon information efficiency (PIE), which specifies the amount of information that can be transmitted in one photon. In particular, we show that when information is encoded in the phase of coherent states, the maximum attainable PIE is 2 nats per photon in the low-cost limit when arbitrary individual measurements are permitted. This coincides with the commonly used in optical communications reference value derived from the Shannon–Hartley theorem, which assumes the specific case of homodyne detection [11]. We also provide a relation between the attainable PIE and the peak-to-average power ratio of a general coherent state ensemble used for communication. Further, we investigate superadditivity of accessible information in the case of few-symbol measurements performed on the binary ensemble of coherent states with equal mean photon number and opposite phases. We show that linear optical setups combining photon counting and homodyne detection can be used to demonstrate superadditivity for coherent states carrying on average less than approx. 0.01 photon. We also discuss how scalable communication schemes exhibiting the superadditivity effect [10, 12] emerge from the presented formalism.

This paper is organized as follows. Section 2 presents the mathematical formulation of the problem. The asymptotic expansion of mutual information in the limit of the vanishing cost is carried out in section 3. The results are applied to communication with an ensemble of coherent states and individual measurements in section 4. The case of few-symbol measurements on the binary ensemble of coherent states is discussed in section 5. Finally, section 6 concludes the paper.

2. Problem formulation

A communication scheme can be regarded as encoding messages onto quantum states of certain physical carriers which after transmission are subsequently measured at the receiver. The measurement results are processed to decode the input message. We shall consider classical information encoded in a finite ensemble of quantum states, labeled using an index j and described, after transmission, by density operators ${\hat{\varrho }}_{j}(\zeta )$, with respective input probabilities pj. The states depend on a real nonnegative parameter ζ characterizing their cost. Physically, the cost may correspond e.g. to the mean photon number per symbol in the case of optical communication. We will assume that each ${\hat{\varrho }}_{j}(\zeta )$ is an analytical operator-valued function of ζ and that in the limit $\zeta \to 0$ all states ${\hat{\varrho }}_{j}(\zeta )$ converge to the same state ${\hat{\varrho }}^{(0)}$ which we shall call the zero-cost state. Furthermore, we will assume that the states ${\hat{\varrho }}_{j}(\zeta )$ admit a power series expansion in the cost parameter around ζ = 0. The asymptotic analysis will be based on an expansion up to the second order:

Equation (1)

where

Equation (2)

Note that the zeroth-order term is independent of j as it is given by the zero-cost state. The above expansion is assumed to be valid for sufficiently small values of the cost parameter ζ.

The most general measurement that can be performed on a quantum system is described by a positive operator-valued measure (POVM) with elements ${\hat{Q}}_{r}$ that are positive semidefinite and add up to the identity operator, ${\sum }_{r}{\hat{Q}}_{r}=\hat{{\mathbb{1}}}$. The index r specifies the measurement result. For simplicity we consider here a finite set of possible measurement outcomes. The conditional probability ${p}_{r| j}(\zeta )$ of obtaining an outcome r given the jth input state is defined by Born's rule

Equation (3)

The marginal probability of the rth outcome for the ensemble is ${p}_{r}(\zeta )={\sum }_{j}{p}_{j}{p}_{r| j}(\zeta )$.

The maximum amount of classical information that can be transmitted using the input ensemble of quantum states ${\hat{\varrho }}_{j}$ with respective probabilities pj and a measurement ${\hat{Q}}_{r}$ is given by mutual information ${\mathsf{I}}$ calculated for the joint probability distribution ${p}_{j}{p}_{r| j}(\zeta )$, where the conditional probabilities ${p}_{r| j}(\zeta )$ are given by equation (3). It will be convenient to decompose mutual information into a sum of contributions from individual measurement results,

Equation (4)

with ${{\mathsf{I}}}_{r}$ given by

Equation (5)

Throughout this paper we use natural logarithms and the information is measured in nats, $1\,{\rm{nat}}={\mathrm{log}}_{2}e\,{\rm{bits}}$.

Our overall objective will be to optimize mutual information over measurement operators chosen from a restricted class and input probabilities for a given set (constellation) of symbols ${\varrho }_{j}(\zeta )$ in the regime of a small cost parameter. The first step, presented in the next section, will be a systematic expansion of the mutual information ${\mathsf{I}}$ in the cost parameter ζ that will provide an upper bound on the leading-order term. From now on, it is assumed that the measurement operators ${\hat{Q}}_{r}$ do not depend on the cost parameter. This restricted scenario is motivated by practical optical communication that uses direct detection or homodyne detection. We will see that even under this restriction the low-cost analysis of mutual information yields non-trivial results regarding the superadditivity of accessible information. In section 5.1 we will present an example of further enhancement when the measurement can be adjusted to the value of the cost parameter.

3. Asymptotic expansion

In the limit of the vanishing cost parameter, equation (5) can be expressed through expansion of the conditional probabilities ${p}_{r| j}(\zeta )$ in ζ, which also determines marginal probabilities pr (ζ). As for $\zeta \to 0$ all the states ${\hat{\varrho }}_{j}(\zeta )$ converge to the zero-cost state ${\hat{\varrho }}^{(0)}$, it will be convenient to introduce a shorthand notation

Equation (6)

The expansion given in equation (1) implies the following approximate expressions for conditional probabilities

Equation (7)

where ${p}_{r| j}^{(k)}={\rm{Tr}}[{\hat{Q}}_{r}{\hat{\varrho }}_{j}^{(k)}],k=1,2$. The marginal probabilities are correspondingly approximated by

Equation (8)

with ${p}_{r}^{(k)}={\sum }_{j}{p}_{j}{p}_{r| j}^{(k)}$, where k = 1, 2. Equivalently, one can write ${p}_{r}^{(k)}={\rm{Tr}}[{\hat{Q}}_{r}{\hat{\varrho }}_{{\rm{ens}}}^{(k)}]$, where the operators ${\hat{\varrho }}_{{\rm{ens}}}^{(k)}$ are given by convex combinations

Equation (9)

of derivatives defined in equation (2). It should be noted that the zeroth-order term for both conditional probabilities in equation (7) and marginal probabilities in equation (8) is identical.

Further steps for a given measurement operator ${\hat{Q}}_{r}$ depend on whether the outcome r can be generated by the zero-cost state or not. In the former case, corresponding to the mathematical condition ${p}_{r}^{(0)}={\rm{Tr}}[{\hat{Q}}_{r}{\hat{\varrho }}^{(0)}]\gt 0$, we will call ${\hat{Q}}_{r}$ a type-${\mathscr{Z}}$ operator and use ${\mathscr{Z}}$ to denote the corresponding set of measurement outcomes. Measurement operators for which ${p}_{r}^{(0)}={\rm{Tr}}[{\hat{Q}}_{r}{\hat{\varrho }}^{(0)}]=0$ will be referred to as type-${{\mathscr{Z}}}^{\perp }$ operators and the set of measurement outcomes produced by these operators will be denoted by ${{\mathscr{Z}}}^{\perp }$. We will decompose

Equation (10)

and analyze separately contributions to mutual information from the two sets of measurement outcomes.

In the case of type-${\mathscr{Z}}$ operators, the logarithm in equation (5) can be expanded into a power series in the cost parameter ζ, which yields in the leading order

Equation (11)

The difference ${p}_{r| j}^{(1)}-{p}_{r}^{(1)}$ appearing under summation can be written as

Equation (12)

We will transform this expression further using techniques borrowed from quantum estimation theory [1316]. Let us recall the notion of the symmetric logarithmic derivative (SLD), defined implicitly at ζ = 0 for individual and ensemble density operators by respective equations

Equation (13)

Equation (14)

that are assumed to be well defined [17, 18] for physical scenarios considered here. Subtracting the expression in equation (14) from equation (13) and inserting the difference into equation (12) yields

Equation (15)

where

Equation (16)

Using the Schwarz inequality

Equation (17)

applied to the operators

Equation (18)

and their hermitian conjugates ${\hat{A}}^{\dagger }$ and ${\hat{B}}^{\dagger }$ and inserting the result into equation (11) yields the following upper bound on the mutual information contribution from the measurement outcome $r\in {\mathscr{Z}}$:

Equation (19)

Note that the obtained bound is quadratic in the cost parameter ζ.

In the case of type-${{\mathscr{Z}}}^{\perp }$ measurement operators, the condition ${\rm{Tr}}[{\hat{Q}}_{r}{\hat{\varrho }}^{(0)}]=0$ combined with positive semidefiniteness of the operators ${\hat{Q}}_{r}$ and ${\hat{\varrho }}^{(0)}$ implies that ${\hat{Q}}_{r}{\hat{\varrho }}^{(0)}={\hat{\varrho }}^{(0)}{\hat{Q}}_{r}=0$. Therefore, a calculation of ${\rm{Tr}}[{\hat{Q}}_{r}{\hat{\varrho }}_{j}^{(1)}]$ using the SLD defined in equation (13) yields zero and the leading-order term in the power series expansion of the conditional probability ${p}_{r| j}(\zeta )={\rm{Tr}}[{\hat{Q}}_{r}{\hat{\varrho }}_{j}(\zeta )]$ is quadratic in ζ. We will write it in the form

Equation (20)

where

Equation (21)

with ${\hat{\varrho }}_{j}^{(2)}$ defined in equation (2) and $\hat{{\rm{\Pi }}}$ being the projection onto the kernel of the zero-cost state ${\hat{\varrho }}^{(0)}$. This follows from the fact that for type-${{\mathscr{Z}}}^{\perp }$ operators one has $\hat{{\rm{\Pi }}}{\hat{Q}}_{r}\hat{{\rm{\Pi }}}={\hat{Q}}_{r}$.

As a result, the contribution to mutual information generated by a measurement outcome $r\in {{\mathscr{Z}}}^{\perp }$ can be written in the form:

Equation (22)

Interestingly, although type-${{\mathscr{Z}}}^{\perp }$ operators generate events with vanishing probability in the limit $\zeta \to 0$, their contribution to mutual information is of the same order in ζ as from type-${\mathscr{Z}}$ operators. A similar behavior was observed for mutual information in a somewhat different context in [19].

Inserting equations (19) and (22) into (10), the leading-order expansion of mutual information in the cost parameter ζ is upper bounded by the following expression:

Equation (23)

The sum over ${\mathscr{Z}}$-type measurement operators appearing in the second line of the above formula can be expressed in terms of ${{\mathscr{Z}}}^{\perp }$-type operators as ${\sum }_{r\in {\mathscr{Z}}}\,\,{\hat{Q}}_{r}=\hat{{\mathbb{1}}}-{\sum }_{r\in {{\mathscr{Z}}}^{\perp }}{\hat{Q}}_{r}$. This allows us to write the derived bound solely in terms of ${{\mathscr{Z}}}^{\perp }$-type operators as

Equation (24)

As we will see in the following sections, the above expression, despite its seemingly complicated form, can help substantially in the analysis of specific communication scenarios in the low-cost limit.

4. Individual measurements on single-mode coherent states

The asymptotic analysis presented in section 3 can be applied to a variety of physical situations. In the following we will consider a scenario motivated by optical communication, when information is transmitted using an alphabet of coherent states of a single light mode [20, 21]

Equation (25)

Here $| n{\rangle }_{}$ denotes the n-photon Fock state. The complex amplitudes ζγj of coherent states are written as products of complex coefficients γj and the nonnegative cost parameter ζ that can be viewed as the scaling factor for the entire ensemble represented as a constellation in the complex plane. Physically, it can describe the attenuation of an optical channel over which information transfer takes place. When $\zeta \to 0$, all the states converge to the vacuum state $| 0{\rangle }_{}$ which plays the role of the zero-cost state ${\hat{\varrho }}^{(0)}=| 0{{\rangle }_{}}_{}{}_{}\langle 0| $ introduced in section 2. If individual coherent states $| \zeta {\gamma }_{j}{\rangle }_{}$ are used with respective probabilities pj, the average optical energy for the ensemble, measured in the photon number units, reads

Equation (26)

and is proportional to the square of the cost parameter. As a simple example, m-ary phase shift keying (PSK) corresponds to taking ${\gamma }_{j}={{\rm{e}}}^{2\pi {ij}/m}$ and ${p}_{j}=1/m$ with $j=0,1,\,\ldots ,m-1$. With this parametrization, all the input states have the optical energy equal to the average $\bar{n}={\zeta }^{2}$.

In this section we consider a scenario when information is retrieved by measuring individually symbols in the form of single-mode coherent states defined in equation (25). It is straightforward to obtain explicit expressions for the two operators that enter the asymptotic expression for the mutual information given in equation (23). The SLD difference defined in equation (16) reads

Equation (27)

where ${\gamma }_{{\rm{ens}}}={\sum }_{j}{p}_{j}{\gamma }_{j}$, while

Equation (28)

Consequently, it is sufficient to consider the action of measurement operators ${\hat{Q}}_{r}$ only in the two-dimensional subspace spanned by the zero- and the one-photon Fock states $| 0{\rangle }_{}$ and $| 1{\rangle }_{}$.

4.1. Photon information efficiency bounds

Suppose first that the measurement is composed only of type-${\mathscr{Z}}$ operators. Inserting equation (27) into equation (19) and recalling that ${\hat{\varrho }}^{(0)}=| 0{{\rangle }_{}}_{}{}_{}\langle 0| $ yields for $r\in {\mathscr{Z}}$:

Equation (29)

The sum over j in the above formula gives the variance of the complex coefficients γj, which is upper bounded by the mean square ${\sum }_{j}{p}_{j}| {\gamma }_{j}{| }^{2}$. If all the measurement operators are type-${\mathscr{Z}}$, one has ${\sum }_{r}{}_{}\langle 1| {\hat{Q}}_{r}| 1{\rangle }_{}=1$ which yields:

Equation (30)

where in the second step we have used the calculation of the ensemble average photon number carried out in equation (26). In optical communication, the ratio ${\mathsf{I}}/\bar{n}$ is known as the photon information efficiency (PIE) specifying the maximum amount of information that can be carried by one received photon. Therefore equation (30) can be viewed as an upper bound of 2 nats per photon on the PIE in the asymptotic limit $\bar{n}\to 0$, which holds for measurements composed from type-${\mathscr{Z}}$ operators ${\hat{Q}}_{r}$ satisfying the condition ${}_{}\langle 0| {\hat{Q}}_{r}| 0{\rangle }_{}\gt 0$.

Interestingly, the asymptotic bound of 2 nats per photon on the PIE in the limit $\bar{n}\to 0$ appears also in a scenario when information is encoded in a single quadrature of the optical mode measured by shot-noise limited homodyne detection [11]. In this case the so-called Shannon–Hartley limit for the mutual information reads [2226]

Equation (31)

and is attained by using an input ensemble of coherent states with the Gaussian distribution for the measured quadrature and the conjugate quadrature set to zero. A semi-qualitative connection between this scenario and the bound derived in equation (30) can be made as follows. As it is well known, decomposing POVM elements into positive rank-1 operators can make mutual information only equal or greater [27, 28]. Type-${\mathscr{Z}}$ and rank-1 POVM elements ${\hat{Q}}_{r}$ with non-zero matrix elements ${}_{}\langle 1| {\hat{Q}}_{r}| 1{\rangle }_{}$ which non-trivially contribute to equation (30) must detect coherence between zero- and one-photon Fock states and consequently are sensitive to the phase of the input coherent state. Homodyne detection is an example of such a phase-sensitive measurement, although in the case of the homodyne POVM the classification criterion based on whether ${\rm{Tr}}[{\hat{Q}}_{r}{\hat{\varrho }}^{(0)}]=0$ or not used in section 3 becomes problematic. This is because the outcome of the homodyne measurement has the continuous form corresponding to the field quadrature which can take any real value and the conditional probabilities for homodyne POVM elements on the zero-cost vacuum state become arbitrarily small with the increasing absolute value of the quadrature. More generally, the low-cost limit of PIE attainable with individual phase-sensitive measurements coincides with that derived from the Shannon–Hartley theorem also when the input coherent states are transmitted over a phase-invariant Gaussian channel with excess noise, mapping them onto displaced thermal states. This case is discussed in appendix A.

Let us now turn our attention to type-${{\mathscr{Z}}}^{\perp }$ measurement operators ${\hat{Q}}_{r}$. The condition ${\rm{Tr}}[{\hat{Q}}_{r}{\hat{\varrho }}^{(0)}]=0$ defining type-${{\mathscr{Z}}}^{\perp }$ operators implies that in the zero- and one-photon subspace ${\hat{Q}}_{r}$ must be proportional to a projection $| 1{{\rangle }_{}}_{}{}_{}\langle 1| $ onto the one-photon state. Therefore in optical scenarios type-${{\mathscr{Z}}}^{\perp }$ operators correspond to direct photon counting. Inserting equation (28) into (22) yields the contribution to mutual information from a measurement outcome $r\in {{\mathscr{Z}}}^{\perp }$ in the form

Equation (32)

A frequently encountered constraint in optical communication systems is an upper limit on the peak-to-average power ratio ${ \mathcal P }$ of the transmitted signal. In our case ${ \mathcal P }$ is given by the ratio of the maximum mean photon number carried by one symbol to the average photon number in the input ensemble:

Equation (33)

If all the measurement operators are type-${{\mathscr{Z}}}^{\perp }$, equation (32) summed over r and combined with equation (33) provides an upper bound on mutual information in the form:

Equation (34)

where in the second step we have used equation (26). The above inequality implies that when photon counting detection is used, the PIE is upper bounded by the logarithm of the peak-to-average power ratio ${ \mathcal P }$. It is easy to verify that the maximum value $\mathrm{log}{ \mathcal P }$ can be achieved using a two-symbol ensemble consisting of a coherent state $| \sqrt{{ \mathcal P }\bar{n}}{\rangle }_{}$ sent with a probability $1/{ \mathcal P }$ and the vacuum state $| 0{\rangle }_{}$ sent with the probability $1-1/{ \mathcal P }$, detected by Geiger-mode photon counting which discriminates between zero photons and at least one photon. Such a strategy is customarily used in deep-space optical communication, where photon information efficiency is of primary importance [29, 30]. The practical implementation is based on the pulse position modulation (PPM) format which uses frames composed of an an integer number of ${ \mathcal P }$ temporal bins. Each frame contains one pulse with the mean photon number ${ \mathcal P }\bar{n}$ located in one of the bins that are otherwise left empty.

Note that equation (32) implies that for PSK alphabets, when all γj are equal in their magnitude, the contribution to mutual information from type-${{\mathscr{Z}}}^{\perp }$ operators is zero, ${\tilde{{\mathsf{I}}}}_{r}=0$ when $r\in {{\mathscr{Z}}}^{\perp }$. This is consistent with the simple fact that direct photon counting cannot reveal information encoded in the phase of the electromagnetic field. Consequently, a general detection strategy combining type-${\mathscr{Z}}$ and type-${{\mathscr{Z}}}^{\perp }$ operators for a PSK ensemble has the PIE upper bounded by 2 nats per photon in the limit $\bar{n}\to 0$.

4.2. Individual measurements on BPSK symbols

Let us now discuss the case of individual measurements on the binary phase shift keying (BPSK) alphabet of two coherent states $| \zeta {\rangle }_{}$ and $| -\zeta {\rangle }_{}$ with opposite phases and the same mean photon number $\bar{n}={\zeta }^{2}$. The maximum mutual information attainable in this scenario can be written in a closed analytical form for a finite $\bar{n}$. We will see that the BPSK alphabet is sufficient to saturate the 2 nats per photon bound on the PIE in the low-cost limit. The exact expression for the mutual information attainable for a given $\bar{n}$ will be used in section 5 as a benchmark when discussing superadditive measurements on multiple BPSK symbols.

It has been shown [31, 32] that when a binary ensemble of equiprobable pure states is used for classical information transmission and only individual detection is permitted, mutual information is maximized by the minimum-error measurement described by Helstrom [13]. The symmetric probability of a wrong identification of the input BPSK state can be obtained from the Helstrom bound

Equation (35)

which yields mutual information expressed in nats:

Equation (36)

Here

Equation (37)

is the entropy of a binary random variable specified in nats. The resulting exact expression for the PIE attainable with individual measurements on the BPSK alphabet, equal to ${{\mathsf{I}}}_{{\rm{Hel}}}/\bar{n}$, is depicted in figure 1 as a function of $\bar{n}$. In the limit $\bar{n}\to 0$ the PIE reaches 2 nats. It is worth noting that for $\bar{n}\lesssim 0.4$ the obtained result exceeds the PIE ${{\mathsf{I}}}_{{\rm{SH}}}/\bar{n}$ obtained from the Shannon–Hartley expression given by equation (31), also plotted in figure 1. This highlights the specificity of the Shannon–Hartley limit, used as a canonical reference in optical communication, to the quadrature measurement assumed in its derivation. The lower value of ${{\mathsf{I}}}_{{\rm{SH}}}/\bar{n}$ compared to ${{\mathsf{I}}}_{{\rm{Hel}}}/\bar{n}$ for $\bar{n}\ll 1$ can be verified by inspecting the second-to-leading terms in the asymptotic expansions of the respective expressions given in equations (31) and (36):

Equation (38)

As also shown in figure 1, the PIE of 2 nats can be attained in the limit $\bar{n}\to 0$ with a homodyne measurement on the BPSK alphabet, provided that the information is retrieved from the measured continuous quadrature value. For completeness, the mathematical description of this scenario is presented in appendix B. Of course, in this case the PIE does not exceed ${{\mathsf{I}}}_{{\rm{SH}}}/\bar{n}$ for $\bar{n}\gt 0$.

Figure 1.

Figure 1. The PIE for the BPSK alphabet detected using the minimum error measurement ${{\mathsf{I}}}_{{\rm{Hel}}}/\bar{n}$ (dotted line) compared to the Shannon–Hartley limit ${{\mathsf{I}}}_{{\rm{SH}}}/\bar{n}$ (grey solid line), which assumes a Gaussian ensemble of coherent states. The black solid line depicts the case of the homodyne-detected BPSK alphabet ${{\mathsf{I}}}_{{\rm{BPSK/hom}}}/\bar{n}$ calculated in appendix B.

Standard image High-resolution image

The physical realization of the minimum error measurement for the BPSK alphabet is known to be the Dolinar receiver based on photon counting of the displaced input signal with a fast feedback loop [33]. Unfortunately, due to its complexity, realizations of the Dolinar receiver have so far been confined to the proof-of-principle stage [34]. In search for technically less challenging implementations, alternative nearly-optimal measurements have been investigated, including a feed-forward scheme based on signal splitting and optical delay lines [35]. A considerable amount of work has been devoted also to displacement receivers without feed-back or feed-forward mechanisms [3642] which show a pretty good performance. Additionally, a non-demolition measurement based on an atom-light interaction that comes close to the Helstrom bound has been recently proposed [43]. Furthermore, derivatives of the Dolinar receiver have been demonstrated for alphabets consisting of more than two coherent states like for example commonly considered quaternary phase shift keying or higher PSK formats [4450]. In addition to the minimum-error criterion, one can consider other strategies to distinguish between non-orthogonal quantum states, such as unambiguous state discrimination [5153]. Time-resolved photon counting of the PPM format mentioned in section 4.1 can be viewed as a realization of the unambiguous strategy for quantum states of light prepared within PPM frames: registering a photocount in a given time bin identifies unambiguously the location of the pulse, i.e. the input state, whereas absence of any photocount over the entire frame yields an inconclusive result.

5. Collective measurements on BPSK words

We will now use the upper bound on the mutual information expanded in the cost parameter as derived in section 3 to identify collective measurement schemes on multiple BPSK symbols that beat the performance of individual detection. While the designs for collective schemes will be motivated by the analysis of the mutual information in the asymptotic limit of the vanishing cost, the actual figure for the mutual information, or equivalently the photon information efficiency, attainable using the designed measurement will be calculated exactly for a given finite $\bar{n}$ and compared against the individual detection benchmark given in equation (36).

When collective measurements can be performed on a sequence of individual quantum systems, for example a train of single-mode light pulses, one needs to consider an input ensemble in the form of words composed from the alphabet of elementary symbols, such as coherent states introduced in equation (25). The input probabilities are then defined for entire words. In the remainder of the paper, we will restrict our attention to words constructed from the BPSK alphabet of coherent states $| \zeta {\rangle }_{}$ and $| -\zeta {\rangle }_{}$. Words of length M can be then conveniently labelled with M-bit strings ${\bf{j}}={j}_{1}{j}_{2}\ldots {j}_{M}$. The input states are pure, ${\hat{\varrho }}_{{\bf{j}}}=| {\psi }_{{\bf{j}}}{{\rangle }_{}}_{}{}_{}\langle {\psi }_{{\bf{j}}}| $, and have the form of M-mode coherent states,

Equation (39)

The label ${\bf{j}}$ needs now to be used as an ensemble index in lieu of j when applying general results derived in section 3. For a fair comparison with the individual measurement scenario, the cost should be quantified with the mean photon number per elementary symbol, given in the present case by $\bar{n}={\zeta }^{2}$. The entire optical energy of a BPSK word of length M is $M{\zeta }^{2}=M\bar{n}$.

Our goal will be to identify the probability distribution ${p}_{{\bf{j}}}$ for input words $| {\psi }_{{\bf{j}}}{\rangle }_{}$ and the few-symbol collective measurement strategy that maximizes mutual information in the low-cost limit. Let us first specialize equation (24) to the scenario considered here. It will be convenient to denote a normalized superposition state of one photon in M modes as:

Equation (40)

and use $| {\rm{v}}{\rangle }_{}=| 0{\rangle }_{}| 0{\rangle }_{}\ldots | 0{\rangle }_{}$ for the M-mode vacuum state. With this notation, the SLD difference for the state ${\hat{\varrho }}_{{\bf{j}}}$ reads

Equation (41)

and respectively

Equation (42)

After inserting these expressions into equation (24) it is easy to notice that pairs of terms corresponding to ${\bf{j}}$ and its bitwise negation $\neg {\bf{j}}$ can be combined together because the corresponding one-photon states $| {1}_{{\bf{j}}}{\rangle }_{}$ and $| {1}_{\neg {\bf{j}}}{\rangle }_{}$ differ only by an irrelevant global minus sign. We will select one of the ${\bf{j}}$'s as a representative for each pair and use the equivalence class $[{\bf{j}}]$ as an index for the two-fold reduced sum over the input ensemble. The result reads

Equation (43)

where ${p}_{[{\bf{j}}]}={p}_{{\bf{j}}}+{p}_{\neg {\bf{j}}}$ is the total probability of using either word from a given pair. The above expression can be equivalently written as

Equation (44)

Equation (45)

where

Equation (46)

Since for any ${\bf{j}}$ and r the argument of the logarithm in equation (44) does not exceeds one, the second line of that expression containing logarithms is nonpositive therefore by neglecting it one arrives at a weaker upper bound of the form:

Equation (47)

The function f(u) defined in equation (46) will play an important role in further analysis. Its graph is shown in figure 2. It is easy to verify by an elementary calculation that on the interval 0 ≤ u ≤ 1 the function f(u) is concave, has a single maximum at ${u}^{* }=1/{e}^{3}\approx 0.0498$ equal to $f({u}^{* })={u}^{* }$ and becomes negative for arguments $u\gt 1/{e}^{2}$.

Figure 2.

Figure 2. The function f(u) defined in equation (46). The function is concave and has a single maximum at ${u}^{* }=1/{e}^{3}$. For $u\geqslant 1/{e}^{2}$ the function becomes negative.

Standard image High-resolution image

5.1. Two-symbol measurements

In the case of two-symbol measurements, M = 2, we have only two non-equivalent one-photon states that can be compactly denoted as

Equation (48)

Equation (49)

We will also use + and − symbols to label the two respective equivalence classes $+\equiv [00]=[11]$ and $+\equiv [01]=[10]$ of the input words with corresponding probabilities ${p}_{+}={p}_{00}+{p}_{11}$ and ${p}_{-}={p}_{10}+{p}_{01}$. Equation (47) can be then simplified to the form

Equation (50)

where we have denoted

Equation (51)

Because Q± are the overall probabilities of obtaining an $r\in {{\mathscr{Z}}}^{\perp }$ outcome on states $| \pm {\rangle }_{}$, they are constrained by $0\leqslant {Q}^{\pm }\leqslant 1$. The goal now is to maximize the right hand side of equation (50). Given the properties of the function f(u) discussed earlier and taking into account that ${p}_{+}+{p}_{-}=1$, the right hand side of equation (50) is optimized by selecting ${p}_{-}={u}^{* }=1/{e}^{3},{Q}^{-}=1$, and ${Q}^{+}=0$. An equivalent solution has exchanged labels + and −. This finally yields:

Equation (52)

Given that the mean photon number in an input word is $2\bar{n}$, the asymptotic value of the PIE is upper bounded by $\tilde{{\mathsf{I}}}/(2\bar{n})\leqslant 2+1/{e}^{3}$, which presents a relative increase of $1/(2{e}^{3})\approx 2.49 \% $ compared to the individual measurement scenario. Note that the value derived in equation (52) maximizes also the full expression given in equation (44), as for the chosen p± and Q± the second logarithmic term in equation (44) vanishes.

The optimum identified above can be translated into an optical measurement scheme that asymptotically saturates the bound derived in equation (52). The condition ${Q}^{-}=1$ implies that input states $| {\psi }_{01}{\rangle }_{}$ and $| {\psi }_{10}{\rangle }_{}$ containing the $| -{\rangle }_{}$ component should be detected by type-${{\mathscr{Z}}}^{\perp }$ measurement operators, which as discussed in section 4 correspond in the optical domain to single photon detection. Since photon detection is phase-insensitive, the optimum ${p}_{-}={p}_{01}+{p}_{10}$ specifies only the overall probability of using the two input states $| {\psi }_{01}{\rangle }_{}$ and $| {\psi }_{10}{\rangle }_{}$. For concreteness, we will take only $| {\psi }_{01}{\rangle }_{}$. On the other hand, the requirement ${Q}^{+}=0$ means that input states $| {\psi }_{00}{\rangle }_{}$ and $| {\psi }_{11}{\rangle }_{}$ containing the one-photon component $| +{\rangle }_{}$ are measured only by type-${\mathscr{Z}}$ operators, which corresponds to phase-sensitive detection. We have seen in section 4 that such a measurement can be realized by the Dolinar receiver or homodyning. In order to maximize the contribution to mutual information, one needs to use the states $| {\psi }_{00}{\rangle }_{}$ and $| {\psi }_{11}{\rangle }_{}$ with the same input probabilities ${p}_{00}={p}_{11}$.

A straightforward way to subject the input states to the required type of detection is to interfere the two component coherent states in each two-symbol word on a balanced 50/50 beam splitter, as shown in figure 3. This results in the transformation

Equation (53)

where the two kets on the right hand side refer to the two output ports of the beam splitter. Note that when a Dolinar receiver is used as a phase-sensitive measurement, this reproduces the two-symbol measurement strategy described by Guha [10]. In the following, we consider homodyne detection as a more standard phase-sensitive measurement technique. For a finite $\bar{n}={\zeta }^{2}$ we have optimized numerically the PIE ${{\mathsf{I}}}_{2}/(2\bar{n})$ given by the ratio of the actual mutual information ${{\mathsf{I}}}_{2}$ calculated using equation (B6) in appendix B and the mean photon number over input probabilities parameterized with ${p}_{01}+{p}_{10}=u,{p}_{00}={p}_{11}=(1-u)/2$ in a scenario when a combination of homodyne detection and single photon detection (SPD) is implemented on the two output beam splitter ports. The results are shown in figure 4. It is seen that in the limit $\bar{n}\to 0$ the PIE indeed approaches the value derived in equation (52) which is also shown analytically in appendix B. Superadditivity of accessible information for the BPSK alphabet, when the amount of transmitted information exceeds that attainable with individual measurements, occurs for $\bar{n}\lt 0.0117$.

Figure 3.

Figure 3. A communication scheme providing superadditive advantage for two-symbol BPSK words. Input states $| {\psi }_{00}{\rangle }_{},| {\psi }_{11}{\rangle }_{},| {\psi }_{01}{\rangle }_{}$ are interfered on a 50:50 beamsplitter resulting in the transformation of equation (53). The first output mode undergoes homodyne detection whereas the second one is measured by a single photon detector (SPD). The optional displacement operation preceding the SPD can slightly enhance mutual information.

Standard image High-resolution image
Figure 4.

Figure 4. (a) The PIE defined as ${{\mathsf{I}}}_{2}/(2\bar{n})$ for the two-symbol measurement shown schematically in figure 3 (solid line, without displacement preceding SPD; dashed line, with displacement) compared to the optimal individual measurement (dotted line) as a function of the mean photon number $\bar{n}$. Without a displacement operation the PIE tends to 2.0498 (an advantage of 2.49%) as derived in equation (52). With displacement the PIE reaches at 2.0564 (an advantage of 2.82%). (b) The probability u of sending the state $| {\psi }_{01}{\rangle }_{}$ in the scheme without (solid line) or with (dashed line) displacement as a function of the mean photon number $\bar{n}$.

Standard image High-resolution image

The PIE attainable for two-symbol measurements can be further improved by introducing a displacement operation before the single photon detector (SPD), shown within a dashed box in figure 3. This operation adds a coherent amplitude β to the field detected by the SPD. Such an arrangement is known as the generalized Kennedy receiver [38, 4042]. As seen in figure 4, optimizing β for a given $\bar{n}$ yields a slightly higher value of PIE, approaching 2.0564 in the limit $\bar{n}\to 0$, which corresponds to $2.82 \% $ enhancement relative to individual measurements. Noticeably, the displacement value β tends to zero with diminishing $\bar{n}$, as seen in figure 5. This result indicates that the asymptotic analysis of mutual information in the low-cost limit carried out in section 3 may yield a different bound if explicit dependence of the measurement operators on the cost parameter is allowed. A similar effect occurs also in the analysis of superadditivity of accessible information in the case of a collective measurement on qubit pairs [9].

Figure 5.

Figure 5. A contour plot of the PIE ${\mathsf{I}}/(2\bar{n})$ for the two-symbol measurement depicted in figure 3 as a function of the mean photon number $\bar{n}$ and the displacement β. The dashed line indicates the optimal displacement value for a given mean photon number $\bar{n}$.

Standard image High-resolution image

5.2. Three-symbol measurements

In the case of three-symbol BPSK words, the one-photon sector relevant to the evaluation of mutual information in the low-cost limit is spanned by four states:

Equation (54)

An essential difference with the two-symbol case is that the four single-photon states listed above are mutually nonorthogonal. This makes it problematic to use equation (47) for optimization as the matrix elements ${}_{}\langle {1}_{[{\bf{j}}]}| \left({\sum }_{r\in {{\mathscr{Z}}}^{\perp }}{\hat{Q}}_{r}\right)| {1}_{[{\bf{j}}]}{\rangle }_{}$ cannot be set independently for each one photon state $| {1}_{[{\bf{j}}]}{\rangle }_{}$. Consequently, one needs to revert back to equation (43) for further analysis. We will denote the four equivalence classes with indices $++\equiv [000]=[111],+-\,\equiv \,[001]=[110],-+\,\equiv \,[010]=[101]$, and $--\,\equiv \,[011]=[100]$.

We have performed a numerical search for input probabilities ${p}_{[{\bf{j}}]}$ and type-${{\mathscr{Z}}}^{\perp }$ POVM elements of rank-1 form in the one-photon sector that maximize the right hand side of equation (43). Up to a permutation of the modes, the numerical results suggest an optimum characterized by a high degree of symmetry with

Equation (55)

and two type-${{\mathscr{Z}}}^{\perp }$ projective operators ${\hat{Q}}_{r}=| {q}_{r}{{\rangle }_{}}_{}{}_{}\langle {q}_{r}| ,r=1,2$, which are given in the orthonormal basis $| 100{\rangle }_{},| 010{\rangle }_{},| 001{\rangle }_{}$ by

Equation (56)

It is straightforward to verify that ${}_{}\langle {q}_{1}| ++{\rangle }_{}={}_{}\langle {q}_{2}| ++{\rangle }_{}=0$. The geometry of the measurement is visualized in figure 6 with a three-dimensional diagram using the fact that the relevant state vectors have real components. The four one-photon states specified in equation (54) are located in vertices of a cube centered at the origin of the coordinate system. The two mutually orthogonal states $| {q}_{1}{\rangle }_{}$ and $| {q}_{2}{\rangle }_{}$ corresponding to projective type-${{\mathscr{Z}}}^{\perp }$ operators lie in the plane perpendicular to $| ++{\rangle }_{}$. This means that the vector $| ++{\rangle }_{}$ defines a subspace on which a type-${\mathscr{Z}}$ measurement needs to be carried out. The vectors $| {q}_{1}{\rangle }_{}$ and $| {q}_{2}{\rangle }_{}$ can be viewed as an implementation of a minimum-error measurement for inputs $| +-{\rangle }_{}$ and $| -+{\rangle }_{}$ projected onto the plane perpendicular to $| ++{\rangle }_{}$.

Figure 6.

Figure 6. Graphic representation of the optimal measurement attaining superadditive advantage for three-symbol BPSK words. Input states $| {\psi }_{000}{\rangle }_{},| {\psi }_{001}{\rangle }_{},| {\psi }_{010}{\rangle }_{}$ corresponding to single-photon states $| ++{\rangle }_{},| +-{\rangle }_{},| -+{\rangle }_{}$ represented by black arrows are measured using a projective POVM with one element parallel to $| ++{\rangle }_{}$ and two others being $| {q}_{0}{\rangle }_{}$ and $| {q}_{1}{\rangle }_{}$, indicated by red arrows. The latter part of the POVM is the minimum error measurement between projections of $| +-{\rangle }_{},| -+\rangle $ onto the plane orthogonal to $| ++{\rangle }_{}$, indicated by dashed arrows.

Standard image High-resolution image

Inserting equation (55) and type-${{\mathscr{Z}}}^{\perp }$ operators specified in equation (56) into (44) yields

Equation (57)

The right hand side reaches maximum at ${v}^{* }\approx 0.0375$ resulting in a PIE bound ${\tilde{{\mathsf{I}}}}_{3}/(3\bar{n})\leqslant 2.0679$, which represents relative $3.40 \% $ improvement compared to individual measurements.

Analogously to the two-symbol case, the above solution suggests an optical implementation shown in figure 7. A single three-port linear optical circuit is constructed such that photons prepared in one of three orthogonal superoposition states $| ++{\rangle }_{},| {q}_{1}{\rangle }_{}$, and $| {q}_{2}{\rangle }_{}$ are routed to different output ports. Homodyne detection is implemented on the port corresponding to the $| ++{\rangle }_{}$ input, while two other ports are monitored by SPDs. The input probabilities are parameterized as ${p}_{000}={p}_{111}=1/2-v$ and ${p}_{001}={p}_{010}=v$. Maximization over v for a given $\bar{n}={\zeta }^{2}$ yields the actual photon information efficiency ${{\mathsf{I}}}_{3}/(3\bar{n})$ shown in figure 8(a) with the optimal value of v depicted in figure 8(b). It is seen that the PIE and and the probability v attain respectively the $3.40 \% $ enhancement and the optimal value implied by equation (57). in the asymptotic limit $\bar{n}\to 0$.

Figure 7.

Figure 7. A schematic representation of a communication protocol attaining superadditive advantage for three-symbol BPSK words. Input states $| {\psi }_{000}{\rangle }_{},| {\psi }_{111}{\rangle }_{},| {\psi }_{001}{\rangle }_{},| {\psi }_{010}{\rangle }_{}$ are first mixed by a set of beam splitters with respective transmissivities 50% and 66% resulting in respective states. The first output mode undergoes a homodyne detection whereas the second and the third one are measured by SPDs.

Standard image High-resolution image
Figure 8.

Figure 8. (a) The PIE defined as ${\mathsf{I}}/(3\bar{n})$ of the three-symbol measurement depicted in figure 7 (solid line) compared to the optimal individual measurement (dotted line) as a function of the mean photon number $\bar{n}$. (b) The probability v of sending either the state $| {\psi }_{001}{\rangle }_{}$ or $| {\psi }_{010}{\rangle }_{}$ which are detected by a SPD as a function of the mean photon number $\bar{n}$.

Standard image High-resolution image

5.3. Orthogonal words

Although for BPSK words longer than two symbols full optimization of the asymptotic expression for mutual information requires resorting to numerical means, some interesting observations can be made when the ensemble of input words is a priori restricted to a subset for which all the one-photon states defined in equation (40) are mutually orthogonal, apart from pairs that differ only by a global minus sign. Such subsets can be systematically constructed using Hadamard matrices, which are real orthogonal matrices with entries ±1 defined in dimensions that are integer powers of 2 and conjectured to exist for any dimension that is a multiple of 4 [54]. The orthogonal words are chosen such that sequences of signs for individual coherent states specified in equation (39) are given by rows of a Hadamard matrix, up to a global phase flip. The orthogonality property of Hadamard matrices mean that for such a reduced subset one-photon states $| {1}_{{\bf{j}}}{\rangle }_{}$ and $| {1}_{{\bf{j}}^{\prime} }{\rangle }_{}$ defined in equation (40) are mutually orthogonal when ${\bf{j}}$ and ${\bf{j}}^{\prime} $ correspond to different rows of a Hadamard matrix.

The orthogonality of the states $| {1}_{[{\bf{j}}]}{\rangle }_{}$ implies that the value of each of the matrix elements ${Q}^{[{\bf{j}}]}={}_{}\langle {1}_{[{\bf{j}}]}| \left({\sum }_{r\in {{\mathscr{Z}}}^{\perp }}{\hat{Q}}_{r}\right)| {1}_{[{\bf{j}}]}{\rangle }_{}$ in equation (47) can be set independently within physical constraints $0\leqslant {Q}^{[{\bf{j}}]}\leqslant 1$. The obvious way to maximize equation (47) is then to take ${Q}^{[{\bf{j}}]}=1$ if $f({p}_{[{\bf{j}}]})\gt 0$ and ${Q}^{[{\bf{j}}]}=0$ otherwise. Recalling that f(t) is positive for arguments $0\lt t\lt 1/{e}^{2}$, this reduces equation (47) to

Equation (58)

Importantly, by taking measurement operators ${\hat{Q}}_{r}$ in the form of projections onto one-photon states $| {1}_{[{\bf{j}}]}{{\rangle }_{}}_{}{}_{}\langle {1}_{[{\bf{j}}]}| $, the second line of equation (44) can be made identically equal to zero, which implies that the bound (58) is equivalent to equation (43).

Suppose now that exactly $M^{\prime} $ indices $[{\bf{j}}]$ satisfy the condition $0\lt {p}_{[{\bf{j}}]}\lt 1/{e}^{2}$. The concavity of f(t) on the interval $0\lt t\lt 1/{e}^{2}$ implies that

Equation (59)

This means that it is always beneficial to make all the probabilities ${p}_{[{\bf{j}}]}$ from the interval $0\lt {p}_{[{\bf{j}}]}\lt 1/{e}^{2}$ identical. Consequently, the sum appearing on the right hand side of equation (58) is maximized by an expression of the form $M^{\prime} f(t)$ under constraints $0\lt t\lt 1/{e}^{2}$ and $M^{\prime} t\leqslant 1$ to guarantee that the sum of the probabilities does not exceed one.

In the case of orthogonal words based on the Hadamard construction one has $M^{\prime} \leqslant M$. In this case a straightforward calculation shows that the mutual information is maximized by one of two strategies. The first one uses $M^{\prime} =M$ and $t=1/M$. The resulting bound on the PIE has the simple form $\mathrm{log}M$. This value can be achieved by feeding Hadamard words into a linear optical circuit which maps them onto the PPM format subsequently detected by single photon detection [10, 12]. The second strategy is to use $M^{\prime} =M-1$ Hadamard sequences with probabilities equal to ${t}^{* }=1/{e}^{3}$ which maximize individual terms in equation (58). The residual probability $1-(M-1){t}^{* }$ should be divided equally between the remaining Hamadard sequence and its phase flipped version that need to be detected by a type-${\mathscr{Z}}$ measurement. The resulting PIE is $2+(M-1)/{e}^{3}$. The first strategy is optimal for $M\geqslant 16$, while the second one offers higher PIE for $M=2,4,8,12$. Its special case for M = 2 is the measurement described in section 5.1. These observations are consistent with the results presented in [55]. It should be kept in mind that the limiting value of PIE is attained when $M\bar{n}\ll 1$ and terms of the order ${(M\bar{n})}^{2}$ and higher can be neglected when calculating the photocount probability. When selecting the communication protocol for a given $\bar{n}$ it may be optimal to choose a finite M [56].

6. Conclusions

We presented a systematic expansion of mutual information for a given ensemble of quantum states and a fixed measurement in the cost parameter characterizing the input ensemble, assuming that with vanishing cost all states converge to one zero-cost state. In the case of optical communication with coherent states, the cost is naturally quantified in terms of the average photon number and the zero-cost state is the vacuum state. Mutual information can be written as a sum of contributions produced by individual measurement outcomes. The expansion depends on whether the measurement operator corresponding to a given outcome detects the zero-cost state or not. In the optical domain, these two classes of operators can be interpreted as photon counting and phase-sensitive detection such as homodyning. The developed formalism substantially helps in deriving bounds on the attainable mutual information in the low-cost limit, reducing the dimensionality of the relevant Hilbert space to zero- and one-photon Fock states and mapping the geometry of the ensemble onto a set of fixed state vectors that are independent of the cost parameter.

In the case of individual measurements on a coherent state ensemble with a vanishing mean photon number, we derived bounds on the attainable photon information efficiency that go beyond those obtained assuming conventional detection techniques. Furthermore, superadditivity of accessible information has been analysed for words composed of the BPSK alphabet of coherent states. The mathematical structure of the derived general bound has been used to identify collective measurements that offer performance beyond individual detection. In the case when single photon states characterizing the words in the low-cost limit are orthogonal one can calculate an analytical value for the asymptotic maximum PIE and the design of the collective measurement is apparent. The remaining cases in which one cannot assume orthogonality are much more involved and in general one need to resort to numerical optimization. We presented experimental schemes based on linear optics, homodyne detection, and photon counting which in the low-cost limit offer superadditive enhancement over the single-symbol measurements equal to 2.49% and 3.40% respectively for two- and three-symbol measurements. Their performance for finite mean photon number has been analyzed using as a benchmark the minimum-error measurement on single symbols, which is known to maximize the mutual information for individual detection. Interestingly, further optimization of collective detection requires a careful adjustment of the measurement to the value of cost parameter.

The developed mathematical formalism can find applications not only in standard classical communication but also in private communication, where the private communication rate can be expressed by the difference of the mutual informations between sender and receiver and the eavesdropper [57, 58]. An extension of our methods in line with [19] to tackle von Neumann or Renyi entropies may allow to analyze low-cost limits of various quantum information quantities like Holevo information [1], coherent information [59, 60] or other quantities, relevant in quantum key distribution.

Acknowledgments

We acknowledge insightful discussions with F E Becerra, R García-Patrón, S Guha and M G A Paris. This work is part of the project 'Quantum Optical Communication Systems' carried out within the TEAM programme of the Foundation for Polish Science co-financed by the European Union under the European Regional Development Fund.

Appendix A.: Photon information efficiency for displaced thermal states

The treatment of individual measurements performed on coherent states presented in section 4 can be generalized to the case of displaced thermal states

Equation (A1)

where $\hat{D}({\gamma }_{j}\zeta )$ is the displacement operator and

Equation (A2)

is a thermal state with the mean photon number nb, which plays the role of the zero-cost state ${\hat{\varrho }}^{(0)}={\hat{\varrho }}_{{\rm{th}}}$. Such states are produced e.g. as a result of propagation of coherent states through a phase-invariant Gaussian channel with excess noise.

In the present case the zero-cost state is mixed and has support on the entire bosonic Hilbert space. Thus all POVM operators ${\hat{Q}}_{r}$ need to be type-${\mathscr{Z}}$. For a displaced thermal state, the SLD difference ${\hat{D}}_{j}$ at $\zeta =0$ defined in equation (16) reads

Equation (A3)

where ${\hat{a}}^{\dagger }$ and $\hat{a}$ denote respectively the creation and the annilation operators of the bosonic mode.

Inserting equation (A3) into (19) yields an upper bound on mutual information ${\mathsf{I}}$ in the low cost limit in the form

Equation (A4)

where we have used ${\rm{Tr}}[{({\hat{a}}^{\dagger })}^{2}{\rho }^{(0)}]=0$ and ${\rm{Tr}}[{\hat{a}}^{2}{\rho }^{(0)}]=0$ as well as the definition of the mean photon number from equation (26). Note that equation (A4) is identical with the first order expansion of the Shannon–Hartley formula [11] for the scenario with excess noise

Equation (A5)

Appendix B.: Homodyne measurement schemes

The statistics of outcomes for shot-noise limited homodyne detection of the real quadrature x on a coherent state with a complex amplitude ζ is given by a Gaussian probability distribution

Equation (B1)

Mutual information for homodyne detection of the BPSK ensemble of coherent states with real amplitudes $| \zeta {\rangle }_{}$ and $| -\zeta {\rangle }_{}$ can be obtained by plugging equation (B1) into (4) and (5)

Equation (B2)

with the summation over results exchanged to integration. Expanding all the functions in the second term up to the fourth order in ζ yields

Equation (B3)

The rightmost expression coincides with the power series expansion of the Shannon–Hartley limit given in equation (38).

The statistics outcomes on the single photon detector (SPD) deployed in the measurement scheme discussed in section 5.1 and depicted in figure 3 with an optional displacement β is given by

Equation (B4)

where k = 0, 1 indicates respectively if any photons were registered or not and $\zeta ^{\prime} $ is the field amplitude before the displacement operation. The probabilities of possible outcomes in the two-symbol measurement design of figure 3 are then given by

Equation (B5)

The mutual information can be written as

Equation (B6)

where H(X, K) denotes the joint entropy of the outcomes at the homodyne detector and the SPD, ${\mathsf{H}}$ is the entropy of a binary random variable defined in equation (37) and $h[p(x)]=-{\int }_{-\infty }^{\infty }{\rm{d}}{x}\,p(x)\mathrm{log}p(x)$.

In the case of a pure SPD measurement without displacement (β = 0) one can calculate mutual information by expanding $\mathrm{log}{p}_{x,0}$ appearing in the expression for $H(X,K)$ up the second order in ζ. Taking the limit of ${\zeta }^{2}=\bar{n}\to 0$ in the resulting expression yields

Equation (B7)

The derived expression is equal to the mutual information of the two-symbol scenario of equation (52) and is maximized by $u=1/{e}^{3}$. The advantage is 2.49% compared to the individual measurement.

Please wait… references are loading.