Asynchronous Chirp Slope Keying for Underwater Acoustic Communication

Schott, Dominik Jan; Gabbrielli, Andrea; Xiong, Wenxin; Fischer, Georg; Höflinger, Fabian; Wendeberg, Johannes; Schindelhauer, Christian; Rupitsch, Stefan Johann

doi:10.3390/s21093282

Open AccessArticle

Asynchronous Chirp Slope Keying for Underwater Acoustic Communication

¹

Department of Microsystems Engineering (IMTEK), University of Freiburg, 79110 Freiburg, Germany

²

Department of Computer Science (IIF), University of Freiburg, 79110 Freiburg, Germany

³

Fraunhofer EMI, 79588 Efringen-Kirchen, Germany

^*

Author to whom correspondence should be addressed.

Sensors 2021, 21(9), 3282; https://doi.org/10.3390/s21093282

Submission received: 29 March 2021 / Revised: 3 May 2021 / Accepted: 3 May 2021 / Published: 10 May 2021

(This article belongs to the Special Issue Applications of Ultrasonic Sensors)

Download

Browse Figures

Versions Notes

Abstract

:

We propose an asynchronous acoustic chirp slope keying to map short bit sequences on single or multiple bands without preamble or error correction coding on the physical layer. We introduce a symbol detection scheme in the demodulator that uses the superposed matched filter results of up and down chirp references to estimate the symbol timing, which removes the requirement of a preamble for symbol synchronization. Details of the implementation are disclosed and discussed, and the performance is verified in a pool measurement on laboratory scale, as well as the simulation for a channel containing Rayleigh fading and Additive White Gaussian Noise. For time-bandwidth products (TB) of 50 in single band mode, a raw data rate of 100 bit/s is simulated to achieve bit error rates (BER) below 0.001 for signal-to-noise ratios above −6 dB. In dual-band mode, for TB of 25 and a data rate of 200 bit/s, the same bit error level was achieved for signal-to-noise ratios above 0 dB. The simulated packet error rates (PER) follow the general behavior of the BER, but with a higher error probability, which increases with the length of bits in each packet.

Keywords:

underwater communication; wireless communication; acoustic communication; ultrasound acoustics; digital signal processing; chirp modulation; chirp slope keying; chirp spread spectrum

1. Introduction

In air, there is a plenitude of electromagnetic wave communications available to connect devices, but due to their strong fading in the underwater channel, acoustic communication has shown beneficial results. While there are several communications systems for deep open water communications available, where offshore industries and naval warfare have accelerated technological advancement, shallow water still challenges communication attempts after over a hundred years of research [1,2,3]. This may stem from the strongly selective frequency fading, high phase noise and fast echoes and for moving nodes, due to a strong Doppler effect which characterizes the instability of the acoustic underwater channel [4,5,6,7,8,9]. The shallower the channel is, the more pronounced this inhibitions become. Previous investigations into the field of acoustic underwater communications have shown promising results [10,11,12], but concentrated on deeper bodies of water over longer distances of several kilometer in audible or sub-audible frequencies [13,14]. While high-bandwidth communication with large spectral efficiencies will be prone to upset the natural habitat if performed in the audible range of the maritime fauna [15,16], narrow-band methods as commonly found in frequency-shift keying (FSK) modems [17,18] are vulnerable to the fading effects discussed before.

The ongoing interest is a result of the strong attenuation of radio signals underwater and the long distances that require to be covered. Application examples, where underwater acoustic communication is crucial are diver tracking [19,20], robot/autonomous underwater vehicle (AUV) telemetry [21,22,23,24], and underwater sensor networks [25,26].

1.1. Historical Overview

Evidence suggests that the general idea to sweep the frequency of a carrier to transmit information may be as old as (breathing) life on earth [27], but the first modern record we found is Hüttmann et al.’s patent for a distance measurement method from 1940 [28]. The idea to use chirps for modulating a signal, e.g., as Chirp Slope Keying (CSK) is often attributed to Winkler et al.’s work in the early 1960s [29], and also sometimes referred to as linear frequency modulation (LFM) [30]. The concept although was patented as early as 1949 by Darlington et al. [31]. After the application in satellite radio transmission during the Cold War era [32,33], the interest in chirp modulations mostly vanished due to the low achievable data rates. For radio communications, this changed in 2013 with the establishment of the LoRa protocol as a low-power long range radio modulation for small consumer applications [34,35].

1.2. Research Problem

We investigate a modulation scheme that is intended for short messages, which in our application of diver status and localization packages are intended to carry an unique ID and a short amount of status information, as well as limited telemetric data, similar to the protocol of the AHOI modem [18]. We found in our previous investigations [36] that conventional chirp slope keying demodulation approaches, e.g., as proposed in the works of [8,23,37,38], show a critical flaw for the application in the transmission of very short multi-band packets, where the probability that a single channel is comprised entirely of one symbol state is high. This leads to one of the matched filter outputs having only the low-valued crosscorrelation of up vs. down chirp to estimate the symbol timing. One common solution is in the use of well known preambles, that feature an adequate amount of all states in all bands, to ensure symbol synchronization, with the draw-back that precious time slots in the channel are occupied by the overheads. Another solution is to implement a coding scheme, which avoids patterns that lead to single channel featuring only a single symbol state. Furthermore, the target application of our communication scheme in diver or UAV communication implicates an underwater environment, where reverberating surfaces are close-by, e.g., the water surface as well as a ship’s hull in case of maintenance divers or the ground for scientific UAVs. Therefore, the assumption of semi-infinite bodies of water does not hold. We emulate this environment by using a small pool for our measurements, as shown in Section 2.5, which introduces strong reverberations from multi-path propagation. Finally, we simulate the overall BER for an idealized channel in Section 4 to provide performance estimators, which can be compared to other systems, along with the PER that is often neglected in other works.

1.3. Related Work

Chirp modulation of acoustic signals under water have gathered attention since the early 2000s [39], and is researched continuously by groups around the world ever since. While the achievable data rate is severely limited compared to narrow-band schemes [5], CSK will be especially of interest, when the data amount is low and the channel inhibitions are strong. Kaminsky and Simanjuntak [37] present a performance evaluation of CSK in additive white Gaussian noise (AWGN) environments and for more realistic underwater models. Lee et al. propose similar methods for acoustic aerial communications (AAC). A low-cost underwater acoustic modem is proposed by Benson et al. [17], which uses FSK instead of CSK. Symbol synchronization is a large issue underwater due to Multi-path and Doppler effects, He et al. [40] therefore propose a self-synchronization method for CSK-type communication. Demirors and Melodia [41] subjoin methods of code division multiplexing to chirp communication to enhance stealthiness. The Fractional Fourier Transform (FrFT) is employed by Yuan et al. [42] to enable a multiuser communication system. Khyam et al. [20] iterate on the multiuser possibility by proposing several sets of orthogonal chirp waveforms. Interference cancellation is crucial for underwater communication, this topic has been addressed by Diamant [43]. In the more recent work of Lee et al. [44], the authors explore the parameter space of chirp spread spectrum (CSS) methods in context of long range communication.

The performance of our approach in this work has partially been reported in [36]. The scope of this contribution is to report on the details of our system’s inner structure and underlying algorithms. This investigation zooms in on the modulation and demodulation part of typical basic elements of digital communication systems [45], hence, expects coded input and will return still coded output. Consequently, additional forward error correction coding is likely to improve the estimation of the overall system [46], but is not part of this investigation.

2. Materials and Methods

We propose a different approach and overcome the demodulators synchronization vulnerability described in Section 1.2 by analyzing the sum of both matched filter outputs for up and down chirps. The resulting superposed signal always features an autocorrelation peak for each symbol, which is then used to retrieve the signal differences between the matched filter outputs to determine the symbol state. We discuss this aspect in detail in Section 2.4. In the following we describe the structure of our communication signal chain in detail, to embed our contribution to the demodulation process properly and fully disclose our method for ease of comparison.

2.1. Basic System Structure

Our system is divided into functional blocks with the modulation and demodulation in focus, as shown in Figure 1; more details about the structure inside those two blocks is discussed in Section 2.2 and Section 2.4.

The data d of length N is modulated into the digital sequence of linear up and down chirps

y_{tx}

, now of length

N_{tx}

, as illustrated in Figure 1. The DAC then converts this into the the output

s_{tx}

, which is an analog continuous real-valued signal of length

T_{tx}

that is boosted by a power amplifier (PA) before it is turned into an acoustic wave by a piezoelectric transducer.

The received signal is bandpass-filtered and amplified by an analog active filter (AF) into the continuous real-valued and band-limited received signal

s_{rx}

of length

T_{rx}

. The ADC samples the received signal into the sequence

y_{rx}

of length

N_{rx}

. The demodulation step estimates the originally sent sequence as

d_{est}

given a small set of prior information about the original signal, e.g., the ideal chirp parameters.

We consider exclusively time discrete signals throughout this work, sampled at points

n = ⌊t f_{s}⌋ \in Z,

(1)

where t is the time of observation and

f_{s}

the sampling frequency. For simplicity we assume the sampling frequency of transmitter and receiver to be equal, save an oscillator frequency mismatch of

Δ f_{s}

and phase offset of

Δ ϕ_{s}

. In practical application, this is not required, but only the parameters that describe the used waveform sufficiently.

2.2. Modulation

Before transmission, data d is multiplexed onto

N_{lo}

sub-bands of information. The

N_{lo}

sub-bands are then modulated through the Chirp Slope Keying (CSK) block. The slope sign of the reference up and down chirps are up-converted by the Digital Up-Converter (DUC) into the transmission bands, see Figure 2. The DUC is often omitted in acoustic communication due to the relatively low frequency of the transmission bands compared to radio communication, but allows for a more efficient use of storage and reduces the computational effort, which is why we include it.

2.2.1. Linear Chirp Creation

Initially, a reference chirp

y_{rbb}

is generated through

\begin{matrix} y_{rbb} [n] & = \{\begin{matrix} w [n] sin (φ [n]), & for 0 < n \leq N_{ref} \\ 0, & else . \end{matrix} \end{matrix}

(2)

For simplicity of calculation we normalize (note that implementations in Matlab often use the Nyquist frequency

f_{nyq} = f_{s} / 2

for normalization instead) the angular frequencies

\begin{matrix} ω_{0} & = 2 π f_{0} / f_{s}, and \\ ω_{1} & = 2 π f_{1} / f_{s} \end{matrix}

(3)

with the start frequency

f_{0}

and the stop frequency

f_{1}

. This allows the definition of the argument

φ [n]

for a linear chirp as

ω [n] = ω_{0} + n \frac{ω_{1} - ω_{0}}{N_{ref}},

(4)

and therefore

φ [n] = φ_{0} + \int_{0}^{n} ω [ν] d ν .

(5)

With (4) and (5), we can calculate the instantaneous phase for a linear sinusoidal chirp according to

φ [n] = φ_{0} + n ω_{0} + \frac{1}{2} n^{2} \frac{ω_{1} - ω_{0}}{N_{ref}} .

(6)

The modulation will become clearer if we substitute

ω_{c} = \frac{ω_{1} + ω_{0}}{2},

(7)

B_{ω} = | ω_{1} - ω_{0} |,

(8)

and introduce

ζ = sign (ω_{1} - ω_{0}) .

(9)

The instantaneous phase of the chirp from (6) then takes the form of

\begin{matrix} φ [n] = & φ_{0} + n ω_{c} + ζ \frac{B_{ω}}{2} n (1 + \frac{n}{N_{ref}}), \end{matrix}

(10)

where each bit of the data is mapped onto the sign

ζ

. The resulting chirp sequence is generated in the base-band frequencies according to (2) through Algorithm A1 and up-converted to each channel through Algorithm A3. In Figure 3 (leftmost), an example for such a base-band chirp is shown and the result of the up-conversion in Figure 3 (center left).

For a fixed transmission channel communication, the up-conversion through a DUC can be omitted and the reference chirps directly be calculated in the transmission band, but for the sake of flexibility, we added the up-conversion as a separate block. The resulting single chirp sequences

y_{ref}

for all

N_{lo}

transmission channels can be stored permanently and only requires to be recalculated, if the parameters, e.g., sampling frequency

f_{s}

, chirp length

N_{ref}

, side-band center frequency

f_{ch}

or bandwidth B change. While intuitively both chirp slope sequences may be pre-generated, we omit this redundancy on implementation as the inverse slope sign is equivalent to a time reversal of the entire chirp sequence.

2.2.2. Shaping

The amplitude shaping window

w [n]

restricts the sequence to be non-zero in the interval between 0 and

N_{ref}

only. While this can be achieved through different window functions, the tapered cosine, i.e., Tukey window is used in this work, because it can be varied easily between the rectangular, i.e., Dirichlet window and a sine, i.e., Hann window, by changing the single tuning factor

a_{t}

to 0 or 1, respectively [47]. This sets the main lobe width between

4 π / N_{ref}

and

8 π / N_{ref}

, as well as the peak sidelobe between

- 13

d

B

and

- 31

d

B

[48]. The Tukey window is mathematically defined as [48]

w [n] = \{\begin{matrix} \frac{1}{2} [1 - cos (\frac{π}{N_{tk}} n)], & for 0 < n \leq N_{tk} \\ 1, & for N_{tk} < n \leq (N_{ref} - N_{tk}) \\ \frac{1}{2} [1 - cos (\frac{π}{N_{tk}} (n - (N_{ref} - N_{tk})))], & for (N_{ref} - N_{tk}) < n \leq N_{ref} \\ 0, & otherwise, \end{matrix},

(11)

where the threshold of the taper is set by

N_{tk} = \frac{a_{t} N_{ref}}{2} .

The amplitude shape of the chirp can thus be adapted to the channel, depending on the application, i.e., if a narrow autocorrelation peak width is required for spatial distinction of two close reverberations or wide smooth peaks are desired for more robust communication. The implementation we used in this work is described in Algorithm A2. A simplified comparison of a selection of window functions for a close echo is shown in Figure 4 and Figure 5.

2.2.3. Input Multiplexing

When data d of length N is submitted for transmission, the 1-D binary sequence is first multiplexed onto

N_{lo}

channels through

\begin{matrix} d_{mux} [k, l] & = d [n], where \\ n & = (k - 1) N_{lo} + l, \\ k & \in [1, N_{sym}], \\ l & \in [1, N_{lo}] . \end{matrix}

(12)

The multiplexed sequence length

N_{sym}

is

N_{sym} = ⌊\frac{N}{N_{lo}}⌋,

(13)

therefore N needs to be an integer multiple of

N_{lo}

. To assert this, we use a simple zero-padding algorithm. If bits are added, they will remain in the data on reception and need to be removed on a higher level later on. This can be avoided, by matching the data length to the desired number of channels in advance, but for a more flexible and general approach, we implement the zero-padding approach and truncate to byte-sizes of 8.

2.2.4. Chirp Slope Keying

A fast way to modulate the binary sequence is to have a simple decision block, that will output a chirp sequence of either upward or downward slope according to the binary value at the input (see Algorithm A5). The segments of the ouput sequence

y_{tx}

are assembled by filling each interval of length

N_{ref}

with the normalized sum of the superposed channels’ sequences to

y_{tx} [n] = \frac{1}{N_{lo}} \sum_{l}^{N_{lo}} y_{ref} [n, l, d_{mux} [k, l]] .

(14)

The output sequence is transferred into an analog signal

s_{tx}

, e.g., by a DAC, amplified by the PA and transmitted. Alternatively, this modulation can be calculated by zero-padding the multiplexed bit sequences

d_{mux}

by length

N_{ref}

and convolving the resulting sequence with the reference signal

y_{ref}

, but this approach is neither efficient in memory usage, nor the calculation steps required [49].

2.3. Channel Model

We adapted the simulation approach from [37] to estimate the performance of the modulation and demodulation modules in a controlled fashion. The channel model includes simplified Rayleigh fading that multiplies the signal amplitude by the magnitude of two independent, but identically distributed random processes

\begin{matrix} A_{r} & = | (randn \{N_{rx}\} + i \cdot randn \{N_{rx}\}) σ_{r} |, where \\ i & = \sqrt{- 1}, \end{matrix}

(15)

with a distribution parameter

σ_{r} = 1

. The random sequences are generated through the randn function of Matlab that generates a normally distributed random value. An AWGN in the form of

ϵ_{n} = σ_{g} \cdot randn \{N_{rx}\}

(16)

is used to model the thermal noise of the receiver.

For simulations, we assume the receiver samples a combination (15) and (16) with the transmitted signal, at a random packet reception time offset

n_{τ}

, without additional reverberations from multiple paths

y_{rx} [n] = A_{r} y_{tx} [n + n_{τ}] + ϵ_{n} .

(17)

2.4. Demodulation

The digitized signal

y_{rx}

is translated into the baseband for each of the

N_{lo}

channels in the Digital Down-Converter (DDC) block, as shown in Figure 6. The Fast Hilbert Cross-Correlator (FHX) block compresses the signal further into arrays

y_{fhx}

for additional dimensions for each of the reference chirps of both slope signs. The block Join & Downsample (JDS) attempts coherent addition and subtraction of the 2 signal arrays for each channel. The resulting sum and difference signals in in

y_{jds}

are analyzed by the Frame Detect & Downsample (FDDS) block and the input signal divided into separate frames

y_{sym}

, now at symbol rate. The final decision block translates the symbols into binary values d and estimates the demodulation performance. Each block is described below in detail.

2.4.1. Digital Down-Converter

Before the signal is fed into the resource intensive compression algorithm, we exploit the bandlimited nature of the signal and bring it down into the baseband, by calculating

\begin{matrix} y_{tb} [n, c] = & BPF \{y_{rx} [n]\}, \\ y_{ib} [n, c] = & y_{tb} [n] y_{lo}, \\ y_{bb} [n, c] = & LPF \{y_{ib} [n, c]\}, \end{matrix}

(18)

where the functions

LPF

denotes an arbitrary lowpass filter, and

BPF

any suitable bandpass filter. The implementation is attached in Algorithm A4. The signal content outside of the band is suppressed by the analog bandpass filter of the receivers signal conditioning before the sampling. This is especially important for undersampling a signal to limit the aliasing effect of noise. To achieve the downconversion we first multiply the bandpass filtered raw signal

y_{tb}

of each transmission band with sine waves

y_{lo}

of frequency

f_{lo}

to create the intermediate signal

y_{ib}

. This operation shifts the content of each of the

N_{lo}

channels into the baseband, where a lowpass filter removes the higher harmonics and produces the baseband sequence

y_{bb}

. In doing so, the memory consumption increases by the number of channels

N_{lo}

, a one-dimensional real-valued input sequence of

[N_{rx}, 1]

gets mapped onto an

[N_{rx}, N_{lo}]

output array. The sequence can be truncated in frequency domain to an interval around the center, since most of the frequency bands ideally contain no information about the signal and one loses only information about noise and interference. We effectively resample the sequence to

\begin{matrix} y_{ddc} = & resample \{y_{bb}, f_{s}, f_{s 1}\}, where \\ f_{s 1} = & \frac{f_{s}}{N_{res 1}} . \end{matrix}

(19)

As we use the single sideband approach, a minimal interval is limited by the center frequency

f_{c} = \frac{1}{2} (f_{1} + f_{0})

(20)

and half the bandwidth

B_{f} = (f_{1} - f_{0}) .

(21)

Assuming a sampling rate of, e.g.,

f_{s} = 88 kHz

, a bandwidth of

B_{f} = 2.5 kHz

and a sub-band center frequency of

f_{c} = 3.0 kHz

as used in the dual-band case, the minimal one-sided base band is

B_{fbbm} = f_{c} + \frac{1}{2} B_{f},

(22)

which is for the given example

B_{fbbm} = 4.25 kHz

. Considering the original sample bandwidth and unchanged frequency bin width, the computation is reduced to

B_{fbbm} / \frac{1}{2} f_{s}

, here by about 90% at most. The minimal interval truncation also removes information about the noise, so a trade-off is feasible that implements a larger interval of several bandwidths. Moreover, the whole band-shifting and resampling can effectively be done in the frequency domain with a shift and truncate operation, as described in detail in Algorithm A4. An example for a result of this operation is shown in Figure 3.

2.4.2. Pulse Compression by Fast Hilbert Cross-Correlation

If time and magnitude of a received chirp are of interest, the calculation of the analytic signal after pulse compression through a matched filter is convenient. Hence, the next signal processing step is to convolve (operator ⊛) the received signal with the matched filter for both chirp slope signs

\begin{matrix} y_{mf ↑} = y_{ddc} ⊛ y_{rbb ↑}, \\ y_{mf ↓} = y_{ddc} ⊛ y_{rbb ↓}, \end{matrix}

(23)

This increases the memory allocation to

[N_{rx}, N_{lo}, 2]

samples, as the downsampled sequences are compressed by both, up and down chirps. In case more different chirps are used, this increases the added dimension accordingly. The compressed pulse’s envelope is then calculated as the analytic signal through the norm of the signal and its Hilbert transform

y_{fhx} = \sqrt{y_{mf}^{2} + H {\{y_{mf}\}}^{2}} .

(24)

The calculations, both, matched filtering and envelope extraction, are performed in the frequency domain for convenience. After the Fourier transformation of the raw signal, we perform a bin-wise multiplication against the complex conjugated reference signals to obtain the compressed signals for both up and down chirps.

2.4.3. Join & Downsample

Frame detection and symbol decision require information about the compressed pulse peak positions in time, which are difficult to establish in one matched filter branch, e.g., only the up chirp compression result, as there may be no peaks present, if the signal hypothetically only consists of down chirps. The JDS block first resamples the sequence to an integer fraction by

N_{res 2}

to the sample rate

\begin{matrix} y_{res 2} = & resample \{y_{fhx}, f_{s 1}, f_{s 2}\}, where \\ f_{s 2} = & \frac{f_{s 1}}{N_{res 2}}, \\ N_{jds} = & ⌊\frac{N_{fhx}}{N_{res 2}}⌋, \end{matrix}

(25)

then creates the sum and difference

\begin{matrix} y_{sum} [n, c] & = y_{fhx ↑} [n, c] + y_{fhx ↓} [n, c], \\ y_{dif} [n, c] & = y_{fhx ↑} [n, c] - y_{fhx ↓} [n, c] . \end{matrix}

(26)

This operation requires coherence, since a phase difference between the up and down chirp compressed sequences leads to sub-optimal symbol detection. This condition will be fulfilled only if no Doppler shift is present, so sender and receiver do not move relative to each other [50]. For this work, we exclusively considered stationary conditions. The sum and difference sequences are stored in a joint array

y_{jds}

of size

\{N_{jds}, N_{lo}, 2\}

.

2.4.4. Frame Detect & Downsample

The FDDS block first estimates the frame positions in half symbol space, then uses this information to estimate the symbol phase of each frame and downsample it to full symbol space. First, we assume a known symbol length

N_{ch 2}

from the reference chirp sequence and estimate it simply to

N_{ch 2} = ⌊T f_{s 2}⌋ = ⌊N_{ref} \frac{f_{s 2}}{f_{s 1}}⌋ .

(27)

The mean magnitude of each of the

M_{H}

half symbol frames of length

N_{H}

, where

N_{H} = ⌊\frac{N_{ch 2}}{2}⌋,

(28)

is then calculated by only regarding the superposed pulses of both channels, which guarantees the presence of an autocorrelation peak in each symbol. Therefore, we calculate

\begin{matrix} y_{ms} [m] = & \sum_{n}^{N_{H}} y_{sum} [n + (m - 1)] N_{H}, where \\ m \in & [1, M_{H}], \\ M_{H} = & ⌊\frac{N_{jds}}{N_{H}}⌋, \end{matrix}

(29)

which reduces strong magnitude fluctuations before the data frame detection and resamples the sequence to half symbol space. As the envelope detection is very sensitive to non-steady slopes, we apply an additional 10th order lowpass filter

y_{mLP} = LPF \{y_{ms}\},

(30)

with an estimated cutoff frequency

\begin{matrix} ω_{MLP} = & \frac{m_{MLP}}{M_{H}}, where \\ m_{MLP} = & arg max \{| FFT \{y_{ms}\} |\} . \end{matrix}

(31)

The frame detection algorithm has two parts. Initally, a threshold is calculated for the whole received sequence of each channel, then a state machine iterates through it and extracts frame start and end times. We estimate the threshold

y_{th}

by a simple clustering, that first calculates the mean amplitude of the lowpass filtered half symbol magnitude

{\bar{y}}_{mLP} = mean \{y_{mLP}\},

(32)

then calculates the cluster means for both sides of the mean level,

\begin{matrix} {\bar{y}}_{mS} = mean \{y_{mLP} [y_{mLP} > {\bar{y}}_{mLP}]\}, \\ {\bar{y}}_{mN} = mean \{y_{mLP} [y_{mLP} < {\bar{y}}_{mLP}]\}, \end{matrix}

(33)

where the lower mean value

{\bar{y}}_{mN}

is considered the noise level and the upper mean

{\bar{y}}_{mS}

the signal level. The threshold is then simply the arithmetic mean of those two levels

y_{mTh} = \frac{({\bar{y}}_{mS} + {\bar{y}}_{mN})}{2} .

(34)

Subsequently, the state machine iterates through the sequence

y_{mLP}

and records an upwards slope if there are

M_{HL}

of samples below the threshold

y_{mTh}

followed by

M_{HH}

samples above it. We set both intervals as

M_{HL} = M_{HH} = 2

, limiting the miminal frame size to

M_{HL} + M_{HH} - 1 = 3 samples

. A state variable will keep track if the iteration is inside a frame and stores start index

m_{0} [p]

and end index

m_{1} [p]

of each pth frame. The frame limits are then reconstructed in sample space through scaling the indice by

M_{H}

,

\begin{matrix} n_{0} [p] & = m_{0} [p] N_{H}, and \\ n_{1} [p] & = m_{1} [p] N_{H} . \end{matrix}

(35)

The single frames in sample space are then defined as

\begin{matrix} y_{fSum} [p, n] = & y_{sum} [n_{0} [p] + n], where \\ y_{fDif} [p, n] = & y_{dif} [n_{0} [p] + n], where \\ n \in & [1, N_{frm} [p]], \end{matrix}

(36)

where

y_{sum}

and

y_{div}

are the two sub-arrays of

y_{jds}

and include all

N_{lo}

channels as an additional dimension, respectively. The indexing of the channel dimensions has been omitted for ease of reading. The number of samples in each frame is

N_{frm} [p] = n_{1} [p] - n_{0} [p] .

(37)

The last part of the block selects each data frame in the sample space, searches for the optimal sample offset

n_{off}

to maximize the symbol power and assembles a frame in symbol space accordingly. We assemble the power matrix for each frame p and each channel by iterating through the phase sample by sample

\begin{matrix} A_{y} [p, n] = & \sum_{k}^{K [n]} {(y_{fSum} [p, n + (k - 1) N_{ch 2}])}^{2}, where \\ K [n] = & ⌊\frac{N_{jds} - n}{N_{ch 2}}⌋ . \end{matrix}

(38)

The optimal sample offset

n_{off}

is then estimated to

n_{off} [p] = arg max_{n} \{A_{y} [p, n]\} .

(39)

We use this to assemble the block’s final three-dimensional output sequences

y_{sym}

, that span the number of detected frames, in each of which the number of symbols, and a constant number of channels. Hence, occupy memory is of size

[N_{frm}, N_{sym}, N_{lo}]

as we decimate

y_{sym} [p, k] = y_{fDif} [p, n_{off} [p] + (k - 1) N_{ch 2}],

(40)

again the indexing for all channels is omitted.

2.4.5. Symbol Decision

The symbol decision iterates through each frame’s symbol space difference sequence

y_{sym}

similarly to (32) to (34) of Section 2.4.4, by separating each frame in two clusters split by the mean symbol amplitude, and estimates the half distance between both clusters’ means as a threshold

y_{fTh}

for symbol decision for each channel. The decision equation is, therefore, simply

d_{rmx} [k] = \{\begin{matrix} 1, & for y_{sym} [k] > y_{fTh} \\ 0, & otherwise, \end{matrix}

(41)

for the

k th

symbol of each channel and frame.

2.4.6. De-Multiplexing

The last block of the demodulation chain re-assembles the

N_{lo}

-dimensional symbol sequences of each frame into a one-dimensional bit sequence. The length of the received bit sequence

N_{est}

is first truncated to multiples of 8, as the application is meant to send and receive data bytewise, hence

N_{est} = 8 ⌊\frac{N_{lo} N_{sym}}{8}⌋ .

(42)

The data is then de-multiplexed by reshaping the sequences

d_{rmx}

with n in the range

[1, N_{est}]

to

\begin{matrix} d_{est} [n] & = d_{rmx} [k, l], where \\ k & = \frac{n}{N_{lo}}, \\ l & = n \mod N_{lo} . \end{matrix}

(43)

2.5. Experimental Set-Up

We conducted two experimental runs to verify our approach. One of a single band transmission, the other of a dual-band transmission. The experiments were performed in a steel-walled pool as shown in Figure 7, which was assembled temporarily inside a building. The transmitter and receiver hardware is a modified version of the indoor localization system [51], as we published before [19,36].

2.5.1. Frequency Band Considerations

Acoustic underwater communication influences the maritime habitat, hence system and acoustic experiment designs have to minimize the interruption of natural communication [52] and ideally avoid mimicking animal calls (compare [53]). Fish and sharks have no hearing sensitivity for frequencies far above 10

k

Hz

[15,52]. Sea mammals, e.g., dolphins, seals, and whales on the other hand are highly sensitive to frequencies of up to 150

k

Hz

[54,55,56,57,58]. The general structure of sea mammal’s sound creation and hearing is known from anatomic considerations and follows similar mechanisms, with some whales having acoustic matching melons in their frontal part of the head that also serve as an acoustic bandpass and lens [57,59]. From a behavioral point of view, seals and dolphins are highly relevant, as they are well researched and commonly found near harbors and shores. As a simplified design rule, we regard that seals’ hearing is sensitive to sound frequencies below 80

k

Hz

, with already increased sound pressure level thresholds, i.e., decreased hearing sensitivity above 60

k

Hz

[60]. For dolphins, this hearing threshold is approximately 200

k

Hz

, with decreased sensitivities above 150

k

Hz

[54,55,56,57]. For all practical purposes, it has to be assumed that every transmission will be audible to sea mammals in the vicinity and can cause potential harm or changes in behavior. Additionally, the attenuation of acoustic underwater waves exceeds 20

d

B

km⁻¹ for those frequencies, limiting the spatial sphere of influence. A limiting factor for coastal applications is natural and artificial noise, e.g., from the surf and ship traffic, which we regard as Brownian noise decaying at about 18

d

B

per decade [16,61]. For our system, we limit the communication band therefore as in Table 1.

2.5.2. Experiment Parameters

The sampling rate of our acquisition unit is limited to

f_{s} = 88 kHz

. As a result, the received signal is undersampled, i.e., the Nyquist frequeny is below the transmission band. While this mixing operation generally results in a leakage of signal power, the band-limited nature of the chirp sequences and the low noise environment limit this aliasing effect. This band-limitation is ensured by an additional analog bandpass-filter. The chirp parameters are listed in Table 2 for the single band and dual-band transmission.

The symbol rate and occupied bandwidth of both transmissions is kept constant. Therefore, the single band signal has twice the TB compared to the dual-band one. This implicates the ratio of symbol energy to noise energy to double as well [62],

\frac{E_{sym}}{E_{n}} = T B γ,

(44)

where

γ

is the signal-to-noise ratio of the received signal. The expected data rate on the other hand is halved, as each symbol only contains half the bits.

3. Results

Channel Frequency Response

The transmitted and received signals are shown in Figure 8 as spectrograms over frequency and time. The undersampling introduces harmonic interference outside of the transmission band of the recorded signal, which are not physically present in the medium itself. Those phantom bands are removed on downsampling, by narrow bandpass filters.

The power levels are more clearly visible in the averaged plots of Figure 9. The noise floor confirms the assumption of AWGN outside of the transmission, with an approximate SNR of 65

d

B

. The interference caused by the transmission itself raises the average power outside of the transmission band for about 30

d

B

.

4. Bit Error Rate and Packet Error Rate Simulations

The bit error rate (BER) and packet error rate (PER) of the proposed algorithms are estimated through simulation for an idealized channel as described in Section 2.3. We define the BER in two ways: By comparing each bit in the order of demodulation through the exclusive or (XOR) operation

r_{BE} = \sum_{n} (d_{est} \oplus d) + | N_{est} - N |,

(45)

and by cross-correlation (XCorr), which returns the maximum match between the transmitted sequence d and demodulated sequence

d_{est}

r_{BExc} = max_{n} \{d_{est} ⊛ d\} + | N_{est} - N |,

(46)

both of which include differences in the number of bits to account for additional or missing bits. The former (XOR) we regard for data transmission, where the content of the sequence is not known at the receiver, while the latter (XCorr) indicates the performance, if a known set of codes is expected. The PER is defined through the relative number of erroneous packets compared to the total number of sent packets, where a packet error is any packet that includes at least one bit error. For the PER we consider bit errors according to (45).

The probability of errors approximately follows the error function (erfc) over the SNR [63]. While there are closed-form approximations for LoRa [62,64], to ease comparisons we approximate those through superposed error functions

P_{be | pe} (γ) = \sum_{q} (A_{q} erfc \{{(B_{q} 10^{γ / 10})}^{1 / 2^{q}}\}),

(47)

which were fitted manually for the coefficients in Table 3 and Table 4 for the BER and PER simulation as shown in Figure 11, respectively.

5. Discussion

The transmissions in both scenarios (see Figure 10) were demodulated without error in the single and dual-band verification runs. The verification only cover a small range and is not meant to be exhaustive for a characterization of the system. The first point to notice is the relative lack of noise in the signal, which is illustrated by the blue background in Figure 8 over 60

d

B

below the highest signal levels. If we closely inspect the right edge of the plots in the lower row Figure 8c,d, the long ring-down of the signal spanning over more than 100

m

s

is visible as a brighter colored leg smearing the stronger power bands in time. This reminiscence of the signal in the channel affects the transmission as inter-signal interference and is the strongest cause of error in our transmission. The channel impulse response itself depends on the geometry of the body of water and the environment conditions, which are not covered by this investigation. However, the general behavior of the proposed communication scheme will hold in similar environments, and improve for less challenging conditions. The phantom bands that appear for higher and lower frequencies around the transmission are caused by the undersampling on reception and are not present in the physical channel. They are removed in the DDC by a narrow bandpass-filter and only will cause the signal to be corrupted, if one of the phantom bands overlap with the intermediate band where most of the signal power is shifted to. When we compare the frequency spectrum in and outside of symbols in Figure 9, the high signal-to-noise ratio becomes more obvious: The noise floor is at approximately 65

d

B

below the highest signal levels (blue lines), while the interference raises the floor to about

- 40

d

B

(red line, outside the intermediate bands).

If we regard the distributions of the symbols in Figure 10 around the decision threshold, which is at

y_{dif} = 0

, the single band levels show a multi-modal behavior. In the dual-band case, the distributions in all channels are much closer to uni-modal distributions, which is the ideal case, but are still far from the optimal. Optimal symbol levels would be achieved, if there would be two symmetrical probability bands at the minimal and maximal edge of the differential signal levels. This effect is due to the symbol peaks of up and down chirps not aligning in phase, but are shifted to each other. The symbol synchronization in the superposed signal locks-in on the matched filter output

y_{fhx}

that shows the strongest mean correlation, therefore pushing the other symbols out of phase. An additional phase offset correction before the summation of both matched filter outputs, i.e., before the JDS block, would move the secondary symbol levels closer to their ideal state and increase the overall performance, but is not included at this point. The general feasibility of our approach is thereby shown, but further practical verification is required.

If we regard the results in Figure 11, the simulation of the transmission error probabilities through BER and PER does not include the multi-path response and, therefore, describes the idealized performance of the algorithm at that point. The approximate far field fading of acoustic signals in open water of approximately 20

d

B

km⁻¹ in seawater can be used to design the acoustic output to suitable levels, e.g., for a BER of up to

0.1

% over a range of 2000

m

, the overall channel losses of approximately 40

d

B

have to be countered by about a combined gain of transmitter and receiver of 33

d

B

in the single band case and 39

d

B

in the dual-band case. More generally, the dual-band transmission is shifted by approximately 6

d

B

, which confirms the assumptions from (44) that a reduction of the bandwidth B to half the single band value also halves the symbol power

E_{sym}

. The addition of a channel impulse response with additional echoes is expected to increase the error probability for all scenarios, as it introduces additional interference.

The PER follows the behavior of the BER, which is due to the definition that a packet is considered erroneous as soon as a single bit is demodulated wrongly. The error curves’ behavior for lower SNR to approach values over

10^{1}

may be counter-intuitive for synchronized transmissions or when the number of symbols in a packet is known. Since we allow for arbitrary lengths and have no additional synchronization, for those conditions more packets are detected than were initially sent. This implies very short or fragmented packets, which can be disregarded on a higher level, if an additional protocol is implemented, e.g., that restricts the syntax of valid packages as in the NMEA 2000 standard of the National Marine Electronics Association [65].

If we regard the initially set application of acoustic communication in reverberating environments, the validation runs have shown that we can successfully transmit data even inside a very small body of water, while the simulation hints at the performance for larger ones, especially lakes and harbor areas, where there is little surf and most interference stems from ship engines. The change of acoustic properties in salt-water compared to fresh water is proportional to the length of the signal propagation, therefore for close proximity communications negligible, but requires consideration for long distance links. Hence, we assume the results of the pool runs can be extended to application at sea, lakes or oceans, albeit not as an accurate performance indicator for open water communication, but rather a worst-case scenario, in a highly reverberating environment, e.g., if two maintenance divers try to communicate in bad sight conditions near the ground or close to a ship’s hull.

6. Conclusions and Future Works

The proposed demodulator for preamble-free Chirp Slope Keying was implemented and the complete signal chain simulated and tested inside a measurement pool in a laboratory scale experiment for transmission rates of 100 bit/s for single band communications, as well as 200 bit/s for dual-band communication. The available bandwidth and symbol length was kept constant. The achieved BER estimated through the bitwise xor operation was simulated to drop below 0.001, i.e.,

0.1

% for SNR above

- 6

dB for a TB of 50 in the single band mode and for SNR above 0 dB for a TB of 25 in dual-band mode. The correct detection of packages and the demodulation was successfully implemented, verified and simulated as well. The PER follows the BER with an SNR offset of approximately 1 dB. The simulated channel contained Rayleigh fading and set the SNR through Additive White Gaussian Noise. A model for fitting the simulation results and parameters were disclosed, and required extensions for a more realistic simulation model discussed. The approach removes the necessity of preambles for multi-band communication that consume the limited available time slots. While the achieved data rates are low compared to narrow band communication schemes, the feasibility in a highly reverberating water tank has been shown, where those schemes will tend to exhibit high error rates if the preamble is not found. Equalization techniques can be employed to achieve better bit and packet error rate for lower signal-to-noise ratio, together with the design of specific RAKE devices to mitigate the multi-path characteristic of the underwater environment. We also strongly believe that investigations on the animals’ underwater hearing behaviour have to be done in order to better estimate the impact of artificial noise on the underwater environment. Further investigations can include the issues arising from the multiple access to the medium and in situations, where oxygen bubbles coming from the divers equipment are disturbing the channel.

Author Contributions

Conceptualization, D.J.S. and A.G.; methodology, D.J.S.; software, D.J.S.; validation, D.J.S., A.G., W.X. and G.F.; formal analysis, D.J.S.; investigation, D.J.S.; resources, D.J.S.; data curation, D.J.S.; writing–original draft preparation, D.J.S.; writing–review and editing, D.J.S., A.G., W.X., G.F. and S.J.R.; visualization, D.J.S.; supervision, F.H., J.W., C.S. and S.J.R.; project administration, F.H., J.W., C.S. and S.J.R. funding acquisition, F.H., J.W., C.S. and S.J.R. All authors have read and agreed to the published version of the manuscript.

Funding

This work has been supported financially under the KMU innovativ initiative of the German Federal Ministry of Education and Research (BMBF), funding number (FKZ) 01IS16010A “ULTa” and 01IS18011C “Smart Diver 4.0”, as well as by the state of Baden-Württemberg in the framework of the MERLIN project.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

AF	Active Filter
AWGN	Additive White Gaussian Noise
BB	Baseband
BER	Bit Error Rate
BPF	Bandpass Filter
CSS	Chirp-Spread Spectrum
CSK	Chirp Slope Keying
DDC	Digital Down-Converter
DUC	Digital Up-Converter
FDDS	Frame Detect & Downsample
FrFT	Fractional Fourier Transform
FHX	Fast Hilbert Cross-Correlator
FSK	Frequency Shift Keying
JDS	Join & Downsample
LFM	Linear Frequency Modulation
LPF	Lowpass Filter
MUX	Multiplexer
PA	Power Amplifier
PER	Packet Error Rate
RX	Received or Receiver
SNR	Signal-to-Noise Ratio
TB	Time-Bandwidth Product
TX	Transmitted or Transmitter
UAV	Underwater Autonomous Vehicle
XOR	Exclusive Or Operation
XCorr	Cross-Correlation

Appendix A

Algorithm A1: Linear Chirp Generation.

input :

f_{0}

start frequency,
T length of chirp in seconds,

f_{1}

end frequency,

f_{s}

sampling frequency.
output: y 1-D vector containing a real-valued linear chirp sequence
// Translate frequency input into relative frequencies:
1

ω_{0} = 2 \cdot π \cdot f_{0} / f_{s}

;
2

ω_{1} = 2 \cdot π \cdot f_{1} / f_{s}

;
// Prepare sample time:
3

N = ⌊T \cdot f_{s}⌋

;
4

n = 0 : (N - 1)

;
// Calculate the shaping window
5

y_{w} = tukey (N, 0.5)

;
// The discrete sampled time limited chirp signal is calculated as follows:
6

k = (ω_{1} - ω_{0}) / 2 \cdot N_{ch}

;
7

ω = ω_{0} + k \cdot n

;
// Furthermore, lastly the shaping function is simply superposed:
8

y = y_{w} \cdot s i n (ω \cdot n)

;

Algorithm A2: Partially Constructed Tukey Window.

input : N length of window in samples,
a taper fraction.
output: y 1-D vector containing a real-valued window sequence
// Prepare taper thresholds:
1

N_{th} = ⌊a \cdot (N / 2 - 1)⌋ + 1

;
// Calculate 1st leg: Up slope
2

n_{I} = 0 : (N_{th} - 1)

;
3

y_{I} = (1 - cos (n_{I} \cdot π / (N_{th} - 1))) / 2

;
// Calculate 2nd leg: Flat top
4

n_{II} = N_{th} : (N - N_{th} - 1)

;
5

y_{II} = ones (1, length (n_{II}))

;
// Calculate 3rd leg: Down slope
6

N_{off} = (N - 1) \cdot (1 - a / 2)

;
7

n_{III} = (N - N_{th}) : N

;
8

y_{III} = (1 - cos ((n_{III} - N_{off}) \cdot π / (N_{th} - 1))) / 2

;
// Join legs:
9

y = [y_{I}, y_{II}, y_{III}]

;

Algorithm A3: Digital Up-Conversion.

Algorithm A4: Digital Down-Conversion.

Algorithm A5: Chirp Slope Keying Modulator.

References

Batchelder, J.M. Submarine Signal. U.S. Patent 368272, 16 August 1887. [Google Scholar]
Gray, E.; Mundy, A.J. Transmission of Sound. U.S. Patent 636519, 7 November 1899. [Google Scholar]
Mundy, A.J.; Gale, H.B. Apparatus for Producing Submarine Sound-Signals. U.S. Patent 842327, 29 January 1907. [Google Scholar]
Stojanovic, M.; Catipovic, J.; Proakis, J.G. Adaptive multichannel combining and equalization for underwater acoustic communications. J. Acoust. Soc. Am. 1993, 94, 1621–1631. [Google Scholar] [CrossRef]
Stojanovic, M. Recent advances in high-speed underwater acoustic communications. IEEE J. Ocean. Eng. 1996, 21, 125–136. [Google Scholar] [CrossRef]
Sharif, B.S.; Neasham, J.; Hinton, O.R.; Adams, A.E. A computationally efficient Doppler compensation system for underwater acoustic communications. IEEE J. Ocean. Eng. 2000, 25, 52–61. [Google Scholar] [CrossRef]
Akyildiz, I.F.; Pompili, D.; Melodia, T. Challenges for efficient communication in underwater acoustic sensor networks. ACM Sigbed Rev. 2004, 1, 3–8. [Google Scholar] [CrossRef]
Steinmetz, F.; Heitmann, J.; Renner, C. A Practical Guide to Chirp Spread Spectrum for Acoustic Underwater Communication in Shallow Waters. In Proceedings of the Thirteenth ACM International Conference on Underwater Networks & Systems, Shenzhen, China, 3–5 December 2018; Association for Computing Machinery: New York, NY, USA, 2018. [Google Scholar] [CrossRef]
Steinmetz, F.; Renner, C. Resilience against Shipping Noise and Interference in Low-Power Acoustic Underwater Communication. In Proceedings of the OCEANS 2019 MTS/IEEE SEATTLE, Seattle, WA, USA, 27–31 October 2019; pp. 1–10. [Google Scholar] [CrossRef]
Zhang, G.; Hovem, J.M.; Dong, H.; Zhou, S.; Du, S. An efficient spread spectrum pulse position modulation scheme for point-to-point underwater acoustic communication. Appl. Acoust. 2010, 71, 11–16. [Google Scholar] [CrossRef]
Xu, J.; Li, K.; Min, G. Asymmetric multi-path division communications in underwater acoustic networks with fading channels. J. Comput. Syst. Sci. 2013, 79, 269–278. [Google Scholar] [CrossRef]
Xing, S.; Qiao, G.; Tsimenidis, C. A novel two-step Doppler compensation scheme for coded OFDM underwater acoustic communication systems. Procceedings of the 4th Underwater Acoustics Conference and Exhibition UACE2017, Skiathos, Greece, 3–8 September 2017; pp. 351–356. [Google Scholar]
Domingo, M.C. Overview of channel models for underwater wireless communication networks. Phys. Commun. 2008, 1, 163–182. [Google Scholar] [CrossRef]
Hovem, J.M. Underwater acoustics: Propagation, devices and systems. J. Electroceramics 2007, 19, 339–347. [Google Scholar] [CrossRef]
Popper, A.; Hastings, M. The effects of anthropogenic sources of sound on fishes. J. Fish Biol. 2009, 75, 455–489. [Google Scholar] [CrossRef]
Radford, C.A.; Stanley, J.A.; Tindle, C.T.; Montgomery, J.C.; Jeffs, A.G. Localised coastal habitats have distinct underwater sound signatures. Mar. Ecol. Prog. Ser. 2010, 401, 21–29. [Google Scholar] [CrossRef]
Benson, B.; Li, Y.; Faunce, B.; Domond, K.; Kimball, D.; Schurgers, C.; Kastner, R. Design of a Low-Cost Underwater Acoustic Modem. IEEE Embed. Syst. Lett. 2010, 2, 58–61. [Google Scholar] [CrossRef] [Green Version]
Renner, B.C.; Heitmann, J.; Steinmetz, F. Ahoi: Inexpensive, Low-Power Communication and Localization for Underwater Sensor Networks and μAUVs. ACM Trans. Sen. Netw. 2020, 16. [Google Scholar] [CrossRef] [Green Version]
Schott, D.J.; Faisal, M.; Höeflinger, F.; Reindl, L.M.; Bordoy Andreú, J.; Schindelhauer, C. Underwater localization utilizing a modified acoustic indoor tracking system. In Proceedings of the 2017 IEEE 7th International Conference on Underwater System Technology: Theory and Applications (USYS), Kuala Lumpur, Malaysia, 18–20 December 2017; pp. 1–5. [Google Scholar] [CrossRef]
Khyam, M.O.; Xinde, L.; Ge, S.S.; Pickering, M.R. Multiple Access Chirp-Based Ultrasonic Positioning. IEEE Trans. Instrum. Meas. 2017, 66, 3126–3137. [Google Scholar] [CrossRef]
Renner, C.; Golkowski, A.J. Acoustic Modem for Micro AUVs: Design and Practical Evaluation. In Proceedings of the 11th ACM International Conference on Underwater Networks & Systems, Shanghai, China, 24–26 October 2016; Association for Computing Machinery: New York, NY, USA, 2016. [Google Scholar] [CrossRef]
Zhao, Z.; Zhao, A.; Hui, J.; Hou, B.; Sotudeh, R.; Niu, F. A Frequency-Domain Adaptive Matched Filter for Active Sonar Detection. Sensors 2017, 17, 1565. [Google Scholar] [CrossRef] [Green Version]
Bernard, C.; Bouvet, P.J.; Pottier, A.; Forjonel, P. Multiuser Chirp Spread Spectrum Transmission in an Underwater Acoustic Channel Applied to an AUV Fleet. Sensors 2020, 20, 1527. [Google Scholar] [CrossRef] [Green Version]
Danckaers, A.; Seto, M.L. Transmission of images by unmanned underwater vehicles. Auton. Robot. 2020, 44, 24–44. [Google Scholar] [CrossRef]
Syed, A.A. Understanding and Exploiting the Acoustic Propagation Delay in Underwater Sensor Networks. Ph.D. Thesis, Faculty of the Graduate School, Los Angeles, CA, USA, 2009. [Google Scholar]
Fattah, S.; Gani, A.; Ahmedy, I.; Idris, M.Y.I.; Targio Hashem, I.A. A Survey on Underwater Wireless Sensor Networks: Requirements, Taxonomy, Recent Advances, and Open Research Challenges. Sensors 2020, 20, 5393. [Google Scholar] [CrossRef]
Senter, P. Voices of the past: A review of Paleozoic and Mesozoic animal sounds. Hist. Biol. 2008, 20, 255–287. [Google Scholar] [CrossRef] [Green Version]
Hüttmann, E. Distance Measurement Method. DE 768068C, 10 June 1955. [Google Scholar]
Price, R.; Turin, G. Communication and radar—Section A. IEEE Trans. Inf. Theory 1963, 9, 240–246. [Google Scholar] [CrossRef]
Cook, C.E. Linear FM Signal Formats for Beacon and Communication Systems. IEEE Trans. Aerosp. Electron. Syst. 1974, AES-10, 471–478. [Google Scholar] [CrossRef]
Darlington, S. Pulse Transmission. U.S. Patent 2678997A, 18 May 1954. [Google Scholar]
Burnsweig, J.; Wooldridge, J. Ranging and Data Transmission Using Digital Encoded FM-“Chirp” Surface Acoustic Wave Filters. IEEE Trans. Microw. Theory Tech. 1973, 21, 272–279. [Google Scholar] [CrossRef]
Kim, J.; Pratt, T.; Ha, T. Coded multiple chirp spread spectrum system and overlay service. In The Twentieth Southeastern Symposium on System Theory; IEEE Computer Society: Los Alamitos, CA, USA, 1988; pp. 336–341. [Google Scholar] [CrossRef]
TCo. SX1272/3/6/7/8: LoRa Modem Designer’s Guide, Technical Report AN1200.13; Semtech Corporation: Camarillo, CA, USA, 2013; Revision 1. [Google Scholar]
Ferré, G.; Giremus, A. LoRa Physical Layer Principle and Performance Analysis. In Proceedings of the 2018 25th IEEE International Conference on Electronics, Circuits and Systems (ICECS), Bordeaux, France, 9–12 December 2018; pp. 65–68. [Google Scholar] [CrossRef] [Green Version]
Schott, D.J.; Faisal, M.; Höflinger, F.; Reindl, L.M. A Multichannel Acoustic Chirp-Spread Modulation Approach Towards Diver-to-Diver Communication. In Proceedings of the 2018 15th International Multi-Conference on Systems, Signals Devices (SSD), Yasmine Hammamet, Tunisia, 19–22 March 2018; pp. 475–479. [Google Scholar] [CrossRef]
Kaminsky, E.J.; Simanjuntak, L. Chirp slope keying for underwater communications. In Sensors, and Command, Control, Communications, and Intelligence (C3I) Technologies for Homeland Security and Homeland Defense IV; Carapezza, E.M., Ed.; International Society for Optics and Photonics, SPIE: Bellingham, WA, USA, 2005; Volume 5778, pp. 894–905. [Google Scholar] [CrossRef] [Green Version]
Lee, H.; Kim, T.H.; Choi, J.W.; Choi, S. Chirp signal-based aerial acoustic communication for smart devices. In Proceedings of the 2015 IEEE Conference on Computer Communications (INFOCOM), Hong Kong, China, 26 April–1 May 2015; pp. 2407–2415. [Google Scholar] [CrossRef]
Kebkal, K.G.; Bannasch, R. Sweep-spread carrier for underwater communication over acoustic channels with strong multipath propagation. J. Acoust. Soc. Am. 2002, 112, 2043–2052. [Google Scholar] [CrossRef]
He, C.; Huang, J.; Zhang, Q.; Lei, K. Reliable Mobile Underwater Wireless Communication Using Wideband Chirp Signal. In Proceedings of the 2009 WRI International Conference on Communications and Mobile Computing, Kunming, China, 6–8 January 2009; Volume 1, pp. 146–150. [Google Scholar] [CrossRef]
Demirors, E.; Melodia, T. Chirp-Based LPD/LPI Underwater Acoustic Communications with Code-Time-Frequency Multidimensional Spreading. In Proceedings of the 11th ACM International Conference on Underwater Networks & Systems, Shanghai, China, 24–26 October 2016; Association for Computing Machinery: New York, NY, USA, 2016. [Google Scholar] [CrossRef] [Green Version]
Yuan, F.; Wei, Q.; Cheng, E. Multiuser chirp modulation for underwater acoustic channel based on VTRM. Int. J. Nav. Archit. Ocean. Eng. 2017, 9, 256–265. [Google Scholar] [CrossRef] [Green Version]
Diamant, R. Robust Interference Cancellation of Chirp and CW Signals for Underwater Acoustics Applications. IEEE Access 2018, 6, 4405–4415. [Google Scholar] [CrossRef]
Lee, J.; An, J.; Ra, H.I.; Kim, K. Long-Range Acoustic Communication Using Differential Chirp Spread Spectrum. Appl. Sci. 2020, 10, 8835. [Google Scholar] [CrossRef]
Proakis, J.G.; Salehi, M. Elements of a Digital Communication System. In Digital Communications; Chapter 1.1; McGraw Hill: New York, NY, USA, 2008; p. 2. [Google Scholar]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef] [Green Version]
Behar, V.; Adam, D. Parameter optimization of pulse compression in ultrasound imaging systems with coded excitation. Ultrasonics 2004, 42, 1101–1109. [Google Scholar] [CrossRef]
Proakis, J.G.; Manolakis, D.G. Design of Linear-Phase FIR Filters Using Windows. In Digital Signal Processing; Chapter 10.2.2; Pearson Prentice Hall: Upper Saddle River, NJ, USA, 2008; pp. 664–670. [Google Scholar]
Cariow, A.; Paplinski, J.P. Some Algorithms for Computing Short-Length Linear Convolution. Electronics 2020, 9, 2115. [Google Scholar] [CrossRef]
Aguilera, T.; Álvarez, F.J.; Paredes, J.A.; Moreno, J.A. Doppler compensation algorithm for chirp-based acoustic local positioning systems. Digit. Signal Process. 2020, 100, 102704. [Google Scholar] [CrossRef]
Hoeflinger, F.; Hoppe, J.; Zhang, R.; Ens, A.; Reindl, L.; Wendeberg, J.; Schindelhauer, C. Acoustic indoor-localization system for smart phones. In Proceedings of the 2014 IEEE 11th International Multi-Conference on Systems, Signals Devices (SSD14), Barcelona, Spain, 11–14 February 2014; pp. 1–4. [Google Scholar] [CrossRef]
Chapuis, L.; Collin, S.P.; Yopak, K.E.; McCauley, R.D.; Kempster, R.M.; Ryan, L.A.; Schmidt, C.; Kerr, C.C.; Gennari, E.; Egeberg, C.A.; et al. The effect of underwater sounds on shark behaviour. Sci. Rep. 2019, 9, 6924. [Google Scholar] [CrossRef] [Green Version]
Ahn, J.; Lee, H.; Kim, Y.; Lee, S.; Chung, J. Mimicking dolphin whistles with continuously varying carrier frequency modulation for covert underwater acoustic communication. Jpn. J. Appl. Phys. 2019, 58, SGGF05. [Google Scholar] [CrossRef]
Ketten, D.R. The Marine Mammal Ear: Specializations for Aquatic Audition and Echolocation. In The Evolutionary Biology of Hearing; Webster, D.B., Popper, A.N., Fay, R.R., Eds.; Springer: New York, NY, USA, 1992; pp. 717–750. [Google Scholar] [CrossRef]
Ketten, D.R. Functional analyses of whale ears: Adaptations for underwater hearing. In Proceedings of the OCEANS’94, Brest, France, 13–16 September 1994; Volume 1, pp. I/264–I/270. [Google Scholar] [CrossRef]
Houser, D.S.; Finneran, J.J. A comparison of underwater hearing sensitivity in bottlenose dolphins (Tursiops truncatus) determined by electrophysiological and behavioral methods. J. Acoust. Soc. Am. 2006, 120, 1713–1722. [Google Scholar] [CrossRef] [PubMed]
Hemilä, S.; Nummela, S.; Reuter, T. Anatomy and physics of the exceptional sensitivity of dolphin hearing (Odontoceti: Cetacea). J. Comp. Physiol. A 2010, 196, 165–179. [Google Scholar] [CrossRef] [PubMed]
Nummela, S.; Thewissen, J.; Bajpai, S.; Hussain, T.; Kumar, K. Sound transmission in archaic and modern whales: Anatomical adaptations for underwater hearing. Anat. Rec. 2007, 290, 716–733. [Google Scholar] [CrossRef]
Boisvert, C.A.; Johnston, P.; Trinajstic, K.; Johanson, Z. Chondrichthyan Evolution, Diversity, and Senses. In Heads, Jaws, and Muscles: Anatomical, Functional, and Developmental Diversity in Chordate Evolution; Ziermann, J.M., Diaz, R.E., Jr., Diogo, R., Eds.; Springer International Publishing: Cham, Switzerland, 2019; pp. 65–91. [Google Scholar] [CrossRef]
Kastelein, R.A.; Wensveen, P.; Hoek, L.; Terhune, J.M. Underwater hearing sensitivity of harbor seals (Phoca vitulina) for narrow noise bands between 0.2 and 80 kHz. J. Acoust. Soc. Am. 2009, 126, 476–483. [Google Scholar] [CrossRef]
Stojanovic, M.; Preisig, J. Underwater acoustic communication channels: Propagation models and statistical characterization. IEEE Commun. Mag. 2009, 47, 84–89. [Google Scholar] [CrossRef]
Ferreira Dias, C.; Rodrigues de Lima, E.; Fraidenraich, G. Bit Error Rate Closed-Form Expressions for LoRa Systems under Nakagami and Rice Fading Channels. Sensors 2019, 19, 4412. [Google Scholar] [CrossRef] [Green Version]
Proakis, J.G.; Salehi, M. Deterministic and Random Signal Analysis. In Digital Communications; Chapter 2; McGraw Hill: New York, NY, USA, 2008; p. 42. [Google Scholar]
Elshabrawy, T.; Robert, J. Closed-Form Approximation of LoRa Modulation BER Performance. IEEE Commun. Lett. 2018, 22, 1778–1781. [Google Scholar] [CrossRef]
National Marine Electronics Association. NMEA 2000(c) Interface Standard. Available online: https://www.nmea.org/content/STANDARDS/NMEA_2000 (accessed on 8 May 2021).

Figure 1. Flow diagram of the basic communication chain: The data d is modulated, amplified before transmission, filtered and amplified at reception, and demodulated as

d_{est}

. The entire analog domain is regarded as part of the communication channel.

Figure 1. Flow diagram of the basic communication chain: The data d is modulated, amplified before transmission, filtered and amplified at reception, and demodulated as

d_{est}

. The entire analog domain is regarded as part of the communication channel.

Figure 2. Modulator block in detail: The data input d is mapped onto

N_{lo}

sub-bands through a multiplexer (MUX) and modulated by the up-converted chirped symbols from the DUC. The transmission sequence

y_{tx}

is assembled by the CSK block, already in the transmission band. Simple arrow lines indicate vectors, double lines arrays.

Figure 2. Modulator block in detail: The data input d is mapped onto

N_{lo}

sub-bands through a multiplexer (MUX) and modulated by the up-converted chirped symbols from the DUC. The transmission sequence

y_{tx}

is assembled by the CSK block, already in the transmission band. Simple arrow lines indicate vectors, double lines arrays.

Figure 3. Resampling example for a linear chirp with

f_{c} = 3 kHz

,

B_{f} = 5 kHz

, and

T = 10 ms

. leftmost: Base band signal

y_{tb}

at the transmitter, center left: Transmission band

y_{ib}

, center right: Undersampled signal on reception, rightmost: Down-converted base band signal

y_{bb}

at the receiver where the originally transmitted signal is overlayed in gray. The transmission band’s center frequency is at

67.5

kHz

. In the experiments, the intermediate band on reception is around

20.5

kHz

due to undersampling with only

f_{s} = 88 kHz

. Note the changed frequency scale for the Bode plots in the two columns on the right.

Figure 3. Resampling example for a linear chirp with

f_{c} = 3 kHz

,

B_{f} = 5 kHz

, and

T = 10 ms

. leftmost: Base band signal

y_{tb}

at the transmitter, center left: Transmission band

y_{ib}

, center right: Undersampled signal on reception, rightmost: Down-converted base band signal

y_{bb}

at the receiver where the originally transmitted signal is overlayed in gray. The transmission band’s center frequency is at

67.5

kHz

. In the experiments, the intermediate band on reception is around

20.5

kHz

due to undersampling with only

f_{s} = 88 kHz

. Note the changed frequency scale for the Bode plots in the two columns on the right.

Figure 4. Autocorrelation magnitude comparison of a selection of shaping window functions. All magnitudes are normalized by the Dirichlet shaped chirp power for comparison. Gaussian noise was added to a signal-to-noise ratio

S N R = 0 dB

, as well as two echoes at

n = 9

and

n = 19

. To the left the spatial resolution and peak power is higher, to the right the inter-signal interference and spectral leakage are reduced.

Figure 4. Autocorrelation magnitude comparison of a selection of shaping window functions. All magnitudes are normalized by the Dirichlet shaped chirp power for comparison. Gaussian noise was added to a signal-to-noise ratio

S N R = 0 dB

, as well as two echoes at

n = 9

and

n = 19

. To the left the spatial resolution and peak power is higher, to the right the inter-signal interference and spectral leakage are reduced.

Figure 5. Simulated spectrograms of the autocorrelations of a selection of shaping window functions. All magnitudes are normalized by the Dirichlet shaped chirp power for comparison. Gaussian noise was added to a signal-to-noise ratio of 0

d

B

, as well as two echoes at

n = 9

and

n = 19

.

Figure 5. Simulated spectrograms of the autocorrelations of a selection of shaping window functions. All magnitudes are normalized by the Dirichlet shaped chirp power for comparison. Gaussian noise was added to a signal-to-noise ratio of 0

d

B

, as well as two echoes at

n = 9

and

n = 19

.

Figure 6. Comparison of the conventional and proposed demodulation as a block diagram in detail. (a) In the former case, the received sequence y_rx is processed by Digital Down-Coverter (DDC), compressed through a Fast Hilbert Cross-Correlator (FHX), converted into symbol space through Frame Detect & Downsample (FDDS), which is interpreted by a binary decision (Decide) block, and finally assembled into the estimated data output d_est through a reverting multiplexer (De-MUX). (b) We propose the insertion of a superposition in the compressed sample space through the Join & Downsample (JDS) block that creates a sum signal for symbol timing and a difference signal for symbol extraction.

Figure 7. Schematic experimental set-up in for the acoustic transmission inside a water tank. The tank is filled with fresh water and located inside a closed building. A comparable scenario would be two divers working on a ship’s hull or an UAV inspecting a lake harbor’s foundations.

Figure 8. Spectrograms showing the intermediate frequency over time of parts of the signal to illustrate the effects of the channel and undersampling. (a) Single band signal after up-conversion before transmission; (b) Multi-band signal after up-conversion before transmission; (c) Single band after reception before down-conversion; (d) Multi-band after reception before down-conversion. Each package is transmitted three times in the experiment.

Figure 9. Averaged spectral power plots of the raw received signals. (a) Single band communication; (b) Dual-band communication. The colored area marks the ± 1 σ region of each frequency bin.

Figure 10. Time domain plot of the detected frames (red) and estimated symbols (blue dots). (a) Single band communication; (b) Dual-band communication. The symbol difference is not optimally detected, as the amplitude of the signal exceeds the amplitude of the estimated symbols. The histograms to the right of each time plot are normalized by the total number of samples (red) and symbols (blue) in each frame.

Figure 11. Plots of the simulated bit error rate (BER) and packet error rate (PER) for both single band (a) and dual-band transmission (b). The markers indicate each simulated SNR condition, the lines are manually fitted curves.

Table 1. Transmission Band Parameters.

Parameter	Value	Description
$f_{c}$	67.5 kHz	Center frequency
$\hat{B}$	5.0 kHz	Maximal available bandwidth
$f_{s}$	88.0 kHz	Receiver sampling frequency
$N_{res}$	4	Resampling factor after down-mixing
$N_{res 2}$	2	Resampling factor after signal merge
$f_{s 1}$	22 kHz	Sampling frequency after 1st downsampling
$f_{s 2}$	11 kHz	Sampling frequency after 2nd downsampling

Table 2. Experiment Waveform Parameters.

Parameter	Value		Description
	Single	Dual
N	96	64	Transmitted bits
$N_{lo}$	1	2	Number of sub-channels
$N_{frm}$	3	3	Number of packages sent
B	5.0 kHz	2.50 kHz	Bandwidth per channel
T	10 ms	10 ms	Length of a single chirp in time
$f_{c}$	67.5 kHz	[66.25, 68.75] kHz	Frequency offset to band center
$T B$	50	25	Time-bandwidth product

Table 3. Single band BER fit coefficients.

q		$- 1$	0	1	2
$r_{BE}$	$A_{q}$	0.95	$- 0.85$	0.8	0.5
$r_{BE}$	$B_{q}$	26	50	22	140
$r_{BExc}$	$A_{q}$	1.3	$- 0.5$	0.8	0.02
$r_{BExc}$	$B_{q}$	27	50	22	140
$r_{PE}$	$A_{q}$	0	24	2.1	2.0
$r_{PE}$	$B_{q}$	0	50	25	140

Table 4. Dual-band fit coefficients.

q		$- 1$	0	1	2
$r_{BE}$	$A_{q}$	0.60	$- 0.65$	0.65	0.65
$r_{BE}$	$B_{q}$	9.5	50	6.8	43
$r_{BExc}$	$A_{q}$	1.00	$- 0.65$	0.15	1.00
$r_{BExc}$	$B_{q}$	10	22	10	65
$r_{PE}$	$A_{q}$	4.0	6.0	1.8	1.2
$r_{PE}$	$B_{q}$	25	13	7.1	43

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Schott, D.J.; Gabbrielli, A.; Xiong, W.; Fischer, G.; Höflinger, F.; Wendeberg, J.; Schindelhauer, C.; Rupitsch, S.J. Asynchronous Chirp Slope Keying for Underwater Acoustic Communication. Sensors 2021, 21, 3282. https://doi.org/10.3390/s21093282

AMA Style

Schott DJ, Gabbrielli A, Xiong W, Fischer G, Höflinger F, Wendeberg J, Schindelhauer C, Rupitsch SJ. Asynchronous Chirp Slope Keying for Underwater Acoustic Communication. Sensors. 2021; 21(9):3282. https://doi.org/10.3390/s21093282

Chicago/Turabian Style

Schott, Dominik Jan, Andrea Gabbrielli, Wenxin Xiong, Georg Fischer, Fabian Höflinger, Johannes Wendeberg, Christian Schindelhauer, and Stefan Johann Rupitsch. 2021. "Asynchronous Chirp Slope Keying for Underwater Acoustic Communication" Sensors 21, no. 9: 3282. https://doi.org/10.3390/s21093282

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Asynchronous Chirp Slope Keying for Underwater Acoustic Communication

Abstract

1. Introduction

1.1. Historical Overview

1.2. Research Problem

1.3. Related Work

2. Materials and Methods

2.1. Basic System Structure

2.2. Modulation

2.2.1. Linear Chirp Creation

2.2.2. Shaping

2.2.3. Input Multiplexing

2.2.4. Chirp Slope Keying

2.3. Channel Model

2.4. Demodulation

2.4.1. Digital Down-Converter

2.4.2. Pulse Compression by Fast Hilbert Cross-Correlation

2.4.3. Join & Downsample

2.4.4. Frame Detect & Downsample

2.4.5. Symbol Decision

2.4.6. De-Multiplexing

2.5. Experimental Set-Up

2.5.1. Frequency Band Considerations

2.5.2. Experiment Parameters

3. Results

Channel Frequency Response

4. Bit Error Rate and Packet Error Rate Simulations

5. Discussion

6. Conclusions and Future Works

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI