1 Introduction

Underwater communications have attracted great research interest and are expected to enable a wide range of internet of underwater things (IoUT)s based applications such as pollution monitoring, oceanographic data collection, disaster prevention, tactical surveillance, underwater exploration, and port security [1,2,3,4,5]. The practical realization of these cooperative IoUT applications depends on several factors such as reliability, real-time long-range communication, throughput, security, etc. [1,2,3,4,5,6,7,8,9,10,11], and references therein. The peculiar nature of the underwater channel makes it very difficult to satisfy the above-mentioned IoUT application requirements [12, 13].

The underwater communication poses adverse channel conditions due to high attenuation, a large number of multipath, variable propagation delay spread, and susceptibility to Doppler shift between the source and the destination [5]. It is also very difficult to maintain line of sight (LOS) communication which results in significant signal attenuation [12, 13]. Furthermore, the inter-symbol interference (ISI) span is very different in shallow and deep water [14]. A possible solution to address underwater propagation channel challenges is to increase the transmit power and/or a high complexity transceiver design [15, 16]. However, this is not a feasible option due to limited available power at underwater communicating nodes [17]. To address above mentioned issues, a multi-hop network with the advantages of cooperative communication is an essential requirement for reliable communication over long distances [7]. Cooperative communication allows multiple sensors to form a distributed cooperative sensor network, enabling them to achieve spatial diversity which helps to save transmission power by combating the severe signal attenuation encountered over long distances [18].

Several works have highlighted the advantages of cooperative communication in IoUTs. In [19], to alter the multi-path effect at the receiver, an amplify-and-forward (AF) based cooperative transmission scheme has been proposed. Particularly, a relay node amplifies the incoming acoustic signal to provide additional multi-paths to the receiver. However, in this work, only a single relay node is considered. In [20], an energy-efficient cooperative acoustic communication protocol to improve network lifetime, end-to-end delay, and reduce energy consumption has been proposed. The algorithm selects the relay and the destination nodes based on the signal-to-noise ratio (SNR) and distance among the communicating nodes. However, the proposed scheme requires SNR calculation at each link which results in high computational complexity. In [7], proposed a dynamic cooperative communication scheme for underwater acoustic networks where the cooperative node transmits blocks with limited redundancy to improve transmission efficiency and to reduce end-to-end delay. In [21], the authors proposed a cooperative communication-based solution where the cluster nodes are selected using a K-mean algorithm and cluster heads are selected using conditional probability to combat the packet loss problem and to improve the lifetime of IoUTs.

Despite the advantages of employing cooperative communication in IoUTs, the main challenge is to resolve the number of multipath [22, 23]. This problem is further exacerbated by a large number of relays that act as sensors in IoUTs. At the receiver, this increases the complexity of the time-domain equalization (TDE) as additional RAKE fingers are required to resolve these multipath components [22, 24, 25]. As a result, the TDE for IoUTs becomes unattractive as compared to the frequency-domain methods [26,27,28,29,30]. Frequency-domain methods employ the advantages of the Fast Fourier Transform (FFT) algorithm such as orthogonal frequency division multiplexing (OFDM) or frequency domain equalization (FDE) [26,27,28,29,30]. In [28], authors presented the advantages of FDE with a single carrier (SC) for a multiple-input multiple-output (MIMO) underwater communication system. In [29], authors investigated the performance of FDE and the hybrid time-frequency-domain equalizer (HTFDE) with SC for synthetic aperture underwater acoustic communications. In [10], authors investigated the optimal relay selection and power control in an OFDM-based cooperative communication. When compared to OFDM, the SC-FDE can achieve a similar bit error rate (BER) performance, but with a lower peak-to-average power ratio (PAPR). Furthermore, FFT is carried out at the receiver side in SC-FDE, therefore, the transmitter used in IoUTs will have a very basic structure [26].

It is highlighted that in existing literature FDE and AF relaying have only been discussed separately. Therefore, the key novelty of this work is to analyze and evaluate the performance of combined SC-FDE and cooperative communication with AF protocol. This paper addresses the problem of reliable communication between the source and the destination in an IoUTs scenario. Particularly, the data is relayed through the intermediate sensory nodes that are placed between the source and the destination. As the main motivation is to keep the design of the transmitter and sensors simple, only a low-complexity AF cooperative diversity protocol that relies on noise variance is considered. In this work, it is assumed that the sensor node would only amplify the received signal and transmit it whereas channel estimation is only performed at the destination node. For AF networks, there exist two choices for estimating the weights, (1) the source-sensor and sensor-destination weights are estimated separately, or (2) the overall source-sensor-destination weight is estimated at the destination. The latter approach achieves higher spectral efficiency and therefore has been adopted in this work. Furthermore, to reduce the computational complexity FDE is implemented that will exploit the diagonal structure of the channel matrices.

The main novelties of the present work are as follows:

  1. 1.

    The extension of the SC-FDE using AF relaying for IoUTs with a goal to provide a simplified and low-complexity multi-hop underwater communication architecture.

  2. 2.

    The proposal of adaptive implementation of FDE to combat the difficulty faced in acquiring channel knowledge in underwater communications. Particularly, two types of adaptive detectors based on the principles of least mean square (LMS) and recursive least square (RLS) are proposed in this work. Furthermore, the considered FFT structure for the received data only requires diagonal elements that reduce the computational complexity significantly as compared to the available LMS and RLS algorithms in the literature.

The simulation results show that the SC-FDE by using AF relaying for IoUTs, which has lower complexity, wherein simplicity of transmitter and sensors is maintained. In addition the simulations results, shown that the adaptive performance of these two detectors is clsoe to the ideal minimum mean square error (MMSE) detector.

The rest of the paper is organized as: Sect. 2 describes the system model for SC-FDE underwater cooperative communication systems using the AF protocol. The structure of the transmitter, sensor and receiver is discussed in detail in this section. In Sect. 3, the optimal and MMSE detectors are discussed. In Sect. 4, LMS and RLS adaptive detectors are discussed at length. The difference between these detectors and the LMS and RLS algorithms in the literature is also highlighted. The simulation results are provided in Sect. 5, and finally conclusions are given in Sect. 6.

Throughout this paper, the following notations are used: Bold upper-case letters denote matrices, while bold lower-case letters denote vectors. For arbitrary matrices, \(A, A^*, A^T, A^H\) and \(A^{-1}\) denote the complex, conjugate, transpose, Hermitian and inverse of the matrix A, respectively. \(E [\cdot ]\) denotes expectation, \(diag(\cdot )\) stands for diagonal matrix, \(|\cdot |\) denotes the absolute value and \(Tr (\cdot )\) represents the trace of a matrix.

2 System Model

We consider the scenario shown in Fig. 1, where a source (S) sends relevant data to the destination (D) with the help of U sensors, denoted by \({R}_1,{R}_2,\dots {R}_{U}\), respectively. The sensors are assumed to possess the capability of relaying data therefore, the terms relay and sensor are used interchangeably. The source is assumed to be far from the destination and therefore no direct communication path exists between the source and the destination. The data transmission is assumed to occur over two separate phases. In phase I, the source broadcasts data to the sensors; while in phase II, the sensors amplify-and-forward the data to the destination.

Fig. 1
figure 1

Data acquisition scheme of a underwater cooperative communication system with the assistance of U AF-based sensors

2.1 Underwater Channel Model

The sound propagation speed is one of the most impactful variables on the performance of IoUTs [11, 22]. The propagation speed of terrestrial radio networks can by contrast reach speeds of \(3 \times 10^8\) m/s. Thus, its underwater counterpart is slower by a factor of \(2 \times 10^5\) m/s respectively [22]. Moreover, the complexity and spatial variability of the underwater physical environment create difficulties for IoUTs channels.

The underwater acoustic channel has been shown to be composed of several distinct paths referred to as the eigenpaths [11, 22]. These eigenpaths are composed of a dominant and stable component as well as several smaller random components. The latter are also known as eigenarrays. Due to these characteristics a modified Saleh Valenzuela (SV) model was proposed for an underwater acoustic channel [11, 22, 31]. Based on this model, the channel between any two nodes is expressed as

$$\begin{aligned} ~ h^{k}_{u}(t)= & {} \sum _{a=0}^{\alpha - 1} \sum _{b=0}^{\beta - 1} q^{k}_{u,b,a}\delta (t - T^{k}_{u,b,a} - \tau ^{k}_{u,b,a}) \nonumber \\= & {} \sum _{l=0}^{L - 1} q^{k}_{u,l}\delta (t - \tau ^{k}_{u,l}), \end{aligned}$$
(1)

where \(k\in \{SR,RD\}\) denotes the source-relay and relay-destination hop, respectively, \(\alpha\) denotes the number of eigenpaths and \(\beta\) represents the number of eigenarrays for each eigenpath. For the b-th eigenray of the a-th eigenpath of the u-th relay in the k-th hop, \(q^{k}_{u,b,a}\) represents the fading coefficient, \(T^{k}_{u,b,a}\) is the arrival time of the a-th path/cluster, \(\tau ^{k}_{u,b,a}\) is the delay of the b-th ray in the a-th path/cluster, respectively. Equivalently, for the l-th eigenray of the u-th relay in the k-th hop \(\tau ^{k}_{u,l}\) represents the delay of the l-th ray, \(q^{k}_{u,l}\) denotes the fading coefficient, respectively, and \(L = \alpha \beta\) denotes the number of possible resolvable multipaths.

The distribution of the cluster arrival time and the ray arrival time are modeled by a Poisson distribution and are calculated through the following [11, 22]

$$\begin{aligned} ~ p(T^{k}_{u,b,a}|T^{k}_{u,b,a-1})= & {} \varLambda \exp [-\varLambda (T^{k}_{u,b,a}-T^{k}_{u,b,a-1})],\quad a>0 \end{aligned}$$
(2)
$$\begin{aligned} p(\tau ^{k}_{u,b,a}|\tau ^{k}_{u,b-1,a})= & {} \lambda \exp [-\lambda (\tau ^{k}_{u,b,a}-\tau ^{k}_{u,b-1,a})], \quad b>0 \end{aligned}$$
(3)

where \(\varLambda\) is the arrival rate of the cluster and \(\lambda\) is the arrival rate of the ray within each cluster. Both arrival rates are assumed to be same for each hop. Typically, the underwater acoustic channel is quasi-static [32, 33]. This means that the fading coefficient, \(q^{k}_{u,l}\), and the delay, \(\tau ^{k}_{u,l}\), remain the same during one transmission burst, but then may change between bursts. The amplitude of the fading coefficient is assumed to follow independent Nakagami-m distribution. The Nakagami-m distribution is a generalized model using which various propagation scenarios can be modeled by the changing of the value of parameter m [34, 35]. The probability density function (PDF) of Nakagami-m distribution is given as

$$\begin{aligned} ~ p_{|q^{k}_{u,l}|}(r) = \frac{2m_{l}^{m_l}r^{2m_l-1}}{\varGamma (m_{l})\varOmega _{l}^{m_l}} \exp \left( -\frac{m_{l}}{\varOmega _{l}} \right) r^2,\quad r > 0, \end{aligned}$$
(4)

where \(\varGamma (\cdot )\) is the gamma function, \(m_l\) is the fading parameter corresponding to the l-th multipath component and the parameter \(\varOmega _l=E[|h_{l}|^2] = 1\). It is assumed that the \(m_l\) and \(\varOmega _l\) are the same for each hop [35]. Furthermore, it is assumed that the fading channel-induced phase rotation is uniformly distributed in \([0,2 \pi ]\) [36].

2.2 Data Transmission

Figure 2 shows the block diagram of the cooperative communication for IoUTs. The communication occurring in each phase is discussed in detail below:

Fig. 2
figure 2

Block diagram of a cooperative communication for IoUTs

2.3 Phase-I: Transmission from Source

The source desires to transmit the data vector \({\pmb {x}}=[x(0),x(1),\cdots , x(N-1)]^T\) of length N where x(i) denotes the i-th data symbol chosen from a M-ary constellation and each symbol is assumed to be equally likely. In order to ensure circular convolution between the channel and transmitted data, so that the inter-block interference is removed through FFT-based demodulation at the receiver, a cyclic prefix (CP) of length L is appended to the data vector \({\pmb {x}}\). This CP is required to have a length greater than the maximum delay profile of the channel [30]. The transmitted data vector with the CP is represented as

$$\begin{aligned} ~ \pmb {x}_{CP}= [ \underbrace{x(N-L+1),\dots ,x(N-1)}_{CP} \underbrace{x(0),x(1),\dots ,x(N-1)}_{\pmb {x}}]^T. \end{aligned}$$
(5)

The data vector \({\pmb {x}}_{CP}\) is converted from parallel-to-serial (P/S) prior to transmission.

2.4 Phase-II: Transmission from the Relays to the Destination

The signal transmitted from the source is received at the u-th relay after being convolved with the respective channel, \(h_u^{SR}(t)\), where \(u=\{1,2,...U\}\). Each relay first converts the signal from serial-to-parallel (S/P) and discards the first \(({L - 1})\) CP samples. The signal after the CP has been removed at the u-th relay, is expressed as

$$\begin{aligned} ~ {\pmb {y}}_{u} = {\pmb {H}}_{u}{\pmb {x}} + {\pmb {n}}_{u}, \end{aligned}$$
(6)

where \({\pmb {n}}_u=[n_u(0),n_u(1),\cdots ,n_u(N-1)]^T\) denotes the noise vector at the u-th relay, \(n_u(i)\) is modeled as complex additive white Gaussian noise (AWGN) with mean zero and variance \(\sigma ^{2}_{n_{u}}/2\) per dimension and \({\pmb {H}}_u\) is a \(N \times N\) channel matrix for the source to the u-th relay and is given in (7).

$$\begin{aligned} {\pmb {H}} _u=\left[ \begin{array}{cccccccc} h^{SR}_u(0)&{}0&{}\dots &{}\dots &{}0&{}h^{SR}_u(L- 1)&{}\dots &{}h^{SR}_u(1)\\ h^{SR}_u(1)&{}\ddots &{}\ddots &{}\ddots &{}\ddots &{}\ddots &{}\ddots &{}\vdots \\ \vdots &{}\ddots &{}h^{SR}_u(0)&{}\ddots &{}\ddots &{}\ddots &{}\ddots &{}h^{SR}_u(L-1)\\ h^{SR}_u(L-1)&{}\vdots &{}h^{SR}_u(1)&{}\ddots &{}\ddots &{}\ddots &{}\ddots &{}0\\ 0&{}\ddots &{}\vdots &{}\ddots &{}\ddots &{}\ddots &{}\ddots &{}\vdots \\ \vdots &{}\ddots &{}h^{SR}_u(L-1)&{}\vdots &{}\ddots &{}h^{SR}_u(0)&{}\ddots &{}\vdots \\ \vdots &{}\ddots &{}\ddots &{}\ddots &{}\vdots &{}h^{SR}_u(1)&{}\ddots &{}0\\ 0&{}\dots &{}\dots &{}0&{}\ddots &{}\ddots &{}0&{}h^{SR}_u(0) \end{array}\right] . \end{aligned}$$
(7)

It then amplifies the remaining samples with a fixed gain \(\zeta _{u}\), adds another CP and forwards the signal to the destination after P/S conversion. Without a loss of generality, the CP length at the transmitter and the relay/sensor are assumed to be same and is selected based on the channel with the maximum delay profile. As the relay node is assumed to have a low complexity, fixed gain amplifying is performed at the relay. The fixed gain \(\zeta _{u}\) is expressed as

$$\begin{aligned} ~ \zeta _{u} = \sqrt{\frac{1}{ \sigma _{h^{SR}_{u}}^{2} + \sigma _{n_{u}}^{2}}}. \end{aligned}$$
(8)

This fixed-gain ensures the maintenance of the average or long-term power constraint. However, it also allows the instantaneous transmit power to be much larger than the average when necessary [18]. The amplified received signal at the u-th relay after the appending the CP is given as

$$\begin{aligned} &~ {\pmb {y}}_{u,CP}= \zeta _u \left[ \underbrace{y_u(N-L+1),\dots ,y_u(N-1)}_{CP_u}\right. \\&\quad \left. \underbrace{y_u(0),y_u(1),\dots ,y_u(N-1)}_{\pmb {y}_u}\right] ^T. \end{aligned}$$
(9)

2.5 Receiver Structure

In Fig. 2c the receiver converts the signal for the last time to parallel and applies the FFT transform, convolves the signal with the weight matrix \(\pmb {W}\), and applies the IFFT to detect the transmitted signals. From this figure it can be observed that the transmitter only requires a buffer to store N symbols and requires a P/S convertor to carry out the transmission. Furthermore, the sensors will require S/P and P/S convertors and an amplifying and forwarding block, as well as the maximal delay profile of the channel so that an appropriate CP length can be added. Therefore, by employing an SC-FDE system, use of an inexpensive transmitter and sensors for underwater communication is possible. Each relay forwards the signal to the destination in different time slots and the signal transmitted from the u-th relay is received at the destination after being convolved with the respective channel, \(h_u^{RD}(t)\), where \(u=\{1,2,...U\}\). The destination performs S/P conversion and then discards the first \(({L - 1})\) CP samples. The received signal at the destination from the u-th relay, after removing the CP, is given as

$$\begin{aligned} ~ \pmb {r}_u\,=\, & {} \pmb {G}_{u}\pmb {y}_{u} +\pmb {n} _{D,u}, \nonumber\, \\=\, & {} \underbrace{\pmb {G}_{u}\zeta _{u}\pmb {H}_{u}}_{C{u}}\pmb {x} + \underbrace{\pmb {G}_{u}\zeta _{u}\pmb {n}_{u}+\pmb {n}_{D,u}}_{\pmb {v}_{u}} \end{aligned}$$
(10)

where \({\pmb {n}}_{D,u}=[n_{D,u}(0),n_{D,u}(1),\cdots ,n_{D,u}(N-1)]^T\) denotes the noise vector at the destination during the u-th relay transmission, \(n_{D,u}(i)\) is modeled as complex AWGN with mean zero and variance \(\sigma ^{2}_{{D}}/2\) per dimension and \({\pmb {G}}_u\) is a \(N \times N\) channel matrix for the u-th relay to destination channel and is given in (11).

$$\begin{aligned} {\pmb {G}}_u=\left[ \begin{array}{cccccccc} h^{RD}_u(0)&{}0&{}\dots &{}\dots &{}0&{}h^{RD}_u(L- 1)&{}\dots &{}h^{RD}_u(1)\\ h^{RD}_u(1)&{}\ddots &{}\ddots &{}\ddots &{}\ddots &{}\ddots &{}\ddots &{}\vdots \\ \vdots &{}\ddots &{}h^{RD}_u(0)&{}\ddots &{}\ddots &{}\ddots &{}\ddots &{}h^{RD}_u(L-1)\\ h^{RD}_u(L-1)&{}\vdots &{}h^{RD}_u(1)&{}\ddots &{}\ddots &{}\ddots &{}\ddots &{}0\\ 0&{}\ddots &{}\vdots &{}\ddots &{}\ddots &{}\ddots &{}\ddots &{}\vdots \\ \vdots &{}\ddots &{}h^{RD}_u(L-1)&{}\vdots &{}\ddots &{}h^{RD}_u(0)&{}\ddots &{}\vdots \\ \vdots &{}\ddots &{}\ddots &{}\ddots &{}\vdots &{}h^{RD}_u(1)&{}\ddots &{}0\\ 0&{}\dots &{}\dots &{}0&{}\ddots &{}\ddots &{}0&{}h^{RD}_u(0) \end{array}\right] . \end{aligned}$$
(11)

It can be noted that \({\pmb {C}}_{u}\) is a circulant matrix as it is a product of two circulant matrices. Acknowledging this property helps in determining the modified LMS and RLS adaptive algorithms.

The overall noise at the destination, \(\pmb {v}_{u}\), can be approximated as complex Gaussian noise with zero mean and covariance matrix [37, 38]

$$\begin{aligned} ~ \varSigma v_{u} = \zeta _{u}^2 {\pmb {G}}_{u}{\pmb {G}}_{u}^H\sigma _{u}^2 + \sigma _{D}^2{\pmb {I}}_N, \end{aligned}$$
(12)

where \(\pmb {I}_N\) denotes and identity matrix of size \(N\times N\). Finally, the combined received signals from all the relays can be expressed as

$$\begin{aligned} ~ {\pmb {r}}_D= & {} \sum _{u=1}^{U}{\pmb {r}}_{u}\nonumber \\= & {} \sum _{u=1}^{U}{\pmb {C}}_{u}{\pmb {x}}+\sum _{u=1}^{U}{\pmb {v}}_{u}\nonumber\, \\=\, & {} {\pmb {Dx}}+{\pmb {n}}. \end{aligned}$$
(13)

It is worth noting here that \(\mathbf{D}\) will also be circulant. This is because the summation of circulant matrices is also itself circulant. The FFT of the received signal yield can be calculated as

$$\begin{aligned} {\pmb {r}}(f)&= {\pmb {F}} {\pmb {r}}_D = {\pmb {FDx}} + {\pmb {Fn}}\nonumber \\&=\underbrace{{\pmb {FDF}}^{H}}_{\Xi }{\pmb {s}}(f)+{\pmb {Fn}}, \end{aligned}$$
(14)

where \({\pmb {F}}\) represents the FFT and \({\pmb {s}}(f)={\pmb {F}}{\pmb {x}}\) represents transmitted signal \({\pmb {x}}\) in the frequency domain. To detect the \({\pmb {s}}(f)\), a FDE is utilized at the receiver. After obtaining the estimate of \({\pmb {s}}(f)\), an inverse FFT is carried out to obtain the estimate of the transmitted signal \({\pmb {x}}\). In the following section, the design of the FDE for the detection of \(\pmb {s}(f)\) is presented.

3 Frequency Domain Detectors

3.1 Optimal Detector

The maximal likelihood (ML) detector is an optimal detector using which \(\pmb {s}(f)\) can be detected as

$$\begin{aligned} \hat{\pmb {s}}(f)= & {} \arg \min _{\pmb {s}(f)} \Vert \varSigma ^{-1/2}(\pmb {r}(f) - \Xi \pmb {s}(f))\Vert ^2\nonumber \\= & {} \arg \min _{\pmb {s}(f)} \left( \pmb {r}(f)- \Xi \pmb {s}(f)\right) ^H\varSigma ^{-1}\left( \pmb {r}(f)- \Xi \pmb {s}(f)\right) , \end{aligned}$$
(15)

where \(\varSigma = \pmb {F} (\sum _{u=1}^{U} \varSigma _{v_{u}}) \pmb {F}^H\). The process of minimization is conducted over all the possible combinations of the constellation symbols. For larger constellations, the ML Detector is considered to be intractable. Furthermore the complexity of the ML detector has been proven to be the equivalent of \(O(L^{6.5})\), wherein L is the number of relays [38]. The complexity increases in cooperative systems which include more relays, thus rendering the ML detector impractical.

Linear suboptimal detectors are employed to mitigate this issue. These detectors have a lower computational complexity than their ML counterparts.

3.2 MMSE Detector

A linear transformation is applied to the received signal \(\pmb {r}(f)\) by multiplying it by a weight matrix \(\pmb {W}^H(f)\), which minimizes the mean-square error (MSE) between the transmitted symbol vector, \(\pmb {s}(f)\), and the detected symbol vector, \(\hat{\pmb {s}}(f)\) i.e.

$$\begin{aligned} \hat{\pmb {s}}(f) = \pmb {W}^H(f)\pmb {r}(f), \end{aligned}$$
(16)

where the optimal weight matrix \(\pmb {W}(f)\) is obtained as [39]

$$\begin{aligned} \pmb {W}(f) = \arg \min _{\pmb {W}} E[\Vert \pmb {s}(f) - \pmb {W}^H(f)\pmb {r}(f)\Vert ^2], \end{aligned}$$
(17)

Equation (17) can be solved to yield the optimum weight matrix \(\pmb {W}(f)\) as

$$\begin{aligned} \pmb {W}(f) = \pmb {R}^{-1}(f)\pmb {\varPsi }(f), \end{aligned}$$
(18)

where \(\pmb {\varPsi }(f)=E[\pmb {r}(f)\pmb {s}(f)^H]\) represents the cross-correlation between \(\pmb {r}(f)\) and \(\pmb {s}(f)\), while \(\pmb {R}(f)=E[\pmb {r}(f)\pmb {r}(f)^H]\) is the auto-correlation of \(\pmb {r}(f)\). These can be represented as

$$\begin{aligned}&\pmb {\varPsi }(f)= \Xi = {\pmb {FDF}}^{H}, \end{aligned}$$
(19)
$$\begin{aligned}&\pmb {R}(f) = \Xi \Xi ^H + \varSigma = \pmb {FD} \pmb {D}^H \pmb {F}^H+\varSigma . \end{aligned}$$
(20)

Equation (18) illustrates that the MMSE detector’s complexity is determined by the calculation of inverse of matrix \(\pmb {R}(f)\). \(\pmb {R}(f)\) is a \(N \times N\) matrix, therefore the computational complexity for calculating its inverse is of \(O(N^3)\) [40]. However, with the introduction of CP and FFT, \(\pmb {\varPsi }(f)\) is now a diagonal matrix. The inversion of a diagonal matrix has a significantly lower complexity of O(N), resulting the MMSE detector having the complexity level of a maximal ratio combiner (MRC) [18]. Furthermore, it can be noted from (18), (19) and (20), that channel state information (CSI) is required to compute \(\pmb {W}(f)\). Determining this accurately is difficult in IoUTs for several reasons. Firstly, the received underwater signal is made up of numerous multipath components, and each of these components has very low energy. Secondly, for AF based relaying the received channel is a cascade model, consisting of the source to the sensor and the sensor to the destination channel. This further complicates the (CSI) estimation [41, 42]. Thirdly, the noise propagation from each cooperating relay to the destination is different, as each relay experiences different channel gain. In response to these observations, adaptive algorithms are used to establish a sub-optimal solution to the optimum weight matrix \(\pmb {W}\) in (18), and thus, arrive at the optimum MMSE solution achievable through iterative computing [39].

4 Adaptive Detection

In this section, two different approaches for constructing adaptive detectors are presented. The first is a stochastic gradient based algorithm, also termed the LMS adaptive algorithm [36, 39]. The second approach is based on the least square (LS) adaptive algorithm, also called the recursive least squares (RLS) algorithm [39]. The LMS algorithm is low complexity and uses stochastic gradient technique to find the suboptimal weight matrix. On the contrary, the RLS algorithm gives better estimate of the matrix, as will be shown in the simulation results, but this comes at a cost of higher processing as RLS has more calculation steps compared to the LMS algorithm. As FFT and IFFT introduces a structure to the received signal, \(\pmb {F}^H \pmb {WF}\) becomes diagonal and \(\pmb {W}\) becomes circulant. Thus, the complete matrix can be reconstructed using only a row or a column of \(\pmb {W}\) which reduces the computations significantly. The properties of \(\pmb {W}\) can be understood as following:

$$\begin{aligned} ~ \pmb {W} = diag[w_{0},w_{1},\dots ,w_{(N-1)}], \end{aligned}$$
(21)

where “diag” stands for diagonal matrix. Thus, only the N diagonal elements of the matrix \(\pmb {W}\) need to be computed, while the LMS and RLS of the considered literature required \((N \times N)\) elements. This further reduces the computational complexity in comparison with the algorithmic counterparts proposed in the literature.

4.1 LMS Algorithm

As discussed previously, using stochastic gradient technique, the LMS algorithm calculates a sub-optimal weight matrix for \({\pmb {W}}\) in (18), and ensures that the mean-square error (MSE) is similar to the MSE for the ideal MMSE detector. The steps of the algorithm are outlined below.

  1. 1.

    Initialize the weights of the filter \(\pmb {w}_0 = \pmb {0}\), when lacking any priori information.

  2. 2.

    Choose a suitable step-size \(\mu\), such as \(\mathbf{0}\mathbf \le \mu \le \frac{2}{E||\mathbf{r}\mathbf (f)||^2}\).

  3. 3.

    Calculate the estimated error between the transmitted and detected symbols using

    $$\begin{aligned} {\pmb {e}}_{i}(f) = {\pmb {s}}_{i}(f) - {\pmb {w}}_{i}^{H}{\pmb {r}}_{i}(f), \end{aligned}$$
    (22)

    where i represents the frame index.

  4. 4.

    Update the weight vector as

    $$\begin{aligned} {\pmb {w}}_{(i+1)} = {\pmb {w}}_{i} - \mu {\pmb {r}} _{i}(f) \odot {\pmb {e}}_{i}^H(f), \end{aligned}$$
    (23)

    where \(\odot\) denotes the Hadamard product.

4.2 RLS Algorithm

The RLS adaptive algorithm is capable of significantly faster convergence speed compared to the LMS adaptive algorithm. However, this comes at the cost of higher computational complexity. It uses the detected data to improve the convergence speed as well as the detection performance. The steps of the algorithm are outlined below.

  1. 1.

    Initialize the weights of the filter \(\pmb {w}_{0}=\pmb {0}\), when lacking any priori information.

  2. 2.

    Initialize the inverse of the autocorrelation matrix \(\pmb {P}_{0}(f)=\pmb {R}^{-1}(f)=\pmb {I}\).

  3. 3.

    Choose a suitable forgetting factor \(\lambda _{RLS}\) such that \(0 \le \lambda _{RLS} \le 1\).

  4. 4.

    Calculate the gain vector \(\pmb {k}_i(f)\) as

    $$\begin{aligned} {\pmb {k}}_{i}(f) = \frac{\lambda _{RLS}^{-1} {\pmb {P}}_{(i-1)}(f){\pmb {r}}_{i}(f)}{1 + \lambda _{RLS}^{-1}{\pmb {r}}_{i}^{H}(f){\pmb {P}}_{(i-1)}(f){\pmb {r}}_{i}(f)}, \end{aligned}$$
    (24)

    where i represents the frame index.

  5. 5.

    Determine the estimated error between the transmitted and detected symbols as

    $$\begin{aligned} {\pmb {e}}_{i}(f) = {\pmb {s}}_{i}(f) - {\pmb {w}}_{i}^{H}{\pmb {r}}_{i}(f). \end{aligned}$$
    (25)
  6. 6.

    Update the weight vector using

    $$\begin{aligned} {\pmb {w}}_{i} = {\pmb {w}}_{(i-1)} + {\pmb {k}}_{i}(f) \odot {\pmb {e}}_{i}^{H}(f). \end{aligned}$$
    (26)
  7. 7.

    Update the inverse of the autocorrelation matrix as

    $$\begin{aligned} {\pmb {P}}_{i}(f) = \lambda _{RLS}^{-1}{\pmb {P}}_{(i-1)}(f) - \lambda _{RLS}^{-1}{\pmb {k}}_{i}(f){\pmb {r}}_{i}^{H}(f){\pmb {P}}_{(i-1)}(f). \end{aligned}$$
    (27)

5 Simulation Results and Discussions

In this section, numerical simulation results are presented to compare the performance of the proposed LMS and RLS algorithms with that of the MMSE and ML detection algorithms. In the simulations, the SV channel model using the following parameters was simulated; \(1/\varLambda = 14.99\)ns, \(1/\lambda = 0.476\)ns, \(\varGamma =0.024\)ns and \(\gamma = 0.12\)ns. \(\varGamma\) and \(\gamma\) are the power decay coefficients for clusters and multipath, respectively. The number of multipaths for the source to sensor and sensor to destination were assumed to be the same, and the Nakagami-m parameter was fixed to \(m_l = 1.3\).

Fig. 3
figure 3

BER versus SNR performance of an AF underwater communication system when using a single relay with different detectors

For a network consisting of only one relay/sensor, Fig. 3 shows the BER performance of the MRC, MMSE and ML detectors. Complete CSI is assumed to be available to the ML detectors. The number of multipaths considered are \(L = 4\) and \(L = 15\). It can be observed, that as the number of multipaths increases, the performance of the MRC detector degrades as the MRC does not have the ability to mitigate multipaths. The The BER performance of the MMSE receiver is not affected by an increased number of multipaths. It is worth noting that the complexity of MMSE and MRC is of the order O(N). Its performance is also similar to that of the ML detector, which has a complexity of \(O({N^{6.5}})\). Thus, it could be argued that employing SC-FDMA with the MMSE detector could resolve the multipaths issue in underwater communications.

Fig. 4
figure 4

Convergence performance of LMS and RLS adaptive detector for underwater communication system

Figure 4 illustrates the convergence of the proposed LMS and RLS detectors for a single relay network. During the simulations, the ensemble-average squared error was obtained by averaging over 100,000 independent channel realizations at \(E_b/N_0= 5\) dB. The forgetting factor of RLS was fixed to \(\lambda _{RLS} = 0.995\) while the step size of the LMS algorithm was fixed to \(\mu = 0.05\). It can be noted that convergence of the proposed RLS adaptive detector is superior to that of the proposed LMS based adaptive detector. Moreover, training length of 50 symbols is sufficient for algorithm convergence.

Fig. 5
figure 5

BER versus SNR performance of an AF underwater communication system when using a single relay with different doppler frequency

Figure 5 compares the BER performance of both the LMS and RLS adaptive detectors when the relay is moving at different velocities. The multipath numbers were fixed to \(L = 15\). It can be observed that, for 10 km/hour, the RLS detector gives lower BER compared to the the LMS detector and is just 1 dB away from the ML detector which has complete CSI knowledge. This shows that the low complexity RLS detector is ideal for practical deployment. In addition, the other obvious trend which can be observed is that as the velocity increases, the BER increases.

Fig. 6
figure 6

BER of the AF scheme as a function of distance for underwater communication system

Figure 6 shows the effect of sensor placement when the source and sensor are transmit with the same power. The normalized doppler frequency was fixed to 0.001. The figure reports an SNR = 0, 10, 20 and 30 dB respectively. The SNR decreases from bottom to top. For ease, the distance between the source and relay has been kept at \(\delta\), while that between the relay and destination was \((1-\delta )\). The best position seems here to have been achieved when the relay is equidistant between the source and destination. It can also be observed that for higher SNRs, the RLS algorithm always perform better compared to the LMS detector.

Fig. 7
figure 7

BER versus SNR performance of an AF underwater communication system when using multiple relays with different doppler frequency

Figure 7 shows the impact of multiple relays on the underwater communication system. The BER is plotted for the RLS adaptive algorithm. It can be observed that as the number of relays increases, the BER performance of the detector improves. Furthermore, the RLS adaptive detector exhibits good performance even for different normalized doppler frequencies, wherein 0 MHz corresponds to the stationary relay and 50 MHZ corresponds to a movement of approximately 90 km/hr.

6 Conclusion

In this contribution, the use of SC-FDE detectors for underwater communication systems alongside the AF protocol has been discussed. Specifically, the ML, the ideal MMSE and two types of adaptive detectors—namely the LMS and RLS. Furthermore, the approach of measuring both the source sensor and sensor destination weights together to improve the spectral efficiency of these detectors was adopted. It can be concluded from the results herein that the considered adaptive detectors could inform numerous novel detection schemes for SC-FDE underwater cooperative communication systems. The simulation results suggest that the considered adaptive detection schemes can achieve BER performance similar to that of the ideal ML detector, which requires ideal channel knowledge. The RLS adaptive detector can achieve the best BER performance, but with a higher detection complexity than that of the LMS adaptive detector.