Proof of Theorem 4.1
From matrix inversion lemma, we know
$$\displaystyle\begin{array}{rcl} & & tr\left ((\mathbf{D}^{H}\mathbf{D})^{-1}\right ) = tr((\mathbf{D}_{ 1}^{H}\mathbf{D}_{ 1} -\mathbf{D}_{1}^{H}\mathbf{D}_{ 2}(\mathbf{D}_{2}^{H}\mathbf{D}_{ 2})^{-1}\mathbf{D}_{ 2}^{H}\mathbf{D}_{ 1})^{-1}) \\ & & \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad + tr((\mathbf{D}_{2}^{H}\mathbf{D}_{ 2} -\mathbf{D}_{2}^{H}\mathbf{D}_{ 1}(\mathbf{D}_{1}^{H}\mathbf{D}_{ 1})^{-1}\mathbf{D}_{ 1}^{H}\mathbf{D}_{ 2})^{-1}) \\ & & = tr((\mathbf{D}_{1}^{H}P_{\mathbf{ D}_{2}}^{\perp }\mathbf{D}_{ 1})^{-1}) + tr((\mathbf{D}_{ 2}^{H}P_{\mathbf{ D}_{1}}^{\perp }\mathbf{D}_{ 2})^{-1}) \geq tr((\mathbf{D}_{ 1}^{H}\mathbf{D}_{ 1})^{-1}) + tr((\mathbf{D}_{ 2}^{H}\mathbf{D}_{ 2})^{-1}),{}\end{array}$$
(4.32)
and the equality holds if and only if
\(\mathbf{D}_{1}^{H}\mathbf{D}_{2} = {\boldsymbol 0}\). However, we cannot directly claim that the optimal
\(\mathbf{D}_{1}\) and
\(\mathbf{D}_{2}\) are orthogonal since
\(\mathbf{D}_{1}\) and
\(\mathbf{D}_{2}\) have to follow the circulant structure.
Nonetheless, we first take look at the following optimization:
$$\displaystyle\begin{array}{rcl} \min _{\mathbf{D}_{j}}& & \ tr((\mathbf{D}_{j}^{H}\mathbf{D}_{ j})^{-1}) \\ \mathrm{s.t.}& & \ \|\mathbf{d}_{j}\|^{2} \leq NP_{ j},{}\end{array}$$
(4.33)
for
j = 1, 2. Note that (
4.33) is the standard optimal channel estimation in single-input single-output (SISO) OFDM [
9], whose solutions are
$$\displaystyle{ \vert \tilde{d}_{j,i}\vert ^{2} = P_{ j},\ \ \forall j = 1,2,\ i = 0,\ldots,N - 1. }$$
(4.34)
Meanwhile, the minimum value of
\(tr((\mathbf{D}_{1}^{H}\mathbf{D}_{1})^{-1})\) is
\(\frac{2L_{1}-1} {NP_{1}}\) and the minimum value of
\(tr((\mathbf{D}_{2}^{H}\mathbf{D}_{2})^{-1})\) is
\(\frac{L_{1}+L_{2}-1} {NP_{2}}\).
Lastly, if we can find a pair of \(\mathbf{D}_{1}\) and \(\mathbf{D}_{2}\) from (4.34) that satisfies \(\mathbf{D}_{1}^{H}\mathbf{D}_{2} = {\boldsymbol 0}\), then these \(\mathbf{D}_{1}\) and \(\mathbf{D}_{2}\) make the equality in (4.32) hold, and must be the optimal solution to (4.15).
Therefore, the training satisfying both (4.34) and the following equation
$$\displaystyle\begin{array}{rcl} \sum _{i=0}^{N-1}\tilde{d}_{ 1,i}^{{\ast}}\tilde{d}_{ 2,i}e^{-\text{j}2\pi ki/N} = 0,\quad \forall k \in \{-(L_{ 1} + L_{2} - 2),-(L_{1}+L_{2} - 3),\ldots,(2L_{1} - 2)\}& &{}\end{array}$$
(4.35)
is the optimal solution to (
4.15).
Proof of Theorem 4.2
The N-point DFT of \(\mathbf{b}\) can be expressed as \(\tilde{\mathbf{b}} = [\tilde{b}_{0},\tilde{b}_{1},\ldots,\tilde{b}_{N-1}]^{T}\), where \(\tilde{b}_{i} = \tilde{h}_{i}^{2}\). Then, only \(\tilde{b}_{i}^{1/2} = \tilde{\gamma }_{i}\tilde{h}_{i}\) can be obtained by the rooting operation, where \(\tilde{\gamma }_{i} = \pm 1\) contains the channel uncertainty in each carrier. Define \(\tilde{\boldsymbol{\gamma }} = [\tilde{\gamma }_{0},\tilde{\gamma }_{1},\ldots,\tilde{\gamma }_{N-1}]^{T}\) and \(\tilde{\mathbf{t}} = [\tilde{b}_{0}^{1/2},\tilde{b}_{1}^{1/2},\ldots,\tilde{b}_{N-1}^{1/2}]^{T} = \tilde{\mathbf{h}} \odot \tilde{\boldsymbol{\gamma }}\). Construct an auxiliary vector \(\tilde{\boldsymbol{\kappa }} = [\tilde{\kappa }_{0},\tilde{\kappa }_{1},\ldots,\tilde{\kappa }_{N-1}]^{T}\), where \(\tilde{\kappa }_{i} \in \{ +1,-1\}\), and define \(\tilde{\boldsymbol{\zeta }} = \tilde{\boldsymbol{\gamma }} \odot \tilde{\boldsymbol{\kappa }} = [\tilde{\zeta }_{0},\tilde{\zeta }_{1},\ldots,\tilde{\zeta }_{N-1}]^{T}\), where \(\tilde{\zeta }_{i} = \tilde{\gamma }_{i}\tilde{\kappa }_{i}\) belongs to \(\{+1,-1\}\). The IDFT of \(\tilde{\mathbf{t}} \odot \tilde{\boldsymbol{\kappa }}\) can then be expressed as \(\mathbf{t} \otimes \boldsymbol{\kappa } = \mathbf{h} \otimes \boldsymbol{\zeta }\) where \(\boldsymbol{\kappa }\) and \(\boldsymbol{\zeta }\) are the IDFTs of \(\tilde{\boldsymbol{\kappa }}\) and \(\tilde{\boldsymbol{\zeta }}\), respectively. Suppose \(\boldsymbol{\zeta }\) has length L ζ . Then, the length of \(\mathbf{h}\otimes \boldsymbol{\zeta }\) is \(\min \{L_{1} + L_{\zeta } - 1,N\}\).6 If L 1 = N, then the length of \(\mathbf{t}\) is N regardless of \(\boldsymbol{\zeta }\). However, if L 1 < N, then the length of \(\mathbf{h}\otimes \boldsymbol{\zeta }\) is L 1 only when L ζ = 1. In this case, \(\boldsymbol{\zeta }\) has only one non-zero value at its first element and is, actually, a scale of the discrete delta function. Therefore, all \(\tilde{\zeta }_{i}\)’s are equal. Considering the range of \(\tilde{\zeta }_{i}\), \(\tilde{\boldsymbol{\zeta }}\) can only be \(\pm {\boldsymbol 1}\), which corresponds to \(\tilde{\boldsymbol{\kappa }} = \pm \tilde{\boldsymbol{\gamma }}\). The so derived \(\mathbf{t} \otimes \boldsymbol{\kappa }\) is then \(\pm [\mathbf{h},{\boldsymbol 0}_{1\times (N-L_{1})}]^{T}\) whose first L 1 entries give \(\pm \mathbf{h}\).
Proof of Theorem 4.3
After dividing the i th element from \(\tilde{\mathbf{c}}\) by \(\tilde{b}_{i}^{1/2}\), we obtain \(\tilde{\mathbf{z}} = \tilde{\mathbf{c}} \oslash \tilde{\mathbf{t}} = \tilde{\mathbf{g}} \odot \tilde{\boldsymbol{\gamma }}\), where ⊘ denotes the element-wise division. If L 2 is known and L 2 < N, from a similar process in Appendix “Proof of Theorem 4.2”, we can obtain \(\tilde{\mathbf{g}}\) up to a sign ambiguity by restricting the length of \(\mathbf{z}\otimes \boldsymbol{\kappa }\) as L 2, where \(\mathbf{z}\) is the IDFT of \(\tilde{\mathbf{z}}\). The effect of \(\tilde{\boldsymbol{\kappa }} = \pm \tilde{\boldsymbol{\gamma }}\) can then be removed from \(\tilde{\mathbf{z}}\), which results in \(\pm \mathbf{g}\).
Proof of Theorem 4.4
From Appendix “Proof of Theorem 4.2”, we know that non-trivial ambiguity happens if there exists a \(\boldsymbol{\zeta }\) with length \(L_{\zeta } \leq \hat{L}_{1} - L_{1} + 1\) such that the length of \(\mathbf{t}\otimes \boldsymbol{\kappa } = \mathbf{h}\otimes \boldsymbol{\zeta }\) is still no greater than \(\hat{L}_{1}\).
Define \(\bar{\mathbf{F}}_{x}\) as the matrix that contains the last x columns of \(\mathbf{F}\). We only need to find the maximum \(\hat{L}_{1}\), such that \(\bar{\mathbf{F}}_{N-(\hat{L}_{1}-L_{1}+1)}\tilde{\boldsymbol{\zeta }}\neq {\boldsymbol 0}\) for all \(\tilde{\boldsymbol{\zeta }}\)’s except for \(\tilde{\boldsymbol{\zeta }} = \pm {\boldsymbol 1}\). We first take a look into the following lemma.
From Lemma 4.1, we know the nontrivial ambiguity does not exist for \(\hat{L}_{1} \leq L_{1} + \frac{N} {2} - 1\). In fact, \(\hat{L}_{1} = L_{1} + \frac{N} {2} - 1\) is also the maximum allowed value since \(\bar{\mathbf{F}}_{N/2-1}^{H}\tilde{\boldsymbol{\zeta }} = {\boldsymbol 0}\) when \(\tilde{\boldsymbol{\zeta }}_{i} = [+1,-1,+1,-1,\ldots,+1,-1]^{T}\).
Noting from Theorem 4.2 that \(\hat{L}_{1}\) should be smaller than N, we complete the proof.
Proof of Theorem 4.5
For odd N, the maximum \(\hat{L}_{1}\) depends on the factorization of N and we do not have a general conclusion, yet. However from Lemma 4.1, we know \(\hat{L}_{1} \leq L_{1} + \frac{N-1} {2}\) can guarantee only the sign ambiguity, although it may not be the maximum value.
Algorithm to Extract \(\mathbf{h}\) and \(\mathbf{g}\)
Based on the discussions from Appendix “Proof of Theorem
4.2” to Appendix “Proof of Theorem
4.5”,
\(\tilde{\boldsymbol{\kappa }} = \pm \tilde{\boldsymbol{\gamma }}\) should be found from
$$\displaystyle\begin{array}{rcl} \|\bar{\mathbf{F}}_{N-\hat{L}_{1}}^{H}(\tilde{\mathbf{t}} \odot \tilde{\boldsymbol{\kappa }})\|^{2} +\| \bar{\mathbf{F}}_{ N-\hat{L}_{2}}^{H}(\tilde{\mathbf{z}} \odot \tilde{\boldsymbol{\kappa }})\|^{2} =\| \bar{\mathbf{F}}_{ N-\hat{L}_{1}}^{H}\tilde{\mathbf{T}}\tilde{\boldsymbol{\kappa }})\|^{2} +\| \bar{\mathbf{F}}_{ N-\hat{L}_{2}}^{H}\tilde{\mathbf{Z}}\tilde{\boldsymbol{\kappa }})\|^{2} = 0,& &{}\end{array}$$
(4.36)
where
\(\tilde{\mathbf{T}} \triangleq \text{diag}\{\tilde{\mathbf{t}}\}\) and
\(\tilde{\mathbf{Z}} \triangleq \text{diag}\{\tilde{\mathbf{z}}\}\) are two diagonal matrices.
Due to the noise in practical transmission, the equality in (
4.36) cannot hold. Then,
\(\tilde{\boldsymbol{\kappa }}\) should be found from
$$\displaystyle{ \tilde{\boldsymbol{\kappa }} =\arg \min _{x_{i}\in \{+1,-1\}}\|\boldsymbol{\varXi }\mathbf{x}\|^{2}, }$$
(4.37)
where
\(\mathbf{x} = [x_{0},x_{1},\ldots,x_{N-1}]^{T}\) is the trial variable and
\(\boldsymbol{\varXi } \triangleq \left [(\bar{\mathbf{F}}_{N-\hat{L}_{1}}^{H}\tilde{\mathbf{T}})^{T}\right.,\left.(\bar{\mathbf{F}}_{N-\hat{L}_{2}}^{H}\tilde{\mathbf{Z}})^{T}\right ]^{T}\) is a
\((2N -\hat{L}_{1} -\hat{L}_{2}) \times N\) matrix. Since the sign ambiguity always exists, we can fix
x 0 = 1. One way is to apply the exhaustive search over 2
N−1 possible candidates. Considering the range of
x i , we can also apply the efficient sphere decoding (SD) algorithm [
12,
13] to reduce the complexity.