A Bayesian MMSE Turbo Receiver for Coded MIMO Systems

1 downloads 0 Views 1MB Size Report
Abstract—In conventional interference cancellation (IC)-based turbo receivers, extrinsic information (EXT) from the soft-input–soft-output. (SISO) decoders is ...
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 57, NO. 5, SEPTEMBER 2008

A Bayesian MMSE Turbo Receiver for Coded MIMO Systems Sumei Sun, Yan Wu, Yuan Li, and Tjeng Thiang Tjhung Abstract—In conventional interference cancellation (IC)-based turbo receivers, extrinsic information (EXT) from the soft-input–soft-output (SISO) decoders is used to compute the statistical mean (SM) of the interfering signals. In this paper, we present a class of Bayesian minimum mean-square-error (MMSE) turbo receivers for coded multiple-input– multiple-output (MIMO) systems. Instead of using the EXT to estimate the prior SM, we use it in the posterior Bayesian MMSE estimation of the interfering signals. The estimation accuracy is, therefore, improved, leading to better bit-error-rate (BER) performance from our proposed Bayesian receivers. For the cases that we have studied, the proposed receiver with two iterations has a lower BER than the conventional one with five iterations, thus significantly reducing the receiver processing delay and the implementation complexity. Its superior performance is also demonstrated by the higher detector output mutual information and fewer iterations to achieve convergence, which is obtained from the extrinsic information transfer (EXIT) chart analysis. Index Terms—Bayesian estimation (BE), expectation maximization (EM), extrinsic information transfer (EXIT) chart, multiple-input– multiple-output (MIMO), turbo receiver.

I. I NTRODUCTION Multiple-input–multiple-output (MIMO) wireless communication systems have attracted a lot of academic and industrial interest over the last decade due to their great potential to increase spectral efficiency and diversity gains. Many schemes have been proposed to realize these gains, for example, the various Bell Laboratories Layered Space–Time (BLAST) architectures [1], [2] to exploit multiplexing gain, space–time block codes (STBCs) [3], [4] to exploit diversity gain, and the simple hybrid schemes of STBCs and spatial multiplexing such as groupwise STBCs (GSTBCs) [5] to exploit multiplexing and diversity gains. In this paper, we focus on iterative turbo receiver techniques for forward error control-coded MIMO systems. In this context, many works have been published. For example, Lu and Wang [6] proposed a linear minimum mean-square-error (LMMSE) interference cancellation (IC) based turbo receiver for multiuser space–time coded systems; Sellathurai and Haykin [7] applied the same receiver to coded BLAST systems and referred to it as the turbo BLAST (T-BLAST). The biterror-rate (BER) performance of the T-BLAST in correlated Rayleigh fading channels was then evaluated in [8], and its practical virtue was verified in experiments in [9]. In a semitutorial paper, Biglieri et al. [10] analyzed the performance of the IC-based turbo receivers using the extrinsic information transfer (EXIT) chart. Boutros and Caire [11] developed a more generalized framework on iterative receivers for code-division multiple-access (CDMA) systems, which is also applicable to MIMO. Based on the factor-graph representation and the sumproduct algorithm [12], they showed that the estimated interference at each iteration should be a function of the decoders’ extrinsic information (EXT) rather than the a posteriori probability. The EXT Manuscript received June 2, 2006; revised November 5, 2006, April 30, 2007, October 24, 2007, and December 3, 2007. The review of this paper was coordinated by Dr. A. Ghrayeb. The authors are with the Institute for Infocomm Research, Singapore 119613 (e-mail: [email protected]; [email protected]; [email protected]. edu.sg; [email protected]). Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identifier 10.1109/TVT.2008.917230

3283

is used to obtain the a priori probability of the coded bits from which the statistical mean (SM), or the prior estimate [13], of the interference is calculated. IC is then performed, and soft-input–softoutput (SISO) decoding is implemented. Different filtering schemes, such as a matched filter (MF) or the LMMSE, can be applied in the IC step. In MIMO, we interchangeably use the terms maximal ratio combining (MRC) and MF. The turbo receivers considered in [6]–[11] are referred to as conventional turbo receivers in this paper. One drawback to the conventional schemes, as we will show in this paper, lies in the poor accuracy of the SM estimate at low-to-medium SNR values or when a high rate code is used. To improve receiver performance, we propose a novel Bayesian minimum mean-squareerror (BMMSE) turbo receiver that exploits the EXT in the Bayesian estimation (BE) [13, ch. 10 and 11], [14, Lesson 13] of the interference signals. We show that the BE conditioned on the received signal and the estimated interference (from the previous iteration) is a function of both the EXT and the decision statistic of the IC-MRC. The iterative IC receiver based on BE is referred to as the Bayesian IC-MRC turbo receiver in this paper to differentiate it from the conventional design in [11]. We show through EXIT chart analysis and BER simulations that the posterior BE results in much higher output mutual information at the SISO detector, fewer iterations to achieve convergence to the lower bound, and a lower BER than the conventional prior SM estimate. Using the Gaussian model that is developed in [15] for the conventional LMMSE-IC detectors, we further apply the BE scheme to LMMSE-IC receivers and develop a class of Bayesian LMMSE-IC turbo receivers. It can have accurate interference estimation by BE and effective interference suppression by LMMSE filtering; therefore, it has superior performance over the conventional LMMSE-IC receivers. This will be demonstrated in the EXIT chart analysis and the BER simulation results. The application of Bayesian expectation maximization (EM)– maximum a posteriori (MAP) symbol estimation in a turbo receiver was previously considered in [16], in which the frequency flat-fading channel coefficients and the transmitted symbols were estimated iteratively. However, no detailed derivation and analysis were given in [16]. In this paper, we consider both narrow-band signals in flat-fading channels and wideband signals in frequency-selective channels. In the latter case, the multipath-induced intersymbol interference (ISI) is mitigated by orthogonal frequency division multiplexing (OFDM) [17]. MIMO-OFDM is of high practical interest, which has been selected as the enabling technology for the next-generation high-throughput wireless local area networks (WLANs) in IEEE 802.11n [18]. We will establish a common signal model for the frequency-domain (FD) signals in MIMO-OFDM and the time-domain signals in narrow-band MIMO systems. For the applications, we consider the vertical BLAST (VBLAST) and GSTBC OFDM systems, both of which have been adopted in IEEE 802.11n [18]. The proposed receiver design, however, can be applied to multiuser CDMA systems in a straightforward manner. Throughout this paper, vectors and matrices are represented by boldfaced letters. The symbols (·)T , (·)H , and (·)−1 denote matrix transposition, Hermitian, and inversion, respectively. All vectors are defined as column vectors, with row vectors represented by transpo˘ X, ˆ and X ˜ to, respectively, denote the BMMSE sition. We use X, estimate, the SM estimate, and the detector decision statistic for the signal X. II. S IGNAL M ODEL We consider the wideband signals in frequency-selective channels and the narrow-band signals in flat-fading channels. For the former

0018-9545/$25.00 © 2008 IEEE

3284

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 57, NO. 5, SEPTEMBER 2008

Fig. 1. Coded MIMO-OFDM transmitter.



denotes the bit interleaver.

case, the multipath induced ISI can be mitigated by OFDM modulation. The coded MIMO-OFDM system that is considered in this paper is depicted in Fig. 1. The random information bits are first demultiplexed into parallel streams (layers) by the “Spatial DeMux” unit. In each parallel stream, information bits are encoded, bit-interleaved, and mapped to constellation points of the adopted modulation scheme based on the bit-interleaved coded modulation (BICM) principle [19]. This is referred to as a “horizontal coding architecture” (HCA) [20]. Alternately, a “vertical coding architecture” (VCA) [20] can be adopted. In the VCA, a single encoder is used. The encoded bits are demultiplexed into parallel streams, and the bit-interleaving and symbol-mapping functions are implemented in each stream. Both coding architectures have been adopted in IEEE 802.11n [18]. The signal model that is established in this section and the BMMSE turbo receivers that are proposed in this paper can be applied to both architectures. The BICM outputs from the parallel streams are further processed by the “space–time–frequency precoding” (STFP) unit. Depending on the system requirement and the available transmit and receive antennas, different STFP schemes can be adopted. For example, if maximum multiplexing gain is desired and if no channel state information (CSI) is available at the transmitter, VBLAST can be used. If both transmit diversity and multiplexing gains are desired, GSTBCs [5] can be applied. The STFP output is then OFDM-modulated, consisting of the inverse fast Fourier transform and cyclic prefix (CP) insertion. The CP length should be no shorter than the number of multipaths in the channel to completely remove the ISI [17]. The OFDM-modulated signals are then transmitted by different antennas. Assuming perfect timing and frequency synchronization, we can remove the CP part from the received data before converting them to the FD by fast Fourier transform (FFT), as shown in Fig. 2. The FD data at each receive antenna are then multiplexed into one signal vector R for each subcarrier, which is written as follows: R = HX + V

(1)

where H, X, and V denote the channel, the transmitted signal, and the additive white Gaussian noise (AWGN), respectively. For a VBLAST system with nT transmit and nR receive antennas (nT × nR ), we have def

R = [ R1

R2

· · · RnR ]TnR ×1

def

H = [Hi,j ]nR ×nT def

X = [ X1 def

V = [ V1

X2 V2

· · · XnT ]TnT ×1 · · · VnR ]TnR ×1

where the subscripts for the elements in R and V denote the receive antenna index, the subscript in X denotes the transmit antenna index, and the subscript in H denotes the receive–transmit antenna pair index. For the complex AWGN, we have V ∼ CN (0, 2σ 2 InR ).

For an nT × nR GSTBC system, we have def

R = [ R1,1 def

X = [ X1 def

V = [ V1,1

∗ −R1,2

X2

· · · RnR ,1

∗ −Rn ]T R ,2 2nR ×1

· · · XnT ]TnT ×1

∗ −V1,2

· · · VnR ,1

−Vn∗R ,2 ]TnR ×1

def

H = [(Hi,j )2×2 ]2nR ×nT where





H Hi,2j , (Hi,j )2×2 = i,2j−1 ∗ ∗ −Hi,2j Hi,2j−1

i = 1, . . . , nR , j = 1, . . . ,

nT 2

and the subscripts of R and V denote the receive antenna and OFDM symbol indexes in each Alamouti STBC codeword [5]. The signal model in (1) is applicable to single-carrier systems in narrow-band, flat-fading channels. In this case, no OFDM modulation is needed, and the STFP block is replaced by a space–time precoding block. Accordingly, (1) becomes a time-domain signal model. Other than that, the rest of the blocks remain the same as for the MIMOOFDM systems. III. B AYESIAN IC-MRC T URBO R ECEIVER The proposed Bayesian turbo receiver for MIMO-OFDM systems with an HCA is depicted in Fig. 2. When we remove the OFDM demodulation blocks, the same receiver can be directly applied to narrow-band, single-carrier systems. Besides the OFDM demodulation and spatial multiplexing units, the rest of the receiver can be divided into two parts—the SISO detector and the SISO decoders. The EXT is exchanged between the two parts, as we will show in detail here and in Section IV. At each iteration, the SISO decoders take in the deinterleaved extrinsic bit metric values from the SISO detector and compute the updated extrinsic log-likelihood ratio (LLR), i.e., the EXT, for the coded bits. The updated EXT is then interleaved and sent back to the SISO detector. The SISO detector consists of three blocks—the BMMSE estimators, the STFP detector, and the soft demodulator. In the BMMSE estimators, the BE of the transmitted signals is calculated using the EXT of the SISO decoder and the decision statistic of the STFP detector, both from the previous iteration. Their outputs are used by the STFP detector to implement either the IC-MRC or the LMMSE-IC scheme. The detector output, i.e., the decision statistic of the transmitted signals, is then delivered to the soft demodulator to calculate the extrinsic bit metric values. A. STFP Detector The STFP detector calculates the decision statistic of the transmitted signals. For the low-complexity, IC-based linear detectors, the decision statistic is calculated as ˜ ˜ k,i = FH X k,i Rk,i

(2)

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 57, NO. 5, SEPTEMBER 2008

Fig. 2.

3285

Bayesian turbo receiver.

where the subscripts k and i represent the signal and iteration indexes, ˜ denotes the IC output, which is given as respectively, and R ˜ k,i = R − R

nT 

˜ k,i Hp Xp,i−1,est = Hk Xk + V

Hp (Xp − Xp,i−1,est ) + V.

p=1 p=k

F denotes the linear filter for which two schemes can be implemented, namely, an MF (or MRC) with Fk,i = Hk

(4)

and the LMMSE filter to be discussed in detail in Section IV. In conventional turbo receivers, the SM is used for Xp,i−1,est , which is computed as follows [10], [11]: ˆ p,i−1 = X





pi−1 (Xm ) =



pi−1 clm



˘ BMMSE = E{X|R} X



(6)

under the assumption that Xm is mapped from bits [c1m , c2m , . . . , log M cm 2 ], which are uncorrelated with sufficient interleaving. pi−1 (clm )

Xj p(Xj |R)

Xj ∈ΩnT



l=1

(7)

Assuming perfect CSI at the receiver, we obtain the BMMSE estimate of the transmitted signals as [13, ch. 10 and 11]

Xm ∈Ω

log2 M



1 + exp λai−1 (k, l)

B. Optimal BMMSE Estimate

(5)

where Ω is the modulation set of the transmitted signals with dim(Ω) = M , and pi−1 (Xm ) is the probability mass function (pmf) that is calculated at iteration (i − 1), i.e.,



with λai−1 (k, l) being the interleaved EXT for clk at iteration (i − 1) and the superscript “a” denoting a priori. For notational simplicity, we will omit the iteration index in the pmf for the rest of this paper. In this paper, we take a different detection approach. We start from the optimal BMMSE estimate of the transmitted signals and show that when EM algorithm [21] is used to reduce the computational complexity, we obtain the same linear IC-MRC detector in (2) and (4). ˘ p,i−1 }, However, in the IC process [see (3)], the BMMSE estimates {X ˆ instead of the prior SM estimate {Xp,i−1 }, of the interference signals will be used. This is one major difference of our proposed BMMSE turbo receiver from its conventional counterpart.

= Xm pi−1 (Xm )

λai−1 (k, l)clk



= exp

(3)

with Hp denoting the pth column of H, Xp,i−1,est denoting the ˜ k,i denoting the composite estimate of Xp at iteration (i − 1), and V residual interference and AWGN, i.e., nT 

pi−1 (clk )

p=1 p=k

˜ k,i = V

denotes the a priori pmf estimate for bit clm [6], i.e.,

=

Xj p(R|Xj )p(Xj )

Xj ∈ΩnT



Xi

p(R|Xi )p(Xi )

(8)

∈ΩnT

where the last equality is obtained by using Bayes’ rule. The computation of the numerator and the denominator of (8) incurs the complexity of M nT . Simplification is, thus, desired. To reduce the computational complexity, we adopt the expectation step of the spacealternating missing parameter EM algorithm that is outlined in [21]. In this approach, the transmitted signal is estimated by treating the interfering signals as “missing” data.

3286

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 57, NO. 5, SEPTEMBER 2008

For the modulation schemes that are considered here, we further have

C. BEME The Bayesian EM MMSE estimate (BEME) of Xk at iteration i, ˘ k,i , is derived based on the received signal R and the BEME of i.e., X ˘ p,i−1 , p = k, i.e., the interfering signals at iteration i − 1, i.e., X



˘ k,i−1 ˘ k,i def = E Xk |R, X X

(9)

˘ k,i−1 = [X ˘ 1,i−1 , . . . , X ˘ k−1,i−1 , X ˘ k+1,i−1 , . . . , X ˘ n ,i−1 ]T . where X T Using Bayes’ rule, we can further write (9) as follows: ˘ k,i = X

 

Sm

˘ k,i−1 ) p(Xk = Sm , R, X ˘ k,i−1 ) p(R, X

Sm ∈Ω

=



Sm

Sm ∈Ω

=



=  + Hk Xk 2 − 2Re ⎝Xk∗ HH k R−



(10)

˘ k,i−1 )p(Xk = Sn ) p(R|Xk = Sn , X

Sm ∈Ω



˘ k,i−1 ) p(R|Xk = Sn , X

.

(11)

In this paper, we propose to use the estimated pmf in (6) in (10). This distinguishes our approach from the conventional one in (11). In terms of the application of the turbo principle, using the a priori information in the posterior BEME is different from using it in the prior SM estimation in the conventional turbo receivers. ˘ k,i is unbiased. Theorem 1: The BEME X Proof: From (9), we have





˜ k,i =  + Hk Xk 2 − 2Re Xk∗ X

=E

 

˘ k,i−1 ) Xk p(Xk |R, X

Xk ∈Ω

˘ k,i−1 )p(Xk |R, X ˘ k,i−1 )dR p(R|X

Xk

Xk ∈Ω

=

˘ k,i−1 ) Xk p(Xk |R, X



Xk

˘ k,i−1 )dR p(Xk , R|X

Xk ∈Ω

=



˘ k,i−1 ) Xk p(Xk |X

Xk ∈Ω

=



Xk ∈Ω

Xk p(Xk ) = E{Xk }.

nT nT  

˘∗ ˘ HH p Hq Xp,i−1 Xq,i−1

p=1 q=1 p=k q=k

is a constant term for all the hypotheses of Xk , and



⎞ nT 



˘ p,i−1 ⎠ Hp X

p=1 p=k

is the IC-MRC decision statistic in (2). Equation (12) is, therefore, simplified to





˜ k,i H X 2 −2Re Xk∗ X ˘ k,i−1 ) = C exp − k k p(R|Xk , X 2σ 2

 (13)

where C = α exp(−/2σ 2 ), and the BEME in (10) becomes





p=1 p=k



+

˜ k,i = HH ⎜ X k ⎝R −

Xk ∈Ω



⎟⎥

˘ p,i−1 ⎠⎦ Hp X

p=1 p=k



˘ k,i−1 )dR p(R|X

⎞⎤

nT 

⎞ nT  ⎜ ⎟ ∗ ˘ p,i−1 X  = RH R − 2Re ⎝ HH p R⎠

˘ k,i−1 } ˘ k,i } = E E{Xk |R, X E{X





˘ p,i−1 ⎠ HkH Hp Xk∗ X







where “Re” denotes the real part of the signal

˘ k,i−1 ) Sm p(R|Xk = Sm , X

Sn ∈Ω

=



=  + Hk Xk 2 − 2Re ⎣Xk∗ HH k ⎝R −

˘ k,i−1 ) = p(Xk )p(X ˘ k,i−1 ). under the assumption that p(Xk , X The conventional EM receivers in [21] and [22] assume an equal pmf for the transmitted signals; hence, (10) becomes



nT  p=1 p=k





Sn ∈Ω

˘ k,i = X

(12)



˘ k,i−1 )p(Xk = Sm ) Sm p(R|Xk = Sm , X





= RH R − 2Re(XH HH R) + XH HH HX

˘ k,i−1 ) p(R, Xk = Sn , X

Sm ∈Ω

=

(R − HX)H (R − HX) 2σ 2

˘ k,i−1 ] X=[Xk ,X

Sn ∈Ω



 ˘ k,i−1 ) = α exp − p(R|Xk , X

κ = (R − HX)H (R − HX)|

˘ k,i−1 )p(Xk = Sm , X ˘ k,i−1 ) p(R|Xk = Sm , X



 ˘ k,i . We first assume perfect Now, we proceed to derive BEME X interference estimation and cancellation. In this case, we have

˘ k,i−1 ], and α is a system-dependent constant, e.g., with X = [Xk , X α = 1/(2πσ 2 )nR for the VBLAST, and α = 1/(2πσ 2 )2nR for the GSTBC. To simplify the derivation, we define

˘ k,i−1 ) Sm p(Xk = Sm |R, X

Sm ∈Ω

=

˘ k,i } = 0. E{X

Sm exp

˘ k,i = Sm ∈Ω X Sn ∈Ω

 −H

 exp

∗ ˜ Xk,i ) (Sm

2 k Sm  +2Re 2σ 2

∗X ˜ k,i ) −Hk Sn 2 +2Re(Sn

2σ 2

p(Xk = Sm ) . p(Xk = Sn ) (14)

˜ k,i and p(Xk ). ˘ k,i , we need X From (14), we can see that to obtain X Whereas p(Xk ) can be obtained from the SISO decoders, as shown in ˜ k,i . Fig. 2, an IC-MRC detector needs to be implemented to provide X

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 57, NO. 5, SEPTEMBER 2008

For constant modulus modulation signals, (14) can be further simplified to

˘ k,i = X



!

Sm exp Re

Sm ∈Ω





!

exp Re

Sn ∈Ω

∗ ˜ Xk,i Sm σ2

∗X ˜ k,i Sn σ2

3287

TABLE I BAYESIAN IC-MRC TURBO RECEIVER ALGORITHM

"

p(Xk = Sm )

"

p(Xk = Sn ).

The assumption of perfect IC, however, is unrealistic. On the other hand, obtaining an accurate estimate of the multivariate conditional ˘ k,i−1 ) is very complex. As an alternative, we proceed pdf p(R|Xk , X to analyze the statistical properties of the decision statistic of the IC˜ k,i |Xk ), as suggested ˜ k,i and obtain its conditional pdf p(X MRC X ˜ k,i as the by (13) and (14). We will then obtain the BEME by using X observation signal. 1) Statistics of the IC-MRC Signal: From (2) and (4), we can express the decision statistic of IC-MRC as ˜ k,i = gk Xk + v˜k,i X

(M -QAM), can be derived following a similar procedure. We give the expression for 8PSK signals in Appendix B.

(15) D. Soft Demodulator

where

The soft demodulator calculates the updated LLR of the coded bits, ˜ k,i , i.e., i.e., the extrinsic metric values, using X

gk = HH k Hk ˜ v˜k,i = HH k Vk,i =

nT 

H ˘ HH k Hp (Xp − Xp,i−1 ) + Hk V.

λei (k, l) = Λi (k, l) − λai−1 (k, l)



p=1 p=k

When nT is reasonably large, v˜k,i can be approximated as a Gaussian random variable from the central limit theorem. As proven in Appendix A, it has mean zero and variance

 1  nR  ˘ p,i−1 |2 + gk σ 2 . 1 − |X 2 2 p=1 nT

ς2 =



= log

nT

#

2 ˜ ˜ k,i |Xk ) = 1 exp − |Xk,i − gk Xk | p(X 2 2 2πς 2ς

.

(16)

 Sm ∈Ω

˜ k,i |Xk = Sm )p(Xk = Sm ) p(X . Sm ˜ k,i |Xk = Sn )p(Xk = Sn ) p(X Sn ∈Ω

(17) For BPSK signals, we have

# ˘ k,i,BPSK,IC-MRC = tanh X

λai (k) 2

˜ k,i gk X + ς2

$ .

(18)

√ For QPSK signals with a Gray mapping rule of Sk = (1/ 2) × [(2c1k − 1) + j(2c2k − 1)], we use the a priori pmf in (6) and obtain



˘ k,i,QPSK,IC-MRC = √1 tanh X 2

#

+j tanh

#

Sk ∈Ω−

˘ k,i,IC−MRC = E{Xk |X ˜ k,i } X

˜ k,i ) Re(gk X λai (k, 1) √ + 2 2ς 2

˜ k,i ) Im(gk X λai (k, 2) √ + 2 2ς 2



˜ k,i |Sk )p(Sk ) p(X ˜ k,i |Sk )p(Sk ) p(X ˜ k,i |Sk ) p(X

Sk ∈Ω+

= log

2) IC-MRC BEME: With (16), we are ready to derive the IC-MRC ˜ k,i as the observed signal, i.e., BEME using X

=



$



Sk ∈Ω+ Sk ∈Ω−

p=k

We, therefore, have



˜ k,i p cl = 1 p clk = 1|X   − log  kl = log  l ˜ k,i p ck = 0 p ck = 0|X



log2 M

− log

p(clk = 1) p(clk = 0)

 

p cjk

j=1 j=l

˜ k,i |Sk ) p(X



log2 M

 

(20)

p cjk

j=1 j=l

where λE , Λ, and λa , respectively, denote the extrinsic, a posteriori, and a priori LLRs, and p(clk ) is the estimated a priori pmf of clk given in (7). At the initialization (iteration 0), p(clk ) = 1/2, and λa−1 (k, l) = 0. Ω+ = {Sk : clk = 1} denotes the subset of modulation symbols whose lth bit is 1, and Ω− = {Sk : clk = 0} denotes the signal subset whose lth bit is 0. E. Summary of the Complete Receiver Signal Processing Steps We summarize the signal processing algorithms of the Bayesian IC-MRC turbo receiver in Table I. For comparison, the operation of the conventional receiver is also included in the same table. It can be seen that the major difference between the two receivers lies in the signal estimation step. The rest of the processing, such as IC, linear filtering, and soft demodulation, are the same.

$

F. Proposed Receiver and the Sum–Product Principle

$ . (19)

The BEME for other modulation schemes, e.g., eight-phase shift keying (8PSK) and M -ary quadrature amplitude modulation

The difference between our BEME in (17) and that in [21] and [22] is that we have improved the original EM estimate by using the estimated a priori probability from the SISO decoders rather than simply assuming an equal a priori probability. As pointed in [13, ch. 10 and 11], the use of prior information will always improve the estimation accuracy.

3288

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 57, NO. 5, SEPTEMBER 2008

On the other hand, not only the EXT from the decoder but also the decision statistics from the detector is needed to compute the BEME. Therefore, the sum–product principle has not been followed strictly. Nonetheless, performance improvement can be achieved from the BEME, as we will demonstrate in Section V. This could be due to the fact that the sum–product principle assumes Gaussian interfering signals in the IC-based linear detector. As the assumption is not true, the IC detection could be improved by not exactly following the sum–product principle. The BEME, in general, cannot be obtained by directly replacing the EXT in the SM estimate with the a posteriori LLR. However, for BPSK and QPSK signals, only one bit is mapped to the real and imaginary parts, and the residual interference and noise are circularly symmetric complex Gaussian; we, therefore, have the following: ˜ k,i } = p{ck |X ˜ k,i } p{Xk |X

where Γk,i is a diagonal matrix with elements

⎧ 1, ⎪ ⎨ γp,i =

˘ p,i−1 |2 , p{Xp }|Xp |2 + |X

Xp ∈Ω ⎪ ⎩ ˆ ˆ∗ ˘∗ ˘ −X p,i−1 Xp,i − Xp,i−1 Xp,i

p=k p = k

(22)

˘ p,i being the SM and the BEME of Xp , respectively. ˆ p,i and X with X Note that due to the use of the Bayesian estimate in the IC process, the computation of γp=k,i here is different from that in the conventional LMMSE-IC receivers. The LMMSE-IC decision statistic is given by ˜ ˜ k,i = FH R X k,i k,i = μk,i Xk + ηk,i

(23)

which is an equivalent AWGN model with [15]

for BPSK, and

%

˜ k,i } = p{c1k , c2k |X ˜ k,i,Re , X ˜ k,i,Im } p{Xk |X

˜ k,i Xk∗ } = HH [HΓk,i HH + 2σ 2 I]−1 H μk,i = E{X

˜ k,i,Re , X ˜ k,i,Im } = p{c1k |c2k , X ×p

%

%

˜ k,i,Re , X ˜ k,i,Im c2k |X

& %



log2 M

%

˜ k,i p clk |X

&

and the BEME has to be computed following (17) instead of replacing the EXT with the posterior LLR in the SM estimation in (5), although by doing the latter, some performance gain may also be obtained, as reported in [23]. Therefore, the sum–product principle is followed strictly. The BEME, in general, requires higher computational complexity than the SM estimation. This complexity increase, as demonstrated in Section V, leads to even better performance at the second iteration than the conventional receiver at the fifth iteration. Considering the amount of computation in each iteration, which includes IC, linear filtering, interference estimation, soft demodulation, and SISO decoding, we can easily conclude that great complexity saving is achieved by using the Bayesian receiver.

#

˜ k,i |Xk ) = p(X

Similar to the conventional LMMSE-IC turbo receivers [6], [24], LMMSE filtering can be applied in (2) to further suppress the residual interference in a Bayesian turbo receiver. As proven in [15], the LMMSE filter can be modeled by an equivalent AWGN channel. We can, therefore, obtain the BEME of the signals for the LMMSE-IC turbo receiver in a very straightforward manner. The LMMSE filter Fk,i minimizes the mean square error between the transmitted signal ˜ ˜ k,i = FH R Xk and the filter output X k,i k,i and is obtained from the Wiener–Hopf equation as follows:

&−1

%

˜ k,i E Xk∗ R

= (HΓk,i HH + 2σ 2 I)−1 Hk

˜ k,i − μk,i Xk |2 |X 1 exp − 2 2 2πνk,i 2νk,i

$ .

(26)

˜ k,i as the With (26), we can derive the LMMSE-IC BEME using X observed data, i.e.,

=

& (21)

 Sm

˜ k,i |Xk = Sm )p(Xk = Sm ) p(X Sm . ˜ k,i |Xk=Sn )p(Xk=Sn ) p(X ∈Ω Sn ∈Ω

(27) Specifically, we have

# ˘ k,i,BPSK,LMMSE-IC = tanh X

˜ k,i μk,i X λai (k) + 2 2 νk,i

$ (28)

for BPSK signals and

 ˘ k,i,QPSK,LMMSE-IC X

IV. B AYESIAN LMMSE-IC T URBO R ECEIVER

%

(25)

˜ k,i } ˘ k,i,LMMSE-IC = E{Xk |X X

l=1

˜ k,i R ˜H Fk,i = E R k,i

(24)

and {·}ij denotes the element at the ith row and the jth column. We, therefore, have

&

for QPSK. Hence, their BEME can be obtained by directly replacing the EXT with the posterior LLR in the SM estimate, as suggested in (18) and (19). For higher order modulation signals, e.g., 8PSK and 16-QAM, more than one bit are mapped to the real and imaginary parts. Hence ˜ k,i } = p{Xk |X

kk

2 ˜ k,i } = μk,i − μ2k,i = Var{X νk,i

&

˜ k,i,Re p c2k |X ˜ k,i,Im = p c1k X

&



1 = √ tanh 2

+ j tanh

˜ k,i ) Re(μk,i X λai (k, 1) √ 2 + 2 2νk,i

˜ k,i ) Im(μk,i X λai (k, 2) √ 2 + 2 2νk,i





(29)

for QPSK signals. The BEME for other modulation schemes can be derived in a similar manner. We give the BEME of 8PSK signals in Appendix B. The Bayesian LMMSE-IC turbo receiver operation is summarized in Table II. For comparison, the conventional LMMSE-IC receiver algorithms are also included in Table II. The major differences between the proposed and conventional LMMSE-IC turbo receivers lie in the signal estimation and LMMSE filter coefficient calculation; however, the remaining steps are the same.

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 57, NO. 5, SEPTEMBER 2008

3289

TABLE II BAYESIAN LMMSE-IC TURBO RECEIVER ALGORITHM

Fig. 4. BER performance comparison of the Bayesian IC-MRC and conventional IC-MRC receivers, ZFIS, LMMSE IS, 8 × 4 GSTBC, QPSK, Rc = 3/4, and K = 3.

Fig. 3. BER performance of the Bayesian IC-MRC receiver, 8 × 4 GSTBC, QPSK, Rc = 1/2, and K = 3.

V. N UMERICAL R ESULTS In this section, we present the numerical results and demonstrate the superior performance of the proposed Bayesian turbo receivers by using the EXIT chart and the BER. In all the simulations, the rate Rc = 1/2, constraint length K = 3 convolutional code (CC) with generation function g = (5, 7)octal is used as the mother code. The puncturing pattern in [25] and [26] is followed to generate the higher rate code. A log-MAP decoding algorithm is used in the SISO decoders. The HCA is adopted for the BER simulations, and the VCA is used for the EXIT chart analysis. For OFDM, we set the FFT size to 64 and assign 48 subcarriers for data transmission following IEEE 802.11a [27]. The channel is assumed quasi-static within each packet but randomly and independently varies between the packets. Uniform power delay profiles with 16 independent identically distributed (i.i.d.) complex Gaussian taps are used for the uncorrelated spatial channels corresponding to each transmit–receive antenna pair. A. BER Performance We first present in Fig. 3 the BER performance of the IC-MRC Bayesian MMSE receiver at iterations i = 1, . . . , 5 for an 8 × 4 GSTBC system with Rc = 1/2 and QPSK modulation. Zero-forcing

interference suppression (ZFIS) is used for the initialization. For comparison, the conventional IC-MRC turbo receiver performance and the lower bound of the perfect IC are also depicted. Several observations can be made from Fig. 3. First, the Bayesian receiver has superior performance over the conventional counterpart. Its second iteration has better performance than the conventional receiver’s fifth iteration at low-to-medium SNR values and the same performance at a high SNR. This is because of the improved accuracy in the BMMSE interference estimation, as shown in Section III. With the same number of iterations, an SNR gain of 1.2 dB can be achieved from the Bayesian receiver at iteration five, and BER = 10−5 . The convergence of the Bayesian receiver to the lower bound appears at SNR = 4.5 dB, corresponding to BER = 6 × 10−4 , whereas the conventional receiver cannot converge to the lower bound in the SNR range in our simulation setups. The performance advantage of the Bayesian receiver is more obvious when a punctured code is used, as exhibited in the BER simulation results in Fig. 4 for an Rc = 3/4 QPSK 8 × 4 GSTBC system. BER versus Eb /N0 curves at iterations 1, 3, and 5 are presented for four receivers, namely, conventional IC-MRC receivers with both ZFIS and LMMSE IS initialization and Bayesian IC-MRC receivers with ZFIS and LMMSE IS initialization. From Fig. 4, it is clear that the BER of the conventional receivers gets worse with an increasing number of iterations. We believe that this is due to puncturing in the CC that severely degrades the accuracy in the EXT. The Bayesian receivers, with the compensation of the detector’s decision statistic, have a decreasing BER with an increasing number of iterations. A 5-dB gain can be achieved over the conventional receivers with both ZFIS and LMMSE IS initialization, and more importantly, the Bayesian receiver with LMMSE IS initialization reaches the lower bound at BER = 10−3 and SNR = 6 dB. The BER performance of the Bayesian LMMSE-IC receiver is depicted in Fig. 5 for an 8 × 8 VBLAST system with Rc = 3/4 8PSK. Similar to the IC-MRC Bayesian receivers, its second-iteration performance is better than that of the fifth iteration in the conventional receiver at low-to-medium SNR values due to the improved accuracy in the interference estimation, and at a high SNR, the performance approaches that of the conventional receiver due to the dominance of the a priori information in the Bayesian estimate. For all five iterations, the Bayesian receiver achieves at least a 1-dB gain over

3290

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 57, NO. 5, SEPTEMBER 2008

Fig. 5. BER performance comparison of the Bayesian LMMSE-IC and conventional LMMSE-IC receivers, 8 × 8 VBLAST, 8PSK, Rc = 3/4, and K = 3.

the conventional one. When compared with the lower bound, however, there is still an approximately 2-dB gap to bridge. B. EXIT Chart Analysis We now demonstrate the superior performance of the Bayesian receivers through the EXIT chart analysis. The 4 × 4 fixed channel matrix that is given in [28] is used. The a priori information input to the detector is assumed to be Gaussian distributed, with λai |Ci ∼ N (Ci σa2 /2, σa2 ), as given in [29]. The corresponding input mutual information I a is calculated from σa2 using the following mapping function [29]:

!  

∞

I a = J σa2 = 1 −

z−

+

−∞



1 2πσa2

e

2 σa 2

"2

2 2σa

× log2 [1 + exp(−z)] dz. For the conventional receivers, we use the a priori information, i.e., the EXT, for the interference estimation. For the proposed Bayesian receivers, we use the EXT and the detector decision statistic for the interference estimation. The Gaussian-distributed EXT is generated the same way as in the conventional detectors. The decision statistic is generated from the EXT following the steps outlined as follows. 1) Obtain the decoder output mutual information from the EXT e = J(σe2 ). using Idec 2) Obtain the decoder’s input mutual information from the inverse −1 e a = Tdec (Idec ), where Tdec of the decoder EXIT function as Idec denotes the EXIT function of the decoder. 3) Obtain the decoder’s input a priori information as σa2 = a ). J −1 (Idec 4) Obtain the decision statistic from the decoder’s input a priori information. IC is then performed, and the detector output mutual information is calculated from the output extrinsic LLR as [30] I e (Le ; C) = 1 −

N 1  log2 [1 + exp (−Cn λen )] . N n=1

Fig. 6. Mutual information transfer function comparison of the conventional and Bayesian receivers and the decoding path for the turbo receivers with Rc = 1/2 generation function g = (5, 7)octal CC. Static channel, QPSK modulation, and σ 2 = 0.1256.

Fig. 6 depicts the EXIT chart of the proposed Bayesian and conventional turbo receivers with QPSK modulated signals and the AWGN power of σ 2 = 0.1256. As expected, the Bayesian receiver outperforms its conventional counterpart, and the Bayesian LMMSE-IC receiver has the best performance. From Fig. 6, we also observe that the EXIT functions of the conventional and Bayesian receivers merge at the starting point of a a a = 0 and the ending point of Idet = 1. For 0 < Idet < 1, the Idet a , the larger the difference between the Bayesian smaller the input Idet e values. This is expected, and the conventional detectors’ output Idet a as when Idet = 0, there is no a priori information available, and the a ’s, the two receivers have the same performance; at small input Idet BMMSE interference estimation accuracy is greatly improved by usa increases, ing the additional detector decision statistic, and when Idet the a priori probability becomes increasingly dominant in the BMMSE estimate, making it increasingly close to the a priori estimate. We also illustrate the convergence behavior of the proposed and conventional IC-MRC turbo receivers in Fig. 6. From the trajectories of the conventional and Bayesian receivers, we can see that fewer iterations are needed by the Bayesian receiver to achieve the same performance. Furthermore, performance improvement from the initial iterations in the Bayesian receiver is substantial, which is consistent with the BER results in Section V-A. VI. C ONCLUSION A class of Bayesian MMSE turbo receivers has been presented for coded MIMO systems. Using the EM algorithm, we have proposed a novel application of the turbo principle by making use of the EXT in the posterior Bayesian MMSE signal estimation. We have shown that the Bayesian MMSE estimate of the transmitted signals is a function of the linear detector decision statistic and the EXT from the SISO decoder. The Bayesian MMSE estimate effectively compensates for the inaccuracy that is experienced by the prior SM estimation using only the EXT in conventional turbo receivers. This contributes to the SNR gains at the same BER performance and the fewer number of iterations that are needed to achieve convergence. The EXIT chart analysis has further proved that a significant improvement can be obtained in the detector output mutual information over the conventional receiver.

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 57, NO. 5, SEPTEMBER 2008

When k = p, we have

A PPENDIX A M EAN AND V ARIANCE OF THE I NTERFERENCE AT THE IC-MRC O UTPUT Defining ρkp = HH k Hp , we have

E{˜ vk } = E

⎧ ⎪ nT ⎨ ⎪ ⎩ p=1

nT 

%

E ρkp ρ∗kq

&

=

nR nR 1  δ(g − n)δ(p − q) = nR nT 2 δ(p − q). n2T g=1 n=1

˘ p,i−1 ) + HH ρkp (Xp − X k V

p=k

=

3291

⎫ ⎪ ⎬

For OFDM systems, we have the following FD channel coefficients:

⎪ ⎭

and

⎧/ /2 ⎫ /n / ⎪ ⎪ T / / ⎬ ⎨  % 2& / / ˘ p,i−1 ) + HH vk | = E / ρkp (Xp − X E |˜ k V/ / / ⎪ ⎪ ⎩/ p=1 / ⎭ p=k ⎧/ /2⎫ /n /⎪ ⎪ T /⎬ ⎨/  / /2 / / ˘ / =E / ρkp (Xp − Xp,i−1 )/ + E /HH k V /⎪ ⎪ ⎩// p=1 ⎭ / p=k

⎧/ /2 ⎫ /n / ⎪ ⎪ T / ⎬ ⎨/  / ˘ p,i−1 )// ρkp (Xp − X E / / ⎪ ⎪ ⎩// p=1 / ⎭ p=k ⎧ ⎪ nT nT ⎨ 

E

%

ρkp ρ∗kq

&

=E

n R 

ρkp ρ∗kq (Xp

⎪ ⎩ p=1

=

=

E

ρkp ρ∗kq

&

nR nR  



∗ ˘ p,i−1 ) Xq∗ − X ˘ q,i−1 −X

0



=

nR nR  

∗ ∗ E Hm,g,k Hm,g,p Hm,n,k Hm,n,q

E

L−1 

g=1 n=1

× ×

˘ p,i−1 , and is independent of Xp and X

×

For single-carrier systems in flat-fading channels, assuming that the spatial domain channels are perfectly uncorrelated, we have =E

n R 

∗ Hg,k Hg,p

g=1

=

nR nR  

nR 

! 2π

× exp j =

n=1

E

nR nR  

∗ ∗ Hg,k Hg,p Hn,k Hn,q

%

N

=

%

%

∗ E Hg,k Hn,k nR nR

1  n2T g=1 n=1

&

E

%

∗ Hn,q Hg,p

2π mt N

"

2π mi N

! 2π

exp j

"

N

"



ms

%

E h∗l,g,k hi,n,k ht,g,p h∗s,n,q

&

"

m(l − i − t + s)

nR nR L−1L−1L−1L−1      0 %

E h∗l,g,k hi,n,k

g=1 n=1 l=0 i=0 t=0 s=0

&

∗ ∗ E Hg,k Hg,p E Hn,k Hn,q

g=1 n=1

+

&

h∗s,n,q

    

&

% & % ∗ & +E hs,n,q hi,n,k % &1 × E h∗l,g,k ht,g,p " × E ht,g,p h∗s,n,q

g=1 n=1

=

!

hi,n,k exp −j

g=1 n=1 l=0 i=0 t=0 s=0

 ∗ Hn,k Hn,q

%



"

!

s=0 nR nR L−1L−1L−1L−1

=

E



&

ml

ht,g,p exp −j

t=0 L−1

1

/ /2 / = 2σ 2 gk . E /HH k V

&



N

L−1

i=0 L−1

under the assumption that ρkp

ρkp ρ∗kq

! 2π

h∗l,g,k exp j

l=0

p=1 q=1, p=k q=k

%

 ∗ Hm,n,k Hm,n,q

n=1

⎪ ⎭

∗ ˘ p,i−1 ) Xq∗ − X ˘ q,i−1 E (Xp − X

nR 

%

g=1 n=1

q=1, p=k q=k

%

∗ Hm,g,k Hm,g,p

g=1

⎫ ⎪ ⎬

2π ml) N

where m = 0, 1, 2, . . . , N − 1 is the subcarrier index, N is the FFT size, n = 1, 2, . . . , nR and k = 1, 2, . . . , nT are the receive and transmit antenna indexes, respectively, L is the number of multipaths in the channel corresponding to the transmit–receive antenna pair (k, n), and hl,n,k is the time-domain multipath coefficient. Here, we consider the uniform power delay profile in which each multipath is i.i.d. complex Gaussian with zero mean and variance 1/LnT so that the average signal power at each receive antenna remains the same as the mapper output. We, hence, have, for k = p

with

nT nT  

hl,n,k exp(−

l=0

˘ p,i−1 } + HH ρkp E{Xp − X k E{V} = 0

p=1 p=k

=E



L−1

Hm,n,k =

&

! 2π

× exp j

&

δ(k − p)δ(k − q) + δ(g − n)δ(p − q)

=

1

N

m(l − i − t + s)

    

nR nR L−1L−1L−1L−1

n2T L2 g=1 n=1 l=0 i=0 t=0 s=0

by using the result from [31] that E {Z1∗ Z2∗ Z3 Z4 } = E {Z1∗ Z3 } E {Z2∗ Z4 } + E {Z1∗ Z4 } E {Z2∗ Z3 } if Z1 , Z2 , Z3 , and Z4 are zero-mean stationary complex Gaussian. Here, δ(·) is the Dirac delta function.

!

[δ(l − i)δ(g − n) × δ(t − s)δ(p − q) × +δ(s − i)δ(k − q) × δ(l − t)δ(k − p)]

"

2π × exp j m(l − i − t + s) N nR = 2 δ(p − q). nT

3292

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 57, NO. 5, SEPTEMBER 2008

TABLE III 8PSK GRAY MAPPING TABLE

R EFERENCES

We then obtain the composite interference and the AWGN noise power for the narrow-band single-carrier systems and the wideband OFDM systems as follows: ς2 =

nT ! / / " & 1 nR 1 % ˘ p,i−1 /2 + gk σ 2 . (30) E |˜ μk,i |2 = 1 − /X 2 2 2 p=1 nT p=k

A PPENDIX B BEME FOR 8PSK S IGNALS For 8PSK signals following the Gray mapping rule in Table III, we obtain the BEME in the IC-MRC receiver as follows: ˘ k,i,8PSK,IC-MRC X a x3 ) + b cosh(´ x2 ) sinh(´ x4 )eλi (k,3) a cosh(´ x1 ) sinh(´ = a cosh(´ x1 ) cosh(´ x3 ) + cosh(´ x2 ) cosh(´ x4 )eλi (k,3) λa (k,3) x2 )e i + b cosh(´ x3 ) sinh(´ x1 ) a cosh(´ x4 ) sinh(´ +j i−1 λ (k,3) a cosh(´ x1 ) cosh(´ x3 ) + cosh(´ x2 ) cosh(´ x4 )e

(31)

where ˜ k,i,Im bgk X λa (k, 1) − i 2 ς 2 ˜ k,i,Im agk X λai (k, 1) − x ´2 = ς2 2 ˜ k,i,Re agk X λai (k, 2) − x ´3 = ς2 2 ˜ bgk Xk,i,Re λai (k, 2) − x ´4 = ς2 2 + √ + √ with a = 2+ 2/2 and b = 2− 2/2, and ς 2 is defined in (30). For the LMMSE-IC receiver, the BEME of the 8PSK signal is calculated following (31) as well, but with x ´i , i = 1, . . . , 4, which is defined as follows: x ´1 =

˜ k,i,Im bμk,i X λa (k, 1) − i 2 νk,i 2 a ˜ aμk,i Xk,i,Im λ (k, 1) x ´2 = − i 2 νk,i 2 ˜ k,i,Re aμk,i X λai (k, 2) x ´3 = − 2 νk,i 2 a ˜ bμk,i Xk,i,Re λ (k, 2) x ´4 = − i 2 νk,i 2 x ´1 =

and μk,i and νk,i are defined in (24) and (25), respectively. ACKNOWLEDGMENT The authors would like to thank the reviewers for their valuable comments and Prof. L. K. Rasmussen for discussions.

[1] G. J. Foschini, “Layered space–time architecture for wireless communications in a fading environment when using multi-element antennas,” Bell Labs Tech. J., vol. 1, no. 2, pp. 41–59, 1996. [2] G. J. Foschini, G. D. Golden, R. A. Valenzuela, and P. W. Wolnianski, “Simplified processing for high spectral efficiency wireless communication employing multi-element arrays,” IEEE J. Sel. Areas Commun., vol. 17, no. 11, pp. 1841–1852, Nov. 1999. [3] S. M. Alamouti, “A simple transmit diversity technique for wireless communications,” IEEE J. Sel. Areas Commun., vol. 16, no. 8, pp. 1451–1458, Oct. 1998. [4] V. Tarokh, H. Jafarkhani, and A. R. Calderbank, “Space–time block codes from orthogonal designs,” IEEE Trans. Inf. Theory, vol. 45, no. 5, pp. 1456–1467, Jul. 1999. [5] S. Sun, Y. Wu, Y. Li, and T. T. Tjhung, “A novel iterative receiver for coded MIMO OFDM systems,” in Proc. IEEE Int. Conf. Commun., Jun. 2004, vol. 4, pp. 2473–2477. [6] B. Lu and X. Wang, “Iterative receivers for multiuser space–time coding systems,” IEEE J. Sel. Areas Commun., vol. 18, no. 11, pp. 2322–2335, Nov. 2000. [7] M. Sellathurai and S. Haykin, “Turbo-BLAST for wireless communications: Theory and experiments,” IEEE Trans. Signal Process., vol. 50, no. 10, pp. 2538–2546, Oct. 2002. [8] M. Sellathurai and S. Haykin, “Turbo-BLAST: Performance evaluation in correlated Rayleigh-fading environment,” IEEE J. Sel. Areas Commun., vol. 21, no. 3, pp. 340–349, Apr. 2003. [9] M. Sellathurai and S. Haykin, “T-BLAST for wireless communications: First experimental results,” IEEE Trans. Veh. Technol., vol. 52, no. 3, pp. 530–535, May 2003. [10] E. Biglieri, A. Nordio, and G. Taricco, “Iterative receiver interfaces for coded multiple-antenna signaling,” in Proc. 2nd Int. Workshop Signal Process. Wireless Commun., Jun. 2004, pp. 1–12. [11] J. Boutros and G. Caire, “Iterative multiuser joint decoding: Unified framework and asymptotic analysis,” IEEE Trans. Inf. Theory, vol. 48, no. 7, pp. 1772–1793, Jul. 2002. [12] F. Kschischang, B. Frey, and H.-A. Loeliger, “Factor graphs and the sum– product algorithm,” IEEE Trans. Inf. Theory, vol. 47, no. 2, pp. 498–519, Feb. 2001. [13] S. M. Kay, Fundamentals of Statistical Signal Processing, vol. I. Englewood Cliffs, NJ: Prentice-Hall, 1993. [14] J. M. Mendel, Lessons in Estimation Theory for Signal Processing, Communications, and Control. Englewood Cliffs, NJ: Prentice-Hall, 1995. [15] H. V. Poor and S. Verdú, “Probability of error in MMSE multiuser detection,” IEEE Trans. Inf. Theory, vol. 43, no. 3, pp. 858–871, May 1997. [16] E. Chiavaccini and G. M. Vitetta, “MAP symbol estimation on frequencyflat Rayleigh fading channels via a Bayesian EM algorithm,” IEEE Trans. Commun., vol. 49, no. 11, pp. 1869–1872, Nov. 2001. [17] W. Y. Zou and Y. Wu, “COFDM: An overview,” IEEE Trans. Broadcast., vol. 41, no. 1, pp. 1–8, Mar. 1995. [18] IEEE 802.11 TGn D2.00, Feb. 2007. [Online]. Available: http://www. ieee802.org/11/ [19] G. Caire, G. Taricoo, and E. Biglieri, “Bit-interleaved coded modulation,” IEEE Trans. Inf. Theory, vol. 44, no. 3, pp. 927–946, May 1998. [20] X. Li, H. C. Huang, G. J. Foschini, and R. A. Velenzuela, “Effects of iterative detection and decoding on the performance of BLAST,” in Proc. IEEE Global Telecommun. Conf., 2000, vol. 2, pp. 1061–1066. [21] L. B. Nelson and H. V. Poor, “Iterative multiuser receivers for CDMA channels: An EM-based approach,” IEEE Trans. Commun., vol. 44, no. 12, pp. 1700–1710, Dec. 1996. [22] S. Sun, T. T. Tjhung, and P. H. W. Fung, “Soft-decision based iterative interference cancellation (IIC) for group-wise STBC MIMO systems,” in Proc. IEEE Veh. Technol. Conf.—Spring, Apr. 2003, vol. 2, pp. 984–988. [23] M. Witzke, S. Bäro, F. Schreckenbach, and J. Hagenauer, “Iterative detection of MIMO signals with linear detectors,” in Proc. 36th Asilomar Conf. Signals, Syst., Comput., Nov. 2002, vol. 1, pp. 289–293. [24] X. Wang and H. V. Poor, “Iterative (turbo) soft interference cancellation and decoding for coded CDMA,” IEEE Trans. Commun., vol. 47, no. 7, pp. 1046–1061, Jul. 1999. [25] G. Begin and D. Haccoun, “High-rate punctured convolutional codes: Structure properties and construction technique,” IEEE Trans. Commun., vol. 37, no. 12, pp. 1381–1385, Dec. 1989. [26] G. Begin, D. Haccoun, and C. Paquin, “Further results on high-rate punctured convolutional codes for Viterbi and sequential decoding,” IEEE Trans. Commun., vol. 38, no. 11, pp. 1922–1928, Nov. 1990.

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 57, NO. 5, SEPTEMBER 2008

3293

[27] Part 11: Wireless LAN Medium Access Control (MAC) and Physical Layer (PHY) Specifications: High-Speed Physical Layer in the 5 GHz Band, Sep. 1999. IEEE Std. 802.11a-1999: Supplement to IEEE Std. 802.11-1999. IEEE 802. [28] C. Hermosilla and L. Szczeci´nski, “EXIT charts for turbo receivers in MIMO systems,” in Proc. 7th ISSPA, Jul. 2003, pp. 209–212. [29] S. ten Brink, “Convergence behavior of iteratively decoded parallel concatenated codes,” IEEE Trans. Commun., vol. 49, no. 10, pp. 1727–1737, Oct. 2001. [30] M. Tüchler and J. Hagenauer, “EXIT charts of irregular codes,” in Proc. Conf. Inf. Sci. Syst., Mar. 2002, pp. 748–753. [31] I. S. Reed, “On a moment theorem for complex Gaussian processes,” IEEE Trans. Inf. Theory, vol. IT-8, no. 3, pp. 194–195, Apr. 1962.

A Modified Decorrelated Delay Lock Loop for Synchronous DS-CDMA Systems W. K. Wong, Y. T. Wu, S. H. Leung, and Y. S. Zhu Abstract—This paper presents a modified decorrelated delay lock loop (MD-DLL) for synchronous direct-sequence code-division multiple-access (DS-CDMA) systems under multiuser environments. The structure of the MD-DLL is similar to that of a traditional delay lock loop (T-DLL), aside from the fact that the despreading signals in T-DLL are replaced by a linear combination of all the users’ code sequences with various weightings and timing offsets. The basic principles of MD-DLL are given as follows: 1) to use the decorrelating technique of a multiuser detector to remove the multiple-access interference at the on-time code position instant and 2) to choose a proper timing offset for the code sequences to reduce the weighting imbalance between the two arms of the tracking loop and suppress noise enhancement. The proposed MD-DLL scheme solves the tracking bias problems in a decorrelated delay lock loop under a low-signal-to-noise-ratio (SNR) condition. Computer simulations based on Gold code sequences are used to verify the findings. It is shown that MD-DLL achieves a better and stable S-curve with negligible tracking bias, better rms tracking error and mean-time-to-lose-lock performance under high-SNR conditions, and stronger near–far resistance than T-DLL. Index Terms—Delay lock loop, direct-sequence code-division multiple access (DS-CDMA), multiple-access interference (MAI), synchronization.

I. I NTRODUCTION The direct-sequence code-division multiple-access (DS-CDMA) technique is widely used in present-day communication systems for its well-known advantages, such as antijam and multipath rejection. An important issue for the successful operation of a DS-CDMA system is the code synchronization between the locally generated despreading sequence and the incoming sequence. The synchronization process consists of two stages: 1) code acquisition, which performs the coarse code phase alignment, and 2) code tracking, which maintains fine phase alignment. In this paper, we focus on the code tracking part. Manuscript received April 11, 2007; revised September 3, 2007 and December 4, 2007. This work was supported by the City University of Hong Kong under Project 7001855-570. The review of this paper was coordinated by Dr. X. Wang. W. K. Wong and S. H. Leung are with the Department of Electronic Engineering, City University of Hong Kong, Kowloon, Hong Kong (e-mail: [email protected]; [email protected]). Y. T. Wu is with the Department of Electronic Engineering, City University of Hong Kong, Kowloon, Hong Kong, and also with The Key Labs of Integrated Microsystems, Shenzhen Graduate School, Peking University, Shenzhen 518055, China (e-mail: [email protected]). Y. S. Zhu is with The Key Labs of Integrated Microsystems, Shenzhen Graduate School, Peking University, Shenzhen 518055, China (e-mail: [email protected]). Digital Object Identifier 10.1109/TVT.2008.915516

Fig. 1. Block diagram of (a) T-DLL and (b) MD-DLL.

The traditional delay lock loop (T-DLL) shown in Fig. 1(a) has been a well-proven technique for code tracking in DS-CDMA systems since its introduction in the 1960s [1]. T-DLL requires simple hardware implementation, and its performance is close to that of a maximumlikelihood estimator under an additive white Gaussian noise channel and single-user environment [2]. In a multiuser environment, the use of nonorthogonal code sequences causes multiple-access interference (MAI). Hong et al. [3] and Caffery and Stüber [4] have analyzed the effects of MAI on T-DLL with a Gaussian approximation for MAI and a simple approximation for the average aperiodic cross-correlation function between two sequences, respectively. Both have shown that the performance of T-DLL is highly degraded under strong MAI. It has also been demonstrated that the S-curve (i.e., the discriminator characteristic) is severely distorted, and the loop may become unstable for certain combinations of sequences [5], [6]. In addition, Wong [6] pointed out that a noticeable tracking bias may be created under imbalanced MAI. It has been shown that a 0.2-chip tracking bias would cause noticeable degradation in most multiuser receivers [7], [8]. Interference cancellation techniques, such as Wang et al.’s successive interference cancellation [9] and Latva-aho and Lilleberg’s parallel interference cancellation [10], have been applied to T-DLL to combat MAI. Both require accurate estimation before synchronization, and the hardware complexity increases with the number of interferers. A decorrelated delay lock loop (D-DLL) with simple hardware complexity has been proposed in [6]. It uses the decorrelating concept of the multiuser detector to solve the tracking bias and the stability problems under a high-signal-to-noise ratio (SNR) environment. However, a noticeable tracking bias may occur in D-DLL under low-SNR conditions. In this paper, we propose a modified D-DLL (MD-DLL) to combat the effects of MAI and achieve a negligible tracking bias over the SNR range of practical interest. The structure of MD-DLL is similar to that of T-DLL, aside from the fact that the despreading signals in T-DLL are replaced by a linear combination of weighted and shifted versions of all the users’ code sequences. The hardware complexity is comparable to that of T-DLL and is independent of the number of interferers. Here, we focus on the S-curve, rms tracking error, and mean-timeto-lose-lock (MTLL) performance of a first-order loop. The analysis

0018-9545/$25.00 © 2008 IEEE