Machine Learning Based Models For Spectrum Sensing in Cooperative Radio Networks
Machine Learning Based Models For Spectrum Sensing in Cooperative Radio Networks
Caio Henrique Azolini Tavares1, Jose Carlos Marinello1, Mario Lemes Proenca Jr1, Taufik Abrao1
1Department of Electrical Engineering, Londrina State University, Rod. Celso Garcia Cid - PR445, PO Box 10.011, CEP: 86057-970, Londrina,
PR, Brazil
E-mail: [email protected]
Abstract: In this study, the authors consider the application of machine learning (ML) models in cooperative spectrum sensing
of cognitive radio networks (CRNs). Based on a statistical analysis of the classic energy detection scheme, the probability of
detection and false alarm is derived, which depends solely on the number of samples and signal-to-noise ratio of the secondary
users. The channel occupancy detection obtained from the established analytical techniques such as maximum ratio combining
and AND/OR rules is compared to different ML techniques, including multilayer perceptron (MLP), support vector machine, and
Naive Bayes, based on receiver operating characteristic and area under the curve metrics. By using standard profiling tools,
they obtain the computational performance of the analysed models during the training phase, a critical step for operating in
CRNs. Ultimately, the results demonstrate that the MLP ML technique presents a better trade-off between training time and
channel detection performance.
where hi is the channel coefficient from the PU to the ith SU, x(k) K
λ = 2Γ−1 (9)
∗
u Pfa,
is the transmitted PU signal, and ni(k) is the noise at the ith SU 2
receiver considered to be a zero-mean Gaussian random variable
with variance σn2 . where Γu(x, n) is the upper incomplete Gamma function, defined as
The channel coefficient hi is described by the path-loss and ∞
fading components Γu(x, n) =
1
Γ(x) ∫ n
t x − 1e− tdt (10)
α
hi = giDi
−
2 (2)
∞
Pdi = ∫λ
f (yi H1) dy ≜ QΓ λ; K /2, 2(1 + γi) (11)
λ K
Pdi = Γu ,
2(1 + γi) 2 (12)
3 Conventional SS–CRN techniques Fig. 1 Example of a CR scenario training phase. The SUs provide the
fusion center with y ∈ ℝM × 1 energy samples, while the PU provides the
So far, the discussed methodology for the ith SU to assess the
corresponding channel status vector d
presence or absence of a PU involves only the estimated energy
level on the ith SU. Now, we shall enumerate and briefly analyse
classical analytical methods of cooperative SS, where the estimated
energy level on all SUs are transmitted through a service layer to a
fusion centre, which will apply deterministic decision rules aiming
at deciding cooperatively whether a PU is present or not by
combining the SU individual decisions. In the following, the three
main deterministic decision rules for SS cooperative networks,
namely AND, OR, and maximum ratio combining (MRC) rules are
revisited.
4 ML techniques
3104 IET Commun., 2020, Vol. 14 Iss. 18, pp. 3102-3109
© The Institution of Engineering and Technology 2020
17518636, 2020, 18, Downloaded from https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/iet-com.2019.0941 by UFF - Universidade Federal Fluminense, Wiley Online Library on [10/09/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
where w ∈ ℝN × 1 optimally separates the two classes or channel
status by minimising an error function E(h(Ytrain), d). The decision
region of (16) can also be interpreted as the hyperplane h(y) = 0.
With this in mind, we can write down the general goal of a
supervised ML problem:
M
1
M i∑
minimise E(h(yi), di) (17)
w, b =1
N δ j = σ ′ a(1)
j ∑ wθ j δθ
(2)
(27)
(1)
oj = σ ∑ wi j yi(1)
(21) θ
i=0
where σ ′( ⋅ ) is the derivative of the activation function σ( ⋅ ).
and similarly for the output neurons Once we have the errors for every neuron, we can easily obtain
the derivative
N
oθ(2) = σ ∑ w jθ o j
(2) (1)
(22) ∂Em
j=0 = δ joi (28)
∂wi j
in both cases, y0 and o0(1) are known as bias inputs and are equal to Finally, once the neural network is trained, the channel status
1. They are necessary to shift the activation function away from the inference for an unseen example is made based on
origin.
By considering a training vector of desired binary outputs H1, if o ≥ 1 − Pfa∗
^
d ∈ ℝM × 1 with dm ∈ {0; 1} for M input vectors y, we can interpret Smlp = (29)
H0, otherwise
the output o of the neural network (22) as the conditional
M
where
∂ℒ
∂b
=0→ ∑ αmdm = 0 (42)
1 dm + 1
m=1 pm = , tm = .
1+e Ah( ym) + B 2
∂ℒ
= 0 → αm = ξ − μm (43)
∂δm Finally, the channel status inference using the SVM approach
results
From (40)–(43) we have the following set of Karush–Kuhn–
Tucker conditions:
^
^ H1, if P(h(y)) ≥ 1 − Pfa∗
Ssvm = (50)
H0, otherwise
αm ≥ 0 (44a)
Table 2 AUC results for AWGN and Rayleigh channels Fig. 7 ROC curves for the different techniques under Rayleigh fading
Technique AUC AWGN AUC Rayl. channel
(a) ROC curve, (b) Zoom into the ROC 0.9Pd / 0.1Pfa interest region
AND 0.7186 0.7370
OR 0.9302 0.9256
MRC 0.9616 0.9240
NB 0.9594 0.9244
SVM-linear 0.9613 0.9360
SVM-Gaussian 0.9604 0.9327
MLP 0.9609 0.9359
Bold values indicate the high performance for the SS method, i.e. AUC > 0.96
(AWGN channels) or AUC > 0.93 (Rayleigh channels).
8 References
[1] Chen, Y., Oh, H.-S.: ‘A survey of measurement-based spectrum occupancy
modeling for cognitive radios’, IEEE Commun. Surv. Tutor., 2014, 18, (1), pp.
848–859
[2] Sun, Z., Bradford, G.J., Laneman, J.N.: ‘Sequence detection algorithms for
PHY-layer sensing in dynamic spectrum access networks’, IEEE J. Sel. Top.
Signal Process., 2010, 5, (1), pp. 97–109
[3] FCC: ‘Spectrum policy task force’. ET Docket No. 02-155, November 2002
[4] Mitola, J., Maguire, G.Q.: ‘Cognitive radio: making software radios more
personal’, IEEE Pers. Commun., 1999, 6, (4), pp. 13–18
[5] Tseng, F.-h., Chou, L.-d., Chao, H.-c., et al.: ‘Ultra-dense small cell planning
using cognitive radio network toward 5G’, IEEE Wirel. Commun., 2015, 22,
(6), pp. 76–83
[6] Jia, M., Gu, X., Guo, Q., et al.: ‘Broadband hybrid satellite-terrestrial
communication systems based on cognitive radio toward 5G’, IEEE Wirel.
Commun., 2016, 23, (6), pp. 96–106
[7] Chae, S.H., Jeong, C., Lee, K.: ‘Cooperative communication for cognitive
Fig. 9 Time spent during training phase for different sizes of training sets satellite networks’, IEEE Trans. Commun., 2018, 66, (11), pp. 5140–5154
[8] Li, B., Fei, Z., Chu, Z., et al.: ‘Robust chance-constrained secure transmission
The computational metrics were evaluated on a consumer for cognitive satellite–terrestrial networks’, IEEE Trans. Veh. Technol., 2018,
67, (5), pp. 4208–4219
laptop with Intel i7 5500U @2.4 GHz and 16 GB DDR3 @1600 [9] Alpaydin, E.: ‘Introduction to machine learning’ (The MIT Press, USA, 2014,
MHz RAM. Based solely on the results from Table 3 and the AUC 3rd edn.)
metrics from Table 2, the MLP achieves the best performance- [10] Boutaba, R., Salahuddin, M.A., Limam, N., et al.: ‘A comprehensive survey
computational complexity trade-off operating in both AWGN or on machine learning for networking: evolution, applications and research
opportunities’, J. Internet Serv. Appl., 2018, 9, (1), p. 16. Available at https://
Rayleigh channels when compared to the other ML models. doi.org/10.1186/s13174-018-0087-2
Nonetheless, it is worth noting that besides time spent training, [11] Arjoune, Y., Kaabouch, N.: ‘A comprehensive survey on spectrum sensing in
the ML techniques have the requirement of cooperation from the cognitive radio networks: recent advances, new challenges, and future
PU to provide the fusion centre with the channel status vector, research directions’, Sensors, 2019, 19, (126), pp. 1–32. Available at https://
doi.org/10.3390/s19010126
which must be taken into account, a system complexity of [12] Thilina, K.M., Choi, K.W., Saquib, N., et al.: ‘Machine learning techniques
implementing such models. for cooperative spectrum sensing in cognitive radio networks’, IEEE J. Sel.
On the other hand, while the MRC technique provides almost Areas Commun., 2013, 31, (11), pp. 2209–2221
instantaneous channel status inference based on energy samples, it [13] Azmat, F., Chen, Y., Stocks, N.: ‘Analysis of spectrum occupancy using
machine learning algorithms’, IEEE Trans. Veh. Technol., 2015, 65, (9), pp.
requires the SUs transmit to the fusion centre in a completely 6853–6860
centralised way their estimated SNR, which can also pose a further [14] Li, Z., Wu, W., Liu, X., et al.: ‘Improved cooperative spectrum sensing model
challenge for implementation while reducing the overall spectral based on machine learning for cognitive radio networks’, IET Commun.,
efficiency of the CRN. 2018, 12, (19), pp. 2485–2492
[15] Bkassiny, M., Li, Y., Jayaweera, S.K.: ‘A survey on machine-learning
techniques in cognitive radios’, IEEE Commun. Surv. Tutor., 2012, 15, (3), pp.
6 Discussion and final remarks 1136–1159
[16] Shah, H.A., Koo, I.: ‘Reliable machine learning based spectrum sensing in
In this study, we compared the use of ML models, namely the cognitive radio networks’, Wirel. Commun. Mob. Comput., 2018, 2018, p. 17,
SVM with linear and Gaussian kernel functions, a feed-forward no. ID 5906097. Available at https://doi.org/10.1155/2018/5906097
[17] Lu, Y., Zhu, P., Wang, D., et al.: ‘Machine learning techniques with
neural network with one hidden layer and NB method with those probability vector for cooperative spectrum sensing in cognitive radio
well-established analytical models in the context of SS problem for networks’. 2016 IEEE Wireless Communications and Networking Conf.,
CRs. Doha, Qatar, April 2016, pp. 1–6
Numerical results demonstrated that the studied models are [18] Umar, R., Sheikh, A.U.H., Deriche, M.: ‘Unveiling the hidden assumptions of
energy detector based spectrum sensing for cognitive radios’, IEEE Commun.
proved suitable for the task of channel status inference from energy Surv. Tutor., 2013, 16, (2), pp. 713–728
samples obtained from multiple SUs. Based on the ROC curve and [19] Bishop, C.M.: ‘Pattern recognition and machine learning’ (Springer, USA,
AUC metrics, we conclude that all ML models performed closely to 2006)
the optimum MRC analytical technique under AWGN channels. [20] Rosenblatt, F.: ‘The perceptron: a probabilistic model for information storage
and organization in the brain’, Psychol. Rev., 1958, 65, (6), pp. 386–408
Interestingly enough, under Rayleigh flat fading channels, all [21] Hastie, T., Tibshirani, R., Friedman, J.: ‘The elements of statistical learning’
ML techniques outperformed the MRC due to varying SNR levels (Springer Inc., New York, 2001, 2nd edn.)
on SUs over each sensing period. It becomes clear that MRC needs [22] Shawe-Taylor, J., Cristianini, N.: ‘Kernel methods for pattern analysis’
to perform channel response estimation in order to be able to (Cambridge University Press, New York, NY, USA, 2004)
[23] Haykin, S.S.: ‘Neural networks and learning machines’ (Pearson Education,
perform well on fading channels, which could be a challenge under USA, 2009, 3rd edn.)
CRNs. [24] Platt, J.C.: ‘Probabilistic outputs for support vector machines and
By using standard profiling tools, we were able to obtain comparisons to regularized likelihood methods’, in Alexander, J., Smola, P.B.,
computational performance metrics for each ML model evaluated Bernhard, S., et al. (Eds.): ‘Advances in large margin classifiers’ (MIT Press,
USA, 1999), pp. 61–74
during the training and inference phases. For small CRNs context,
the results demonstrated an advantage of MLP technique followed