Robust Channel Invariant Deep Noncooperative Spectrum Sensing
Robust Channel Invariant Deep Noncooperative Spectrum Sensing
3, MARCH 2023
Abstract—Deep learning (DL) has been introduced to cogni- Recently, deep learning (DL) technology has been widely
tive radio network to solve the problem of spectrum scarcity adopted in communications and signal processing as it can
and further enhance the spectrum utilization. However, many extract higher levels of representations of the received signals.
DL-based spectrum sensing methods are sensitive to the environ-
ment, which means the sensing model needs to be re-trained with
For example, in [7], a two-step DL-based spectrum sensing
a large number of labelled samples in a new environment. In this framework was introduced, which is applicable to several types
letter, we propose a novel DL-based channel environment-robust of modulations.
spectrum sensing network named ER-SNet, which contains the Generally, most of DL-based spectrum sensing methods out-
encoder part extracting channel invariant features and the clas- perform traditional feature-based methods in scenarios where
sifier part for true hypothesis prediction. Extensive simulations they are trained. However, the environment of electromag-
have been conducted to show the performance improvement and
robustness of the proposed algorithm in sensing weak signals
netic wave propagation is considerably diverse, which means
over different channel conditions. the received signal distribution is highly impacted. Hence, the
sensing model trained well under one environment may work
Index Terms—Deep learning, spectrum sensing, channel
poorly in another one. One approach to address the challenge
robustness, cognitive radio network.
of the environment variations is to gather sufficient labeled
samples in various contexts, which is difficult to realize in
I. I NTRODUCTION practice. In [8], the authors exploited the cross-correlation
OWADAYS, the enormous growth of mobile terminal property of the cyclic prefix of orthogonal frequency-division
N device connectivity has led to increasing demand for
spectrum resources and a severe shortage of spectrum. The
multiplexing (OFDM) signals to sense the full-duplex OFDM
system via matching network (MN). However, the proposed
cognitive radio network (CRN) has shown tremendous poten- scheme is only applicable to a certain type of signal. In [9],
tial in tackling new requirements and challenges brought by the authors introduced adversarial learning strategy for deal-
the fifth generation (5G) mobile communication system [1]. ing with performance degradation with untrained testing SNR
Spectrum sensing performs critical task in cognitive radio sets, but it requires training SNR labels to extract the SNR-
(CR) technology, allowing a secondary user (SU) to detect invariant features. In [10], the authors proposed a deep transfer
and access idle spectrum opportunistically [2]. cooperative sensing (DTCS) approach to enhance the adap-
Energy detection (ED) is a traditional approach with the tation ability, but it could only obtain good performance on
lowest computational complexity, but it requires the estimation source and target domains rather than unknown domains.
of the noise power, and performs poorly at low signal-to- In [11], the authors investigated the phase difference (PD)
noise ratio (SNR) levels [3]. In [4], the matched-filter detection distribution of the received samples and proposed a deep
method has been adopted to achieve better performance, but it neural network (DNN)-based spectrum sensing approach that
requires prior information of primary user (PU) which is usu- is robust to noise uncertainty. However, the computation of
ally not available. In [5], a cyclo-stationary feature detection PD distribution would be highly affected by the sampling
method was proposed by leveraging the intrinsic periodicity of length.
the modulated signal. However, it has high computational com- Motivated by the aforementioned obstacles, we propose
plexity. As compared to the traditional approaches, machine an end-to-end spectrum sensing network named ER-SNet
learning (ML)-based spectrum sensing methods could obtain to improve the model generalization ability for untrained
higher detection accuracy [6], but it requires huge dataset, channel conditions. The contributions are summarized as
and the detection performance is affected by feature selection. follows:
• We propose a non-cooperative spectrum sensing scheme
Manuscript received 8 November 2022; accepted 9 December 2022. Date of ER-SNet, which is applicable for detecting the presence
publication 15 December 2022; date of current version 9 March 2023. This of unknown PU signals in new propagation environments.
work was supported in part by the Temasek Laboratories and Rapid-Rich
Object Search (ROSE) Lab, NTU, Singapore. The associate editor coordinat- • ER-SNet operates on complexed received signals, with
ing the review of this article and approving it for publication was R. Wang. advantage of extracting underlying features than manual
(Corresponding author: Kah Chan Teh.) extraction. The core idea is to extract the channel-
Zhengyang Su, Kah Chan Teh, and Alex C. Kot are with the School
of Electrical and Electronic Engineering, Nanyang Technological University,
insensitive features that are discriminative for PU state
Singapore 639798 (e-mail: [email protected]; [email protected]; but robust to channel variation to some extent.
[email protected]). • The effectiveness of ER-SNet is demonstrated via sim-
Sirajudeen Gulam Razul is with the Temasek Laboratories, Nanyang ulations. It can achieve better sensing performance as
Technological University, Singapore 637553 (e-mail: esirajudeen@
ntu.edu.sg). compared to other baseline schemes including DTCS,
Digital Object Identifier 10.1109/LWC.2022.3229491 DetectNet, TF-ResNet, etc.
2162-2345
c 2022 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
thorized licensed use limited to: AMRITA VISHWA VIDYAPEETHAM AMRITA SCHOOL OF ENGINEERING. Downloaded on November 26,2023 at 22:20:06 UTC from IEEE Xplore. Restrictions app
SU et al.: ROBUST CHANNEL INVARIANT DEEP NONCOOPERATIVE SPECTRUM SENSING 437
thorized licensed use limited to: AMRITA VISHWA VIDYAPEETHAM AMRITA SCHOOL OF ENGINEERING. Downloaded on November 26,2023 at 22:20:06 UTC from IEEE Xplore. Restrictions app
438 IEEE WIRELESS COMMUNICATIONS LETTERS, VOL. 12, NO. 3, MARCH 2023
feature space. In addition, B stands for the batch size, and m Algorithm 1 Environment-Robust Spectrum Sensing Scheme
denotes the margin value that specifies the separability of each (ER-SNet)
layer’s embedding features. Pre-processing: Collect training samples and constitute the dataset
Ds = {X , Ψ, D}
The classifier H is constituted with transposed CNN layers Initialize: Set θe , θh with random initial weights
and a dense layer. Represented by Hθh , where θh denotes 1: Repeat
the parameters of layers in the classifier H, the output of the 2: Compute the total loss J (θe , θh ) with (8);
3: Compute the backward gradients;
network can be expressed as 4: Update the network’s weights θe , θh by employing
⎡ ⎤ gradient descent method Adam;
Hθh Eθe |H0 (x k ) 5: Until accuracy with no improvement;
Hθh Eθe (x k ) = ⎣ ⎦ 6: Return the probability of each hypothesis, i.e., fθ|H0 (x ), fθ|H1 (x ),
Hθh Eθe |H1 (x k ) and the well-trained optimal parameters θe∗ , θh∗ .
Prediction: Evaluation of samples under untrained channel conditions
fθ|H0 (x k ) utilizing the decision threshold η with (12).
= , (5)
fθ|H1 (x k )
with
Thus, the reordered sequence with V corresponding classi-
fθ|H0 (x k ) + fθ|H1 (x k ) = 1, (6) fication probabilities of hypothesis H1 is given by:
where fθ|Hi (x k )(i = 0, 1) is the output probability of each fθ∗ |H1 (x̂ 1 ), fθ∗ |H1 (x̂ 2 ), . . . , fθ∗ |H1 (x̂ V ) . (11)
hypothesis with hyper-parameters θ of the whole network. We Lastly, the decision threshold can be obtained by
adopt the CE loss function for classification according to the
maximum likelihood criterion, as given by η = fθ∗ |H1 x̂ floor (V ∗(1−Pfa )) , (12)
1
B where the function floor (·) rounds down to the nearest integer.
Jcross (θe , θh ) = − bk · log fθ|H1 (x k ) + Thus, we can regulate Pfa to be our desired value, and the PU
B
k =1
state will be determined by equations (2) and (12).
(1 − bk ) · log 1 − fθ|H1 (x k ) , (7) Hence, the proposed scheme with training and testing
procedures is summarized in Algorithm 1.
where bk represents the class label of the k th sample x k . To
balance the portion of the contrastive loss and classification IV. S IMULATION R ESULTS AND A NALYSIS
loss, equations (4) and (7) are normalized and weighted by γ In this section, we will present the parameters of the dataset
and 1 − γ, respectively. Hence, the total loss of the ER-SNet in detail, and analyze the complexity of our proposed scheme.
can be expressed as: Following that, simulation results and analysis including
robustness to channel variations will be presented.
J (θe , θh ) = γJcon (θe ) + (1 − γ)Jcross (θe , θh ). (8)
thorized licensed use limited to: AMRITA VISHWA VIDYAPEETHAM AMRITA SCHOOL OF ENGINEERING. Downloaded on November 26,2023 at 22:20:06 UTC from IEEE Xplore. Restrictions app
SU et al.: ROBUST CHANNEL INVARIANT DEEP NONCOOPERATIVE SPECTRUM SENSING 439
TABLE I
C HANNEL S PECIFICATIONS
B. Complexity Analysis
In this section, we use a measurement known as multiply-
accumulate operations (MACC) [15] to analyze the complexity
of the proposed approach. Specifically, the ER-SNet mainly
consists of multiple convolutional, deconvolutional, and dense
layers that may significantly increase the system complexity.
The complexity of 1D-CNN layer is O(Cin · Cout · ks · Lout ),
where Cin and Cout represent the number of input chan-
nel and output channel, respectively. ks stands for the kernel
size, and Lout denotes the output length. Besides, deconvolu-
tional layers contribute similar complexity expression as that
of 1D-CNN layers, namely, O(Cin · Cout · ks · Lin ), where Fig. 3. Comparisons of detection performance between training and
Lin is the input length of the layer. Lastly, fully connected prediction at Pfa = 0.1.
layers incur the computational complexity of O(Min · Md ),
where Min denotes the input dimension of the dense layer,
and Md represents the number of neurons. Overall, the MACC
those of other methods since the encoder E in ER-SNet can
of the proposed ER-SNet in one iteration is approximately
be viewed as a domain augmentation step, which has the
6.58 million for one sample, which is about 17% more than
ability to deal with the channel variations and help reduc-
that of DetectNet [16]. More intuitively, it takes 0.056 ms for
ing the channel-dependency of the extracted features. Hence,
ER-SNet to make a decision for one sample. As compared
for prediction purposes, it will have a relatively small accu-
to DTCS, ER-SNet can obtain the Pd improvement and the
racy drop for Rician testing channels. Although DTCS could
robustness to the communication environments at the expense
perform relatively better than TF-ResNet and DetectNet, it
of around 30% more prediction time per sample.
requires re-training the model to set the specified target domain
in order to obtain good adaptation performance to unknown
C. Performance Evaluation channel conditions. Similarly, utilizing the strategy of transfer
In the simulations, we compare the performance of learning, TF-ResNet needs a small portion of labeled samples
our proposed ER-SNet to that of deep learning-based or under the new channel condition for re-training to obtain sim-
other conventional spectrum sensing approaches, includ- ilar level of probability of detection Pd as training stage. By
ing DetectNet [16], transfer learning-based ResNet (TF- contrast, the channel variations will cause around 25% drop
ResNet) [17], DTCS [10], and ED [18]. To ensure a fair in detection performance of DetectNet over Rician channel
performance evaluation of each scheme, the training dataset model. Besides, even within the same Rayleigh channel model
is maintained consistent. It should be noted that the optimal as that of training dataset, the minor changes such as multi-
hyper-parameters of each network have been obtained through path delays and gains in the channel profile will also cause an
extensive numerical simulations. accuracy drop of around 10%. In general, the proposed ER-
As illustrated in Fig. 2, the accuracy drop at SNR=−12dB SNet can perform better in untrained environments without
caused by channel model variations can be clearly observed. re-training the model.
The ED scheme will be excluded for comparisons in Fig. 2 In Fig. 3, the probabilities of detection of various spec-
since the conventional approach does not require offline train- trum sensing approaches under different channel conditions are
ing phase. Note that the DetectNet and ResNet are typical plotted against different SNR levels at Pfa = 0.1. Assuming
DL-based methods, and it is difficult to derive a general {Dt_Rayleigh} is the target domain for transfer learning-based
function that can extract the common feature of training sam- methods, and {Dt_Rician (K = 4)} is the unknown domain.
ples with different channel models. The training accuracy of It is shown that the proposed ER-SNet performs the best for
the proposed ER-SNet is around 7% to 14% higher than target Rayleigh channel regardless of the SNR, indicating the
thorized licensed use limited to: AMRITA VISHWA VIDYAPEETHAM AMRITA SCHOOL OF ENGINEERING. Downloaded on November 26,2023 at 22:20:06 UTC from IEEE Xplore. Restrictions app
440 IEEE WIRELESS COMMUNICATIONS LETTERS, VOL. 12, NO. 3, MARCH 2023
R EFERENCES
Fig. 4. ROC curves at SNR = −12dB for prediction over Rician channel: [1] J. Lunden, V. Koivunen, and H. V. Poor, “Spectrum exploration and
{Dt_Rician (K = 4)}. exploitation for cognitive radio: Recent advances,” IEEE Signal Process.
Mag., vol. 32, no. 3, pp. 123–140, May 2015.
[2] Y. Chen and H.-S. Oh, “A survey of measurement-based spectrum occu-
pancy modeling for cognitive radios,” IEEE Commun. Surveys Tuts.,
efficiency of the proposed approach. Under low SNR levels, vol. 18, no. 1, pp. 848–859, 1st Quart., 2016.
Pd of ER-SNet is around 8% to 13% higher than that of other [3] H. Urkowitz, “Energy detection of unknown deterministic signals,” Proc.
IEEE, vol. 55, no. 4, pp. 523–531, Apr. 1967.
DL techniques. Besides, without re-training the experiment [4] F. Salahdine, H. E. Ghazi, N. Kaabouch, and W. F. Fihri, “Matched
using large amounts of labelled samples, DTCS and DetectNet filter detection with dynamic threshold for cognitive radio networks,” in
will suffer a significant degradation of Pd when predicting Proc. Int. Conf. Wireless Netw. Mobile Commun. (WINCOM), Oct. 2015,
under the Rician channel model. If a small portion of labelled pp. 286–291.
[5] K. Sherbin and V. Sindhu, “Cyclostationary feature detection for spec-
samples in the unknown domain are considered, TF-ResNet trum sensing in cognitive radio network,” in Proc. Int. Conf. Intell.
could maintain comparable performance. Although the prob- Comput. Control Syst. (ICCS), May 2019, pp. 1250–1254.
ability of detection of ED is not affected by prediction, it [6] K. M. Thilina, K. W. Choi, N. Saquib, and E. Hossain, “Machine
learning techniques for cooperative spectrum sensing in cognitive radio
performs the poorest in general over both Rayleigh and Rician networks,” IEEE J. Sel. Areas Commun., vol. 31, no. 11, pp. 2209–2221,
channel. Nov. 2013.
Lastly, Fig. 4 shows the receiver operating characteristic [7] Z. Su, K. C. Teh, S. G. Razul, and A. C. Kot, “Deep non-cooperative
(ROC) curve, which is a graph that depicts the performance spectrum sensing over Rayleigh fading channel,” IEEE Trans. Veh.
Technol., vol. 71, no. 4, pp. 4460–4464, Apr. 2022.
of the binary classification model for unknown Rician channel [8] Q. Cheng, Z. Shi, and J. Yuan, “Spectrum sensing in full-duplex OFDM
model at a specific SNR level with two metrics, Pd and Pfa . systems using one-shot learning,” in Proc. Int. Conf. Commn. (ICC),
By adjusting Pfa along x-axis in log scale, we can visualize Jun. 2021, pp. 1–6.
Pd along y-axis at SNR of −12dB. It is observed that the [9] C. Wang, Y. Xu, Z. Chen, J. Tian, P. Cheng, and M. Li, “Adversarial
learning-based spectrum sensing in cognitive radio,” IEEE Wireless
proposed ER-SNet outperforms other schemes at any level of Commun. Lett., vol. 11, no. 3, pp. 498–502, Mar. 2022.
Pfa . When Pfa = 0.1, the Pd of ER-SNet can reach over [10] L. Li, H. Jiang, and H. He, “Deep transfer cooperative sensing
70%, whereas the Pd of other schemes range between 22% in cognitive radio,” IEEE Wireless Commun. Lett., vol. 10, no. 6,
pp. 1354–1358, Jun. 2021.
to 60%. In addition, we consider the imperfect situation with [11] Y. Wang et al., “Deep neural network-based robust spectrum sensing:
noise uncertainty (NU), which may be inevitable in a real- Exploiting phase difference distribution,” in Proc. Int. Conf. Commn.
world scenario. In the case of NU, the noise power varies over (ICC), Jun. 2021, pp. 1–7.
time, and the estimated noise power σ̂n2 follows a uniform dis- [12] X. Fu, Z. Feng, Y. Zhang, Q. Zhang, and W. Xu, “Phase difference
tribution over [ 1ε σn2 , εσn2 ], where ε denotes the NU factor and
variance based low complexity spectrum sensing scheme,” in Proc. IEEE
Global Commun. Conf. (GLOBECOM), Dec. 2015, pp. 1–6.
σn2 represents the actual noise power. In our simulations, ε is [13] T. J. O’Shea and N. West, “Radio machine learning dataset generation
set to 2dB, namely, ε = 1.57, which is denoted as “NU-2dB” with GNU radio,” in Proc. GNU Radio Conf., Jan. 2016, pp. 1–6.
in Fig. 4. It is observed that the ED-based scheme is sensitive [14] E. Perenda, S. Rajendran, G. Bovet, S. Pollin, and M. Zheleva, “Learning
the unknown: Improving modulation classification performance in
to NU because it implements the blind detection through the unseen scenarios,” in Proc. IEEE INFOCOM Conf. Comput. Commun.,
estimation of noise power. Contrarily, other schemes do not May 2021, pp. 1–10.
require prior knowledge about noise power for blind sensing. [15] V. Sze, Y.-H. Chen, T.-J. Yang, and J. S. Emer, “Efficient processing
of deep neural networks: A tutorial and survey,” Proc. IEEE, vol. 105,
These results have validated the detection reliability and effi- no. 12, pp. 2295–2329, Dec. 2017.
ciency of our proposed ER-SNet regardless of the presences [16] J. Gao, X. Yi, C. Zhong, X. Chen, and Z. Zhang, “Deep learning
of NU. Additionally, the proposed ER-SNet could be extended for spectrum sensing,” IEEE Wireless Commun. Lett., vol. 8, no. 6,
to the generalization of SNR and signal types. pp. 1727–1730, Dec. 2019.
[17] S. Zheng, S. Chen, P. Qi, H. Zhou, and X. Yang, “Spectrum sens-
ing based on deep learning classification for cognitive radios,” China
V. C ONCLUSION Commun., vol. 17, no. 2, pp. 138–148, Feb. 2020.
[18] Y.-C. Liang, Y. Zeng, E. C. Y. Peh, and A. T. Hoang, “Sensing-
In this letter, a novel network architecture and the train- throughput tradeoff for cognitive radio networks,” IEEE Trans. Wireless
ing strategy have been designed to improve the detection Commun., vol. 7, no. 4, pp. 1326–1337, Apr. 2008.
thorized licensed use limited to: AMRITA VISHWA VIDYAPEETHAM AMRITA SCHOOL OF ENGINEERING. Downloaded on November 26,2023 at 22:20:06 UTC from IEEE Xplore. Restrictions app