MLC_TPP2
MLC_TPP2
Abstract— In this letter, for the first time, we propose an The identification of modulation techniques in signals,
automatic modulation classification (AMC) method for orthog- known as automatic modulation classification (AMC), is a
onal time-frequency space (OTFS) signal modulation using a challenging task that holds great importance in a wide array
hybrid convolutional neural network and long short-term mem-
ory (CNN-LSTM) network with a residual stack. The proposed of real-world applications, particularly those characterized by
method uses in-phase and quadrature-phase (IQ) of the received high mobility and operating at high frequencies [5]. This is
OTFS modulated signal to classify the received modulation especially relevant in domains such as next-generation wireless
accurately. We consider the six digital modulation schemes such communication systems, unmanned aerial vehicles (UAVs),
as binary phase shift keying (BPSK), quadrature PSK (QPSK), satellite communications, and Internet of Things (IoTs), where
minimum-shift keying (MSK), on-off keying (OOK), 4-amplitude
shift keying (4ASK), and 8ASK for orthogonal time-frequency rapid movement and the use of high-frequency bands introduce
space (OTFS) in the delay-Doppler domain. Results depict that unique modulation challenges [6].
the proposed method achieves a high classification performance In general, AMC methods can be categorized into three
even at a low signal-to-noise ratio (SNR). main groups: likelihood-based AMC (LB-AMC), feature-
Index Terms— Deep learning, automatic modulation classifica- based AMC (FB-AMC), and distribution test-based AMC
tion, OTFS, CNN, LSTM, residual learning. (DB-AMC). The likelihood ratios between desired received
signals and known signals are computed as part of the
I. I NTRODUCTION LB-AMC process. Based on these likelihood ratios and a pre-
determined threshold, a choice of the modulation class is made
C OMMUNICATION systems with high mobility and oper-
ating at high frequencies face a significant challenge
known as Doppler spread, which arises due to the movement of
[7]. The LB-AMC provides optimal performance, because of
its enormous complexity, it may have problems converting
the transmitter, receiver, and scatterers. In scenarios with high the theoretical system model into real-world applications.
mobility, conventional orthogonal frequency-division multi- However, the FB-AMC stands out as the best option due to
plexing (OFDM) modulation, commonly utilized in 4G, 5G, its extraordinary robustness, capacity to simplify difficult prob-
and WiFi networks, encounters limitations and difficulties [1], lems, and adaptability in processing different signals. Usually,
[2]. The performance of the OFDM waveform is negatively FB-AMC includes both feature extraction and classification
affected by a significant issue called inter-carrier interference steps. Hence, the efficiency and robustness of AMC can be
(ICI), which is further exacerbated by the notable difference in significantly improved by robust and reliable feature extraction
normalized Doppler between the highest and lowest subcarri- process and classification algorithms [8], [9].
ers. Therefore, achieving synchronization is also a significant In recent times, deep learning (DL) techniques have been
hurdle. A promising solution has emerged in the form of utilized in FB-AMC to automatically extract relevant features
a new two-dimensional (2D) modulation scheme known as from received signals. This approach has shown promising
orthogonal time-frequency space (OTFS), which has recently results in achieving accurate classification by training the
been put forward as a potential choice for high-mobility models with appropriate datasets. Hong et al. [10] employed a
communications [3]. OTFS modulation is an innovative mod- deep convolutional neural network (CNN) network to address
ulation scheme in which every transmitted symbol maintains signal modulation identification (SMI). This network featured
a nearly constant channel gain, even in challenging conditions a simple structure comprising consecutive convolutional layers
like high Doppler, large antenna arrays in massive multiple- and fully connected layers. However, despite their attempts to
input multiple-output (massive MIMO), or high-frequency classify modulations under challenging channel circumstances,
environments such as millimeter waves. Specifically, the OTFS they encountered limitations in effectively capturing correla-
modulation technique maps each information symbol to a two- tions between distinct OFDM symbols using straightforward
dimensional (2D) orthogonal basis function from a predefined architectures. In [11], the authors utilize CNNs with resid-
set [4]. ual learning to effectively tackle issues such as sub-carrier
fluctuations and fading channels. The technique demonstrates
Manuscript received 2 October 2023; accepted 23 October 2023. Date of superior performance compared to established methodolo-
publication 27 October 2023; date of current version 12 December 2023. The
associate editor coordinating the review of this letter and approving it for
gies through real-time trials utilizing a radio frequency (RF)
publication was W. Yuan. (Corresponding author: Udit Satija.) dataset. However, OFDM loses subcarrier orthogonality at
The authors are with the Department of Electrical Engineering, Indian high Doppler shifts due to mobility, rendering it unsuitable for
Institute of Technology Patna, Dayalpur Daulatpur 801106, India (e-mail:
[email protected]; [email protected]; [email protected]).
high delay and Doppler conditions whereas OTFS modulation
Digital Object Identifier 10.1109/LCOMM.2023.3328011 offers better adaptability in dynamic scenarios. Therefore,
1558-2558 © 2023 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
Authorized licensed use limited to: Indian Institute of Technology - Jodhpur. Downloaded on March 25,2025 at 17:29:56 UTC from IEEE Xplore. Restrictions apply.
3256 IEEE COMMUNICATIONS LETTERS, VOL. 27, NO. 12, DECEMBER 2023
Fig. 1. Block diagram of the proposed hybrid CNN-LSTM method with a residual stack.
AMC for OTFS systems will play a crucial role in future through learned representations. Fig. 1 presents the block
wireless communication where high data rates and reliable diagram of the proposed method designed for recognizing
communication are critical. There are various studies available modulations in OTFS systems. The method involves gen-
that focus on channel state information estimation to enhance erating the modulation signal using OTFS modulation and
the performance of signal detection in OTFS systems using subsequently transforming it into an OTFS signal for further
deep learning [12], [13]. However, to the best of our knowl- analysis and classification.
edge, there is no work available in the literature on AMC for
the OTFS system. A. OTFS Modulation Procedure
In this work, we present a novel method for AMC in the The OTFS modulation scheme employs a configuration
OTFS framework. To successfully complete this assignment, where two successive 2D transforms are interconnected within
we employ CNN-LSTM with residual learning. While CNN the transmitter. The basic OTFS modulation scheme’s block
represents a simpler neural network architecture to extract diagram is shown in Fig. 1. At the transmitter, windowing
relevant features in few epochs, the inclusion of LSTM is of is used after the inverse symplectic finite Fourier transform
paramount importance due to its capacity to capture long-term (ISFFT) maps the information symbols from the delay-
dependencies which has been explored in OFDM and other Doppler (DD) domain to the time-frequency (TF) domain.
modulation classification techniques. To achieve robustness Prior to the transmission through the channel, the TF symbols
and reliability in the context of modulation classification, are further transformed to the time domain using the Heisen-
we employ a combination of CNN-LSTM with a residual berg transform [14].
stack, which is able to extract spatial and temporal features, The N × M DD grid with the information symbols x[k, l]
mitigating the effect of the gradient vanishing problem that is is given by [14]
caused by random noise introduced due to high mobility.
We considered the set of six digital modulated baseband k l
Γ= , | k = 0, . . . , N −1, l = 0, . . . , M −1 ,
signals include binary phase shift keying (BPSK), quadrature N T M ∆f
phase shift keying (QPSK), minimum shift keying (MSK), (1)
on-off keying (OOK), 4-amplitude shift keying (4ASK), and
where N and M are the number of Doppler and delay bins,
8-amplitude shift keying (8ASK). On the basis of the baseband
respectively, and N1T and M1∆f are the bin sizes in the Doppler
IQ samples alone, our proposed approach can correctly classify
and delay domains, respectively. The DD domain symbols
the modulation format for OTFS system. The remaining sec-
x[k, l] are transferred to the symbols in the TF domain X[n, m]
tions of the letter are organized as follows: Section II outlines
using ISFFT. The TF signal can be expressed as follows,
the OTFS system model. In Section III, we elaborate on the
according to rectangular windowing
proposed neural network for AMC, encompassing the dataset
N −1 M −1
generation procedure and training methodology. Moving for- 1
x [k, l] ej2Π( N − M ) .
X X nk ml
Authorized licensed use limited to: Indian Institute of Technology - Jodhpur. Downloaded on March 25,2025 at 17:29:56 UTC from IEEE Xplore. Restrictions apply.
KUMAR et al.: RESIDUAL STACK-AIDED HYBRID CNN-LSTM-BASED AMC FOR OTFS SYSTEM 3257
Z
We generate a set of six digital modulated baseband signals
X in OTFS, where the considered modulation schemes include
h(τ, ν) = hi δ(τ − τi )δ(ν − νi ), (4)
i=1
BPSK, QPSK, MSK, OOK, 4ASK, and 8ASK. We analyze
the OTFS signal consisting of M = 64 subcarriers in the
where Z represents the number of paths in the DD domain. frequency domain and N = 32 symbols in the time domain.
The variables hi , τi , and νi refer to the channel gain, delay, and We gather a dataset by capturing the received signals of the six
Doppler shift, respectively. The signal received at the receiver, modulated signals. Each signal in the dataset covers a signal-
r(t), is the time-domain representation of the transmitted to-noise ratio (SNR) range from −10 dB to +20 dB, with
signal after passing through the channel, given as, increments of 5 dB. For each modulation type at a specific
Z ∞Z ∞
SNR level, we generate a total of 4096 examples, each con-
r(t) = h(τ, ν)x(t − τ )ej2πν(t−τ ) dτ dν + v(t), sisting of 2048 samples. In total, we have 172,032 examples of
−∞ −∞
data. During the neural network training, we employ one-hot
(5)
encoding to label the collected data.
where v(t) denotes the additive white Gaussian noise. The received signal can be written in the form of a
In this letter, the input to the proposed network for modu- 2048 × 2 matrix, such as
lation classification is represented by the in-phase component K
(I) and quadrature component (Q) of r(k) which is a discrete D = [I Q]2048×2 i=1 , (7)
sampled version of r(t), commonly referred to as IQ signals, where K is the number of training examples.
which can be expressed as
K K
I = {real[r[k]]}k=0 , and Q = {imag[r[k]]}k=0 , (6) C. Training Process
where K is the number of training examples. To ensure effective training and evaluation, we divide the
generated dataset into two sets: 70 percent of the data is
used for training the neural network, while the remaining
III. P ROPOSED M ETHODOLOGY
30 percent is reserved for validation. The weights are learned
In this section, we present a detailed explanation of the using the Adam optimizer, with a fixed learning rate of 0.01,
proposed CNN-LSTM with residual stack-based AMC for the and the objective is to minimize the categorical cross-entropy
OTFS system. loss function. Throughout the training process, which spans
100 epochs, we employ a batch size of 512 examples. This
A. Proposed DL-Based AMC Architecture setup allows us to assess the performance and generalization
capabilities of the trained networks on unseen data.
The neural network architecture begins with an input shape
of (2048, 2), representing IQ samples of received signals.
These samples are passed through two cascaded CNN layers IV. S IMULATION R ESULTS AND D ISCUSSION
with a kernel size of 1×7 and 64 kernels each. Each CNN layer In this section, we simulate the classification performance
employs the scaled exponential linear unit (SELU) activation of the proposed hybrid CNN-LSTM-based AMC for OTFS
function, followed by batch normalization and a dropout of systems over delay-Doppler channels. In particular, we inves-
0.3 for regularization. The output of the CNN layers is then tigate the performance of OTFS-modulated signals in terms of
fed into a max-pooling layer with a pool size of 2 × 2 and classification accuracy in percentage. In Table I, all relevant
strides of 2. simulation parameters are listed. We consider the extensive
Subsequently, the pooled output enters a triple-stack residual vehicle A model as the channel model for path delays (or
network, following a design specified in Fig. 2. After this, the delay taps) and generate a single Doppler shift for each delay
result undergoes batch normalization and the SELU activation tap using Jakes’ formula [14].
function. The processed data then passed through two cas- Fig. 3 (a) illustrates the training loss and validation loss
caded LSTM layers to track the long-term dependency of the of the proposed model. It can be observed from the figure
extracted pattern from the input data [16]. Each LSTM has that the model is effectively learning with rising epochs from
a memory size of 48. Batch normalization is applied again the training data and generalizing well to new unseen data as
Authorized licensed use limited to: Indian Institute of Technology - Jodhpur. Downloaded on March 25,2025 at 17:29:56 UTC from IEEE Xplore. Restrictions apply.
3258 IEEE COMMUNICATIONS LETTERS, VOL. 27, NO. 12, DECEMBER 2023
TABLE I
S IMULATION PARAMETERS
TABLE II
C LASSIFICATION ACCURACY C OMPARISON
Fig. 3. (a) Training and validation curve, (b) Classification accuracy vs. SNR
plot for different activation functions.
Authorized licensed use limited to: Indian Institute of Technology - Jodhpur. Downloaded on March 25,2025 at 17:29:56 UTC from IEEE Xplore. Restrictions apply.
KUMAR et al.: RESIDUAL STACK-AIDED HYBRID CNN-LSTM-BASED AMC FOR OTFS SYSTEM 3259
Fig. 5. (a) Confusion matrix at 0 dB SNR, (b) Confusion matrix at 5 dB SNR, (c) Confusion matrix at 10 dB SNR.
Authorized licensed use limited to: Indian Institute of Technology - Jodhpur. Downloaded on March 25,2025 at 17:29:56 UTC from IEEE Xplore. Restrictions apply.