Deep Learning Models Performance Evaluations For Remote Sensed Image Classification

project on remote sensing

Uploaded by

ananya kunduru

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Deep Learning Models Performance Evaluations For Remote Sensed Image Classification

project on remote sensing

Uploaded by

ananya kunduru

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Received 17 August 2022, accepted 9 October 2022, date of publication 17 October 2022, date of current version 27 October 2022.

Digital Object Identifier 10.1109/ACCESS.2022.3215264

Deep Learning Models Performance Evaluations

for Remote Sensed Image Classification
ABEBAW ALEM 1,2 AND SHAILENDER KUMAR2 , (Member, IEEE)
1 ITDepartment, Debre Tabor University, Debre Tabor 251272, Ethiopia
2 Department of Computer Science and Engineering, Delhi Technological University, Delhi 110042, India

Corresponding author: Abebaw Alem ([email protected])

This work of Abebaw Alem was supported by the Scholarship from the Ethiopian Ministry of Education (MoE).

ABSTRACT Deep learning-based land cover and land use (LCLU) classification systems are a significant
aspiration for remote sensing communities. In nature, remote sensing images have various properties that
need to be analyzed. Analyzing and interpreting image properties is difficult due to the nature of the image,
the sensor technology’s capability, and other determinant variables such as seasons and weather conditions.
The problem is essential for environmental monitoring, agricultural decision-making, and urban planning if
it can be supported by deep learning systems. Therefore, deep learning approaches are proposed to quickly
analyze and interpret the remote sensing image to classify the LCLU. The deep learning methods could be
designed starting from scratch or using pre-trained networks. However, there are few comparisons of deep
learning methods developed from scratch and trained on pre-trained networks. Thus, we proposed evaluating
and comparing the deep learning models convolutional neural network feature extractor (CNN-FE) by
developing it from scratch, transfer learning, and fine-tuning it for the LCLU classification system using
remote sensed images. Using CNN-FE, TL, and fine-tuning deep learning models as examples, this paper
compares and analyzes deep learning algorithms for remote sensed image classification. After developing
and training each deep learning model on the UCM dataset, we evaluated and compared their performances
using the performance measurement metrics accuracy, precision, recall, f1-score, and confusion matrix. The
proposed deep learning algorithms can adapt and learn the features of the remote sensing images, and the TL
and fine-tuning classification performances are significantly improved. As a result of the efficient time used
for training the models, this paper discovered that the fine-tuned deep learning model achieved profound
accuracy performance results in the UCM dataset.

INDEX TERMS Convolutional neural network, deep learning, fine-tuning, performance comparisons,
remote sensed image classification, transfer learning.

I. INTRODUCTION is increasing dramatically, and the demand for land use is

Environment monitoring, agricultural decision-making, and increasing. A learning system could be applied to the domain
urban planning in today’s fast-paced world all rely primarily to utilize this land properly.
on the LCLU classification learning system. The LCLU clas- Thus, the LCLU classification is the recent hot, chal-
sification using remote sensed (RS) images is a critical issue lenging task in RS [1], [2], [3], [4]. With advanced sensor
in managing natural resources and human-made activities on technologies, RS images are satellite data collected from the
the natural phenomena in the earth’s environment. RS image earth’s environment. The deep learning (DL) method could
classification is the most recently focused area for the RS be applied to solve the challenge.
societies in the computer vision trends and image processing The DL is a recent specialized machine learning (ML)
research areas. From time to time, the world’s population approach that could automatically extract features of the
image for large datasets with admirable performance
The associate editor coordinating the review of this manuscript and improvements. Thus, DL is a recently focused research area
approving it for publication was Giacomo Fiumara . applied in various domains, such as classification [1], [5], [6],

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
111784 VOLUME 10, 2022
A. Alem, S. Kumar: Deep Learning Models Performance Evaluations for Remote Sensed Image Classification

[7], [8], recognition [9], and object detection [10]. It is also and comparison with pre-trained development models have
potentially challenging in many other domains [11]. not been widely researched. For designing the TL and fine-
DL techniques proposed in this paper are convolutional tuning model, we preferred the recently pre-trained network,
neural networks (CNNs), transfer learning (TL), and fine- the EffificientNetB7, which was trained on the ‘‘ImageNet’’
tuning, which make the classification task more attractive. large dataset. Recently, [16] achieved 84.4% top-1 accu-
CNN is one of computer vision’s most common DL methods racy of state-of-the-art EffificientNetB7 on the ‘‘ImageNet’’.
[18] for feature extractions and LCLU modelling using RS According to [16], eight scaling-up series of EffificientNet
images. CNN is a feedforward and backward neural network pre-trained models from EffificientNetB0-EffificientNetB7
consisting of convolutional calculations and deep structures. were designed on the larger dataset called ‘‘ImageNet’’. The
Therefore, CNN models have powerful feature extraction performance of each successive version has been improved.
capabilities for classification performance improvement in According to [18], who have applied EffificientNetB3,
RS images [12]. the large versions of EffificientNet models achieve better
To train classification models from scratch, DL approaches accuracy than the smaller ones. Thus, we proposed the Effi-
such as CNN require a large amount of data and a tremen- ficientNetB7 pre-trained network to design TL and fine-
dous amount of processing power [13]. However, the classi- tuning models in the domain of LCLU classification using
fication problem can be solved with less training time and RS images to evaluate their performances and compare them
smaller dataset training samples using TL [14] and fine- with the CNN-FE developed from scratch. We selected the
tuning [4], [15]. Thus, the main problem of the DL models UCM dataset to assess and compare the DL models.
is training the DL models using deep CNN from scratch Therefore, this paper aims to design the DL models and
requires a large dataset and takes longer to train the model. evaluate their performance with various performance mea-
To solve these kinds of DL problems, we came up with surement metrics. We contributed the following main contri-
the TL and fine-tuning approaches and compared how well butions accordingly.
they worked with the convolutional neural network feature • We developed the CNN model from scratch and com-
extractor (CNN-FE) model, which was built from scratch. pared its performance with the deep TL and fine-tuning
TL is another recent DL technique used to train the DL models for LCLU classification using the RS UCM
model by reusing pre-trained networks. TL and fine-tuning dataset.
are used for smaller datasets and can be designed from the • We applied the recent advanced EfficientNetB7 pre-
pre-trained networks on their top fully connected layer to trained network to design TL and fine-tune DL models
reuse the features. The training time in TL and fine-tuning to LCLU classification in RS images.
could be much less than that of deep CNN models trained • We looked at the models, compared how well they
from scratch. Therefore, TL could solve the problems of worked by using different ways to measure perfor-
building DL models from scratch by training the models in mance, and came to the conclusion that the fine-tuning
less time with smaller datasets by freezing the pre-trained model worked better with less training time.
network that has already been trained.
TL adopts the features from the pre-trained network to train II. MATERIALS AND PROPOSED METHODS
the new models. Moreover, fine-tuning is a DL technique A. DATASETS AND TOOLS
used to train the model by unfreezing the pre-trained net- We used the publicly available University of California
works. This technique is vital to increasing the performance Merced (UCM) dataset for modelling the CNN, CNN-based
of the model. The TL adopts the properties of the pre-trained TL, and fine-tuning. The UCM dataset is an LCLU data set
layers, excluding the last fully connected layer, i.e., the dense collected from the earth, labeled manually, and introduced
layer is replaced by our classifier with 21 neurons, and the by [19] at the University of California Merced. It contains
activation function is softmax. twenty-one classes. Each class includes 100 images with a
This paper designed and evaluated the DL models, such resolution of 256 × 256 pixels and a spatial resolution of
as CNN, TL, and fine-tuning. The CNN-FE has been devel- about 30 centimeters per pixel. However, the UCM dataset is
oped from scratch and has four CNN blocks. Using Keras inconsistent, as about 44 images have different pixel shapes.
applications, the deep CNN-based TL and fine-tuning models This dataset is available at:
have been developed on the pre-trained model Effificient- http://weegee.vision.ucmerced.edu/datasets/landuse.html.
NetB7 [16]. Python is a high-level computer language tool used for
In recent studies, very few studies have been conducted model development. Python is a versatile and user-friendly
to compare the capabilities of DL models developed from programming language that can be used to create many inter-
scratch with those that reused the pre-trained network to active libraries for the DL and ML models. TensorFlow and
evaluate their capable performances. For instance, [17] has Keras are also other DL tools used with Python.
applied the TL and fine-tuning methods on the ResNet50
pre-trained network and compared their performances with B. PROPOSED DL METHODS
other pre-trained-based networks in scene image classifica- Previous works have studied the DL method in the classifi-
tion. However, the scratch development models’ evaluation cation problems in RS images of various datasets. However,