A Lightweight Meta-Ensemble Approach For Plant Disease Detection Suitable For IoT-Based Environments

Uploaded by

merlin xavier

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views13 pages

A Lightweight Meta-Ensemble Approach For Plant Disease Detection Suitable For IoT-Based Environments

Uploaded by

merlin xavier

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Received 24 January 2024, accepted 11 February 2024, date of publication 19 February 2024, date of current version 26 February 2024.

Digital Object Identifier 10.1109/ACCESS.2024.3367443

A Lightweight Meta-Ensemble Approach for

Plant Disease Detection Suitable for
IoT-Based Environments
RITESH MAURYA 1, SATYAJIT MAHAPATRA 2, AND LUCKY RAJPUT 3
1 Amity Centre for Artificial Intelligence, Amity University, Noida 201301, India
2 Department of Information and Communication Technology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal 576104, India
3 Amity School of Engineering and Technology, Amity University, Noida 201301, India

Corresponding author: Satyajit Mahapatra ([email protected])

ABSTRACT By providing food to billions of people, agriculture contributes significantly to the

global economy. Plant ailments, however, can reduce crop yields and result in financial losses.
An automated artificial intelligence (AI)-based method for the automatic identification of plant diseases
using resource-constrained Internet of Things (IoT) devices has been presented to solve this issue. However,
the deployment of state-of-the-art convolution neural networks (CNNs) and Vision Transformers (ViT) on
IoT devices is not feasible due to their large number of trainable parameters. To overcome this limitation,
a meta-ensemble of lightweight MLP-Mixer and faster Long Short-Term Memory (LSTM) models has been
proposed for plant disease detection on low-powered micro-controllers (MCUs) of IoT devices. The MLP
Mixer model is based on a simple multi-layer perceptron network. The proposed meta-ensemble consists
of two levels: predictions made by the trained models at the first level are used to train the machine
learning classifier at the next level, resulting in further improvement of categorisation accuracy. The proposed
meta-ensemble has been tested on three diverse datasets of varying sizes and plant species, including Maize,
Cotton, and a dataset derived from the Plant Village(PV) dataset. On the Maize, Cotton, and derived PV
datasets, respectively, experimental results demonstrate that the suggested technique obtained classification
performance of 94.27%, 98.43%, and 97.45%. Moreover, prediction time of the proposed meta-ensemble is
low, and it has considerably fewer trainable parameters than CNN and other transformer-based architectures.
Therefore, the proposed meta-ensemble is an efficient and effective solution for plant disease detection with
limited resources.

INDEX TERMS Convolution neural network, ensemble, artificial intelligence, deep learning, Internet of
Things, disease detection.

I. INTRODUCTION machine learning (ML)-based methods have been utilised,

Plant diseases pose massive threats to agricultural productiv- leveraging the features extracted from the diseased plant
ity, necessitating their early detection utmost important for images though manual feature engineering process. In con-
effective disease management. Manual analysis of plant by trast to ML-based methods [1], [2] [3], deep learning
pathologists is time-consuming and subjective to the knowl- methods, particularly convolution neural networks (CNNs),
edge of domain expert, leading to the development of artifi- have shown promising results in learning relevant features
cial intelligence-based automated systems for faster and more automatically.
accurate identification of plant diseases. Conventionally, State-of-the-art CNNs like VGG16, ResNet50, DenseNet,
InceptionNet, and MobileNet trained on ImageNet, have
The associate editor coordinating the review of this manuscript and demonstrated significant improvement in performance across
approving it for publication was Claudio Loconsole . different domains etc. [4], [5], [6]. Customised CNNs [7],

2024 The Authors. This work is licensed under a Creative Commons Attribution 4.0 License.
28096 For more information, see https://creativecommons.org/licenses/by/4.0/ VOLUME 12, 2024
R. Maurya et al.: Lightweight Meta-Ensemble Approach for Plant Disease Detection

[8] and attention-based techniques [9], [10], [11] have also CNN-based method for the categorisation of Bacterial Spot
been deployed for plant disease classification. However, the disease of the peach plants with 98.38% categorisation
deployment of such models on low-powered Internet of accuracy [14]. Ferentinos has tested different CNNs such
Things (IoT) devices with limited computational resources as AlexNet, GoogleNet, and VGGNet for the categorisation
remains a challenge. of 17,548 images of 58 different classes of plant disease
Recently, the lightweight MLP-Mixer architecture has and obtained a categorisation accuracy of 99.53% [15].
gained attention due to its lesser architectural complexity The MobileNet model developed by Kamal et al. with
and competitive performance on ImageNet dataset [12]. deep separable convolution achieved 97.65% categorization
This architecture, which relies solely on multi-layer percep- accuracy on the PlantVillage dataset [16]. A customised
trons(MLPs) without convolution and attention mechanism, CNN model has been suggested by Chohan et al. for the
present a promising solution for resource-constrained IoT classification of illnesses in 15 distinct plants [17]. The
environments. InceptionResNet model was suggested by Hassan and Maji
for the categorisation of 15 different plant disease types [18].
A. MOTIVATION AND CONTRIBUTION Atila et al. have proposed the EfficientNet model for the
Despite existing literature on machine learning and deep categorisation of 39 different diseases present in the PV
learning algorithms for plant disease diagnostics, there is dataset [19]. Amin et al. have proposed a method for corn
still a need for the development of lightweight solutions leaf disease classification by combining the features extracted
that can be easily implemented in resource-constrained from the EfficientNetB0, and DenseNet121 deep CNN
environments with limited memory and computation power. models and achieved 98.56% classification accuracy [20].
This research presents a unique two-tier meta-ensemble Maurya et al. have proposed a method for classification of
approach to address the need for lightweight models that can diseases present in the PlantVillage dataset using pre-trained
be deployed in resource-constrained IoT-based situations for Vision Transformer network and interpreted the performance
automated plant disease diagnosis. The proposed approach of the model using GradCAM algorithm [21].
harnesses the benefits of the MLP-Mixer and Long Short Some of the works under miscellaneous category, proposed
Term Memory (LSTM) models to improve classification by different researchers for the plant disease categorisation
performance while staying appropriate for usage in resource- have been summarised as follows: Abbas et al. have utilised
constrained contexts. generative adversarial networks to produce synthetic images
The adoption of the suggested meta-ensemble technique of the diseased leaves of the tomato plant [22]. Five different
is supported by its lightweight nature, which makes it types of potato plant diseases have been classified with the
suited for deployment in resource-constrained situations DenseNet121 model with 97.11% categorisation accuracy.
such as IoT devices. Integrating MLP-Mixer and LSTM Thakur et al. have utilised the ViT architecture for the
models into the proposed meta-ensemble allows for the use categorisation of the images of plant diseases and achieved
of their complimentary capabilities thereby enhancing the an average accuracy of more than 93% in the case of
classification performance. Apple, Maize, and Rice datasets [23]. For tomato leaf disease
The rest of the article has been split up into the following classification, Karthik et al. [24] proposed a strategy based
sections: Section II details the related works. Section III on the use of the attention mechanism in a deep CNN. Their
provides details about the methods deployed in the proposed suggested model performed 98% categorization correctly
work. Section IV displays experimental results and provides when evaluated with 24001 photos [24]. Shah et al. suggested
detailed discussions of them. Section V provides a concrete a teacher/student architecture for identifying 14 different
outline of the proposed work. plant diseases [25].
Most of the works discussed above either used convolution
II. RELATED WORKS or attention mechanisms embedded with the CNN archi-
Some of the prior works related to the plant disease categori- tecture. These models cannot be adapted to an IoT-based
sation task have been discussed in this section. This paragraph environment where there is a constraint of limited memory
describes some of the convolution neural network based and computational power. Internet of things faces several
models proposed by the different researchers. Zhao et al. [9] challenges such as limited resources in terms of computing,
have proposed a method consisting of an inception module power and memory capacity [26]. Therefore, in the proposed
and residual connection for the identification of diseases work, a lightweight approach has been presented which does
related to the corn, potato and tomato plants. They also not rely on convolution or attention mechanism, thereby,
suggested the use of a web-based system for the real-time it is well suited for IoT-based deployment. The proposed
identification of plant diseases [9]. Pandey and Jain have model also utilises the multi-tier meta ensemble approach in
proposed an attention-based dense CNN model for the which the prediction probabilities obtained from the trained
detection of 44 diverse types of plant diseases using a models at the first level are used as a feature set to train the
dataset constructed from the 10,851 images captured from model at the second level. The meta-ensemble approach helps
the field and achieved 97.33% categorisation accuracy [13]. in further improving the categorisation performance of the
Bedi and Gole proposed a convolutional autoencoder and proposed method.
VOLUME 12, 2024 28097
R. Maurya et al.: Lightweight Meta-Ensemble Approach for Plant Disease Detection

III. DATASET USED C. TOMATO/POTATO/PEPPER DATASET (TPP DATASET)

Three different publicly available datasets pertaining to This dataset has been derived from the PV dataset [29]
various plants were used to test the proposed framework. and consists of 20637 images of plant disease in ‘.jpg’
Examples of the photos found in these datasets are shown in format. This dataset consists of images of diseases belong-
Fig. 1. ing to three different types of plants such as tomato,
potato and pepper. This dataset has been termed as ‘TPP’
(Tomato/Potato/Pepper) throughout the rest of the literature.
The three classes of this dataset belong to the potato plant, two
classes belong to the pepper plant and the rest of the classes
belong to the tomato plant. The total sample images present
in each class of the TPP dataset has been shown in Table 3.

TABLE 3. Number of sample images present in each class.

FIGURE 1. Images of the samples taken from each dataset (a) Cotton
Dataset (b) Tomato/Potato/Pepper Dataset (c) Maize Dataset.

A. COTTON LEAF DISEASE DATASET (COTTON DATASET)

This dataset [27] consists of 1518 images of four differ-
ent classes of cotton leaf disease images captured under
real-world conditions and also from the internet. The images
of four different leaf diseases such as ‘Curl virus’, ‘bacterial
blight’, ‘fusarium wilt’ and ‘healthy plant’ leaves images
were present in this dataset. The number of sample images
present in each class of the Cotton Dataset has been presented
in Table 1.

TABLE 1. Number of sample images present in each class.

IV. PROPOSED METHODOLOGY

The methodology for the proposed meta ensemble framework
for plant disease detection has been shown in Fig. 2. The
whole methodology has been divided into four steps: (i) In
the first step, the whole dataset has been split into training
and test set (ii) then in the next step, pre-processed training
set images were used to train the models (Mixer and LSTM)
B. MAIZE LEAF DISEASE DATASET (MAIZE DATASET) present at the level 1 (iii) After the level 1 models are trained,
This dataset [28], [29] has been derived from popular the level 2 support vector machine classifier is trained using
datasets such as PlantDoc and PV datasets. This dataset the features that are extracted from these models (as an output
consists of 2529 images in ‘.jpg’ format. This dataset of these models). (iv) After training the models present at
includes photos of four different types of maize leaf both levels, the test set images were first given as input to the
illnesses, including ‘‘Common Rust,’’ ‘‘Grey Leaf Spot,’’ trained models present at level 1 to draw the features. Then
‘‘Blight,’’ and ‘‘Healthy’’ Plant Leaves. Total sample images drawn-out features of these models were concatenated and
present in each class of this dataset have been presented in then given as an input to the trained SVM model present at
Table 2. level 2 to reach the final decision. Different component of the
proposed methodology has been explained as follows:

TABLE 2. Number of sample images present in each class. A. SPLIT THE DATASET
Training and test sets have been created from the entire
dataset. While the test set photos were used to gauge how
well the proposed meta ensemble framework performed at
categorising images, the training set images were utilised to
train the models. The experimental findings section contains a
description of the number of sample photos that were utilised
for training and testing.