Plant Disease Detection Using Deep Convolutional Neural Network For Corn, Rice and Banana Crop
Plant Disease Detection Using Deep Convolutional Neural Network For Corn, Rice and Banana Crop
1
School of Computer Engineering, Dr. Vishwanath Karad MIT World Peace
University, Pune, Maharashtra
Abstract. Among the most widely farmed and eaten crops globally are
corn, rice, and bananas. However, a variety of both biological and
environmental variables, including weather, pests, soil, and water quality,
can influence these crops, leading to illnesses that have a substantial impact
on crop quality and output. These illnesses not only cause financial loss but
also represent a serious danger to the farming sector. Traditional methods of
detecting diseases are insufficient and frequently depend on laborious eye
examinations.Consequently, having efficient techniques for identifying and
treating these illnesses is crucial. Using a Deep Convolutional Neural
Network for the crops of corn, rice, and bananas, this article tackles this
problem. a layout in which max pooling layers are arranged in a deliberate
manner after convolutional layers with progressively larger filters to
guarantee the identification of certain illness patterns. The findings showed
that the accuracy was 99.7% for Bananas, 98.8% for Rice, and 99.8% for
Corn. This study aims to empower farmers worldwide and support them in
protecting the health and productivity of their crops by showcasing the
application of convolutional neural networks in crop disease identification.
Keywords: Machine Learning, Deep Learning, Neural Networks,
Convolutional Neural Network
1 Introduction
the application of machine learning (ML) and deep learning (DL) approaches has
shown remarkable promise in transforming the field of crop illness
detection.These techniques leverage the power of computational algorithms to
analyze and interpret large amounts of data, particularly images of diseased plants,
enabling the accurate and early identification of various plant diseases. Khatoon et
al. [1] provides a comprehensive overview of the latest advancements in deep
learning for plant disease detection. It covers various convolutional neural
network architectures, data enhancement techniques, and disease classification
strategies. Plant health and illness may be recognized with high accuracy using
ML and DL models that are trained to collect and analyze information from
photos, such as its hue, texture, and shape. Saleem, Potgieter and Arif’s extensive
review paper [2], provides a thorough analysis of state-of-the-art deep learning
techniques for plant disease detection and classification.
Convolutional Neural Networks (CNNs) are a unique class of neural networks that
process data with a grid topology, making them ideal for image processing. CNNs
have shown great potential in revolutionizing the field of plant disease detection,
enabling the accurate and prompt recognition of various crop illnesses. By
extracting and analyzing features such as color, texture, and shape from images, In
a recent study conducted by Andrew J et al. [3], the potential of deep learning
techniques, specifically convolutional neural networks (CNNs), was investigated
in terms of their ability to accurately distinguish between healthy and diseased
plants. The study found that CNNs demonstrated great precision in this task. The
researchers managed to accomplish a remarkable 96% accuracy rate,
demonstrating the effectiveness of AI in early disease detection.While Afifi,
Alhumamand and Abdelwahab investigate the use of CNNs for recognizing
various diseases on grape leaves [4]. The developed model achieved a remarkable
99.8% accuracy, highlighting the power of AI in differentiating between healthy
and diseased plant tissues. Simhadri and Kondaveeti [5] focused on developing a
deep learning model for rice disease detection that performs well even in resource-
limited environments with limited computing power and data availability. On the
other hand, we used verified datasets related to these crops to train our model. In
this research, we propose a mechanism that uses CNNs to detect diseases in two
prominent crops, corn and rice. It predicts diseases related to these crops by
processing various images and weather datasets. Furthermore, our work presents
an architecture that combines photos of plants with weather data.. This
architecture utilizes simple pooling operations and convolutions, and is referred to
as CNN augmented with cellular automata rules. This approach aims to effectively
identify plant diseases by making use of the combined information from weather
data and plant images.
This paper is organized into six sections, with section 1 providing the introduction
to the problem, section 2 providing an extensive literature survey, section 3 giving
the design and methodology used in our work, section 4 providing the results and
comparisons to the existing work, section 5 providing the conclusion to our work,
3
and section 6 discussing the future scope of our research. Seeking an in-depth
understanding of advanced methods and technologies in the field, the goal is to
contribute to the development of more trustworthy and efficient ways for
identifying and controlling pests and plant diseases.
2 Literature Review
Reshmi A.M and Prameeja Prasidhan's research [8] stands out for its pioneering
use of an efficient Convolutional Neural Network (CNN) algorithm. Achieving an
impressive 99.5% accuracy, the study employs several methods for image
processing to detect and classify crop diseases, providing valuable insights for
farmers to mitigate losses and enhance productivity.Examining the landscape of
automatic crop disease identification, the study by Justine Boulent et al. (2019) [9]
utilizes Convolutional Neural Networks (CNNs) to analyze 19 relevant works.
With a focus on tomatoes and a predominant leaf-centric approach, the research
highlights the potential of deep learning techniques in advancing practical tools
for sustainable agriculture and food production securityIn their 2023 research,
4
gathering and analyzing a diverse leaf image dataset, the authors train and fine-
tune the CNN, achieving an impressive 88% accuracy in overall disease
classification. The proposed methodology, emphasizing simplicity and efficiency,
marks a significant leap in automating plant disease identification, offering
promise for improved precision in agriculture.
3 Data Engineering
Any machine learning model's ability to perform well is closely correlated with
the caliber and variety of the dataset used for training. The leaves of corn, rice,
and banana plants are the main subjects of our research. The creation of the model
and its analysis depend heavily on this part. Researchers and practitioners can
access all of the utilized datasets for free because they are all operated under
public domain licenses.
It consists of a rich variety of carefully annotated corn leaf images. The images are
categorized into 4 classes on the basis of their diseases, common rust, gray leaf
spot, blight and healthy.
large cigar-shaped lesions. Leaves appear grayish in color due to the presence of
fungal spores.
Healthy. Healthy corn leaves are generally green, but can also be purple or
reddish-brown. They are a monocot plant, so they have parallel venation. They
can be 0.3–1.5 meters long and 5–15 centimeters wide.The margins of the leaves
are entire, meaning they don't have obvious teeth. They can also have narrow
longitudinal wrinkles and are sometimes vertically wavy.
7
This dataset consists of a diverse array of diseased rice leaf images.A total of 120
images is present in this dataset. The images are further classified into 3 classes
namely, Leaf smut, brown spot, bacterial leaf blight.
Leaf Smut. Entyloma oryzae is the casuing agent of leaf smut. It's a common but
mild illness that generates little black spots on rice leaves that are angular and
slightly elevated. The spots are linear, rectangular, angular, or elliptical, and
measure 0.5–2.0. On a single leaf, there may be many spots, but they all stay
unique from one another. When the epidermis becomes moist/wet, it breaks open
and releases the black spores. The edges of heavily diseased leaves die and
become pale, while the leaves themselves turn yellow.
lesions that are yellow-brown or brown in color. These lesions have the potential
to encircle the coleoptile (the protective sheath covering the emerging shoot) and
cause distortion in both primary and secondary leaves.
Cordana. It is a fungal disease called cordana leaf spot (CLS) mostly affects
banana trees. Banana diamond leaf spot is the most popular term used to describe
this illness.Two fungi species cause CLS, C. johnstonii and C. musae.. Even
though it is common worlwide, has generally little impact on production.
Sigotaka. It is a fungal disease that infects the leaves, causing spots that can lead
to leaf drying and defoliation. This can affect the quality of the fruit and reduce
yield. The fungus Cercospora musae is the cause of sigatoka, often referred to as
Cercospora leaf spot. It is one of the most harmful banana illnesses common in
banana plants across the world.
Healthy. Healthy banana leaves are large, wide, and slightly rounded with a
smooth surface. They are waxy, flexible, and glossy, and can grow up to two
meters long and half a meter wide. Healthy leaves are generally dark green in
color, flexible and intact with smooth edges.
Data Splitting. Two divisions were made in the dataset, Training set and the
Testing set. The division for the disease detection model was 75-25,implying that
75% of the dataset was used for training, while the remaining 25% was set aside
for testing the model. The rationale behind this approach was to ensure sufficient
data for training while also having a substantial set for unbiased evaluation. The
11
split was executed by randomly shuffling the dataset and subsequently partitioning
it into two disjoint sets. This methodology allowed for a balanced consideration of
training effectiveness and the model's capacity to generalize unseen data.
Rice Leaf
Leaf smut, Brown spot, Bacterial
Diseases 40 each 120
leaf blight
Dataset
CNN becomes more complex, eventually dealing with more significant elements
of the object till it successfully performs identification. [17]
[18]. The final layer aligns with the number of classes corresponding to the
specific crop under consideration. The model is purposefully tailored in a
hierarchical fashion. The data then moves through the layers causing the model to
gradually evolve its understanding from basic features such as colours, edges to
more advanced elements providing a comprehensive analysis of plant health. Its
progressive nature captures the patterns that are relevant to the health of crops and
offers a thorough method for detecting infections in bananas, rice, and maize early
on. This design helped us capture the nuanced patterns specific to crop diseases.
There are 2 graphs shown for each crop, ‘Model Accuracy’ and ‘Model Loss’.
14
The accuracy of the model was calculated by using the following formula.
n
LCE = − ∑ t i log ( p i ), for n classes, (2)
i=0
where ti is the truth table and pi is the Softmax probablity for the ith class [23].
The left graph shows the model's accuracy over time, measured as the fraction of
crops the model correctly identified as diseased or healthy. The model's loss over
time, which is a gauge of how inaccurate the model's predictions are, is displayed
on the right graph.The quantity of training epochs is indicated on the x-axis.An
epoch denotes the end of an iteration over the training set. The model is exposed
to all of the training examples during each epoch, and it modifies its parameters in
response to the patterns it discovers and gains knowledge from the data.
5.1 Rice
Fig. 14. Model accuracy and loss during training for rice disease detection
The accuracy graph starts around 0.38, calculated using equation 1, indicating that
the model correctly identified about 38% of the rice plants as diseased at the
beginning.The accuracy then continues to rise steadily and finally peaking at 30
epochs.The loss in data starts around 3.0, calculated using equation 2.Over the
course of the 30 epochs, the loss line drops dramatically, reaching a value of 0.007
at epoch 30. This indicates that as the model learnt, its prediction errors decreased
dramatically.
15
5.2 Banana
Fig. 15. Model accuracy and loss during training for banana disease detection
The accuracy graph starts around 0.5, calculated using equation 1, indicating that
the model correctly identified about 70% of the banana plants as diseased at the
beginning.The accuracy line rises steadily over the 30 epochs, reaching a peak of
0.99 (99%) at epoch 30. The loss graph starts around 1.2, calculated using
equation 2, shows how far off the model's initial estimates were from the right
answers.The loss line then falls sharply over the 30 epochs, reaching a value close
to 0.007 at epoch 30.
5.3 Corn
16
Fig. 16. Model accuracy and loss during training for corn disease detection
The accuracy graph begins at 0.24, calculated using equation 1, meaning that
around 24% of the rice plants were initially accurately recognized by the model as
unhealthy.After then, the accuracy increases gradually before peaking at 30
epochs.About 0.7 is where the data loss begins.The loss line experiences a
significant decline over the course of the 30 epochs, peaking at 0.005 at epoch 30,
calulated using equation 2.
Thus, the results observed are a testimony for the potential application of this
CNN model in real-world scenarios to aid in early and accurate diagnosis of plant
diseases.
Table 2. Model’s accuracy and data loss for each crop after 30 epochs
Crop Name Accuracy Data Loss
5 Conclusion
structure to our solution. The CNN was tailored to include convolutional layers
with increasing filter size, followed by strategic placement of max pooling layers
that minimize the spatial dimensions of the input. Due to this the learning curve of
the model was improved drastically. The hierarchical design resulted in highly
dense layers, which proved to be instrumental in capturing nuanced patterns
specific to crop diseases. Through meticulous testing and evaluation, the model
displayed remarkable efficiency. The learning capabilities of the model are
displayed via the accuracy and loss graphs, which indicate the performance peak
at 30 epochs. The model showed staggering accuracy of 98.8% for Rice, 99.7%
for Banana and 99.8% for Corn.
6 Future Scope
Looking ahead, the future implications of this research are expansive. This
includes scaling the model to encompass a wide range of crops will ensure the
versatility of the system. The development of real time monitoring systems with
user-friendly interfaces holds the power to empower farmers, allowing them to see
instant and actionable insights. Collaborative efforts with agricultural institutions
will contribute to the real-world implementation of the model. This trajectory not
only promises further technological advancements in the agriculture sector, but
also a positive impact on global agriculture.
References
accessed 2023/12/22
22. Banana leaf disease dataset,
https://www.kaggle.com/datasets/shifatearman/bananalsd, last accessed
2023/12/22
23. Boudjemaa Boudaa,Hadil Touhami (2023) A Graph ATtention Networks
Model for Session-Based Recommender Systems