Performance Evaluation of Machine Learning Algorithms For A Cluster-Based Crop Recommendation System
Performance Evaluation of Machine Learning Algorithms For A Cluster-Based Crop Recommendation System
2023 17th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS) | 979-8-3503-7091-1/23/$31.00 ©2023 IEEE | DOI: 10.1109/SITIS61268.2023.00079
Performance Evaluation of
Machine Learning Algorithms for
a Cluster-based Crop Recommendation System
Dalhatu Muhammed Ehsan Ahvar Shohreh Ahvar
Institut Supérieur d’Electronique de Paris (ISEP) Nokia Nokia
Paris, France Massy, France Massy, France
[email protected] [email protected] [email protected]
Maria Trocan
Institut Supérieur d’Electronique de Paris (ISEP)
Paris, France
[email protected]
Abstract—A cluster-based crop recommendation system cate- opportunity to the farmer to choose one of the crops of the
gorizes the crop candidates into several groups or classes (e.g., recommended group based on his/her interests.
based on soil and environment parameters similarities). After In addition, the grouping of crops can reduce the complexity
receiving a request from a farmer, it recommends the most
appropriate group of crops to the farmer. Proposing a group and, therefore, can reduce the resource usage (e.g., compute
of crops (i.e., more than one crop) can allow farmers to consider resource) of the recommendation system.
their personal interests as well. In addition, the cluster-based crop On the other hand, an appropriate Machine Learning (ML)
recommendation system can reduce the complexity and utilized algorithm can play an important role in the performance of
resources (e.g., compute resource). As the main contribution of a cluster-based crop recommendation system. To this end,
this paper, we evaluate the performance of different Machine
Learning (ML) algorithms to find the most appropriate one for performance evaluation of different ML classification algo-
using in the cluster-based crop recommendation system. rithms to find the most appropriate one is necessary. The key
Index Terms—Crop Recommendation, Machine Learning contributions of this paper are as follows:
(ML) Algorithms, Cluster-based and Performance Evaluation • We present the concept of a cluster-based crop recom-
mendation system.
I. I NTRODUCTION • We group the crops based on their soil and environment
parameters similarities.
442
Authorized licensed use limited to: Indian Institute of Technology Hyderabad. Downloaded on April 28,2024 at 11:00:52 UTC from IEEE Xplore. Restrictions apply.
Fig. 1. The cluster-based crop recommendation system
443
Authorized licensed use limited to: Indian Institute of Technology Hyderabad. Downloaded on April 28,2024 at 11:00:52 UTC from IEEE Xplore. Restrictions apply.
Fig. 2. Dataset features correlation matrix
for non-temporal datasets. We consider RF, KNN, Logistic see that the worst results belong to Logistic Regression. In
Regression, DT, SVM, XGBoost, Naive Bayes, Bagging, Gra- general, increasing size of the dataset from 2000 to 6000 and
dientBoost and LGBM ML models. even 12000 could not make a noticeable effects on the results.
We used 70% of the initial dataset (2200 samples) for
training and hyperparameter tuning and 30% of the dataset V. C ONCLUSION
for testing. GridSearchCV method was utilized for the hyper- This paper first presented the concept of a cluster-based crop
parameter tuning. We also evaluated the performance of the recommendation system where the crops were grouped based
ML models. Table I shows the obtained hyperparameter values on their soil and environment parameters similarities. The
for the ML models. The best score represents the accuracy of cluster-based crop recommendation system received a farm
the ML models. information from a farmer as input and, then, recommended
a group of crops to the farmer. The benefits were 1) having
TABLE II
ACCURACY COMPARISON USING DATASETS WITH SIZES OF 2200, 6000 more than one option for farmers to consider their interests
AND 12000 as well, 2) reducing complexity and resource usage for the
recommendation system.
Models 2200 6000 12000
i=5-i=10 i=5-i=10 i=5-i=10 We then evaluated the performance of several well-known
RF 0.9941- 0.9946- 0.9945- ML algorithms to see which one is the best candidate to
0.9952 0.9952 0.9951 be used in the cluster-based crop recommendation system.
KNN 0.9793- 0.9791- 0.9791-
0.9798 0.9800 0.9800 The result showed that RF obtained the highest accuracy.
Logistic 0.9553- 0.9559- 0.9559- XGBoost, LGBM and GradientBoost were close to RF. The
Regression 0.9594 0.9586 0.9607 dataset utilized in this work was small. We saw that even
DT 0.9875- 0.9874- 0.9876-
0.9903 0.9907 0.9910 augmenting the dataset could not change the results noticeably.
SVM 0.9777- 0.9773- 0.9773- As a future work, the ML models under study can be compared
0.9789 0.9791 0.9791 considering a larger dataset.
XGBoost 0.9925- 0.9927- 0.9927-
0.9948 0.9950 0.9950
Naive Bayes 0.9887- 0.9845- 0.9845-
R EFERENCES
0.9885 0.9841 0.9841 [1] M. Pathan, N. Patel, H. Yagnik, and M. Shah, “Artificial cognition for
Bagging 0.9899- 0.9898- 0.9897- applications in smart agriculture: A comprehensive review,” Artificial
0.9914 0.9908 0.9908 Intelligence in Agriculture, vol. 4, pp. 81–95, 2020.
GradientBoost 0.9908- 0.9904- 0.9905- [2] D. Muhammed, E. Ahvar, S. Ahvar, and M. Trocan, “A user-friendly
0.9922 0.9928 0.9929 aiot-based crop recommendation system (uacr): concept and architec-
LGBM 0.9916- 0.9927- 0.9927- ture,” in 2022 16th International Conference on Signal-Image Technol-
0.9921 0.9931 0.9932 ogy & Internet-Based Systems (SITIS). IEEE, 2022, pp. 569–576.
[3] T. Setiadi, F. Noviyanto, H. Hardianto, A. Tarmuji, A. Fadlil, and
In next step, we evaluated the performance of the ML M. Wibowo, “Implementation of naïve bayes method in food crops
planting recommendation,” Int. J. Sci. Technol. Res, vol. 9, no. 02, pp.
algorithms using the k-fold cross validation method (k=5). 4750–4755, 2020.
In order to get more accurate results, we run the 5-fold [4] T. K. Mishra, S. K. Mishra, K. J. Sai, S. Peddi, and M. Surusomayajula,
cross validation 5 times (i=5) and get the average of these “Crop recommendation system using support vector machine consider-
ing indian dataset,” in Advances in Distributed Computing and Machine
5 times results for every ML algorithm. Similarly, we run the Learning: Proceedings of ICADCML 2022. Springer, 2022, pp. 501–
5-fold cross validation 10 times (i=10). Table II shows the 510.
evaluation results. The results clearly show that RF algorithm [5] K. G. Sandhya, S. Vemuri, K. S. Deeksha, and T. Anvitha, “Crop
recommendation system using ensembling technique,” in 2022 Inter-
could outperform other algorithms in all cases. The results of national Conference on Breakthrough in Heuristics And Reciprocation
XGBoost, GradientBoost and LGBM are close to RF. We can of Advanced Technologies (BHARAT). IEEE, 2022, pp. 55–58.
444
Authorized licensed use limited to: Indian Institute of Technology Hyderabad. Downloaded on April 28,2024 at 11:00:52 UTC from IEEE Xplore. Restrictions apply.
[6] A. Chougule, V. K. Jha, and D. Mukhopadhyay, “Crop suitability and
fertilizers recommendation using data mining techniques,” in Progress
in Advanced Computing and Intelligent Engineering. Springer, 2019,
pp. 205–213.
[7] A. Motwani, P. Patil, V. Nagaria, S. Verma, and S. Ghane, “Soil
analysis and crop recommendation using machine learning,” in 2022
International Conference for Advancement in Technology (ICONAT).
IEEE, 2022, pp. 1–7.
[8] A. Kedlaya, A. Sana, B. A. Bhat, S. Kumar, N. Bhat et al., “An efficient
algorithm for predicting crop using historical data and pattern matching
technique,” Global Transitions Proceedings, vol. 2, no. 2, pp. 294–298,
2021.
[9] N. Patil, S. Kelkar, M. Ranawat, and M. Vijayalakshmi, “Krushi sahyog:
Plant disease identification and crop recommendation using artificial
intelligence,” in 2021 2nd International Conference for Emerging Tech-
nology (INCET). IEEE, 2021, pp. 1–6.
[10] M. Suchithra and M. L. Pai, “Data mining based geospatial clustering
for suitable recommendation system,” in 2020 International Conference
on Inventive Computation Technologies (ICICT). IEEE, 2020, pp. 132–
139.
[11] M. D. Hossain, M. A. Kashem, and S. Mustary, “Iot based smart soil fer-
tilizer monitoring and ml based crop recommendation system,” in 2023
International Conference on Electrical, Computer and Communication
Engineering (ECCE). IEEE, 2023, pp. 1–6.
[12] D. Modi, A. V. Sutagundar, V. Yalavigi, and A. Aravatagimath, “Crop
recommendation using machine learning algorithm,” in 2021 5th Inter-
national Conference on Information Systems and Computer Networks
(ISCON). IEEE, 2021, pp. 1–5.
[13] M. S. Teja, T. S. Preetham, L. Sujihelen, S. Jancy, M. P. Selvan et al.,
“Crop recommendation and yield production using svm algorithm,” in
2022 6th International Conference on Intelligent Computing and Control
Systems (ICICCS). IEEE, 2022, pp. 1768–1771.
[14] L. Meenachi, S. Ramakrishnan, M. Sivaprakash, C. Thangaraj, and
S. Sethupathy, “Multi class ensemble classification for crop recommen-
dation,” in 2022 International Conference on Inventive Computation
Technologies (ICICT). IEEE, 2022, pp. 1319–1324.
[15] V. Vagisha, E. Rajesh, and P. Johri, “Crop recommendation system
for intelligent smart farming technology,” in 2022 4th International
Conference on Advances in Computing, Communication Control and
Networking (ICAC3N). IEEE, 2022, pp. 249–253.
[16] A. Priyadharshini, S. Chakraborty, A. Kumar, and O. R. Pooniwala,
“Intelligent crop recommendation system using machine learning,” in
2021 5th International Conference on Computing Methodologies and
Communication (ICCMC). IEEE, 2021, pp. 843–848.
[17] S. K. S. Durai and M. D. Shamili, “Smart farming using machine
learning and deep learning techniques,” Decision Analytics Journal,
vol. 3, p. 100041, 2022.
[18] A. Arora, N. Shoeibi, V. Sati, A. González-Briones, P. Chamoso, and
E. Corchado, “Data augmentation using gaussian mixture model on
csv files,” in Distributed Computing and Artificial Intelligence, 17th
International Conference. Springer, 2021, pp. 258–265.
[19] Sudirman, A. P. Windarto, and A. Wanto, “Data mining tools| rapid-
miner: K-means method on clustering of rice crops by province as
efforts to stabilize food crops in indonesia,” in IOP Conference Series:
Materials Science and Engineering, vol. 420. IOP Publishing, 2018,
p. 012089.
[20] F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion,
O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg et al.,
“Scikit-learn: Machine learning in python,” the Journal of machine
Learning research, vol. 12, pp. 2825–2830, 2011.
445
Authorized licensed use limited to: Indian Institute of Technology Hyderabad. Downloaded on April 28,2024 at 11:00:52 UTC from IEEE Xplore. Restrictions apply.