SmartCare A Symptoms Based Disease Prediction Model Using Machine Learning Approach
SmartCare A Symptoms Based Disease Prediction Model Using Machine Learning Approach
https://doi.org/10.22214/ijraset.2022.47434
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue XI Nov 2022- Available at www.ijraset.com
Abstract: The breakthrough on computer-based technology has resulted in storage of a lot of electronic data in the healthcare
industry. Machine Learning technology has been proven beneficial in giving an immeasurable platform in the medical field so
that health care issues can be resolved effortlessly and expeditiously. Prediction of disease at early stage could help people from
getting the necessary treatment on time. These days many virtual prediction models are available for the same. The existing
systems either made use of only one algorithm or prediction system were capable for predicting only one disease. The maximum
accuracy of the existing systems range between 52% to 88%. The algorithms used in various prediction system consisted of
Linear Regression, Decision Tree, Naïve Bayes, KNN, CNN, Random Forest Tree, etc. In our project i.e., “SmartCare: A
Symptoms Based Disease Prediction Model Using Machine Learning Approach”, it is possible to predict more than one disease
at a time. So, the user does not need to traverse many models to predict the diseases. It will help to reduce the time and cost of
predicting diseases at prior stages, so as to prevent the extremities of it and thus, there is a chance of reducing mortality rate.
Keyword: Machine Learning, Decision Tree Algorithm, K-Nearest Neighbour Algorithm, Naïve Bayes Algorithm, Random
Forest Algorithm, GPS Connectivity.
I. INTRODUCTION
Healthy lifestyle, healthcare and medicines are few of the essential elements of human lifestyles and economy. There is a
tremendous change in the world we are living in now and the world that existed few months back. Everything has turned ugly and
divergent. In this case, where the entirety has grown to become digital or let us say virtual, the doctors and nurses are giving their
maximum efforts to keep people's lives and people’s health even though they ought to danger their very own.
Even now in some parts of the world there are still some far-flung villages, remote places which lack clinical centers, health
facilities. Machines have started to gain popularity and dependency by humans as, without any human mistakes, they could perform
duties greater efficaciously and with a steady degree of accuracy.
A disease predictor is nothing but a virtual doctor, which can predict the disorder of any affected person without any human errors.
The first disease prediction system focused on input of blood report values. Whereas The Symptoms Based Disease Prediction
Model predicts the disease of the patient based on the input of symptoms. Depending on the disease being diagnosed a specialized
doctor will be assigned for the patient.
The following algorithms are used in developing the Symptoms Based Disease Prediction Model: -
1) Decision Tree is a Supervised learning technique that can be used for both classification and Regression problems, but mostly it
is preferred for solving Classification problems. It is a tree-structured classifier, where internal nodes represent the features of a
dataset, branches represent the decision rules and each leaf node represents the outcome.[1]
2) Random Forest is a popular machine learning algorithm that belongs to the supervised learning technique. It can be used for
both Classification and Regression problems in ML. It is based on the concept of ensemble learning, which is a process of
combining multiple classifiers to solve a complex problem and to improve the performance of the model.[2]
3) The k-nearest neighbors’ algorithm, also known as KNN or k-NN, is a non-parametric, supervised learning classifier, which
uses proximity to make classifications or predictions about the grouping of an individual data point.[3]
4) Naïve Bayes algorithm is a supervised learning algorithm, which is based on Bayes theorem and used for solving classification
problems. It is mainly used in text classification that includes a high-dimensional training dataset. Naïve Bayes Classifier is one
of the simple and most effective Classification algorithms which helps in building the fast machine learning models that can
make quick predictions.[4]
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 709
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue XI Nov 2022- Available at www.ijraset.com
B. Problem Statement
The “SmartCare: A Symptom-Based Disease Prediction Model Using Machine Learning Approach” does not focus on the
prediction of a specific disease; instead, it predicts disease based on the symptoms given by the user. As a result, the user does not
need to traverse many models to predict the disease. There is a probability of lowering the death rate due to the prediction of disease
at an early stage. Utilizing machine learning methods, our goal is to create a symptom-based disease prediction model. The Frontend
of the system would consist of a responsive website that can be accessed through any device. The website will be developed using
HTML, CSS and JavaScript will be connected to the ML model using Flask. The user has to input the parameters for a specific
disease and the model will detect if the disease is present or not. Based on more than 5000 records of patients, our goal is to create a
prediction model that analyses the user’s symptoms, determines the disease he or she is more likely to have, and directs the user to
the closest hospitals based on location.
B. Software Requirements
1) Browsers supported: Chrome, Brave, Microsoft Edge, Mozilla Firefox
2) Technology used: HTML, CSS, JavaScript, JavaScript Libraries, Flask
3) Complier used: Sublime Text Code Editor and Google Collab
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 710
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue XI Nov 2022- Available at www.ijraset.com
V. SYSTEM DESCRIPTION
Users can give various symptoms and the issues they are facing. The application takes the user's symptoms as inputs to check for
various illnesses that could be associated with it using the algorithms. The system also provides the users with a list of hospitals near
them which they could visit for further consultancy. The model will be available as a website for the user to use and is simple as
well as easy to use. The traditional diagnosis approach demands an affected person visiting a doctor, undergoing many clinical
assessments, and then reaching a conclusion. This whole process was very time consuming. This project proposes an automated
disease prediction system using machine learning approach to save time and cost by predicting diseases at prior stages, so as to
prevent the extremities of it and thus, there is a chance of reducing mortality rate.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 711
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue XI Nov 2022- Available at www.ijraset.com
B. Modules Used
The website contains five pages i.e., Home Page, Disease Prediction Page, Consultant Page, About Page and Feedback Page.
1) Home Page: The user will see navbar in this area which contains the following buttons, “Home, Prediction, Consultancy,
About, Feedback”. These buttons are used to direct the user to the respective page.
2) Prediction page: The patient will be able to enter min 3 and max 5 symptoms and after clicking on submit, the four algorithms
working in the backend will display the result.
3) Consultancy page: Based on the location of the user nearby hospitals will be recommended.
4) About page: Brief description about the website including the list of diseases that can be predicted by the website.
5) Feedback page: The user can give us their feedback through the feedback page.
C. Results
Fig. 2 depicts the Home Page of the system from which they can travel to any page of the website using the navbar that include the
Prediction, Consultancy, About and Feedback.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 712
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue XI Nov 2022- Available at www.ijraset.com
Fig. 4 represents the Prediction page where after giving five symptoms as input the predicted disease will be displayed with the
help of four algorithms that consist of the Naive Bayer’s, Decision Tree, K Nearest Neighbour and the Random Forest Algorithm.
Fig. 5 shows the Consultancy page where after allowing access to the user’s current location, the GPS will track and display all the
nearby hospitals immediately.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 713
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue XI Nov 2022- Available at www.ijraset.com
Fig. 6 depicts the About page that gives the user a brief idea about our website and also displays the list of diseases that can be
predicted by the system.
Fig. 7 represents the Feedback page where the user can share their views, comments or feedback.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 714
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue XI Nov 2022- Available at www.ijraset.com
VII. CONCLUSION
Making predictions from data is a strong use of machine learning. But it's crucial to keep in mind that machine learning is only as
effective as the data used to train the algorithms. The website has been created in such a way that using it will be simple and easy for
users. The project successfully implemented a website that could predict a disease with a 97% accuracy rate after comparing the
four algorithms. Along with disease prediction, the website includes an about page, consultancy page and a feedback page where
users can provide valuable feedback. In terms of future work, we intend to store the data of the users and use that information in the
existing dataset and work on increasing the accuracy rate as well as include a greater number of diseases which the model can
predict, generation of report and include services like virtual doctor appointments and online medicine shopping.
VIII. ACKNOWLEDGEMENT
We would like to express our sincere gratitude to our teacher Asst. Prof. Nilambari Narkar, our principal Dr. Y.D. Venkatesh, and
our department head Dr. Vaishali Gaikwad for providing us the golden opportunity to do this wonderful project on the topic. The
research carried out for the project helped us learn a lot of things and it also gave us practical experience with machine learning. We
are really grateful to them.
REFERENCES
[1] studocu. What is a decision tree?
[2] Board Infinity. Explain random forest algorithm in machine learning?
[3] IBM. What is the k-nearest neighbors algorithm?
[4] Reshma. Classification using naive bayes.
[5] Akkem Yaganteeswarudu. Multi disease prediction model by using machine learning and flask api. In 2020 5th International Conference on Communication
and Electronics Systems (ICCES), pages 1242–1246. IEEE, 2020.
[6] P Hamsagayathri and S Vigneshwaran. Symptoms based disease prediction using machine learning techniques. In 2021 Third International Conference on
Intelligent Communication Technologies and Virtual Mobile Networks (ICICV), pages 747–752. IEEE, 2021.
[7] Archana Singh and Rakesh Kumar. Heart disease prediction using machine learning algorithms. In 2020 international conference on electrical and electronics
engineering (ICE3), pages 452–457. IEEE, 2020.
[8] Yanli Liu, Yourong Wang, and Jian Zhang. New machine learning algorithm: Random forest. In International Conference on Information Computing and
Applications, pages 246–252. Springer, 2012.
[9] Daniel Berrar. Bayes’ theorem and naive bayes classifier. Encyclopedia of Bioinformatics and Computational Biology: ABC of Bioinformatics, 403, 2018.
[10] Bahzad Charbuty and Adnan Abdulazeez. Classification based on decision tree algorithm for machine learning. Journal of Applied Science and Technology
Trends, 2(01):20–28, 2021.
[11] Gongde Guo, Hui Wang, David Bell, Yaxin Bi, and Kieran Greer. Knn model-based approach in classification. In OTM Confederated International
Conferences” On the Move to Meaningful Internet Systems”, pages 986–996. Springer, 2003.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 715