A Machine Learning Approach to Early Heart Disease Paper
A Machine Learning Approach to Early Heart Disease Paper
This study systematically assesses key indicators using Logistic Regression as the
primary predictive model. The proposed approach demonstrates the efficiency of ML
algorithms in medical diagnostics, assisting healthcare professionals in risk
assessment and early intervention. By integrating these methodologies into a
predictive model, the study aims to enhance the accuracy and reliability of heart
disease detection.
Keywords:
Heart Disease Prediction, Machine Learning, Logistic Regression, Healthcare
Analytics, Predictive Modeling
1.INTRODUCTION
Heart disease is a global health challenge, accounting for nearly 17.9 million deaths
annually, according to the World Health Organization (WHO) [1]. Early detection
plays a vital role in reducing mortality rates and improving treatment outcomes.
Traditional methods rely on ECG scans, blood tests, and clinical assessments,
which may not always be efficient for early diagnosis.
With advancements in Machine Learning (ML) and Artificial Intelligence (AI),
predictive has become a promising tool in healthcare decision-making. This study
implements Logistic Regression to predict heart disease risks using patient health
data. The objective is to develop a low-cost, automated predictive model that can
assist doctors, hospitals, and healthcare professionals in risk assessment.
The remainder of this paper is organized as follows: Section II covers the literature
review, Section III explains the methodology, Section IV presents the results and
discussion, and Section V concludes with future work.
II. LITERATURE REVIEW
Several research studies have explored the application of machine learning (ML)
techniques in heart disease prediction. Traditional diagnostic methods rely on ECG
readings, blood tests, and clinical assessments, which may not always detect early-
stage conditions.
II. METHODOLOGY
A. Dataset Description
The dataset used in this study is obtained from Kaggle and contains key health
features, including,Age,Cholesterol levels,Blood pressure,ECG results,Maximum
heart rate achieved
The target variable classifies patients into two categories:
B. Data Preprocessing
1. Handling Missing Values: Missing data were replaced using mean
imputation.
2. Feature Scaling: Standardization using StandardScaler for normalization.
3. Train-Test Split: The dataset was divided into 80% training and 20%
testing data.
IV. RESULTS
A. Model Performance
The integration of machine learning with predictive modeling has the potential to
transform heart disease diagnosis. By leveraging ML techniques, medical
professionals can identify high-risk patients at an early stage, enabling timely
interventions and reducing mortality rates.This research demonstrates that Logistic
Regression effectively predicts heart disease risk with an accuracy of 85%. Future
research will explore:
VI. REFERENCES
[3] Heart Disease Prediction with Data Mining: Patel (2022) focuses on the use of
data mining techniques for predicting heart disease risk. By analyzing large datasets,
data mining algorithms can identify patterns and predict the likelihood of a patient
developing cardiovascular issues