0% found this document useful (0 votes)

43 views27 pages

Smoking & Drinking Prediction ML

Uploaded by

Yohannes Dereje

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views27 pages

Smoking & Drinking Prediction ML

Uploaded by

Yohannes Dereje

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 27

Smoking and Alcohol

drinking prediction system

Machine Learning Project

Group Eight
1. Introduction
 The primary objective of this research is to
shed light on the potential applications and
implications of smoking and drinking vs.
body signal prediction systems in modern
healthcare. By understanding the
physiological impact of smoking and
drinking through the lens of data analysis
and prediction, we can empower
individuals to make healthier choices and
enable healthcare professionals to offer
personalized interventions and support.
2. Statement Of the
problem

 Theproblem at hand is the lack of an

effective prediction system that can
accurately assess the impact of
smoking and drinking on the human
body by analyzing physiological
signals and parameters.
Contd ……….
 healthcareprofessionals struggle to provide
targeted interventions and support due to a
limited understanding of the unique
physiological effects of smoking and
drinking on individuals. Thus, there is a need
to develop a robust prediction system that
offers personalized feedback and enables
healthcare professionals to deliver effective
interventions, ultimately reducing the
burden of smoking and drinking-related
diseases.
3. Objective of the
study

3.1. General Objective

 The general objective of this
research is to develop a
comprehensive and accurate
prediction system that utilizes data
analysis techniques to assess the
impact of smoking and drinking on
the human body
3.2. Specific Objective
1. Review and analyze existing literature on the
physiological effects of smoking and drinking.
2. Develop a robust data collection framework to
capture relevant physiological signals.
3. Apply data analysis techniques to identify
patterns and correlations related to smoking and
drinking habits.
4. Develop a user-friendly prediction system that
provides personalized feedback.
5. Validate the prediction system and assess its
effectiveness.
6. Evaluate the potential impact of the prediction
system on public health.
4. Methodology
The "Smoker and Alcohol drinker Prediction" ML
Project Methodology:
 Define Problem
 Data Collection & Ethics
 Exploratory Data Analysis
 Preprocessing
 Feature Engineering
 Model Selection
 Training & Evaluation
 Interpretability
 Deployment & Feedback Loop
 Ethical Considerations & Documentation.
4.1. Data Collection

 Thedata set we used to train our ML

model is derived from from National
Health Insurance Service in Korea.

 We found the data in the link :-

https://www.kaggle.com/datasets/soo
youngher/smoking-drinking-dataset
4.2. Data type
 Inour "Smoker and Drinker Prediction" ML
project, we leveraged numeric continuous data
as our primary data type for training the
predictive models. This approach allows us to
analyze and extract meaningful patterns from a
range of quantitative variables, contributing to
the accuracy and effectiveness of our
predictions. By focusing on numerical attributes,
we aim to capture nuanced correlations and
dependencies within the dataset, enhancing the
reliability of our machine learning algorithms in
predicting smoking and alcohol consumption
behaviors.
4.3. Machine Learning
Algorithm
 We used Logistic Regression Algorithm for our
project.
 Logistic regression is a commonly used algorithm in
predictive modeling and classification tasks, and it
can be employed in the smoking and drinking
prediction system project. It is a statistical model
that determines the relationship between a binary
dependent variable (such as smoking or drinking
behavior) and one or more independent variables
(such as physiological signals and self-reported
data). The goal is to predict the likelihood of an
individual exhibiting a certain behavior (smoking or
drinking) based on the given set of input variables.
Contd ……
 The logistic regression algorithm uses the
logistic or sigmoid function to transform
the output into a probability value between
0 and 1. This probability represents the
likelihood of the individual belonging to a
particular class (e.g., smoker or non-
smoker) based on the input variables. The
logistic regression model estimates the
coefficients of the independent variables,
which indicate the strength and direction
of their influence on the outcome variable.
Contd……
 We used Logistic regression to model the
relationship between the physiological
signals, self-reported data, and the
likelihood of an individual engaging in
smoking or drinking behaviors. The
algorithm will be trained on a labeled
dataset, where the input variables are the
physiological signals and self-reported
data, and the output variable represents
the smoking or drinking behavior (e.g.,
smoker or non-smoker).
5. Data preprocessing/Data
Preparation
 Inthe Data Preprocessing phase of our
project, we meticulously prepared and
cleaned the dataset to ensure its quality and
suitability for machine learning. This involved
handling missing values, addressing outliers,
and normalizing or scaling features to create
a standardized foundation for model training.
Our rigorous data preparation process is
pivotal in optimizing the performance of our
machine learning algorithms, enhancing their
ability to discern meaningful patterns and
trends from the input data.
5.1. Importing the
dependencies
5.2. Data Cleaning
 Data cleaning is the systematic process of
identifying and rectifying errors or
inconsistencies in a dataset, ensuring its
accuracy and reliability for analysis or
machine learning.

 Handlingmissing values involves addressing

the absence of data points in a dataset,
employing techniques such as imputation or
removal to ensure completeness and
accuracy in analysis or modeling.
5.3. Data Labeling and
Annotation

 Handling missing values involves

addressing the absence of data
points in a dataset, employing
techniques such as imputation or
removal to ensure completeness and
accuracy in analysis or modeling.
Contd ………
6. Training the model

 LogisticRegression is a statistical
method used for binary
classification, predicting the
probability of an event's occurrence.
It's widely applied in machine
learning for scenarios where the
outcome is either yes/no or 0/1.
To understand it……….

 Imagine you have a superhero friend

who helps you decide if you should
wear a jacket or not. This superhero
looks at the weather and says, "I'm
80% sure it will rain today." That's a
bit like logistic regression! It helps
predict things with two choices, like
"rain" or "no rain," based on different
factors, and it's really good at
making these yes/no predictions.
Contd…..
7. Model Evaluation

 Model evaluation in machine

learning is the process of assessing
how well a trained model performs
on new, unseen data. It involves
using metrics like accuracy and
precision to measure the model's
effectiveness in making predictions.
The goal is to ensure the model
works reliably on different datasets.
Contd…..
 Model evaluation in machine learning is like
checking how well our superhero friend (the
model) is doing in predicting things. We
give our superhero some test questions
(new data it hasn't seen before) and see if
it gets them right or wrong. It's like asking,
"Did you correctly predict if it would rain
today?" We use metrics, like accuracy, to
measure how good our superhero is at
making predictions. The better our
superhero performs on new challenges, the
more we trust it to help us in the real world!
Contd……
8. Making Predictions
 Predictingsmoking and alcohol consumption
involves using a trained model to anticipate
whether an individual is likely to be a smoker
or drink alcohol based on input features. The
model takes relevant factors into account,
such as demographic information, lifestyle
choices, or health indicators, and produces
predictions indicating the likelihood of
smoking and alcohol consumption. These
predictions aid in understanding and
addressing potential health-related behaviors
in individuals.
Contd…….

 The Smoker and Alcohol Drinker

Prediction System holds the potential
to advance public health by
predicting risky behaviors. Future
improvements may enhance
prediction accuracy, incorporate
diverse data, and foster
collaborations for impactful
preventive interventions.
Contd…..
Thank YOU

Tobacco Use and Mortality, 2004-2015
No ratings yet
Tobacco Use and Mortality, 2004-2015
12 pages
Industrial Report
No ratings yet
Industrial Report
52 pages
G26 Report
No ratings yet
G26 Report
4 pages
Aih Lab1
No ratings yet
Aih Lab1
10 pages
Mini Project Report
No ratings yet
Mini Project Report
21 pages
Ek125 Final Project
No ratings yet
Ek125 Final Project
13 pages
Diabetes Prediction Project ShinyAS
No ratings yet
Diabetes Prediction Project ShinyAS
11 pages
Module-2 - Logistic Regression in Machine Learning
No ratings yet
Module-2 - Logistic Regression in Machine Learning
28 pages
B-56 Sanket Jambhulkar MLA-3
No ratings yet
B-56 Sanket Jambhulkar MLA-3
7 pages
Diabetes Prediction Presentation
No ratings yet
Diabetes Prediction Presentation
12 pages
Disease Pred Report
No ratings yet
Disease Pred Report
42 pages
Diabetes
No ratings yet
Diabetes
41 pages
DM Final
No ratings yet
DM Final
79 pages
Classification Models
No ratings yet
Classification Models
3 pages
IPL Winning Prediction Intern Report
No ratings yet
IPL Winning Prediction Intern Report
52 pages
Smoking Behavior ADS Project 1
No ratings yet
Smoking Behavior ADS Project 1
17 pages
Ads Exp 10
No ratings yet
Ads Exp 10
10 pages
IoT in Hospital Management
No ratings yet
IoT in Hospital Management
7 pages
Machine Learning For Preventive Healthcare
No ratings yet
Machine Learning For Preventive Healthcare
10 pages
Estimating Diabetic Risk Accurately
No ratings yet
Estimating Diabetic Risk Accurately
26 pages
Course Regression Model Strategies PDF
No ratings yet
Course Regression Model Strategies PDF
307 pages
Predicting Disease With Machine Learning
No ratings yet
Predicting Disease With Machine Learning
20 pages
Logistic Regression Lecture Notes
No ratings yet
Logistic Regression Lecture Notes
11 pages
Diabetes Prediction PP T
No ratings yet
Diabetes Prediction PP T
16 pages
Final Year Minor Project
No ratings yet
Final Year Minor Project
9 pages
MLPC Midterm
No ratings yet
MLPC Midterm
18 pages
Thesis
No ratings yet
Thesis
45 pages
Ai Datascience Project Grade 10
No ratings yet
Ai Datascience Project Grade 10
14 pages
BE 883 Data and Analytics Group Presentation-V2
No ratings yet
BE 883 Data and Analytics Group Presentation-V2
38 pages
Seetu Papers 1
No ratings yet
Seetu Papers 1
6 pages
CIEA Term Project
No ratings yet
CIEA Term Project
19 pages
Medhun Final 1
No ratings yet
Medhun Final 1
4 pages
L&T Interview
No ratings yet
L&T Interview
14 pages
Previewpdf
No ratings yet
Previewpdf
45 pages
Final Research Paper
No ratings yet
Final Research Paper
3 pages
LS Project Report
No ratings yet
LS Project Report
10 pages
ML Exp 8
No ratings yet
ML Exp 8
22 pages
Project Synopsis - Disease Prediction System Using Multivariate Health Data
No ratings yet
Project Synopsis - Disease Prediction System Using Multivariate Health Data
2 pages
Developing A Machining Learning Models From Start To Finish.
No ratings yet
Developing A Machining Learning Models From Start To Finish.
59 pages
Logistic Regression for Beginners
No ratings yet
Logistic Regression for Beginners
3 pages
Statistical Prediction and Machine Learning
100% (6)
Statistical Prediction and Machine Learning
314 pages
Exer5 Cabugnason
No ratings yet
Exer5 Cabugnason
7 pages
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
No ratings yet
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
22 pages
Rms PDF
No ratings yet
Rms PDF
506 pages
Classification
No ratings yet
Classification
9 pages
Broadly, There Are 3 Types of Machine Learning Algorithms.
No ratings yet
Broadly, There Are 3 Types of Machine Learning Algorithms.
33 pages
AI Disease Prediction for Healthcare
No ratings yet
AI Disease Prediction for Healthcare
25 pages
ML 01 (Pranavv)
No ratings yet
ML 01 (Pranavv)
14 pages
Google Docs
No ratings yet
Google Docs
26 pages
AI and ML Lab Ex3 To 12
No ratings yet
AI and ML Lab Ex3 To 12
27 pages
Linear Regression Lab Guide
100% (1)
Linear Regression Lab Guide
8 pages
2 Modele Lineare
No ratings yet
2 Modele Lineare
43 pages
Mini Project 5
No ratings yet
Mini Project 5
27 pages
Preview-9781000427899 A41277316
No ratings yet
Preview-9781000427899 A41277316
28 pages
Keith McNulty - Handbook of Regression Modeling in People Analytics-Routledge (2021)
100% (1)
Keith McNulty - Handbook of Regression Modeling in People Analytics-Routledge (2021)
272 pages
Commonly Used Machine Learning Algorithms
No ratings yet
Commonly Used Machine Learning Algorithms
27 pages
Internmod
No ratings yet
Internmod
44 pages
Cloud Article Review G4 Sec 1
No ratings yet
Cloud Article Review G4 Sec 1
7 pages
Chapter 9 - LEX - LabManual
No ratings yet
Chapter 9 - LEX - LabManual
26 pages
Compiler CH-1
No ratings yet
Compiler CH-1
33 pages
Compiler Design and Language Translation
No ratings yet
Compiler Design and Language Translation
60 pages
Machine Learning Model Evaluation Guide
No ratings yet
Machine Learning Model Evaluation Guide
65 pages
Chapter 3 Finite Automata and Lexical Analysis
No ratings yet
Chapter 3 Finite Automata and Lexical Analysis
95 pages
Lab Demo
No ratings yet
Lab Demo
1 page
Chapter Five System Design: Identifying Design Goals Decomposing The System Addressing Design Goals
No ratings yet
Chapter Five System Design: Identifying Design Goals Decomposing The System Addressing Design Goals
31 pages
Chapter 8 Code Optimization and Code Generation
No ratings yet
Chapter 8 Code Optimization and Code Generation
58 pages
CH 5
No ratings yet
CH 5
17 pages
Chapter Five
No ratings yet
Chapter Five
38 pages
Compiler Design for CS Students
No ratings yet
Compiler Design for CS Students
34 pages
Lab Demos
No ratings yet
Lab Demos
13 pages
Software Testing Techniques Guide
No ratings yet
Software Testing Techniques Guide
40 pages
ML - Chapter 5 - Neural Network
No ratings yet
ML - Chapter 5 - Neural Network
64 pages
Chapter 4 Syntax Analysis
No ratings yet
Chapter 4 Syntax Analysis
90 pages
Chapter 3 Database Modeling
No ratings yet
Chapter 3 Database Modeling
51 pages
Chapter 5 Syntax-Directed Translation
No ratings yet
Chapter 5 Syntax-Directed Translation
25 pages
Chapter 1 Query Processing
100% (1)
Chapter 1 Query Processing
63 pages
Chapter 2-Computer Security Attacks and Threats
No ratings yet
Chapter 2-Computer Security Attacks and Threats
40 pages
CH 1 Web - Programming
No ratings yet
CH 1 Web - Programming
34 pages
Chapter 2 DB Security
No ratings yet
Chapter 2 DB Security
40 pages
Grand Hyatt Dubai Fact Sheet English
No ratings yet
Grand Hyatt Dubai Fact Sheet English
1 page
Munchee History
96% (23)
Munchee History
69 pages
Fruit-Bearing Trees & Orchards Guide
100% (1)
Fruit-Bearing Trees & Orchards Guide
16 pages
Lara Bar Flavors & Ingredients
No ratings yet
Lara Bar Flavors & Ingredients
5 pages
Libation Ceremony
No ratings yet
Libation Ceremony
2 pages
Laboratory Manual 5
No ratings yet
Laboratory Manual 5
3 pages
Reading Comprehension (For Kids)
No ratings yet
Reading Comprehension (For Kids)
46 pages
Vietnam Consumer & Retail Report - Q4 2022
No ratings yet
Vietnam Consumer & Retail Report - Q4 2022
47 pages
English 6 1-15B
No ratings yet
English 6 1-15B
1 page
Family Assessment Questionnaire
No ratings yet
Family Assessment Questionnaire
6 pages
Community Nutrition Thesis Topics
50% (2)
Community Nutrition Thesis Topics
7 pages
List of Colors (Alphabetical)
No ratings yet
List of Colors (Alphabetical)
101 pages
Philippine National Standard: Halâl Feeds
No ratings yet
Philippine National Standard: Halâl Feeds
23 pages
Mixture & Alligation
No ratings yet
Mixture & Alligation
6 pages
Filipino Caramel Cake Recipe
No ratings yet
Filipino Caramel Cake Recipe
6 pages
HS - Street Food
No ratings yet
HS - Street Food
10 pages
Best Chewy Chocolate Chip Cookies Recipe
No ratings yet
Best Chewy Chocolate Chip Cookies Recipe
1 page
Spinneys Integrated Report 2024 1741239053
No ratings yet
Spinneys Integrated Report 2024 1741239053
83 pages
Billions S01E04 Axe and Elise
No ratings yet
Billions S01E04 Axe and Elise
3 pages
Product List: NO. Product Name EX Packing
100% (1)
Product List: NO. Product Name EX Packing
1 page
Patient Information: Ciprofloxacin 500 MG Oral Tablet
No ratings yet
Patient Information: Ciprofloxacin 500 MG Oral Tablet
1 page
Term Paper: How Does Globalization Occur Subjectively?
No ratings yet
Term Paper: How Does Globalization Occur Subjectively?
12 pages
Ark Sheet PDF
No ratings yet
Ark Sheet PDF
2 pages
Measurement of Uncertainty and Verification in Microbiological Analysis of Water and Food Samples
No ratings yet
Measurement of Uncertainty and Verification in Microbiological Analysis of Water and Food Samples
7 pages
Approved Indian Abattoirs
No ratings yet
Approved Indian Abattoirs
13 pages
LP-Format-EFDT TLE 7.8 &9
100% (1)
LP-Format-EFDT TLE 7.8 &9
3 pages
Life in The Past - Year 6 Worksheets
100% (1)
Life in The Past - Year 6 Worksheets
11 pages
Overdue Retur Tahun 2024 S.D 2025 - 30 Hari (18.06.2025)
No ratings yet
Overdue Retur Tahun 2024 S.D 2025 - 30 Hari (18.06.2025)
13 pages
WLP - Q4 - Music 10 - Week 4
No ratings yet
WLP - Q4 - Music 10 - Week 4
8 pages
7 Day Weight Gain Diet Plan
No ratings yet
7 Day Weight Gain Diet Plan
15 pages

Smoking & Drinking Prediction ML

Uploaded by

Smoking & Drinking Prediction ML

Uploaded by

Smoking and Alcohol

drinking prediction system

Machine Learning Project

 Theproblem at hand is the lack of an

3.1. General Objective

 Thedata set we used to train our ML

 We found the data in the link :-

 Handlingmissing values involves addressing

 Handling missing values involves

 Imagine you have a superhero friend

 Model evaluation in machine

 The Smoker and Alcohol Drinker

You might also like