Ml-Exp-2 - Jupyter Notebook

Uploaded by

engageelite1407

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views2 pages

Ml-Exp-2 - Jupyter Notebook

Uploaded by

engageelite1407

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

10/20/24, 11:16 PM ml-exp-2 - Jupyter Notebook

In [24]:  # Import necessary libraries

import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.neighbors import KNeighborsClassifier
from sklearn.svm import SVC
from sklearn.metrics import classification_report, accuracy_score

# Load the dataset
url = '/kaggle/input/email-spam-classification-dataset-csv/emails.csv'
df = pd.read_csv(url)

# Display the first few rows of the dataset
#print("Initial Dataset:")
#print(df.head())

# Check unique values in 'Prediction' before mapping
#print("\nUnique values in 'Prediction' before mapping:")
#print(df['Prediction'].unique())

# Drop rows where 'Prediction' is NaN to avoid issues
#df = df.dropna(subset=['Prediction'])

# Check the size of the DataFrame after dropping NaNs
#print(f"\nNumber of rows after dropping NaNs: {len(df)}")

# Check unique values in 'Prediction' after dropping NaNs
#print("\nUnique values in 'Prediction' after dropping NaNs:")
#print(df['Prediction'].unique())

# Split dataset into features and labels
X = df['Email No.']
y = df['Prediction']

# Check if the data has sufficient samples
if len(X) == 0 or len(y) == 0:
raise ValueError("The dataset is empty after preprocessing. Please

# Split the dataset into training and test sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2

# Convert the text data into numerical data using TF-IDF vectorization
tfidf = TfidfVectorizer(stop_words='english', max_df=0.7)
X_train_tfidf = tfidf.fit_transform(X_train)
X_test_tfidf = tfidf.transform(X_test)

# K-Nearest Neighbors Classifier
knn = KNeighborsClassifier(n_neighbors=5)
knn.fit(X_train_tfidf, y_train)
y_pred_knn = knn.predict(X_test_tfidf)

# Support Vector Machine Classifier
svm = SVC()
svm.fit(X_train_tfidf, y_train)
y_pred_svm = svm.predict(X_test_tfidf)

# Evaluate the models
print("K-Nearest Neighbors (KNN) Results:")
print(classification_report(y_test, y_pred_knn))
print(f"Accuracy: {accuracy_score(y_test, y_pred_knn) * 100:.2f}%")

localhost:8888/notebooks/Downloads/ml-exp-2.ipynb 3/5
10/20/24, 11:16 PM ml-exp-2 - Jupyter Notebook
print("\nSupport Vector Machine (SVM) Results:")
print(classification_report(y_test, y_pred_svm))
print(f"Accuracy: {accuracy_score(y_test, y_pred_svm) * 100:.2f}%")

K-Nearest Neighbors (KNN) Results:

precision recall f1-score support

0 0.71 1.00 0.83 739

1 0.00 0.00 0.00 296

accuracy 0.71 1035

macro avg 0.36 0.50 0.42 1035
weighted avg 0.51 0.71 0.59 1035

Accuracy: 71.40%

Support Vector Machine (SVM) Results:

precision recall f1-score support

0 0.71 1.00 0.83 739

1 0.00 0.00 0.00 296

accuracy 0.71 1035

macro avg 0.36 0.50 0.42 1035
weighted avg 0.51 0.71 0.59 1035

Accuracy: 71.40%

/opt/conda/lib/python3.10/site-packages/sklearn/metrics/_classificati
on.py:1344: UndefinedMetricWarning: Precision and F-score are ill-def
ined and being set to 0.0 in labels with no predicted samples. Use `z
ero_division` parameter to control this behavior.
_warn_prf(average, modifier, msg_start, len(result))
/opt/conda/lib/python3.10/site-packages/sklearn/metrics/_classificati
on.py:1344: UndefinedMetricWarning: Precision and F-score are ill-def
ined and being set to 0.0 in labels with no predicted samples. Use `z
ero_division` parameter to control this behavior.
_warn_prf(average, modifier, msg_start, len(result))
/opt/conda/lib/python3.10/site-packages/sklearn/metrics/_classificati
on.py:1344: UndefinedMetricWarning: Precision and F-score are ill-def
ined and being set to 0.0 in labels with no predicted samples. Use `z
ero_division` parameter to control this behavior.
_warn_prf(average, modifier, msg_start, len(result))
/opt/conda/lib/python3.10/site-packages/sklearn/metrics/_classificati
on.py:1344: UndefinedMetricWarning: Precision and F-score are ill-def
ined and being set to 0.0 in labels with no predicted samples. Use `z
ero_division` parameter to control this behavior.
_warn_prf(average, modifier, msg_start, len(result))
/opt/conda/lib/python3.10/site-packages/sklearn/metrics/_classificati
on.py:1344: UndefinedMetricWarning: Precision and F-score are ill-def
ined and being set to 0.0 in labels with no predicted samples. Use `z
ero_division` parameter to control this behavior.
_warn_prf(average, modifier, msg_start, len(result))
/opt/conda/lib/python3.10/site-packages/sklearn/metrics/_classificati
on.py:1344: UndefinedMetricWarning: Precision and F-score are ill-def
ined and being set to 0.0 in labels with no predicted samples. Use `z
ero_division` parameter to control this behavior.
_warn_prf(average, modifier, msg_start, len(result))

localhost:8888/notebooks/Downloads/ml-exp-2.ipynb 4/5

ESCL-SOP-019, Procedure For Control of Inspection, Measurement and Test Equipment
No ratings yet
ESCL-SOP-019, Procedure For Control of Inspection, Measurement and Test Equipment
5 pages
Transfer of Analytical Procedures 1224 Usp42 - NF
No ratings yet
Transfer of Analytical Procedures 1224 Usp42 - NF
2 pages
Core Tools: Measurement Systems Analysis (MSA)
No ratings yet
Core Tools: Measurement Systems Analysis (MSA)
6 pages
Model and Simulation PDF
No ratings yet
Model and Simulation PDF
253 pages
Full
No ratings yet
Full
605 pages
1-Basic Concept of Measurement Systems
No ratings yet
1-Basic Concept of Measurement Systems
51 pages
University of Bath Department of Mechanical Engineering: Me10304 Mathematics 1 Drdasrees
No ratings yet
University of Bath Department of Mechanical Engineering: Me10304 Mathematics 1 Drdasrees
13 pages
Validation in Clinical Chemistry: Elvar Theodorsson
No ratings yet
Validation in Clinical Chemistry: Elvar Theodorsson
26 pages
Grade 7 Science Topic 3
No ratings yet
Grade 7 Science Topic 3
17 pages
Enzyme Technology: Activity Brief
No ratings yet
Enzyme Technology: Activity Brief
10 pages
2 Physics-Module 1-INTRODUCTION - Student PDF
No ratings yet
2 Physics-Module 1-INTRODUCTION - Student PDF
14 pages
Horizontal Accuracy Reporting Using Terrasolid Products: Terramatch, Terraphoto Versions 012 and Above
No ratings yet
Horizontal Accuracy Reporting Using Terrasolid Products: Terramatch, Terraphoto Versions 012 and Above
8 pages
Measurement, Data Processing & Analysis (Second Test)
No ratings yet
Measurement, Data Processing & Analysis (Second Test)
7 pages
Micromachines 13 00947
No ratings yet
Micromachines 13 00947
12 pages
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
100% (1)
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
73 pages
MELC6 Contrast Indigenous Media To The More Common Sources of Information Such As Library Internet Etc.
100% (1)
MELC6 Contrast Indigenous Media To The More Common Sources of Information Such As Library Internet Etc.
40 pages
A Phase-Based Technique For Localization of UHF-RFID Tags Moving On A Conveyor Belt Performance Analysis and Test-Case Measurements
No ratings yet
A Phase-Based Technique For Localization of UHF-RFID Tags Moving On A Conveyor Belt Performance Analysis and Test-Case Measurements
10 pages
MTEB: Massive Text Embedding Benchmark
No ratings yet
MTEB: Massive Text Embedding Benchmark
24 pages
Chapter6: Experiment On Fps Elimination
No ratings yet
Chapter6: Experiment On Fps Elimination
8 pages
Machine Learnin1
100% (1)
Machine Learnin1
41 pages
Measurement Terminology
No ratings yet
Measurement Terminology
3 pages
Pharmaceutical Validation - A Review 2021
No ratings yet
Pharmaceutical Validation - A Review 2021
6 pages
Lab Week 7
No ratings yet
Lab Week 7
3 pages
Assignment 3
No ratings yet
Assignment 3
7 pages
I Avaliação Parcial - 25.0 PTS - Gabarito
No ratings yet
I Avaliação Parcial - 25.0 PTS - Gabarito
9 pages
Lecture 02
No ratings yet
Lecture 02
12 pages
A Novel Method For Facial Recognition Ba
No ratings yet
A Novel Method For Facial Recognition Ba
5 pages
Scikit-Learn: Scikit-Learn Is An Open Source Python Library That
100% (1)
Scikit-Learn: Scikit-Learn Is An Open Source Python Library That
1 page
Pract5 1
No ratings yet
Pract5 1
3 pages
Scikit-Learn Cheat Sheet Python For Data Science: Preprocessing The Data Evaluate Your Model's Performance
100% (1)
Scikit-Learn Cheat Sheet Python For Data Science: Preprocessing The Data Evaluate Your Model's Performance
1 page
Case Study - Classifier
No ratings yet
Case Study - Classifier
5 pages
Qualitative and Quantitative Analysis
No ratings yet
Qualitative and Quantitative Analysis
18 pages
6 - 2 - SVMS, - Randon - Forests - and - KNN - Ipynb - Colaboratory
No ratings yet
6 - 2 - SVMS, - Randon - Forests - and - KNN - Ipynb - Colaboratory
4 pages
Python For Data Science Cheat Sheet: Scikit-Learn Create Your Model Evaluate Your Model's Performance
100% (1)
Python For Data Science Cheat Sheet: Scikit-Learn Create Your Model Evaluate Your Model's Performance
1 page
ML 11 Decision Trees
No ratings yet
ML 11 Decision Trees
4 pages
Scikit Learn Cheat Sheet Python
No ratings yet
Scikit Learn Cheat Sheet Python
1 page
Machine Learning
No ratings yet
Machine Learning
3 pages
Scikit-Learn Cheat Sheet
No ratings yet
Scikit-Learn Cheat Sheet
1 page
Some Basic Concepts of Chemistry
No ratings yet
Some Basic Concepts of Chemistry
128 pages
Python CA 4
No ratings yet
Python CA 4
9 pages
ML Lab6
No ratings yet
ML Lab6
4 pages
Apply Logistic Regression To Amazon Reviews Data Set (M)
No ratings yet
Apply Logistic Regression To Amazon Reviews Data Set (M)
11 pages
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
No ratings yet
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
20 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
ML 2 16
No ratings yet
ML 2 16
6 pages
ML 5
No ratings yet
ML 5
3 pages
CSC 301 Ass.1
No ratings yet
CSC 301 Ass.1
7 pages
Scikit-Learn Cheat Sheet
No ratings yet
Scikit-Learn Cheat Sheet
1 page
Classification of Instruments
No ratings yet
Classification of Instruments
18 pages
Praveen Ai
No ratings yet
Praveen Ai
6 pages
ML Assignment 4
No ratings yet
ML Assignment 4
7 pages
Multi Classification - Py (For 1 Class TP, TN, FP, FN)
No ratings yet
Multi Classification - Py (For 1 Class TP, TN, FP, FN)
25 pages
ML Lab Prgms Split
No ratings yet
ML Lab Prgms Split
3 pages
TP - Ipynb - Colab
No ratings yet
TP - Ipynb - Colab
6 pages
Prac7 23bme053
No ratings yet
Prac7 23bme053
2 pages
Experiment 2 FDL - Jupyter Notebook
No ratings yet
Experiment 2 FDL - Jupyter Notebook
2 pages
Hatespeech Code Ipynb
No ratings yet
Hatespeech Code Ipynb
31 pages
A Dataset For Multimodal Information Retrieval in PDF-based Visual Question Answering
No ratings yet
A Dataset For Multimodal Information Retrieval in PDF-based Visual Question Answering
22 pages
6.2 Cloning and Biotechnology MS
No ratings yet
6.2 Cloning and Biotechnology MS
33 pages
Scaling in One Range: 5172 Rows × 3002 Columns
No ratings yet
Scaling in One Range: 5172 Rows × 3002 Columns
2 pages
Ann Experiential Learning
No ratings yet
Ann Experiential Learning
43 pages
TASK 8: Deploy Support Vector Machine, Apriori Algorithm: BTCS619-18
No ratings yet
TASK 8: Deploy Support Vector Machine, Apriori Algorithm: BTCS619-18
5 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
ML Keshav
No ratings yet
ML Keshav
23 pages
1 Pharmaceutical Chemistry Sample Paper Noteskarts
No ratings yet
1 Pharmaceutical Chemistry Sample Paper Noteskarts
2 pages
ML101 Graded Assignment 2.ipynb - Colab
No ratings yet
ML101 Graded Assignment 2.ipynb - Colab
6 pages
ML Expt 4
No ratings yet
ML Expt 4
4 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
8 pages
IDS U-5 Answers
No ratings yet
IDS U-5 Answers
16 pages
Bi 6 New
No ratings yet
Bi 6 New
6 pages
ML Lab Programs 2
No ratings yet
ML Lab Programs 2
16 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
7 pages
Experiment 7
No ratings yet
Experiment 7
3 pages
ML Lab 8
No ratings yet
ML Lab 8
9 pages
Synt 3M 01 09 04 03 B - en
No ratings yet
Synt 3M 01 09 04 03 B - en
63 pages
KNN
No ratings yet
KNN
4 pages
Dsbda 5
No ratings yet
Dsbda 5
4 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
Machine Learning Final Report
No ratings yet
Machine Learning Final Report
8 pages
Shobit Sharma (2124399) ML Lab File PDF
No ratings yet
Shobit Sharma (2124399) ML Lab File PDF
19 pages
Dsbda 10
No ratings yet
Dsbda 10
5 pages
ML Mini Project
No ratings yet
ML Mini Project
9 pages
ML Mini Project
No ratings yet
ML Mini Project
9 pages
ML Functions
No ratings yet
ML Functions
12 pages
TensorFlow深度学习项目实战: Chinese Edition
From Everand
TensorFlow深度学习项目实战: Chinese Edition
Posts & Telecom Press
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)

Ml-Exp-2 - Jupyter Notebook

Uploaded by

Ml-Exp-2 - Jupyter Notebook

Uploaded by

10/20/24, 11:16 PM ml-exp-2 - Jupyter Notebook

In [24]:  # Import necessary libraries

K-Nearest Neighbors (KNN) Results:

0 0.71 1.00 0.83 739

accuracy 0.71 1035

Support Vector Machine (SVM) Results:

0 0.71 1.00 0.83 739

accuracy 0.71 1035

You might also like