0% found this document useful (0 votes)

115 views10 pages

Customer Churn Analysis - Jupyter Notebook

This document contains a summary of steps taken to analyze customer churn from a telecom dataset: 1. Various libraries are imported and the dataset is read into a Pandas dataframe with over 7,000 rows and 21 columns of customer data. 2. Unneeded columns like customer ID are dropped, object columns are converted to numeric where possible, and null values are dropped to clean the data. 3. Exploratory data analysis is begun with a bar plot showing the distribution of customers who have churned versus those still with the provider.

Uploaded by

akash.050501

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

115 views10 pages

Customer Churn Analysis - Jupyter Notebook

Uploaded by

akash.050501

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

07/12/2023, 15:50 Customer Churn Analysis - Jupyter Notebook

Import necessary libraries

In [1]: 1 import pandas as pd
2 import numpy as np
3 import matplotlib.pyplot as plt
4 import seaborn as sns

Read dataset
In [2]: 1 df=pd.read_csv('Tel_Customer_Churn_Dataset.csv')
2 df.head()

Out[2]:
customerID gender SeniorCitizen Partner Dependents tenure PhoneService MultipleLines InternetService Onlin

7590- No phone
0 Female 0 Yes No 1 No DSL
VHVEG service

5575-
1 Male 0 No No 34 Yes No DSL
GNVDE

3668-
2 Male 0 No No 2 Yes No DSL
QPYBK

7795- No phone
3 Male 0 No No 45 No DSL
CFOCW service

9237-
4 Female 0 No No 2 Yes No Fiber optic
HQITU

5 rows × 21 columns

In [3]: 1 df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 7043 entries, 0 to 7042
Data columns (total 21 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 customerID 7043 non-null object
1 gender 7043 non-null object
2 SeniorCitizen 7043 non-null int64
3 Partner 7043 non-null object
4 Dependents 7043 non-null object
5 tenure 7043 non-null int64
6 PhoneService 7043 non-null object
7 MultipleLines 7043 non-null object
8 InternetService 7043 non-null object
9 OnlineSecurity 7043 non-null object
10 OnlineBackup 7043 non-null object
11 DeviceProtection 7043 non-null object
12 TechSupport 7043 non-null object
13 StreamingTV 7043 non-null object
14 StreamingMovies 7043 non-null object
15 Contract 7043 non-null object
16 PaperlessBilling 7043 non-null object
17 PaymentMethod 7043 non-null object
18 MonthlyCharges 7043 non-null float64
19 TotalCharges 7043 non-null object
20 Churn 7043 non-null object
dtypes: float64(1), int64(2), object(18)
memory usage: 1.1+ MB

Dropping unwanted columns

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 1/10

07/12/2023, 15:50 Customer Churn Analysis - Jupyter Notebook

In [4]: 1 df=df.drop(["customerID"], axis = 1)

2 df.head()

Out[4]:
gender SeniorCitizen Partner Dependents tenure PhoneService MultipleLines InternetService OnlineSecurity On

No phone
0 Female 0 Yes No 1 No DSL No
service

1 Male 0 No No 34 Yes No DSL Yes

2 Male 0 No No 2 Yes No DSL Yes

No phone
3 Male 0 No No 45 No DSL Yes
service

4 Female 0 No No 2 Yes No Fiber optic No

Converting the 'TotalCharges' column to numeric values

In [5]: 1 df['TotalCharges'] = pd.to_numeric(df['TotalCharges'], errors='coerce')

2 df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 7043 entries, 0 to 7042
Data columns (total 20 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 gender 7043 non-null object
1 SeniorCitizen 7043 non-null int64
2 Partner 7043 non-null object
3 Dependents 7043 non-null object
4 tenure 7043 non-null int64
5 PhoneService 7043 non-null object
6 MultipleLines 7043 non-null object
7 InternetService 7043 non-null object
8 OnlineSecurity 7043 non-null object
9 OnlineBackup 7043 non-null object
10 DeviceProtection 7043 non-null object
11 TechSupport 7043 non-null object
12 StreamingTV 7043 non-null object
13 StreamingMovies 7043 non-null object
14 Contract 7043 non-null object
15 PaperlessBilling 7043 non-null object
16 PaymentMethod 7043 non-null object
17 MonthlyCharges 7043 non-null float64
18 TotalCharges 7032 non-null float64
19 Churn 7043 non-null object
dtypes: float64(2), int64(2), object(16)
memory usage: 1.1+ MB

Checking for null values

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 2/10

07/12/2023, 15:50 Customer Churn Analysis - Jupyter Notebook

In [6]: 1 df.isnull().sum()

Out[6]: gender 0
SeniorCitizen 0
Partner 0
Dependents 0
tenure 0
PhoneService 0
MultipleLines 0
InternetService 0
OnlineSecurity 0
OnlineBackup 0
DeviceProtection 0
TechSupport 0
StreamingTV 0
StreamingMovies 0
Contract 0
PaperlessBilling 0
PaymentMethod 0
MonthlyCharges 0
TotalCharges 11
Churn 0
dtype: int64

Treating null values

In [7]: 1 df = df.dropna()

In [8]: 1 df.isnull().sum().sum()

Out[8]: 0

EDA

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 3/10

07/12/2023, 15:50 Customer Churn Analysis - Jupyter Notebook

In [9]: 1 #churn distribution

2
3 plt.figure(figsize=(8, 6))
4 plt.bar(df['Churn'].unique(), df['Churn'].value_counts(), color=['green', 'yellow'])
5 plt.title('Churn Distribution', fontsize=16, fontweight='bold')
6 plt.xlabel('Churn')
7 plt.ylabel('Count')
8 plt.grid(axis='y', linestyle='--', alpha=0.7)
9 plt.show()

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 4/10

07/12/2023, 15:50 Customer Churn Analysis - Jupyter Notebook

In [10]: 1 #churn vs gender

2
3 sns.countplot(x='gender', hue="Churn", data=df, palette=['green', 'yellow'], edgecolor = 'Bl
4 sns.despine()
5 plt.grid(axis='y', linestyle='--', alpha=0.7)

In [11]: 1 #churn vs SeniorCitizen

2
3 sns.countplot(x='SeniorCitizen', hue="Churn", data=df, palette=['green', 'yellow'], edgecolo
4 sns.despine()
5 plt.grid(axis='y', linestyle='--', alpha=0.7)

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 5/10

07/12/2023, 15:50 Customer Churn Analysis - Jupyter Notebook

In [12]: 1 #churn vs partner

2
3 sns.countplot(x='Partner', hue="Churn", data=df, palette=['green', 'yellow'], edgecolor = 'B
4 sns.despine()
5 plt.grid(axis='y', linestyle='--', alpha=0.7)

In [13]: 1 #churn vs dependents

2
3 sns.countplot(x='Dependents', hue="Churn", data=df, palette=['green', 'yellow'], edgecolor =
4 sns.despine()
5 plt.grid(axis='y', linestyle='--', alpha=0.7)

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 6/10

07/12/2023, 15:50 Customer Churn Analysis - Jupyter Notebook

In [14]: 1 #churn vs contract

2
3 sns.countplot(x='Contract', hue="Churn", data=df, palette=['green', 'yellow'], edgecolor =
4 sns.despine()
5 plt.grid(axis='y', linestyle='--', alpha=0.7)

In [15]: 1 #churn vs MonthlyCharges

2
3 ax = sns.kdeplot(df['MonthlyCharges'][df["Churn"] == 'No'], fill = True,color='green')
4 ax = sns.kdeplot(df['MonthlyCharges'][df["Churn"] == 'Yes'],ax =ax, fill= True,color='skyblu
5 ax.legend(["Not Churn","Churn"],loc='upper right')
6 ax.set_ylabel('Density')
7 ax.set_xlabel('Monthly Charges')
8 ax.set_title('Distribution of monthly charges by churn')

Out[15]: Text(0.5, 1.0, 'Distribution of monthly charges by churn')

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 7/10

07/12/2023, 15:50 Customer Churn Analysis - Jupyter Notebook

In [16]: 1 #churn vs TotalCharges

2
3 ax = sns.kdeplot(df['TotalCharges'][df["Churn"] == 'No'], fill = True,color='green')
4 ax = sns.kdeplot(df['TotalCharges'][df["Churn"] == 'Yes'],ax =ax,fill= True,color='skyblue'
5 ax.legend(["Not Churn","Churn"],loc='upper right')
6 ax.set_ylabel('Density')
7 ax.set_xlabel('TotalCharges')
8 ax.set_title('Distribution of Total Charges by churn')

Out[16]: Text(0.5, 1.0, 'Distribution of Total Charges by churn')

Label encoding
In [17]: 1 df["gender"]=df["gender"].map({"Female":0,"Male":1})
2 df["Partner"]=df["Partner"].map({"No":0,"Yes":1})
3 df["Dependents"]=df["Dependents"].map({"No":0,"Yes":1})
4 df["PhoneService"]=df["PhoneService"].map({"No":0,"Yes":1})
5 df["PaperlessBilling"]=df["PaperlessBilling"].map({"No":0,"Yes":1})
6 df["Churn"]=df["Churn"].map({"No":0,"Yes":1})

In [18]: 1 df=pd.get_dummies(df,drop_first=True)
2 df.head()

Out[18]:
gender SeniorCitizen Partner Dependents tenure PhoneService PaperlessBilling MonthlyCharges TotalCharges

0 0 0 1 0 1 0 1 29.85 29.85

1 1 0 0 0 34 1 0 56.95 1889.50

2 1 0 0 0 2 1 1 53.85 108.15

3 1 0 0 0 45 0 0 42.30 1840.75

4 0 0 0 0 2 1 1 70.70 151.65

5 rows × 31 columns

Assigning dependent and independent variable

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 8/10
07/12/2023, 15:50 Customer Churn Analysis - Jupyter Notebook

In [19]: 1 X=df.drop(columns = "Churn")

2 y=df["Churn"]

Machine Learning classification model libraries

In [20]: 1 from sklearn.linear_model import LogisticRegression
2 from sklearn.tree import DecisionTreeClassifier
3 from sklearn.ensemble import RandomForestClassifier
4 from sklearn import metrics
5 from sklearn.metrics import classification_report
6 from sklearn.model_selection import train_test_split

Splitting the dataset into training and testing set

In [21]: 1 X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.3, random_state =12

Logistic regression
In [22]: 1 logmodel = LogisticRegression(random_state=50)
2 logmodel.fit(X_train,y_train)
3 pred = logmodel.predict(X_test)
4
5 print(classification_report(y_test, pred))

precision recall f1-score support

0 0.85 0.88 0.87 1567

1 0.63 0.57 0.60 543

accuracy 0.80 2110

macro avg 0.74 0.73 0.73 2110
weighted avg 0.80 0.80 0.80 2110

C:\Users\msi\anaconda3\Lib\site-packages\sklearn\linear_model\_logistic.py:460: ConvergenceWar
ning: lbfgs failed to converge (status=1):
STOP: TOTAL NO. of ITERATIONS REACHED LIMIT.

Increase the number of iterations (max_iter) or scale the data as shown in:
https://scikit-learn.org/stable/modules/preprocessing.html (https://scikit-learn.org/stabl
e/modules/preprocessing.html)
Please also refer to the documentation for alternative solver options:
https://scikit-learn.org/stable/modules/linear_model.html#logistic-regression (https://sci
kit-learn.org/stable/modules/linear_model.html#logistic-regression)
n_iter_i = _check_optimize_result(

Decision Tree

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 9/10

07/12/2023, 15:50 Customer Churn Analysis - Jupyter Notebook

In [23]: 1 dtmodel = DecisionTreeClassifier(criterion = "gini", random_state = 50)

2 dtmodel.fit(X_train, y_train)
3 dt_pred = dtmodel.predict(X_test)
4
5 print(classification_report(y_test, dt_pred))

precision recall f1-score support

0 0.83 0.79 0.81 1567

1 0.47 0.52 0.49 543

accuracy 0.72 2110

macro avg 0.65 0.66 0.65 2110
weighted avg 0.73 0.72 0.73 2110

Random Forest
In [24]: 1 rfmodel = RandomForestClassifier(n_estimators = 100, criterion = 'entropy', random_state = 0
2 rfmodel.fit(X_train, y_train)
3 rf_pred = rfmodel.predict(X_test)
4
5 print(classification_report(y_test, rf_pred))

precision recall f1-score support

0 0.84 0.89 0.86 1567

1 0.62 0.50 0.55 543

accuracy 0.79 2110

macro avg 0.73 0.69 0.71 2110
weighted avg 0.78 0.79 0.78 2110

In [25]: 1 # Factors contributing to customer attrition :

2 # 1. Contract - if it is 'Month to month', churn rate is high
3 # 2. Monthly charge - if it is between 65 and 110, churn rate is high
4 # 3. Total charg - churn rate is high when it is less than 2000

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 10/10

Dire Online Shopping System
100% (1)
Dire Online Shopping System
81 pages
Course Title For Sbubject Area Method Two (II) Option 2
100% (1)
Course Title For Sbubject Area Method Two (II) Option 2
4 pages
SQL - Create Table
No ratings yet
SQL - Create Table
5 pages
NIMCET Syllabus 2025
No ratings yet
NIMCET Syllabus 2025
4 pages
Mall Customer Segmentation Using Machine Learning Techniques
No ratings yet
Mall Customer Segmentation Using Machine Learning Techniques
17 pages
Customer Churn Prediction
No ratings yet
Customer Churn Prediction
16 pages
Student Data Management System Proposal
No ratings yet
Student Data Management System Proposal
23 pages
Cs8651 Internet Programming Unit I Website Basics, HTML 5, Css 3, Web 2.0 9
No ratings yet
Cs8651 Internet Programming Unit I Website Basics, HTML 5, Css 3, Web 2.0 9
37 pages
Chapter - 3 - EERD
No ratings yet
Chapter - 3 - EERD
67 pages
Chapter - 4 - Association Rule Mining
No ratings yet
Chapter - 4 - Association Rule Mining
86 pages
File Organisation Lecture Notes-1
No ratings yet
File Organisation Lecture Notes-1
28 pages
Build Your Portfolio Website With HTML and CSS
100% (1)
Build Your Portfolio Website With HTML and CSS
1 page
CN Scenario Ques With Answer
No ratings yet
CN Scenario Ques With Answer
5 pages
A Project Report ON "Purchase and Sales Management System"
No ratings yet
A Project Report ON "Purchase and Sales Management System"
41 pages
Sampling and Quantization in Digital Image Processing
No ratings yet
Sampling and Quantization in Digital Image Processing
9 pages
MCQ Final
No ratings yet
MCQ Final
46 pages
Online Bus Ticket Reservation System
No ratings yet
Online Bus Ticket Reservation System
18 pages
10 Cryptography Case Study
No ratings yet
10 Cryptography Case Study
10 pages
TM05 Monitor and Administer Database
No ratings yet
TM05 Monitor and Administer Database
50 pages
Fundamentals of DBS - CH - 2
No ratings yet
Fundamentals of DBS - CH - 2
28 pages
Unit4 Ecommerce
100% (1)
Unit4 Ecommerce
9 pages
Operation Research Chapter Five 5. Networks and Project Management
No ratings yet
Operation Research Chapter Five 5. Networks and Project Management
11 pages
Spatial Domain Image Processing)
0% (1)
Spatial Domain Image Processing)
64 pages
For Alumni Management System: Software Requirement Specification
100% (1)
For Alumni Management System: Software Requirement Specification
15 pages
INSTRUCTIONS: Answer ALL Questions Question 1 Open Book
100% (1)
INSTRUCTIONS: Answer ALL Questions Question 1 Open Book
2 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
77 pages
Chapter 3-Part II
100% (1)
Chapter 3-Part II
26 pages
Lecture 01 - Introduction To DBMS, MS Access
100% (1)
Lecture 01 - Introduction To DBMS, MS Access
11 pages
The Architecture of Windows NT
No ratings yet
The Architecture of Windows NT
3 pages
Chapter 2-DATABASE SYSTEM Architecture
No ratings yet
Chapter 2-DATABASE SYSTEM Architecture
52 pages
Internship Project PPT Template 2
No ratings yet
Internship Project PPT Template 2
12 pages
Unit - 3
No ratings yet
Unit - 3
6 pages
Telecommunication Customer Churn (New)
100% (1)
Telecommunication Customer Churn (New)
23 pages
RBS 6102 4+4+4 900 and 1800 PDF
96% (23)
RBS 6102 4+4+4 900 and 1800 PDF
16 pages
EGPCL-NPL-PEL-KEC-PPL-RPT-00007 Wall Thickness Calculation Report C01
No ratings yet
EGPCL-NPL-PEL-KEC-PPL-RPT-00007 Wall Thickness Calculation Report C01
13 pages
Chapter-5 Data Compression
No ratings yet
Chapter-5 Data Compression
53 pages
Hardware & Networking Course
100% (1)
Hardware & Networking Course
30 pages
05 - Strategies For Query Processing (Ch18)
No ratings yet
05 - Strategies For Query Processing (Ch18)
50 pages
Final Exam Dsy
100% (1)
Final Exam Dsy
3 pages
Difference Between Iterative and Recursive
0% (1)
Difference Between Iterative and Recursive
37 pages
3 MM Compression
100% (1)
3 MM Compression
35 pages
What Are Batch Processing System and Real Time Processing System and The Difference Between Them
No ratings yet
What Are Batch Processing System and Real Time Processing System and The Difference Between Them
6 pages
Iwt Practical
No ratings yet
Iwt Practical
20 pages
Hard Copy of Faculty Feedback System
83% (6)
Hard Copy of Faculty Feedback System
16 pages
Systems Planning and Selection
100% (1)
Systems Planning and Selection
11 pages
3.4. Sharpening Spatial Filtering
No ratings yet
3.4. Sharpening Spatial Filtering
45 pages
Ts1 ts2
No ratings yet
Ts1 ts2
61 pages
Project
No ratings yet
Project
28 pages
Database Engineering (EC-240) : Lab Manual # 04
No ratings yet
Database Engineering (EC-240) : Lab Manual # 04
9 pages
10 Total Mark: 10 X 1 10: NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
No ratings yet
10 Total Mark: 10 X 1 10: NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
7 pages
SAN Module 1
No ratings yet
SAN Module 1
47 pages
SE 7204 BIG Data Analysis Unit I Final
No ratings yet
SE 7204 BIG Data Analysis Unit I Final
66 pages
Mainframe Operating Systems
No ratings yet
Mainframe Operating Systems
4 pages
Distributed File Systems: Unit - V Essay Questions
No ratings yet
Distributed File Systems: Unit - V Essay Questions
10 pages
Pythonic Data Cleaning With Numpy and Pandas
No ratings yet
Pythonic Data Cleaning With Numpy and Pandas
11 pages
Prepare, Sterilize and Dispense Culture Media
No ratings yet
Prepare, Sterilize and Dispense Culture Media
24 pages
System Analysis and Design
No ratings yet
System Analysis and Design
13 pages
r05410307 Selected Topics in Computer Science
No ratings yet
r05410307 Selected Topics in Computer Science
1 page
2 - 16 Marks
No ratings yet
2 - 16 Marks
20 pages
Sad
No ratings yet
Sad
11 pages
SCADA System of NLDC
100% (1)
SCADA System of NLDC
38 pages
6QPG1 CSE Artificial Intelligence CS8691 QBM
No ratings yet
6QPG1 CSE Artificial Intelligence CS8691 QBM
2 pages
Mechanic CV Examples Uk
100% (1)
Mechanic CV Examples Uk
4 pages
ARO Mandi Rally Notification For Recruiting Year 2024-25
No ratings yet
ARO Mandi Rally Notification For Recruiting Year 2024-25
26 pages
Course Unit - Human Flourishing in Science and Technology-Merged
No ratings yet
Course Unit - Human Flourishing in Science and Technology-Merged
24 pages
05 - BCSL057 Web Programming
No ratings yet
05 - BCSL057 Web Programming
9 pages
Documents From The US Antitrust Investigation Into Apple
No ratings yet
Documents From The US Antitrust Investigation Into Apple
113 pages
Business Plan-Rssk CNC Automation
100% (1)
Business Plan-Rssk CNC Automation
22 pages
AirCheck Detail Report - PK8AP02
No ratings yet
AirCheck Detail Report - PK8AP02
100 pages
Parallel Database
No ratings yet
Parallel Database
27 pages
Definition and Evolution of Marketing Management
No ratings yet
Definition and Evolution of Marketing Management
13 pages
The Use of Ultrasonic Cleaning in Dairy Industry: How Does It Work?
No ratings yet
The Use of Ultrasonic Cleaning in Dairy Industry: How Does It Work?
3 pages
RHLS User Guidelines PDF
No ratings yet
RHLS User Guidelines PDF
50 pages
Log Book Week 1 Week 2
No ratings yet
Log Book Week 1 Week 2
12 pages
Guideline On PETRONAS Career Portal - Candidate Registration
No ratings yet
Guideline On PETRONAS Career Portal - Candidate Registration
14 pages
Application For Admission in " KV NO.2 NAUSENABAUGH "
No ratings yet
Application For Admission in " KV NO.2 NAUSENABAUGH "
7 pages
Proper Waste Management
No ratings yet
Proper Waste Management
20 pages
Cs403 Assignment Solution 1 Fall 2023
No ratings yet
Cs403 Assignment Solution 1 Fall 2023
7 pages
ER04242
No ratings yet
ER04242
5 pages
Summary of Charges Summary of Charges Summary of Charges: Past Due
No ratings yet
Summary of Charges Summary of Charges Summary of Charges: Past Due
3 pages
COMSATS University Islamabad: Terminal Examination, SPRING 2021
No ratings yet
COMSATS University Islamabad: Terminal Examination, SPRING 2021
6 pages
Ar514 Project Manuscript Format
No ratings yet
Ar514 Project Manuscript Format
2 pages
Sata SSD 2.5 Inch
No ratings yet
Sata SSD 2.5 Inch
2 pages
Admitcard31 01 2024
No ratings yet
Admitcard31 01 2024
1 page
Simple Multi-Gbps 60 GHZ Radio-Over-Fiber Links Employing Optical and Electrical Data Up-Convers
No ratings yet
Simple Multi-Gbps 60 GHZ Radio-Over-Fiber Links Employing Optical and Electrical Data Up-Convers
3 pages
Plot Plan Wellpad E - SUPERIMPOSE RIG (E31P, E56P) (WI)
No ratings yet
Plot Plan Wellpad E - SUPERIMPOSE RIG (E31P, E56P) (WI)
1 page
Trackpad Pro Ver. 5.0 Class 7
From Everand
Trackpad Pro Ver. 5.0 Class 7
Nidhi Arora
5/5 (1)
Touchpad Plus Ver. 4.0 Class 4: Windows 10 & MS Office 2019
From Everand
Touchpad Plus Ver. 4.0 Class 4: Windows 10 & MS Office 2019
Nidhi Gupta
No ratings yet
Trackpad Pro Ver. 5.0 Class 6
From Everand
Trackpad Pro Ver. 5.0 Class 6
Nidhi Arora
No ratings yet
AppDynamics Third Edition
From Everand
AppDynamics Third Edition
Gerardus Blokdyk
No ratings yet

Customer Churn Analysis - Jupyter Notebook

Uploaded by

Customer Churn Analysis - Jupyter Notebook

Uploaded by

07/12/2023, 15:50 Customer Churn Analysis - Jupyter Notebook

Import necessary libraries

Dropping unwanted columns

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 1/10

In [4]: 1 df=df.drop(["customerID"], axis = 1)

1 Male 0 No No 34 Yes No DSL Yes

2 Male 0 No No 2 Yes No DSL Yes

4 Female 0 No No 2 Yes No Fiber optic No

Converting the 'TotalCharges' column to numeric values

In [5]: 1 df['TotalCharges'] = pd.to_numeric(df['TotalCharges'], errors='coerce')

Checking for null values

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 2/10

Treating null values

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 3/10

In [9]: 1 #churn distribution

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 4/10

In [10]: 1 #churn vs gender

In [11]: 1 #churn vs SeniorCitizen

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 5/10

In [12]: 1 #churn vs partner

In [13]: 1 #churn vs dependents

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 6/10

In [14]: 1 #churn vs contract

In [15]: 1 #churn vs MonthlyCharges

Out[15]: Text(0.5, 1.0, 'Distribution of monthly charges by churn')

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 7/10

In [16]: 1 #churn vs TotalCharges

Out[16]: Text(0.5, 1.0, 'Distribution of Total Charges by churn')

Assigning dependent and independent variable

In [19]: 1 X=df.drop(columns = "Churn")

Machine Learning classification model libraries

Splitting the dataset into training and testing set

precision recall f1-score support

0 0.85 0.88 0.87 1567

accuracy 0.80 2110

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 9/10

In [23]: 1 dtmodel = DecisionTreeClassifier(criterion = "gini", random_state = 50)

precision recall f1-score support

0 0.83 0.79 0.81 1567

accuracy 0.72 2110

precision recall f1-score support

0 0.84 0.89 0.86 1567

accuracy 0.79 2110

In [25]: 1 # Factors contributing to customer attrition :

localhost:8888/notebooks/KINGS LABS/4. Customer churn analysis/Customer Churn Analysis.ipynb 10/10

You might also like