CH05 Business Analytics Process and Data Exploration
CH05 Business Analytics Process and Data Exploration
Business Analytics
05 Business Analytics Process and Data
Exploration
Course Overview
• This chapter covers data exploration, validation, and cleaning required
for data analysis. You’ll learn the purpose of data cleaning, why you
need data preparation, how to go about handling missing values, and
some of the data-cleaning techniques used in the industry.
Course Contents
• Business Analytics Life Cycle
• Understanding the Business Problem
• Collecting and Integrating the Data
• Preprocessing the Data
• Exploring and Visualizing the Data
• Using Modeling Techniques and Algorithms
• Evaluating the Model
• Presenting a Management Report and Review
• Deploying the Model
Business Analytics Life Cycle
• This purpose is to derive information from data in order to make
appropriate business decisions.
• Consists of eight phases:
a. Understand the Business Problem
b. Collect and Integrate the Data
c. Preprocess the Data
d. Explore and Visualize the Data
e. Choose Modeling Techniques and Algorithms
f. Evaluate the Model
g. Report to Management and Review
h. Deploy the Model
Business Analytics Life Cycle
Business Analytics Life Cycle
Phase 1 → Understand the Business Problem
• the focus is to understand the problem, objectives, and requirements
from the perspective of the business.
• then converted into a data analytics problem with the aim of solving it by
using appropriate methods to achieve the objective.
σ𝑛 𝑦ො −𝑦 2
• 𝑅𝑀𝑆𝐸 = 𝑘=0 𝑘
𝑛
𝑘
Presenting a Management Report and Review
• Problem Description
• Data Set Used
• Data Cleaning Carried Out
• Method Used to Create The Model
• Model Deployment Prerequisities
• Model Deployment and Usage
• Issues Handling
Deploying the Model
• A challenging phase of the project.
• The model is now deployed for end users and is in a production
environment analyzing the live data
• Success of the deployment depends on the following:
a. Proper sizing of the hardware, ensuring required performance
b. Proper programming to handle the capabilities of the hardware
c. Proper data integration and cleaning
d. Effective reports, dashboards, views, decisions, and interventions to be
used by end users or end-user systems
e. Effective training to the users of the model