BAUE
BAUE
What is business
analytics
A c c o r d i n g t o W i k i p e d i a ,” B u s i n e s s a n a l y t i c s r e f e r s t o t h e
skills, technologies, practices for continuous iterative
exploration and investigation of past business
performance to gain insight and drive business planning
Evolution of
It has its roots in operations research, which
Business was extensively used during World War II
Analytics
Operations research was an analytical way to
look at data to conduct military operations
Components of Business
Analytics
Business Problem Framing: In this step, we basically find out what business problem we
are trying to solve, e.g., when we are looking to find out why the supply chain isn’t as
effective as it should be or why we are losing sales
Analytics Problem Framing: Once we have the problem statement, what we need to think
of next is how analytics can be done for that business analytics problem
Data: The moment we identify the problem in terms of what needs to be analyzed, the
next thing that we need is data, which needs to be analyzed
Methodology selection and model building: Once the data gets ready, the tricky part
begins
The Business Analytics Process
Deployment: Post the selection of the model and the statistical ways
of analyzing data for the solution, the next thing we need to do is to
test the solution in a real-time scenario
Use cases of Business Analytics
A ) F i n a n c e B u s i n e s s A n a l y t i c s c a n h e l p fi n a n c i a l
organizations to optimize budgeting, determine
creditworthiness in case of a loan, and also suggest the
chances of a customer defaulting on a loan
B ) B u s i n e s s A n a l y t i c s h e l p s i n e x t ra c t i n g c r u c i a l i n f o r m a t i o n
h i d d e n b e h i n d t h e c r e d i t a n d d e b i t t ra n s a c t i o n s a n d l e t s t h e
b u s i n e s s k n o w, t h e s p e n d i n g h a b i t s , l i f e s t y l e p r e f e r e n c e s ,
a n d fi n a n c i a l s t a n d i n g , ra i s i n g r e d fl a g s w h e r e v e r t h e r e i s a
probability of loss of business
C ) B u s i n e s s a n a l y t i c s b u i l t i n t o t o d a y ’s C R M s y s t e m s , e n a b l e
b u s i n e s s e s t o g a i n d e e p i n s i g h t s i n t o d e m o g ra p h i c s , s o c i o -
economic information and lifestyle of their customer groups
a n d w h a t w o u l d b e t h e b e s t fi t s t ra t e g y t o r e t a i n a n d
increase the customer base
D) Manufacturing
It becomes crucial to stay on top of things, so you cover enough to stay protected
against equipment downtime, delays in raw material supply, the inventory levels to
maintain, and the maintenance expense of machines among others
Business analytics helps you decide on the optimum levels of inventory to maintain and
how much to make up for equipment downtime and keep production at optimum levels
and much more
Business analytics uses data from It uses business data such as annual It uses the database which contains
three sources for construction of the reports, financial ratios, marketing various computer files and
business model research, etc information coming from data
analysis
TYPES OF
BUSINESS
ANALYTICS
A) Descriptive Analytics
R o o t C a u s e A n a l y s i s -W h y t h i s h a p p e n ?
D a t a M i n i n g - Id e n t i f y i n g c o r r e l a t e d d a t a
Pa t t e r n Id e n t i fi c a t i o n a n d A l e r t s – W h e n s h o u l d
action be invoked to correct a process
D) Prescriptive
Analytics
S i g n i fi c a n t l y i n c r e a s i n g s e r v i c e m e t r i c s
performance
A c c o u n t i n g P r o c e s s e s A r e S i m p l i fi e d
L a s t l y, t h e c h a p t e r e x p l o r e s t h e v a l u e o f v i s u a l
data storytelling for data communication, and
establishes how data storytelling is the perfect
skill to bridge the very broad and expansive
business— IT gap
A VISUAL
REVOLUTION
Raw Data
Excel
AN EXAMPLE OF “OLD”
DATA VISUALIZATION COMPARED
TO ITS MODERN EQUIVALENT
FROM VISUALIZATION TO VISUAL DATA STORYTELLING: AN
EVOLUTION
A l o n g t h e w a y, t h e p ra c t i c e o f d a t a v i s u a l i z a t i o n h a s b e e n
a i d e d b y b o t h a d va n c e m e n t s i n v i s u a l d e s i g n a n d c o g n i t i v e
science as well as technology and business intelligence, and
t h e s e h a v e g i v e n r i s e t o t h e a d va n c e m e n t s t h a t h a v e l e d t o
our current state of data visualization
I n t o d a y ’s d a t a - d r i v e n b u s i n e s s e n v i r o n m e n t , a n e m e r g i n g
new approach to storytelling attempts to combine data with
g ra p h i c s a n d t e l l t h e w o r l d ’s s t o r i e s t h r o u g h t h e p o w e r o f
information visualization
A m e r i c a n a u t h o r Ku r t Vo n n e g u t i s q u o t e d a s h a v i n g f a m o u s l y
said, “There is no reason that the simple shapes of stories
c a n’ t b e f e d i n t o a c o m p u t e r — t h e y h a v e b e a u t i f u l s h a p e s .”
FROM VISUALIZATION TO VISUAL DATA STORYTELLING: AN
EVOLUTION
Choose an
effective
visual
• Continue to examine how people see an
how you can use that to your advantage
when crafting visuals.
Focus your
• This includes a brief discussion on sight
audience’s and memory that will act to frame up
the importance of pre-attentive
attention
attributes like size, colour, and position
on page.
FROM VISUAL TO
STORY: BRIDGING
THE GAP
According to the
BIC3 Survey
published in 2014,
communication skills
outrank technical
skills for getting a
business analysis job
• Visualizations distill complex data into digestible
forms, aiding comprehension and pattern
recognition.
5. Visual Consistency
6. Interactive Storytelling
Conclusion • By bridging the gap between
visualizations and storytelling.
I t ’s w o r t h n o t i n g t h a t N a t i o n a l G e o g ra p h i c i s d o m i n a t i n g
visual storytelling online, using powerful imagery to
c a p t i va t e a n d e d u c a t e 1 9 m i l l i o n S n a p c h a t u s e r s , 6 0 m i l l i o n
I n s t a g ra m f o l l o w e r s , a n d 5 0 m i l l i o n Fa c e b o o k f o l l o w e r s
H uma n biology a s ide to s ur vive in compe titive a nd of te n uns ta ble e nvir onme nts
—w he the r w ilde r ne s s or bus ine s s —one thing w e ’ ve a lways ha d to do is
unde r s ta nd othe r pe ople
French civil engineer Charles Joseph Minard has been credited for several
signifi cant contributions in the fi eld of information graphics, among them his
ver y unique visualizations of two militar y campaigns—Hannibal’s march from
Spain to Italy some 2,200 years ago and Napoleon’s invasion of Russia
However, as a visual stor y around human drama, it has earned the distinction
of becoming known as one of the best stor ytelling examples in histor y
Napoleon’s March
Is relatively non-technical.
Tableau
vs Excel
Getting Started with
Tableau
T h e g o a l o f t h i s c h a p t e r i s t o h e l p yo u g e t yo u r
fo o t i n g w i t h t h e Ta b l e a u p r o d u c t e c o s y s t e m a n d
u s e t h e b a s i c Ta b l e a u i n t e r f a c e s o t h a t yo u a r e
familiar enough with the tool to begin working
hands-on with data
T h i s c h a p t e r c o ve r s h o w t o g e t s t a r t e d w i t h
Ta b l e a u , r e v i e w s t h e t o o l ’s b a s i c f u n c t i o n a l i t y,
d i s c u s s e s h o w t o c o n n e c t t o d a t a , a n d p r ov i d e s a n
o ve r v i e w o f d a t a t y p e s i n Ta b l e a u
F r o m h e r e , yo u w i l l b e a b l e t o m ove o n t o t h e
v i s u a l a n a l y s i s p r o c e s s , c u ra t i n g v i s u a l s , a n d
building stories
USING TABLEAU
Standing out against many other data visualization tools on the market,
Tableau is an industr y-leading, best-of-breed tool that delivers an
approachable, intuitive environment for self-service users of all levels to help
them prepare, analyze, and visualize their data
Tableau’s stated mission is to help everyone “see and understand” their data,
and to facilitate this the company off ers a suite of software products,
including a recently released free mobile app called Vizable, designed to suit
the needs of a diverse group of clients from enterprise-level organizations to
academic users and visualization hobbyists who want to visualize data in a
mobile-fi rst format
USING TABLEAU
5 . Ta b l e a u M o b i l e : Ta b l e a u M o b i l e i s a m o b i l e a p p
t h a t a l l o w s u s e r s t o a c c e s s a n d i n t e ra c t w i t h
Ta b l e a u d a s h b o a r d s a n d v i s u a l i z a t i o n s o n t h e i r
smartphones and tablets
6 . Ta b l e a u P u b l i c : Ta b l e a u P u b l i c i s a f r e e ve r s i o n
o f Ta b l e a u t h a t a l l o w s u s e r s t o c r e a t e a n d s h a r e
i n t e ra c t i ve v i s u a l i z a t i o n s p u b l i c l y o n t h e w e b
7 . Ta b l e a u Re a d e r : Ta b l e a u Re a d e r i s a f r e e
desktop application that allows users to view and
i n t e ra c t w i t h Ta b l e a u v i s u a l i z a t i o n s c r e a t e d b y
others
GETTING STARTED
The fi rst thing you need to do to get started
with Tableau is to get your hands on a license
Open: As you create your own workbooks, recently opened workbooks appear
Open here for quick access
Discover: This pane connects you to various Tableau training, visualization, and
Discover other resources
Connecting to Tables
Connect to a fi le : Tableau allows you to connect to fi les
such as Excel spreadsheets, CSV fi les, and text fi les
• Select the database you want to connect to and enter your credentials
• Drag and drop the tables onto the "Join area" in the bottom lef t corner of the
screen
• Specify the join conditions by dragging and dropping the fi elds from each
table onto the appropriate join fi elds
• Once you have previewed the data, click "Sheet" to star t building your
visualizations
• It is often necessary to combine data from multiple places
—different tables or even data sources—to perform a
desired analysis.
Inner Join
A left join returns all records from the left
table and only matching records from the
right table
Left Join
A right join returns all records from the
right table and only matching records
from the left table
Right Join
A full outer join returns all records from
both tables, including records that do not
have a matching value in the other table
Full Outer
Join
BASIC DATA PREP WITH DATA INTERPRETER
Format with are not at the same level of analysis. There are
titles, footnotes, empty cells, merged cells, and pre-
Data Interpreter.
Data interprets as the main data and the red area shows
what Tableau interprets as column names. The
D a s h b o a r d s : Fo r c o m b i n i n g m u l t i p l e s h e e t s a s w e l l a s o t h e r
o b j e c t s l i ke i m a g e s , t e x t , a n d w e b p a g e s , a n d a d d i n g
i n t e ra c t i o n s b e t w e e n t h e m l i ke fi l t e r i n g a n d h i g h l i g h t i n g
S t o r i e s : T h e s e f ra m e w o r k s c a n b e b a s e d o n v i s u a l i z a t i o n s o r
d a s h b o a r d s , o r b a s e d o n d i ff e r e n t v i e w s a n d e x p l o ra t i o n s o f a
s i n g l e v i s u a l i z a t i o n , s e e n a t d i ff e r e n t s t a g e s , w i t h d i ff e r e n t
m a r k s fi l t e r e d a n d a n n o t a t i o n s a d d e d — h o w e ve r i s b e s t s u i t e d t o
n a r ra t e t h e s t o r y i n yo u r d a t a
Data window
S h e l ve s a n d c a r d s
Legends
M e n u s a n d To o l b a r
Save: There is no
automatic save in Tableau
Data Window
M e a s u r e s a r e g e n e ra l l y n u m e r i c a l d a t a o n
w h i c h yo u w a n t t o p e r fo r m c a l c u l a t i o n s —
s u m m i n g , a ve ra g i n g , a n d s o o n
Re m e m b e r, s e t t i n g a fi e l d a s a m e a s u r e o r
dimension can be adjusted in the Data Source
screen by clicking on the data type icon
Yo u c a n a l s o c h a n g e t h i s d i r e c t l y i n t h e s h e e t
b y e i t h e r d ra g g i n g a n d d r o p p i n g a d i m e n s i o n
t o m e a s u r e , o r v i c e ve r s a , o r b y c l i c k i n g t h e
d r o p - d o w n m e n u b y a n y fi e l d a n d s e l e c t i n g
t h e C o n ve r t to M e a s u r e o p t i o n
MODULE 4
DESCRIPTIVE
ANALYTICS
Visualizing and Exploring Data
Impor t your data: Start by importing your data into
Excel
R a n g e : T h e r a n g e i s t h e d i ff e r e n c e b e t w e e n t h e m a x i m u m
and minimum values in a data set
Va r i a n c e : T h e v a r i a n c e i s a m e a s u r e o f t h e v a r i a b i l i t y o f a
data set
N o r m a l D i s t r i b u t i o n : E x c e l h a s t h e f u n c t i o n s N O R M . DI ST a n d
N O R M . I N V t o c a l c u l a t e p r o b a b i l i t i e s a n d i n ve r s e p r o b a b i l i t i e s o f
n o r m a l d i s t r i b u t i o n r e s p e c t i ve l y
B i n o m i a l D i s t r i b u t i o n : E xc e l h a s t h e f u n c t i o n s B I N O M . DI ST a n d
B I N O M . I N V t o c a l c u l a t e p r o b a b i l i t i e s a n d i n ve r s e p r o b a b i l i t i e s o f
b i n o m i a l d i s t r i b u t i o n r e s p e c t i ve l y
Po i s s o n D i s t r i b u t i o n : E xc e l h a s t h e f u n c t i o n s P OI SS O N . DI ST a n d
P OI SS O N . I N V t o c a l c u l a t e p r o b a b i l i t i e s a n d i n ve r s e p r o b a b i l i t i e s
o f Po i s s o n d i s t r i b u t i o n r e s p e c t i ve l y
D a t a M o d e l i n g : E xc e l c a n a l s o b e u s e d t o m o d e l d a t a u s i n g
va r i o u s s t a t i s t i c a l t e c h n i q u e s
Probability Distributions and
Data Modelling
Sampling: Sampling is the process of selecting a subset of individuals from a larger population to
study
R A N D B E T W E E N : T h i s f u n c t i o n g e n e r a t e s a r a n d o m i n t e g e r b e t w e e n t w o s p e c i fi e d va l u e s
Inferential Statistical Methods: Inferential statistical methods are used to make inferences about a
population based on a sample
T. T E ST: T h e T. T E ST f u n c t i o n i s u s e d t o p e r f o r m a t - t e s t t o d e t e r m i n e i f t h e r e i s a s i g n i fi c a n t d i ff e r e n c e
between the means of two samples
A r r a y 1 : T h e fi r s t s e t o f d a t a
Ta i l s : T h e n u m b e r o f t a i l s f o r t h e t e s t
Ty p e : T h e t y p e o f t - t e s t t o p e r f o r m
C O N F I D E N C E : T h e C O N F I D E N C E f u n c t i o n i s u s e d t o c a l c u l a t e t h e c o n fi d e n c e i n t e r va l f o r a s a m p l e m e a n
Sampling and Inferential
statistical methods
A l p h a : T h e s i g n i fi c a n c e l e ve l
Z . T E ST: T h e Z . T E ST f u n c t i o n i s u s e d t o p e r fo r m a z- t e s t
t o d e t e r m i n e i f t h e r e i s a s i g n i fi c a n t d i ff e r e n c e b e t w e e n
the means of two samples
A r ra y 1 : T h e fi r s t s e t o f d a t a
A r ra y 2 : T h e s e c o n d s e t o f d a t a
I n s t a l l i n g t h e D a t a A n a l y s i s A d d -I n : I f t h e D a t a A n a l y s i s
add-in is not already installed in your version of Excel, you
will need to install it
E s t i m a t i o n : T h e D a t a A n a l y s i s a d d - i n p r o v i d e s s e v e ra l
tools for estimation, including
Re g r e s s i o n : T h i s t o o l i s u s e d t o e s t i m a t e t h e r e l a t i o n s h i p
b e t w e e n t w o o r m o r e va r i a b l e s
M o v i n g Av e ra g e : T h i s t o o l i s u s e d t o e s t i m a t e t h e t r e n d i n
a t i m e s e r i e s d a t a s e t b y c a l c u l a t i n g t h e a v e ra g e o f a
c e r t a i n n u m b e r o f o b s e r va t i o n s
Using Excel Data Analysis add in for
estimation and hypothesis testing
Exponential Smoothing: This tool is used to estimate the trend
i n a t i m e s e r i e s d a t a s e t b y w e i g h t i n g t h e o b s e r va t i o n s s o t h a t
m o r e r e c e n t o b s e r va t i o n s h a ve a g r e a t e r i m p a c t o n t h e e s t i m a t e
H y p o t h e s i s Te s t i n g : T h e D a t a A n a l y s i s a d d - i n a l s o p r o v i d e s
s e ve ra l t o o l s fo r h y p o t h e s i s t e s t i n g , i n c l u d i n g
Tw o - S a m p l e A s s u m i n g E q u a l Va r i a n c e s : T h i s t e s t i s u s e d w h e n
t h e va r i a n c e s o f t h e t w o p o p u l a t i o n s a r e a s s u m e d t o b e e q u a l
Tw o - S a m p l e A s s u m i n g U n e q u a l Va r i a n c e s : T h i s t e s t i s u s e d
w h e n t h e va r i a n c e s o f t h e t w o p o p u l a t i o n s a r e a s s u m e d t o b e
unequal
Pa i r e d Tw o - S a m p l e fo r M e a n s : T h i s t e s t i s u s e d w h e n t h e t w o
s a m p l e s a r e r e l a t e d o r p a i r e d , s u c h a s i n a b e fo r e - a n d - a f t e r
study
Using Excel Data Analysis add in for
estimation and hypothesis testing
Data collection : The fi rst step in the process is to collect and prepare the
data that will be used in the analysis
Data cleaning : Once the data has been collected, it must be cleaned and
preprocessed to remove any errors, outliers, or missing values
Data exploration : Af ter the data has been cleaned, it can be explored to
identify patterns, trends, and relationships between variables
Model building : Once the data has been explored, a predictive model can
Predictive
Analytics
Statistical Model
4. Multilevel models : These models are used to
analyze data that have a hierarchical structure,
such as data from individuals within groups
Statistical
Model
4. Bayesian models: These models are used to
incorporate prior knowledge or beliefs into the
Statistical statistical analysis
Model
Explaining relationships between variables :
Statistical models can help identify the
relationships between variables and explain how
they are related
Statistical Predicting outcomes: Statistical models can be
used to predict future outcomes based on historical
Model data
T h e i n f e r e n c e a b o u t t h e r e g r e s s i o n c o e ffi c i e n t i s b a s e d o n
t h e h y p o t h e s i s t e s t i n g f ra m e w o r k , w h e r e t h e n u l l h y p o t h e s i s
i s t h a t t h e r e g r e s s i o n c o e ffi c i e n t i s e q u a l t o z e r o , a n d t h e
alternative hypothesis is that it is not equal to zero
T h e t- t e s t c a l c u l a t e s t h e t- s t a t i s t i c , w h i c h m e a s u r e s t h e
d i ff e r e n c e b e t w e e n t h e e s t i m a t e d r e g r e s s i o n c o e ffi c i e n t a n d
t h e h y p o t h e s i z e d va l u e , r e l a t i v e t o t h e s t a n d a r d e r r o r o f t h e
estimate
• Diffi culty in interpreting the coeffi cients :
When independent variables are highly
correlated, it becomes diffi cult to interpret
the coeffi cients of the regression model, as
the eff ects of one independent variable
cannot be distinguished from the eff ects of
the other independent variables
Outlie rs c an oc c ur du e to me a su re me nt e rrors,
da ta e n t r y e rrors , or due to n a t ura l va ria t ion in
the da t a
I t i n vo l ve s c h e c k i n g w h e t h e r t h e m o d e l i s a g o o d fi t fo r t h e d a t a a n d w h e t h e r i t i s a b l e t o
g e n e ra l i z e w e l l t o n e w d a t a
T h e fi r s t s te p i n va l i d a t i o n o f fi t i s t o s p l i t t h e d a t a i n t o t ra i n i n g a n d t e s t i n g s e t s
T h e r e fo r e , i t ' s i m p o r t a n t t o p e r fo r m f u r t h e r va l i d a t i o n o f fi t u s i n g t e c h n i q u e s l i ke c r o s s -
va l i d a t i o n , w h i c h i n vo l ve s s p l i t t i n g t h e d a t a i n t o m u l t i p l e t ra i n i n g a n d t e s t i n g s e t s a n d
e va l u a t i n g t h e m o d e l ' s p e r fo r m a n c e o n e a c h s e t
I n s u m m a r y, va l i d a t i o n o f fi t i s a n i m p o r t a n t s t e p i n m o d e l i n g t h a t i n vo l ve s c h e c k i n g w h e t h e r
t h e m o d e l i s a g o o d fi t fo r t h e d a t a a n d w h e t h e r i t i s a b l e t o g e n e ra l i z e w e l l t o n e w d a t a
Binomial Logistic Regression and Multinomial Logistic
Regression
Characteristics :
Characteristics:
T h e fo r m u l a fo r s i n g l e e x p o n e n t i a l s m o o t h i n g i s g i ve n
b e l o w : w h e r e , F t + 1 = Fo r e c a s t fo r t h e n e x t t i m e p e r i o d
Y t = A c t u a l va l u e fo r t h e c u r r e n t t i m e p e r i o d F t =
Fo r e c a s t fo r t h e c u r r e n t t i m e p e r i o d α = S m o o t h i n g
p a ra m e t e r T h e s m o o t h i n g p a ra m e t e r d e t e r m i n e s t h e
w e i g h t g i ve n t o t h e m o s t r e c e n t o b s e r va t i o n
Double Exponential Smoothing
Double Exponential Smoothing is an extension of Single
Exponential Smoothing that can handle trends in the data
T h i s m e t h o d u s e s t w o s m o o t h i n g p a ra m e t e r s o r a l p h a a n d
beta to calculate the forecast
The ACF is of ten plotted as a function of the lag k, with the ACF
values on the y-axis and the lag values on the x-axis
ARMA Model
p a s t va l u e s , a s w e l l a s t h e r e l a t i o n s h i p b e t w e e n a
va r i a b l e a n d t h e e r r o r t e r m s o f a m ov i n g a ve ra g e m o d e l
T h e g e n e ra l fo r m o f a n A R M A m o d e l i s : T h e A R M A m o d e l
is a popular tool in time series analysis because it can
capture both the trend and seasonal patterns in the
d a t a , a n d i t c a n b e u s e d t o m a ke fo r e c a s t s fo r f u t u r e
time periods
ARIMA Model
ARIMA is a type of time series model used for
forecasting