BA Questions
BA Questions
Q1a. How might business analytics help the supermarket? What data would be
needed to facilitate good decisions?
Data Required:
Transaction timestamps
Number of active registers
Customer entry and exit times
Sales volume by day and hour
Employee shift schedules
Q1b. Do you agree with the statement "Data Scientist: The Sexiest Job of the
21st Century"? Justify.
b) The statement "Data Scientist: The Sexiest Job of the 21st Century" by Davenport
and Patil is justified due to the increasing reliance on data-driven decision-making.
The demand for skilled data scientists continues to rise, making it one of the most
sought-after careers.
Q2a. Suggest some metrics a fast-food restaurant manager might want to collect.
How might the manager use the data to facilitate better decisions?
Managers can use these metrics to streamline operations, reduce waste, and improve
customer experience.
Q2b. How do you perform exploratory analysis using R? What package do you
use and how do you analyze the data through exploratory analysis? What
descriptive techniques might you want to use?
Example Code:
library(ggplot2)
library(dplyr)
data %>% summary()
ggplot(data, aes(x=variable)) + geom_histogram()
Q3a. How do you build an algorithm that predicts loan defaulters in advance?
Should you use a Decision Tree or a regression model? What will be the
procedure?
Q3b. A marketing team wants to target its set of customers and use an algorithm
that can divide them. What algorithm would you suggest and explain the steps
involved?
a) Amazon and Netflix use collaborative filtering and content-based filtering for
recommendations. The process:
Collect user interactions (views, purchases, ratings)
Compute similarity scores (user-based or item-based)
Generate recommendations based on preferences
Q4b. How important is sampling in data analysis? What challenges might arise
in diagnosing COVID-19 data, and how do you handle them?
Q4c. You have run a classification model, namely logistic regression. How will
you present the effectiveness of the model to the business team?
Q5a. Given patient health data, what kind of algorithm can you use to predict
fever? Why? Explain the process.
a) Given patient health data, a Naïve Bayes classifier can be used for fever
prediction. Steps:
Source SS df MS F P-value
Regression 1200 4 300 15 0.001
Residual 500 295 1.7 - -
Total 1700 299 - - -
Interpretation: A low p-value (< 0.05) indicates that at least one predictor significantly
affects the response variable.
Q6a. Compute the 30th Percentile and Five-Number Summary for BA Quiz
Scores.
Given Scores: 95, 81, 81, 55, 68, 111, 88, 100, 94, 87, 65, 93,
85, 79, 106, 92, 15, 67, 83
30th Percentile ≈ 72
Elements: 8
Variables: 5 (Cost, OS, Display, Battery Life, CPU)
Categorical: OS, CPU Manufacturer
Quantitative: Cost, Display Size, Battery Life
Kohli’s Runs: [82, 21, 61, 50, 6, 59, 0, 54, 31, 55, 1, 18, 100,
101]
Du Plessis’ Runs: [73, 23, 79, 22, 62, 84, 62, 17, 44, 45, 65, 55,
71, 28]
Findings: