FORM B of 2024 Quantitative Data Analysis QUIZ - Google Docs 1
FORM B of 2024 Quantitative Data Analysis QUIZ - Google Docs 1
Q
(FORM B)
lease download the “LOAN INFO” dataset from the OnCampus assignment to be used in
P
Excel and RStudio to complete the questions below.
1. C
reate a histogram of the “int_rate” variable. (Add a title and color) Copy and paste your
graph and use it to answer a-d below:
a) Identify AND provide the best measure of CENTER for the data. Justify your reasoning.
b) Identify AND provide the best measure of SPREAD for the data. Justify your reasoning.
d) A
re there any official outliers for this dataset? What is the threshold before it becomes an
outlier? Justify your reasoning and show your work.
. Create a boxplot of the “loan_amt” variable broken down by the “grade” that each loan was
2
given. (Add a title, x and y axis labels, and color) Copy and paste your graph and use it to
answer a-c below:
a) W
hat clear trend do you notice on the graph? Why does this make sense in the context
of taking out a loan?
b) W
hich loan grade has the highest median amount? Estimate what that amount is and
justify your reasoning.
c) Which loan grade has the smallest IQR? Justify your reasoning.
. Identify the mean and standard deviation of the “loan_amt” variable and use those numbers
3
to complete a-e below. NOTE: Let us act as if the “loan_amt” variable is unimodal and
symmetric for the section of the quiz!
b) C
reate a sketch of a Normal Model, including all values and percentages, using the
mean and standard deviation found in part a.
e) C
ombine the z scores of the “loan_amt” variable and the “int_rate” variable to find the
top 3 loans that fall into each of the below categories: (screenshot code)
i) Take your Time Loans: Loans that have a high amount paired with a low rate
ii) Pay off Quick Loans: Loans that have a low amount paired with a high rate
rite 2-3 sentences describing what you did to find this that outlines the steps you took to solve
W
this problem:
uestion 2:Compare and contrast the two distributionsbelow. You must support your
Q
answer with an argument that includes comparisons of appropriate measures of shape
and center, and descriptions of spread, for both distributions.