Skittles Project
Skittles Project
In the Pie and Pareto Charts above, we can clearly observe that the colors are fairly
equally distributed. While they are not exactly the same, it is easy to see that they are close. I
expected to see something like this because as a company, Skittles Candies should try to have
each flavor be approximately equal among packages. While it is unrealistic for each package to
have the exact same number of each candy, it is expected for them to be fairly close. In my
personal package of candies, the number of each color in my package had greater variation.
Below are tables showing the data from both my package and the entire class sample.
My Package
Red
9
14
12
57 candies
Q1:
59 candies
Med: 60 candies
Q2:
62 candies
Max: 97 candies
Standard Deviation: 9.67 candies
Mean: 62.7 candies
Reflection:
Categorical data is defined as information organized into groups. Quantitative data is
defined as information that can be measured and written down with numbers. In other words,
categorical data does not contain numerical valuesoutside of frequencywhereas quantitative
data does. Graphs and charts that make sense to use with categorical data include: pie, Pareto,
and frequency tables because these forms of organizing data do not necessarily need
quantitative values. Graphs and charts that make sense to use with quantitative data include:
histogram, boxplot, bar graph, line graph, dot plot, and stem plot because these charts and
graphs generally do need numerical values in order to convey the data properly. For categorical
data, the frequency of the categories needs to be calculated because it is the one numerical
value used in the graphing of categorical data. Calculations that need to be made for
quantitative data include: five-number-summary, standard deviation, mean, frequency and
others depending on the kind of chart or graph that is chosen to be used to display the data.
Mean:
(0.191)
E = 1.96(0.809)
941
= 0.025
9.67
E = 2.581
941 = 0.814
Standard Deviation:
2
940(9.67)<
<
124.116
940(9.67)2
61.754
Hypothesis Tests:
A hypothesis test is a method that uses sample data to decide between two claims about a
population characteristic.
Claim: 20% of all Skittles are green
0.206 0.20
= 0.01
(0.20)(0.80)
941
H0: p = 0.20
z=
= 0.4601
H1: p 0.20
to
original claim.
Claim: The mean number of Skittles is 56.
62.7 56
H0: = 56
t=
9.67 = 2.68
15
H1: 56
to
claim.
t/2 = 2.145
Fail to Reject H0
There is not sufficient evidence
support the original
Reflection:
The requirements to calculate interval estimates for population proportions are:
1. The sample is a simple random sample.
2. The conditions for the binomial distribution are satisfied:
- There is a fixed number of trials
- The trials are independent
- There are two categories or outcomes
- the probabilities remain constant for each trial
3. There are at least five successes and at least five failures