Coca-Cola Study
Coca-Cola Study
in a Football Club
Data analysis
Mariona Martín (8)
Samuel Ferrer (9)
Jan Madí (9)
Laura Martí (9)
Diego Martínez (9)
Passeig Pujades, 1 - 08003 Barcelona
www.esci.upf.edu - [email protected]
1
Index
1. Introduction
……………………………………………………………………………………………………...3
2. Methodology………………………………………………………………………………………………4
3. General characteristics of the study
…………………………………………………………….…...5
4. Graphing our qualitative
data…………………………………………………………………………….7
5. Center measurements
…………………………………………………………………………………..…15
6. Quantitative
variables………………………………………………………………………………..…....16
7. Analysis of the relation of two continuous
variables………………………………………..20
8. Relating
variables…………………………………………………………………………………………….23
9. Conclusions…………………………………………………………………………………………………25
10.Webographie…………………………………………………………………………………………………27
2
1. INTRODUCTION
This study has been conducted by Laura Martí, Diego Martínez, Jan Madi, Samuel Ferrer and
Mariona Martín.
Over the last few years, the growing concern about the consumption of soda drinks among
youngsters has increased dramatically. More specifically, the Coca-Cola company has
increased their marketing budget in order to raise their drink consumption among
youngsters.
It is well known that this type of drink represents a threat to public health, and in our study
which involves athletes, even more.
Due to this we decided to carry out a study on the consumption of an amateur soccer club.
The aim of this project will be analysing the Coca-cola consumption among the group aged
16 to 25, as well as, their gender, the drink size, and finally, which type of cola sells the most.
When defining the team project objectives we have considered various variables, such as
quantitative and qualitative.
We included them on a quiz, in order to find possible and common patterns about the group’s
consumption we previously selected.
These patterns could possibly be: age, drink size, where they live and much more.
3
Thus, in this study we will observe the patterns of consumption and shared habits that can
harm not only the general performance of the team but the general well-being of the players.
2. METHODOLOGY
In order to set the basis of this project, we decided that our aim was going to be analysing the
Coca-Cola consumption.
And so, we decided we wanted our population to be sports-related, due to the fact that one
of our group-members currently plays in a football team.
We wanted to have different answers from both genders, so we decided to not focus on just
his team, and extended it to all of the club.
After defining our sample, as we said, we started determining the variables (qualitative and
quantitative) and constructed the questionnaire, by doing this we started the process for
precise data collection.
We carefully chose which were the variables that interested us the most for our study.
Once selected we created the quest and sent it over to the football club so the responses
started to kick in.
The survey we elaborated contained photos and references in which we asked the football
club members about their consumption habits and preferences.
What was their usual size bottle they buy if drinking, how often did they consume it, did they
consume it, etc...
4
Once we got enough information to conduct the analysis, we created a series of graphs and
data models to track down any patterns or casualties of interest related to our study. Of
course, we only included the information which fell in the range we were observing.
As we said, using different programs and methods we created a series of graphs and data
models from the survey. These graphs and data models have been carefully studied and have
been the crucial key points of interpretation, making it at ease for us to make conclusions
based on the plotted data.
Starting off we have our population, in our case, a mixed amateur football club, we also find
ourselves with another variable, the sample.
With that being said, once we obtained the answers about coca-cola’s consumption among
our 94 football players, the main objective of the study was included in the answers of each
of the 94 members of our soccer club.
- QUANTITATIVE VARIABLES:
5
- Their Age: We considered this variable since we wanted to determine which
age-group consumed the most.
- Their Coca-Cola consumption on a daily basis: This variable
allowed us to see with which frequency did footballers consume this soda.
- The bottle or can size they usually buy : This variable’s selection
had the main purpose of seeing which bottle or can size is the most
common/usual among soccer players.
- Their Monthly income: Considering this variable we would have a clearer
insight on the amount of money our soccer players had, since this would
directly affect the expenditure on the studied soda. We could also observe if
their income made their consumption decrease or increase.
- QUALITATIVE VARIABLES:
- Their Gender: This variable would show us the differences between men
and women on coca-cola consumption.
- The Feeling after drinking coca-cola: A variable that would make
us able to see how the footballers feel; satisfied, unsatisfied or indifferent.
6
4. GRAPHING OUR QUALITATIVE DATA
Since our data has a qualitative nature, we decided that the best way to represent
graphically the following variables was using pie charts. This type of graph shows
proportions in relation to the sample size.
1. The first question of our survey asked our subjects about their individual Coca-Cola
daily consumption. Once results were obtained, we could quickly see that the answer
“once a week” was highly popular among our athletes. Contrary to this observation, we
also obtained the answer “more than 10 times a week” as the most unpopular of them
all, since just the 12,1% chose it.
7
2. The second question of the
survey asked the individuals what
their age was. We obtained the following graph that shows that the highest percentage
is located between 18 and 19 years old, while the lowest one was more than 25 years.
This tells us that the age range of our sample is mostly concentrated between 18 and
19.
8
3. The third question asked their gender. The results of our 91 answers appeared as the
following: 53,8% males, 40,7% females and a final 5,5% others.
According to this sample we notice that females and male have similar weight since the
percentages aren’t very different.
9
4. In the following graph we analyzed which size do people buy. We included 5 different types
of sizes which went from 250ml to 2L. The last variable was “I don’t drink”.
According to these values we obtain the following results: 8,8% (2L), 16,5%(1L),
20,9%(250ML), 23,1%(Don’t drink) and finally 30,8%(500ml).
10
Based on the results, we conclude that the most common bottle size that people buy is 500ml
, because it is represented as the highest percentage in our sample.
5. The fifth question was feeling related and offered 3 different possible answers; satisfied,
disappointed and indifferent. We obtained the following results: 11% of people felt
disappointed, 29,7% of people felt indifferent and finally 59,3% of people felt satisfied.
11
According to these percentages, the feeling that our individuals feel after drinking Coca Cola
was satisfaction-related while the lowest one was pure disappointment.
12
700 and 12.1% had more. Again, we define the highest value as 0 to 100 (as an income) and
the lowest as 600 to 700.
13
The last two variables are continuous, so, in order to analyze them we used a histogram since
we find that is the best way to study our data. In order to do that, we need to make intervals
using the Sturge’s rule.
Sturge’s rule: 1 + 3.322 · log(91) = 7,5, but, since we can’t have decimals, we round up to 8
Question 7: How many euros would you pay for a regular can of Coca Cola?
Max: 3,5
Min: 1,2
14
to 3 euros ans, that nearly no one would buy a can for more than
3€.
Last question: How many euros do you think you spend on coca cola monthly?
Intervals:
5 - 9: 22
10 - 14: 19
15 - 19: 23
20 - 24: 9
25 - 29: 16
30 - 34: 2
This graph is also asymmetrical and shows that most people spend between 5 to 19 euros
per month.
15
5. CENTER MEASUREMENTS
Using excel, we calculated all the possible center measurements in order to analyze in detail
the obtained data.
Taking a look at the aim of the study we concluded that question number 1, How often do
you consume Coca-Cola on a daily basis, was the one that summed up perfectly to what we
wanted to obtain from our study. Knowing this we decided to study the center measures in
order to determine the variable behaviour. In order to make the proper calculations we
changed the answers to numbers: Don't drink= 0, Once a week= 1, three times per week=
3 and more than ten times = 10.
Defining the mean: is the average you're used to, where you would sum up all observations
to then divide it by the number of elements summed. In this case, according to the answers
we obtained, the mean is to consume Coca-Cola 2 times per week (2,49).
Defining median: the middle value in the list of numbers. To find the median, your numbers
have to be listed in numerical order from smallest to largest, so you may have to rewrite
your list before you can find the median. Here the answer is the value 1 so the median is
once a week.
Mode: is the value that occurs most often. If no number in the list is repeated, then there is
no mode for the list. The mode is once a week due the fact that we got 30 to choose this
option.
Sample Variance: we use this to calculate how varied a sample is. The answer is 9,16.
Passeig Pujades, 1 - 08003 Barcelona
www.esci.upf.edu - [email protected]
16
Range: The difference between the lowest and highest values. The highest is 10 and the
lowest is 0 so the range would be 10.
Sum and Count: The total sum is 227 and total count is 91.
6. QUANTITATIVE VARIABLES
The best way to explain quantitative data is by doing an histogram. The first step is
determining which of our variables are quantitative.
In order to analyse the quantitative data, we’re going to make a histogram and a freqüencie
table.
The first quantitative variable was to analyze how often do people consume cola on a daily
basis.
In the following pages, we constructed an histogram and frequency table and explained the
structure of the graphs.
17
As it shows at the freqüency table, people tend to consume Coca Cola more than once a week,
between one to three times. The histogram is asymmetrical and has a center dispersion due
the fact that the columns that are more representative (what people voted the most) are the
options of the middle, in this case, consuming cola once a week and more than 10 times a week.
18
In the second question, we wanted to know the age to help us develop a better study of our
sample. Since our study was going to be focused on a football club, the age range was between
16 and 25 years old principally, because we knew that if we put an option of 75 y/o nobody
would select that.
19
The results construct a right skewed distribution since the histogram has a peak to the right
of center, more gradually tapering to the left side. The most frequent age is between 18 to 25.
In order to do this question properly, we decided to use an interval of two variables.
We wanted to know what type of can customers usually buy because this data would tell us
how much does people actually drink, which is the most selled size of soda, etc.
20
Based on the histogram, we can determine that the graph is skewed to the right and is
asymmetrical. That can be explained by the table of frequency, people temp to buy 500ml
small bottles and temp to buy less frequently liter bottles.
In conclusion, people prefer smaller cans/bottles, rather than the big bottles. To clarify that,
we can separate the variables into two different categories in order to calculate the difference
between can and small bottles to gran and max bottles.
Can + Small bottle = 51,65%
1L + 2L bottles = 25,27%
As we see, there’s a huge difference between the two types of sizes.
21
The last question of our survey was made to analyse if the monthly income affected
buying/consuming cola. As we see, the graph is right skewed and asymmetrical, meaning that
the majority of who answered did not have a high income since much of the data is
concentrated in the left part of the graph.
Most of the answers were that they had an income between 0 to 100€
representing 32,96% of the total.
After the study is done, let’s determine some relationships between variables.
22
In order to do that, we need to define which are the explanatory and explained variables. We
know for a fact that explanatory variables are independent, exogenous and explain the result.
They are represented with X. On the other hand, explained variables are dependent,
endogenous and measure the result or answer and differently from the independent variable,
this one is represented with a Y.
To analyse both continuous variables, we made a scatter plot using SPSS. We consider this
program to be the best tool to make the graph due to the fact that it automatically calculates
for you a table with extra information that helps to study with more detail our data and
distribution.
23
In the
scatter
plot the
distribution of the data points shows a linear trend since the regression line is completely
straight. The values increase, creating a positive tendency.
Based on the results provided by SPSS, we can conclude that the strength of the correlation
coefficient is very strong due the fact that the result is 0,979 so it’s so close to 1.
For our study, we wanted to analyse if the price of a can influences how much cans they buy.
In order to do that, we asked how much money they would spend on a regular can to have an
idea of the money they are willing to pay and their monthly budget, to have an approximate
estimation of their monthly soda consumption.
In conclusion, we can say that individuals will end up spending more on something, regardless
of the quantity, if they perceive the item to be of a specific value
After the graph and tables are made, we can conclude that it does affect, since the relationship
between the variables is positive and increases.
24
8. RELATING VARIABLES
To be able to analyze the data of our study more thoroughly, we made a comparison between the
quantitative and qualitative variables in order to know if there is any kind of relationship between
them.
Between the age ranges of 16-17 and 18-19 years, 51.51% of soccer players have an income level of
0$ or less than 100$. Therefore, more than half have the minimum monthly income. On the other hand,
in the age ranges between 20 and 25 years, 78% of players have a monthly income of $ 200-$ 300 or
more. And only 22% have income from $ 0 to $ 100. Those who are over 25 years old, 66, 66% have an
income of $ 400-500 or more.
With this comparison of variables we have been able to verify that the socioeconomic level of each
club player varies depending on age. The younger the player, the less income he has. Therefore we can
conclude that income increases depending on age, thus the incomes are a dependent variable of the
age.
In the age ranges 16,17,18,19 the results are very close to each other. For those aged between
20,21,22,23, the results are almost the same as well, except with 5.26% of the players who drink 10
coca-colas a day. Between the ages of 24, 25 or more, the consumption of 1 coca-cola daily stands out
with 47.61%.
We can see that age does not influence much in daily coca-cola consumption, since most soccer
players, regardless of age, consume a coca-cola per day. These represent 65.93%, that is, more than
half have the same frequency of consumption.
It is true that those over 24 consume a little less, but the data is so tight that we could not say anything.
In conclusion, we cannot determine that neither of the two variables is dependent on the other since
the variations they have are not caused by each other.
25
In the range of $ 0 to $ 100 of monthly income, the daily frequency of drinking Coca-Cola 0 times
represents 36.66%, that is, it is the majority. On the other hand, those who earn $ 200 to $ 500 the
frequency of drinking 1 coca-cola and 3 per day are the majority, with 46.34% each. Soccer players
who earn $ 600 or more have a much higher frequency, with 30% in 10 times a day.
We could say that the higher the player's income, the more frequently he consumes, but the data have
outliers, and very varied data, so we could not fully affirm it. The salary ranges do not have enough data
to be examined with certainty, in this way, with the data provided, we can conclude that the higher the
salary of the population, the greater the frequency of consumption, although it increases progressively
slowly.
26
9. CONCLUSIONS:
After this intense study on the consumption of Coca-Cola in a soccer club, we have tried to
summarize in a brief and clear way the main ideas that we get from the different data
collected. In addition to the brief summary that we have carried out for each of the graphs
and tables, we believe that it is important to offer a collection of knowledge extracted from
the research carried out. So, these are our conclusions:
The first surprise that we got is that even though it is a soccer club, the general problem of
the abusive consumption of extremely sugary soft drinks is present. More than 60% of the
members of the club consume between one and ten Cokes a week even knowing that it will
reduce their performance and can cause serious health problems. In addition, more than ten
people consume it up to more than ten times a week. It is commonly known that there is a
disregard of abusive consumption in society but we were not aware that this drink was so
frequently drunk by athletes who should initially seek maximum physical performance.
Another conclusion that we draw from this study is that the vast majority (60%) consider
their experience with Coca-Cola satisfactory and this is not a surprise.
The addictive and adulterating components of this drink are known to all and that is why
most of its consumers do not value so much the risk of recurring to the use of this soft drink.
When something gives us a gratifying sensation, we tend to not take too much account of its
harmful potential, which is why its consumption is more dangerous.
27
In addition, observing the data we can draw a more speculative conclusion, the Coca-Cola
company (valid for most soft drink companies) is aware and encourages these facts since
even today we continue to see advertisements and advertising actions of the company
linking its drink with obtaining happiness.
The data obtained from this study has helped us to identify certain problems with the Coca-
Cola drink and athletes and now offers us some solutions that we believe are certainly
beneficial for society. In the first place, we believe that it is necessary for sports clubs to
promote the non-consumption of soft drinks and invest a certain time and budget in
campaigns against their drinking.
Finally we would like to thank all the club members for helping us with the data collection
and their sincerity.
28
10. WEBOGRAPHIE
https://www.scribbr.com/methodology/explanatory-and-response-variables/
https://www.ck12.org/book/ck-12-probability-and-statistics-concepts/section/4.6/
29