0% found this document useful (0 votes)

183 views10 pages

Summarising and Analysing Data

Here are the steps to calculate the standard deviation for ungrouped data: 1. Find the mean (average) of the data set. 2. For each data point, subtract it from the mean and square the difference. 3. Add up all the squared differences. 4. Divide the sum of the squared differences by one less than the number of data points. 5. Take the square root of the result. Let's calculate the standard deviation for Class A: Mean = (40 + 50 + 50 + 60) / 4 = 50 Differences from mean: 40 - 50 = -10, 50 - 50 = 0, 50 - 50 = 0, 60 - 50 = 10

Uploaded by

Trya Salsabilla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

183 views10 pages

Summarising and Analysing Data

Uploaded by

Trya Salsabilla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

A company's sales in the last year in its three different markets were as follows:

$
Market 1 100,000
Market 2 149,000
Market 3 51,000
Total 300,000
In a pie chart representing the proportion of the sales made in each market,
what would be the angle of the section representing Market 3?
61 degrees
120 degrees
50 degrees
17 degrees

SUMMARISING AND ANALYSING DATA

a) Describe the two types of data: categorical (nominal and ordinal) and numerical
(continuous and discrete). [S]

Nominal
Categorical
Ordinal
Data
Discrete
Numerical
Continuous
Categorical data
 Nominal data are type of data that is used to label variables without providing any quantitative value. The
data cannot be ordered.
Example: Gender (0 = male, 1 = female)

 Ordinal data are a kind of categorical data with a set order or scale to it
Example: Consumer satisfaction (not satisfied, somewhat not satisfied, neutral, somewhat satisfied,
satisfied)

Numerical Data
 Discrete data are data which can only take on a finite or countable number of values within a given range.
Example: Number of workers (100, 250, 1000)

 Continuous data are data which can take on any value

Example: Weight of flour materials (1.890 kgs)

b) Explain the terms descriptive analysis and inferential analysis

Descriptive analysis: Collecting, summarizing, and describing data
Example: What is the average of monthly electricity cost in 2020?

Inferential analysis: Drawing conclusions and/or making decisions concerning a population based only sample
data
Example: Is there any differences between electricity cost in Factory A and Factory B?

c) Calculate the mean, mode and median for ungrouped data and the mean for
grouped data. [S]
d) Calculate measures of dispersion including the variance, standard
deviation and coefficient of variation both grouped and ungrouped data

The number of daily complaints to a local government office has a mean of 12 and a
standard deviation of 3 complaints.
What is the coefficient of variation as a %?

%

e) Calculate expected values for use in decision-making.[S]

f) Explain the properties of a normal distribution.[S]

g) Interpret normal distribution graphs and tables. [

The results of a chemistry examination are normally distributed with a mean score of
56 and a standard deviation of 12.
What is the percentage probability that a student will score more than 80?

%

Which TWO of the following are feasible values for the correlation coefficient?
+ 1.40
+ 1.04
0
- 0.94

The following statements relate to the advantages that linear regression analysis has
over the high-low method in the analysis of cost behaviour:

(1) The reliability of the analysis can be statistically tested
(2) It takes into account all of the data
(3) It assumes linear cost behaviour
Which of the above statements are TRUE?
2 and 3 only
1 and 2 only
1 only
1, 2 and 3

h) Describe the five characteristics of big data (volume, variety, velocity, value and
veracity).[K]
i) Explain the three types of big data: structured, semi-structured and unstructured.
[K]
j) Describe the main uses of big data and analytics for organisations.

Identify whether each of the following statements about the uses of big data
analytics in organisations is TRUE or FALSE.
. True False
It helps to analyse the efficiency of business processes in real time
It helps to better understand customer behaviour and preferences

Which of the following statements concerning Big Data analytics is correct?

 Big Data analytics relies on digital information
 Big Data analytics relies on written information
 Big Data analytics relies on verbal information
 Big Data analytics relies on confidential information

Big Data
LO:
a) Describe the five characteristics of big data (volume, variety, velocity, value and veracity)
b) Explain the three types of big data: structured, semi-structured and unstructured

Volume: organisations now hold huge volumes of data

Example:
 Social media companies, such as Facebook, will have a data store of all the postings an individual makes
(and where they were made), photos posted and contacts
 A supermarket will have a data store of all purchases made, when and where they were made, how they
were paid for and the use of coupons via loyalty cards swiped at the checkout

Variety: Big Data can include much more than simply financial information and can include other
organisational data which is operational in nature as well as other internal and external information. This data
can be both structured and unstructured in nature
 Structured data – for example, a bank will hold a record of all receipts and payments (date, amount and
source) for a customer.
 Unstructured data – can make up 80% of business data but is more difficult to store and analyse

Velocity: The data must be turned into useful information quickly enough to be of use in decision making and
performance management (in real time if possible). The sheer volume and variety of data makes this task
difficult and sophisticated methods are required to process the huge volumes of non-uniform data quickly

Veracity: The data must be accurate and reliable enough.

Value: Having access to big data is all well and good but that’s only useful if we can turn it into a value

LO:
c) Describe the main uses of big data and analytics for organisations
The processing of Big Data is known as Big Data analytics. For example, Google Analytics tracks many
features of website traffic

Big Data is relevant to management accountancy in a number of ways, such as:

 It can help the organisation to understand its customers’ needs and preferences which can then be used
to improve marketing and sales.
 It can improve forecasting, for example of future customer spending or of machine replacement cycles, so
that more appropriate decisions can be made.
 It can help the organisation to automate business processes resulting in improved efficiency.
 It can help to provide more detailed, relevant and up to date performance measurement
Learning outcomes:
a) Calculate the mean, mode and median for ungrouped data and the mean for grouped data

Mean
The arithmetic mean, also known as the ‘average’, is calculated by dividing the sum of the values in question by
the number of values

A shopkeeper is about to put his shop up for sale. As part of the details of the business, he wishes to quote the
average weekly sales. The sales in each of the last 6 weeks are:
Week 1 2 3 4 5 6
Sales $1,120 $990 $1,040 $1,030 $1,105 $1,015
Determine the mean weekly sale.

The following distribution shows the number of employees absent per day for a company over a 22 day period.
No of employees absent No. of days (frequency)
2 2
3 4
4 3
5 4
6 3
7 3
8 3
Find the arithmetic mean for the above distribution

Median
The median is defined as the middle of a set of values, when arranged in ascending (or descending) order. The
median can be used to overcome any issues of skewed data.

The previous illustration we saw that a shop's weekly sales were given by the following sample over six weeks.

Week 1 2 3 4 5 6
Sales $1,120 $990 $1,040 $1,030 $1,105 $1,015

The sample has an arithmetic mean of $1,050. A prospective buyer of the business notices that the mean is
higher than the sales in four of the 6 weeks.
Calculate the median for him

Mode
The mode or modal value of a data set is that value that occurs most often.

Find the mode for the following distribution:

Complaints per week No of weeks
0 5
1 12
2 7
3 2
4 1

Learning outcomes:
b) Calculate measures of dispersion including the variance, standard deviation and coefficient of variation both
grouped and ungrouped data

Standard deviation
The standard deviation (σ) is a way of measuring how far away on average the data points are from the mean.
They measure average variability about the mean.
Ungrouped data

Find the standard deviation of each class:

FA Score (Class A) FA Score (Class B)
40 10
50 20
50 50
60 80
60 80
60 90

Grouped data

Coefficient of variation
The coefficient of variation is a statistical measure of the dispersion of data points in a data series around the
mean.

Learning outcomes:
c) Calculate expected values for use in decision-making

An expected value is a weighted average value of the different possible outcomes from a decision, where
weightings are based on the probability of each possible outcome

Expected values indicate what an outcome is likely to be in the long term if the decision can be repeated
many times over.

Example 1:
Merry joins a gamble by tossing a coin. Merry will get Rp100.000 if the coin toss result in head. However, if the
coin toss result in tail, Merry need to pay Rp50.000. What is the expected value?
Head (Kepala) / Tails (Ekor) = 50:50

Merry gets 100.000 kalau head

Merry pays 50.000 kalau tail

Expected value?

Outcome Probability Outcome x Probability

Muncul Head 100.000 50% 50.000
Muncul Tails -50.000 50% -25.000
Expected value 25.000

Example 2:
A company has recorded the following daily sales over the last 200 days:
Daily sales (units) Number of days
100 40
200 60
300 80
400 20
What will be the expected sales level in the future? 240 units

Daily sales (units) Number of days Probability Daily sales x Prob

100 40 40/200 = 0,2 20
200 60 60/200 = 0,3 60
300 80 80/200 = 0,4 120
400 20 20/200 = 0,1 40
200 1 240

Example 3:
An entity must make a decision between three options, A, B and C. The possible profits and losses are:
1. Option A: a profit of $2,000 with probability 0.5 or otherwise a loss of $500
2. Option B: a profit of $800 with probability 0.3 or otherwise a profit of $500
3. Option C: a profit of $1,000 with probability 0.7, or $500 with probability 0.1 or otherwise a loss of $400
Using EV, which option should be chosen? Option A

Option A
Outcome Probability Outcome x probability
2.000 0.5 1.000
-500 (1-0.5) = 0.5 -250
Expected value 750

Option B
Outcome Probability Outcome x probability
800 0.3 240
500 (1-0.3) = 0.7 350
Expected value 590

Option C
Outcome Probability Outcome x probability
1.000 0.7 700
500 0.1 50
-400 (1-0.7-0.1) = 0.2 -80
Expected value 670

Learning outcomes:
d) Explain the properties of a normal distribution

A normal distribution has the following characteristics:

 the mean (μ) is shown in the centre of the diagram
 the curve is symmetrical about the mean. This means that 50% of the values will be below the mean and
50% of the values will be above the mean.
 the mean, median and mode will all be the same for a normal distribution

The total area under the curve is equal to 1.

In general 68% of values are within one standard deviation (between –1 and 1), 95% of values are within two
standard deviations (between –2 and 2) and 99.7% of values are within three standard deviations (between –3
and 3).

Learning outcomes:
e) Interpret normal distribution graphs and tables

RST is a food producer, specialising in dried fruit and nuts. The dried fruit and nuts are prepared within the
factory and packed into small bags which are sold as snacks in supermarkets.
The weights of the snack bags are normally distributed with a mean weight of 70g and a standard deviation of
5g.
RST can use normal distribution to calculate the probabilities that a bag selected at random would be of an
acceptable weight.
The food producer would like to know the probability that the bag selected at random weighs less than 60g

Step 1: Calculate z-score

Z-score is a standard normal distribution which has a mean of 0
a standard deviation of 1.

z is the z score
x is the value being considered
μ is the mean
σ is the standard deviation

60−70 −10
z= = =−2
5 5
Step 2: Looking up the normal distribution tables
Therefore, the probability that a bag selected at random weighs less than 60g = (0.5 – 0.4772) = 0.0228 or 2.3%.

Exercise:
A machine produces components with diameter of mean 5 cm and standard deviation 0.1 cm. The production of
this component follows a normal distribution.
What proportion of the components produced will have diameters of the following dimensions?
1) between 5 and 5.2 cm
2) over 5.15 cm
3) between 4.8 and 5.1 cm

A class has score of Accounting exam with a mean of 75 and a variance of 100. 1)
1) What is upper quartile of this distribution? (i.e What is the minimum score to be best 25% in this class?)
2) What is the minimum score to be best 10% in this class?

DATA4
No ratings yet
DATA4
259 pages
final UNIT II-DESCRIPTIVE ANALYTICS
No ratings yet
final UNIT II-DESCRIPTIVE ANALYTICS
128 pages
DBB2102 – QUANTITATIVE TECHNIQUES FOR MANAGEMENT
No ratings yet
DBB2102 – QUANTITATIVE TECHNIQUES FOR MANAGEMENT
11 pages
bisc-ma-chapter-5
No ratings yet
bisc-ma-chapter-5
19 pages
FBR & IT Applications: Compiled and Presented by DR - Deepak Joshi For Academic Use Only
No ratings yet
FBR & IT Applications: Compiled and Presented by DR - Deepak Joshi For Academic Use Only
77 pages
Updated Review Research Paper on Big Data
No ratings yet
Updated Review Research Paper on Big Data
14 pages
BDA U-3
No ratings yet
BDA U-3
30 pages
Univariate Statistics w24 Update
No ratings yet
Univariate Statistics w24 Update
144 pages
Unit_II_Part I_Importance of Statistics in Data Science
No ratings yet
Unit_II_Part I_Importance of Statistics in Data Science
10 pages
Introduction to Statistics..Final
No ratings yet
Introduction to Statistics..Final
221 pages
Data-Preprocessing
No ratings yet
Data-Preprocessing
138 pages
BA_Unit_1_merged[1] Highlighted
No ratings yet
BA_Unit_1_merged[1] Highlighted
103 pages
DS Unit 1
No ratings yet
DS Unit 1
99 pages
Descriptive Statistics: Overview of Using Data
No ratings yet
Descriptive Statistics: Overview of Using Data
47 pages
Unit 3 Descriptive Statistics
No ratings yet
Unit 3 Descriptive Statistics
25 pages
E-Commerce Capstone Project Presentation
No ratings yet
E-Commerce Capstone Project Presentation
26 pages
Data Science & Analytics Paper
No ratings yet
Data Science & Analytics Paper
55 pages
E-Note_33325_Content_Document_20250319114322AM
No ratings yet
E-Note_33325_Content_Document_20250319114322AM
69 pages
Data Analytics Fundamentals
No ratings yet
Data Analytics Fundamentals
35 pages
Problem Set On Desc Stats Regression - PGP 2024 - 06aug2024
No ratings yet
Problem Set On Desc Stats Regression - PGP 2024 - 06aug2024
73 pages
Descriptive Statistics SV
No ratings yet
Descriptive Statistics SV
77 pages
Statistical Learning - Introduction
No ratings yet
Statistical Learning - Introduction
20 pages
CH3. Multiple Linear Regression 2023
No ratings yet
CH3. Multiple Linear Regression 2023
76 pages
Data categories in Business Analytics
No ratings yet
Data categories in Business Analytics
1 page
Exp-3
No ratings yet
Exp-3
16 pages
IDV_Report_Group4
No ratings yet
IDV_Report_Group4
26 pages
Estimation Methods in Network Analysis
No ratings yet
Estimation Methods in Network Analysis
57 pages
Further Bound Reference
No ratings yet
Further Bound Reference
42 pages
Chapter1 2023
No ratings yet
Chapter1 2023
76 pages
DSA-REPORT
No ratings yet
DSA-REPORT
11 pages
Data Analysis3
No ratings yet
Data Analysis3
31 pages
Unit 3
No ratings yet
Unit 3
30 pages
Untitled document
No ratings yet
Untitled document
10 pages
DA Major Notes
No ratings yet
DA Major Notes
46 pages
DBB2102 – QUANTITATIVE TECHNIQUES FOR MANAGEMENT
No ratings yet
DBB2102 – QUANTITATIVE TECHNIQUES FOR MANAGEMENT
15 pages
Exploring and Producing Data For Business Decision Making Module 1
No ratings yet
Exploring and Producing Data For Business Decision Making Module 1
162 pages
Assignment No 3
No ratings yet
Assignment No 3
16 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
7 pages
Lecture 01-05 Data, Central Tendency PDF
No ratings yet
Lecture 01-05 Data, Central Tendency PDF
51 pages
Assignment_DBB2102_BBA 3_Set-1 and 2_July-Aug_2024
No ratings yet
Assignment_DBB2102_BBA 3_Set-1 and 2_July-Aug_2024
18 pages
ch-1.pdf
No ratings yet
ch-1.pdf
19 pages
Unit 4
No ratings yet
Unit 4
5 pages
BA_Unit 1
No ratings yet
BA_Unit 1
16 pages
Introductory Lecture
No ratings yet
Introductory Lecture
29 pages
DBB2102 QUANTITATIVE TECHNIQUES FOR MANAGEMENT (18)
No ratings yet
DBB2102 QUANTITATIVE TECHNIQUES FOR MANAGEMENT (18)
12 pages
Tatistics Is A Branch of Applied Mathematics That Involves T
No ratings yet
Tatistics Is A Branch of Applied Mathematics That Involves T
25 pages
Iit M Qualifier Exam Qpa1
No ratings yet
Iit M Qualifier Exam Qpa1
56 pages
Summarising and Analysing Data
No ratings yet
Summarising and Analysing Data
36 pages
Module 5 Research Methodology (3)
No ratings yet
Module 5 Research Methodology (3)
9 pages
Assignment DSBDS Insem
No ratings yet
Assignment DSBDS Insem
6 pages
BA_CH01
No ratings yet
BA_CH01
14 pages
Introduction To Statistics - Doc1
No ratings yet
Introduction To Statistics - Doc1
236 pages
Lesson 2 Notes
No ratings yet
Lesson 2 Notes
11 pages
Unit .......
No ratings yet
Unit .......
45 pages
Assignments-Aurr (1) Priyabs-11
No ratings yet
Assignments-Aurr (1) Priyabs-11
4 pages
Building Data Report
No ratings yet
Building Data Report
27 pages
It B.tech II Year II Sem DV (R18a0555)
No ratings yet
It B.tech II Year II Sem DV (R18a0555)
73 pages
DBBA2102
No ratings yet
DBBA2102
10 pages
Recasting and Joining of Dataframes: Nptel Noc18-Cs28
No ratings yet
Recasting and Joining of Dataframes: Nptel Noc18-Cs28
23 pages
20IT503 - Big Data Analytics - Unit2
No ratings yet
20IT503 - Big Data Analytics - Unit2
62 pages
06 Feature Engineering
No ratings yet
06 Feature Engineering
24 pages
Ahsan Stats
No ratings yet
Ahsan Stats
9 pages
MMW (Data Management) - Part 1
No ratings yet
MMW (Data Management) - Part 1
26 pages
Dissertation Analysis of Data Examples
100% (2)
Dissertation Analysis of Data Examples
5 pages
Data Science - g.scali (Lect1) (1)
No ratings yet
Data Science - g.scali (Lect1) (1)
22 pages
Biostatistics - Bme Yr Iii
No ratings yet
Biostatistics - Bme Yr Iii
38 pages
2.3 Experimental Design and Construction of Choice Sets 2.3.1 Design
No ratings yet
2.3 Experimental Design and Construction of Choice Sets 2.3.1 Design
18 pages
Midterms Gec Math Adooooor
No ratings yet
Midterms Gec Math Adooooor
6 pages
Complete_Survey_Questions_Jamovi_Analysis copy
No ratings yet
Complete_Survey_Questions_Jamovi_Analysis copy
4 pages
Statistics For Business and Economics: Describing Data: Graphical
No ratings yet
Statistics For Business and Economics: Describing Data: Graphical
53 pages
Miss Forest
No ratings yet
Miss Forest
10 pages
SINGLE VARIABLE Notes 5.3 Year 10
No ratings yet
SINGLE VARIABLE Notes 5.3 Year 10
9 pages
Fundamentals of Predictive Analytics A Business Analytics Course
No ratings yet
Fundamentals of Predictive Analytics A Business Analytics Course
36 pages
ASM 1 Thay Duong
No ratings yet
ASM 1 Thay Duong
8 pages
Data Analytics (Finished
No ratings yet
Data Analytics (Finished
4 pages
Stat Module With Assessment
No ratings yet
Stat Module With Assessment
11 pages
Lesson 1-04 Types of Data STAT
No ratings yet
Lesson 1-04 Types of Data STAT
19 pages
General Principles of Preclinical Study Design
No ratings yet
General Principles of Preclinical Study Design
15 pages
Loan Prediction Using Artificial Intelligence and Machine Learning
No ratings yet
Loan Prediction Using Artificial Intelligence and Machine Learning
24 pages
Different Kinds of Variables and Their Uses
67% (3)
Different Kinds of Variables and Their Uses
38 pages
Business Analytics (MIS171) Summary Notes
No ratings yet
Business Analytics (MIS171) Summary Notes
6 pages
Lecture 2 Additional Notes Introduction To Statistical Modelling
No ratings yet
Lecture 2 Additional Notes Introduction To Statistical Modelling
2 pages
4.02 Statistics Fundamentals
No ratings yet
4.02 Statistics Fundamentals
2 pages
Why Are We Using Logistic Regression To Analyze Employee Attrition?
No ratings yet
Why Are We Using Logistic Regression To Analyze Employee Attrition?
4 pages
CSIS 5420 Final Exam - Answers (13 Jul 05)
No ratings yet
CSIS 5420 Final Exam - Answers (13 Jul 05)
8 pages
Logit Analysis
No ratings yet
Logit Analysis
8 pages
SPSS Notes
No ratings yet
SPSS Notes
2 pages
Statistics For Management: Q.1 A) 'Statistics Is The Backbone of Decision Making'. Comment
No ratings yet
Statistics For Management: Q.1 A) 'Statistics Is The Backbone of Decision Making'. Comment
10 pages
Business Statistics I Essentials
From Everand
Business Statistics I Essentials
Louise Clark
5/5 (5)

Summarising and Analysing Data

Uploaded by

Summarising and Analysing Data

Uploaded by

A company's sales in the last year in its three different markets were as follows:

SUMMARISING AND ANALYSING DATA

 Continuous data are data which can take on any value

b) Explain the terms descriptive analysis and inferential analysis

e) Calculate expected values for use in decision-making.[S]

f) Explain the properties of a normal distribution.[S]

Which of the following statements concerning Big Data analytics is correct?

Volume: organisations now hold huge volumes of data

Veracity: The data must be accurate and reliable enough.

Big Data is relevant to management accountancy in a number of ways, such as:

Find the mode for the following distribution:

Find the standard deviation of each class:

Merry gets 100.000 kalau head

Outcome Probability Outcome x Probability

Daily sales (units) Number of days Probability Daily sales x Prob

A normal distribution has the following characteristics:

The total area under the curve is equal to 1.

Step 1: Calculate z-score

You might also like