Final Assigment 1 Statistic 570
Final Assigment 1 Statistic 570
Student declaration
I certify that the assignment submission is entirely my own work and I fully understand the consequences of plagiarism.
I understand that making a false declaration is a form of malpractice.
Student Signature
Nguyen Thai Bao
Grading grid
P1 P2 M1
1
Description of activity undertaken
Assessor name:
2
Summative Feedbacks Resubmission Feedbacks
3
Contents
Introduction ........................................................................................................................................ 5
Conclusion ......................................................................................................................................... 12
References ......................................................................................................................................... 12
4
Introduction
It is evident that society cannot be properly administered on the basis of hunches or trial and error,
and that much depends on the right interpretation of numerical data in business and economics.
Data-driven decisions will outperform those based on intuition or gut feelings. Statistics may be used
to solve all of these problems. Essentially, statistics is a scientific methodology to analyze numerical
data in order to maximize its interpretation, comprehension, and usage. This indicates that statistics
aids in the transformation of data into information; that is, data that has been evaluated,
comprehended, and is helpful to the recipient. The study will rely on a document including
information and data gathered in Vietnam between November 2014 and April 2016. This document
is part of the World Bank's Enterprise Survey (ES) program, which aims to develop an investment
climate, create jobs, and achieve long-term growth. ES is also used to create an enterprise data sheet,
which allows for the tracking of changes in the business environment over time, as well as measuring
the impact of reforms. The report includes the nature of economic and business data, different
methods of collecting data and the advantages and disadvantages of those approaches; as well as
the application using data from the document from the World Bank’s Enterprise Survey program.
Information
Information is the outcome of data processing, which is commonly done by a computer. Data is
meaningless on its own. When it is understood, it only takes on meaning and becomes information.
Data is made up of basic facts and numbers. When the data is processed into context-specific sets, it
gives information. This yields facts, allowing the processed data to be utilized in context and have
meaning (Boisot & Canals, 2004).
Knowledge
Knowledge is defined as what a person understands. This is known as their knowledge base. As a
person gains experience or learns new things, their knowledge base expands. Understanding what
information is based on one's experience and knowledge basis is required for knowledge. With
5
knowledge, data may be interpreted. Knowledge in computing is also what a computer knows
through the usage of a knowledge base built of rules and facts, as demonstrated in knowledge-based
systems, modeling, and simulation software (Boisot & Canals, 2004).
Application
28.74%
25.29%
South East 127 29.20%
6
thriving economies that attract both domestic and foreign investment. The facilities system is fairly
developed, with roads, railways, maritime, and aviation traffic axis system connecting locations
across the nation.
As a result, enterprises in the Food, garment and non-metallic sectors are concentrated in the three
regions: North Central Area & Central Coastal Area, Red River Delta. However, because having the
main source of daily use water as salt water, thus suffering from salinization and floods, low
intellectuals and slow-developing economy, the Mekong River Delta has a tiny number of
enterprises. These skills are critical for firms trying to expand in the future. Non-metallic product
firms, for example, should invest more in regions that has fast-changing economy such as the Red
River Delta, Southeast, and elsewhere rather than drilling in the Mekong Delta.
7
On the other hand, this technique also comes with an amount of disadvantages. One of the
descriptive analysis's major shortcomings is the respondents' lack of confidentiality and honesty.
Responders are not always truthful and do not always provide the proper response for a variety of
reasons. If the study questionnaire is not thoroughly and deliberately constructed, errors in the
overall research result may result. It is also usual for researchers to choose just material that supports
their study hypothesis and ignore other parts of the data that contradict the idea. This might lead to
bias and mistakes in research. Another downgrade of this type of this method is only the link and
association between two variables are shown by a descriptive correlational study. It simply defines
how closely two variables and qualities are connected to one another. Even if there is a very high and
substantial correlation between variables, this does not always imply causation or influence of one
variable on another (van der Voordt, 2002).
Descriptive Table
Mean 246.5150115
Median 52
Mode 20
Standard Deviation 841.0077621
Variance 707294.0559
Min 5
Max 9000
Range 8995
Count 433
Figure 2. Descriptive statistic
According to descriptive statistics theory: Mean is the average value of the variables represented in
the data, Mode is the most common or most prevalent in data sets; Median is the line drawn
between the big and small half of a variable. Standard deviation is the average variance of the
variables in the data set; the greater the standard deviation, the greater the variability in the data
set, and Variance quantifies how far apart a group of numbers is from its mean value. Range is the
distance between data points with the highest and lowest values. Finally, Minimum is the data that
has the smallest value and Maximum is the data that has the biggest value.
8
The average number of persons in the descriptive table above with the visible number of employee
variable is 247, the median is 52 employees, and the most common number of workers is 20.
Furthermore, the standard variance in the number of employees in various regions is around 841,
indicating that there is a lot of variety and the variance is about 707294 people. This shows that about
68% of 433 surveyed companies have from 5 to about 1,110 employees.
Finally, the lowest number of employees is 5, while the greatest is 9000. The number of persons in
the range between the minimum and maximum may be calculated as 8995. It can be seen that in
this region, there is a company that has the least number of employees (5 people), while there is
another company that has the greatest number of workers (9000 people). This shows that the range
of companies’ sizes is considerably huge.
9
of the trials. Another disadvantage of the methodology is that large samples are more reliable,
especially for a large number of variables (Hoyle, 2000).
140
120
100
f2 (Hours
80 operating/week)
60 Linear (f2 (Hours
40 operating/week))
20
0
0 100 200 300 400
Inventory
10
Exploratory analysis is a flexible and adjustable type of analysis. Second, data from the exploratory
study might be utilized to provide the framework for future research. As a result of greater topic
competency, more research problems are addressed, and the effectiveness of study findings is
significantly enhanced. It also assists in choosing the optimal technique for achieving a researcher's
objectives. Furthermore, because it detects and forecasts study outcomes, this analytical
methodology is useful in research and may save time and money (Hoaglin, et al., 1983).
The exploratory technique, however, has certain downsides since research results generate
information, and how that information is received might be deceiving, and there are no clear
outcomes. Furthermore, because this methodology usually utilizes a costly and underrepresented
number of samples for the target population, it is ineffective for making practical decisions (Hoaglin,
et al., 1983).
140
120
100
f2 (Hours
80 operating/week)
60 Linear (f2 (Hours
40 operating/week))
20
0
0 100 200 300 400
Inventory
11
Conclusion
The contents of data, information, and knowledge have been explained through definitions in the
preceding report, as well as instances for practical application. Furthermore, the paper
demonstrated three distinct analytical methodologies, including descriptive statistics, a
confirmatory approach, and an exploratory strategy. The examples of these three ways have
demonstrated the linkages and correlations between variables, assisting in the recognition of
findings and providing advice for companies based on the three techniques of analysis.
References
Boisot, M., & Canals, A. (2004). Data, information and knowledge: have we got it. Journal of
Evolutionary Economics, 2(2), 50-62.
BPP. (2010). Marketing Principles - Business essentials: supporting HND/HNC and Foundation
Degrees. London: BP Learning Media.
Hoaglin, D. C., Mosteller, F., & Tukey, J. W. (1983). Understanding Robust and Exploratory Data
Analysis. New York: Wiley Classics Library.
Hoyle, R. H. (2000). Confirmatory Factor Analysis. Handbook of applied multivariate statistics and
mathematical modeling, 465 - 497.
Kotler. (2012). Principles of Marketing, 14th edition. New York: Prentice Hall.
Kotler, P., Armstrong, G., Wong, V., & Saunders, J. (2008). Principles of Marketing - Fifth European
Edition. Essex: Prentice Hall.
van der Voordt, J. (2002). Ways to study - Descriptive research. In T. de Jong, & J. van der Voordt
(Eds.), Ways to study architectural, urban and technical design (p. 367). Delft: Delft
University Press.
12