Data Analysis Concepts
Data Analysis Concepts
CONCEPTS
DEFINITION WHY WE USE IT Data transformation usually LONG DATA WIDE DATA
involves:
is the process of changing the بيسهل االناليسز Adding, copying, or replicating Long data is data where each row Long data is preferred when Wide data is data where each row Wide data is preferred when
data’s format, structure, or values. data contains a single data point for a contains multiple data points for
particular item. the particular items identified in
the columns.
Deleting fields or records
وضح اكتر Storing a lot of variables about Performing advanced statistical Creating tables and charts with a Comparing straightforward line
each subject. For example, 60 analysis or graphing few variables about each subject graphs
Standardizing the names of years worth of interest rates for
Data organization: better organized data is easier to use variables each bank
Data merging: data with the same organization can be merged together
1.1.1. Long data is data where each row contains a single data point for a particular item.
1.1.2.1. Storing a lot of variables about each subject. For example, 60 years worth of interest rates for each bank
1.2.1. Wide data is data where each row contains multiple data points for the particular items identified in the columns.
1.2.2.1. Creating tables and charts with a few variables about each subject
2. data transformation
2.1. DEFINITION
2.2.1.1.2. EX.To make it easier to create charts, you may also need to transform long data to wide data. Consider the following example of transforming stock prices (collected as long data) to wide data.
2.2.1.1.3. Data compatibility: different applications or systems can then use the same data
2.2.1.1.4. Data migration: data with matching formats can be moved from one system to another
2.2.1.1.5. Data merging: data with the same organization can be merged together
2.2.1.1.5.1. Mario is a plumber who owns a plumbing company. After years in the business, he buys another plumbing company. Mario wants to merge the customer information from his newly acquired company with his own, but the other company uses a different database. So, Mario needs to make the data compatible. To do this, he has to transform the format of the acquired company’s data. Then, he must remove duplicate rows for customers they had in
common. When the data is compatible and together, Mario’s plumbing company will have a complete and merged customer database.
2.2.1.1.6. Data enhancement: data can be displayed with more detailed fields
2.2.1.1.7. Data comparison: apples-to-apples comparisons of the data can then be made
2.3.6. Saving a file in a different format. For example, saving a spreadsheet as a comma separated values (CSV) file.