Data analysis training workshop_Day 1 presentation
Data analysis training workshop_Day 1 presentation
Quantitative Qualitative
Quantitative Qualitative
2. Are the numbers used to impose some kind of hierarchy or order? If yes then it
is an ordinal scale
6. If yes to the above three questions and you can say that a value (eg: 20) is
twice as much as another value (eg: 40) then it is a ratio scale
Section B: Introduction to MS
Excel
1. File formats
2. Importing data in Excel
3. Basic commands in Excel
2. The default file format when saving data in Excel is .xlxs however,
the workbook can be saved in any of the discussed formats,
including .pdf
3. .xlxs files easily imported into statistical software packages like Stata
and SPSS
2. Importing Data into Excel
1. Click the File tab
2. Select Open
3. Search for the File Name
4. Select File
Import the ‘example1.xlsx’ file into Excel
Two Sheets:
1. Dataset 1
2. Dataset 2
3. Basic Commands in Excel
1. Sum – adds values 7. Min and Max – provides the
2. Average – provides the mean min and max number in a
for a certain range of values range of values
3. Count – counts only numeric 8. Concatenate – combines
values values from multiple cells into
one cell
4. CountA – counts numeric,
strings, dates, time, text, etc 9. Decimal places – formats
values
5. IF – allows you to restrict data
10. Percentage = (𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡
𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣
) ∗ 100
6. Trim – removes spaces in cells 𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣
1. Sum the test scores for each student across all tests
2. What is the average score of each test?
3. What is each students average score?
4. Which students scored higher in the test 1 compared to test 4?
5. Count the number of cells between D4:18
6. Count the number of cells between A15:F15
7. Count the number of cells in the entire dataset
8. What is the minimum score obtained in test 3?
9. What is the maximum score obtained in test 3?
10. Neaten up the dataset by removing unwanted spaces
11. Create a new variable which contains the names and surnames of students in one column
12. Create a new variable which rounds the average test score of each student to the nearest whole number
13. Reorder the new ‘names and surnames’ variable into alphabetic order
14. Reorder test 2 scores from lowest to highest
15. Reorder test 3 scores from highest to lowest
4. Graph Function in Excel
1. Graphs are an effective tool to communicate your data or results
graphically/visually
2. Used to display data and allow for further exploration of the topic
3. Used when tables don’t adequately demonstrate important
relationships or patterns between data points
4. Different types of charts that are available in MS Excel however, the
most common include:
a. Column graph
b. Line graph
c. Pie chart
d. Histogram
A. Column Graph
Histograms are
used to
graphically
summarize the
distribution of
data. It shows
how often each
value in the
dataset appears.
Exercise – In class or homework
1. Plot a graph showing Lisa Reddy’s performance across all tests
2. Plot a graph showing the average score of each test
3. Plot a graph showing the average score across all tests for the first
10 students
4. Using the second sheet (dataset 2) in the ‘example1.xlxs’ data
calculate the following:
a. The percentage and proportion for each category of the gender, race and cell phone
types variables
b. Plot the frequencies of the race variable
c. Plot the proportions of the race gender variable
d. Plot the percentage of the cell phone types variable
End of day one, thank you!