Home Assignment Dataliteracy
Home Assignment Dataliteracy
A. Multiple-choice questions
1. Which of the following best defines data literacy?
A) The ability to read and write data B) The ability to find and use data effectively
C) The ability to analyse data using AI D) The ability to collect and store data securely
ANSWER : Data literacy refers to the ability to read, interpret, create, and communicate data in a meaningful
way.
4. Which of the following statements about the quantity of data needed for machine learning projects is true?
A) More data is always better for good predictions.
B) Small batches of data are sufficient for complex models.
C) Data quantity depends solely on the number of features.
D) Data diversity is not essential for model performance.
Answer : Machine learning models generally perform better with more data because it helps the model learn
patterns and generalize effectively, reducing overfitting and underfitting.
6. What method of data collection involves direct communication with individuals or groups to gather information?
A) Observations B) Experiments C) Interviews D)Marketing campaigns
Answer : The distinguishing feature of ratio scale data is the presence of a true zero point, which indicates
the absence of the quantity being measured. This allows for meaningful comparisons using ratios (e.g., "twice as
much" or "half as much"). Examples include weight, height, age, and distance.
9. Which statistical measure is most suitable for data sets with evenly spread values and no exceptionally high or
low values?
A) Mean B) Median C) Mode D) Variance
Answer : The mean is the most suitable measure for data sets with evenly spread values and no outliers because
it considers all data points and provides the average value.
10. What is the term used to describe the graphical or pictorial representation of data?
A) Statistical summary B) Data organization
C) Data visualization D) Data interpretation
1. Explain the concept of data literacy and its importance in today's digital age.
Answer : Data literacy is the ability to read, understand , analyse and communicate data effectively. It
enables individuals to make informed decisions, assess data critically, and solve problems using data.
Following are the reasons for Data visualisation to be considered important in data Analysis :
Simplifies complex data.
Reveals patterns, trends, and outliers.
Aids data-driven decision-making.
Enhances communication and understanding.
Saves time in interpreting data.
6. What is data?(HW)
7. What do you mean by web scraping?(HW)
D. Python Programs
1. The ages of a group of people in a community are: 25, 28, 30, 35, 40, 45, 50, 55, 60, 65.
Write a program to calculate the mean, median, and mode of the ages.
2. A company recorded the daily temperatures (in degrees Celsius) for five consecutive days:
20°C, 22°C, 25°C, 18°C, and 23°C. Determine the variance and standard deviation of the temperatures.
3. Plot a line chart representing the weekly number of customer inquiries received by a
customer service center:
• Week 1: 150 inquiries
• Week 2: 170 inquiries
• Week 3: 180 inquiries
• Week 4: 200 inquiries
4. Plot a bar chart representing the number of books sold by different genres in a bookstore:
• Fiction: 120 books
• Mystery: 90 books
• Science Fiction: 80 books
• Romance: 110 books
• Biography: 70 books
Answer :
import matplotlib.pyplot as plt
# Adding title
plt.title('Distribution of Transportation Modes Among Commuters', fontsize=14)