0% found this document useful (0 votes)
15 views3 pages

FDS QB

Uploaded by

shwethasri366
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views3 pages

FDS QB

Uploaded by

shwethasri366
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

CS3352 Foundations of Data Science - Question Bank

Unit 1 - Introduction
Part A
1. What is Data Science?
2. What is Big Data?
3. Compare Data Science and Big Data.
4. List out and define the characteristics of big data.
5. List out any 4 benefits of big data.
6. List out the facets of data.
7. Write a brief answer about structured data.
8. Compare Structured and Unstructured data.
9. Briefly explain Graph based data.
10. Compare Natural language data and Machine generated data.
11. List out the seven steps in Data Science process.
12. Briefly discuss about the Data Exploratory process.
13. What is a model and how do you build it?
14. Differentiate Data Mining and Data Warehousing.
15. Problems to find Mean, Median, Mode, Standard Deviation, Variance, Range, IQR.
(i) Find the mean, median, mode and range for the given data:
44, 79, 94, 43, 53, 65, 87, 90, 70, 69, 65, 89, 85, 53, 47, 61, 27, 80
(ii) The weights, in grams, of seven sweet potatoes are 260, 225, 205, 240, 232, 205, and 214.
What is the median weight?
(iii) Find the variance for the following data: 11, 13, 14, 15, 19, 22, 24, and 26
(iv) The heights in inches of the students in your class are as follows: 58, 58, 59, 60, 62, 64,
64, 65, 66, 66, 66, 66, 68, 68, 69, 70, 71, 72, 72, 74, 75, and 77. Calculate the Variance and
Standard Deviation for the given data.
Part B
1. Briefly explain the facets of data.
2. Explain in detail about the steps involved in Data Science process.
3. What is Data mining? Explain Data mining architecture.
4. Discuss about the Central tendency measures, with their formulae and examples.
5. Define Dispersion of data. What are the parameters included to find the dispersion of data?
List out their respective formulae and explain with examples.
6. Define IQR. Find IQR for the following sets of data.
(i) 4, 12, 1, 8, 9, 43, 21, 11, 15, 6, 31, 17, 10, 2, 5.
(ii) 42, 51, 62, 47, 38, 50, 54, 43, 59, 45.
7. Explain in detail about Data Warehousing with its architecture and working.
8. What is Exploratory Data Analysis? “Exploratory data analysis is important to find the
properties of data along with the traditional statistical measures”. Justify.
9.

Unit 2 – Describing Data


Part A
1. List out the types of data.
2. Compare Qualitative and Quantitative data with examples.
3. What is a Ranked data? Give an example.
4. Compare Continuous and discrete variables.
5. Differentiate Nominal and Ordinal data.
6. List out the four levels of measurement.
7. What is a dependent variable? Explain with an example.
8. What is a Frequency distribution?
9. Write the formula for calculating the class interval in a frequency distribution table.
10. Write the formula for calculating Relative frequency and Cumulative frequency
distribution.
11. What is an Outlier? How does an outlier affect a data set?
12. List out the four typical shapes of a histogram or a frequency polygon.
13. What are the measures of variability in statistics?
14. Define a Normal curve.
15. What are the properties of a Normal curve?
16. What is a Z-Score? What is the formula used to calculate a Z-score?
17. Michael scored 85 in the mid-term mathematics examination. Assuming the distribution
of scores of all the students to be normal with mean 72 and standard deviation 10. Find the z-
score for Michael’s exam grade.
18. The weight of chocolate bars from a particular chocolate factory has a mean of 8 ounces
with a standard deviation of 0.1 ounce. What is the z-score corresponding to a weight of 8.17
ounces?
19. Discuss about the Standard normal curve.
20. Define Skewness and discuss about the two possible two types of Skewness.
21. What is Kurtosis?
22. Calculate the mean of first 10 even numbers.
23. Discuss about frequency distribution of grouped and ungrouped data.
24. What is meant by misleading graphs?
Part-B
1. Explain in detail the types of variables with an example for each type.
2. Thirty AA batteries were tested to determine how long they would last. The results, to the
nearest minute, were recorded as follows:
423, 369, 387, 411, 393, 394, 371, 377, 389, 409, 392, 408, 431, 401, 363, 391, 405, 382, 400,
381, 399, 415, 428, 422, 396, 372, 410, 419, 386, 390.
Construct a frequency distribution table to record,
a) Frequency
b) Relative frequency
c) Cumulative Frequency
d) Visualize the table using a histogram.
3. A survey was taken on Maple Avenue. In each of 20 homes, people were asked how many
cars were registered to their households. The results were recorded as follows: 1, 2, 1, 0, 3, 4,
0, 1, 1, 1, 2, 2, 3, 2, 3, 2, 1, 4, 0, 0. Construct a Frequency distribution table for the given data.
4. Explain in detail the various methods involved in describing data using graphs.
5. Explain with an example the guidelines for constructing frequency distributions based on
frequency, relative and cumulative frequency distribution.
6. Illustrate histogram and explain its important features.
7. (a) Interpret the typical shapes of a graph with example.
(b) Describe about Mean, Median and Mode.
8. Write the steps to calculate IQR and calculate the range and IQR with an example.
9. (a) Explain the steps to calculate Standard deviation.
(b) Describe how frequency distribution and graphical representation is carried out for
Qualitative data.

You might also like