FDS QB
FDS QB
Unit 1 - Introduction
Part A
1. What is Data Science?
2. What is Big Data?
3. Compare Data Science and Big Data.
4. List out and define the characteristics of big data.
5. List out any 4 benefits of big data.
6. List out the facets of data.
7. Write a brief answer about structured data.
8. Compare Structured and Unstructured data.
9. Briefly explain Graph based data.
10. Compare Natural language data and Machine generated data.
11. List out the seven steps in Data Science process.
12. Briefly discuss about the Data Exploratory process.
13. What is a model and how do you build it?
14. Differentiate Data Mining and Data Warehousing.
15. Problems to find Mean, Median, Mode, Standard Deviation, Variance, Range, IQR.
(i) Find the mean, median, mode and range for the given data:
44, 79, 94, 43, 53, 65, 87, 90, 70, 69, 65, 89, 85, 53, 47, 61, 27, 80
(ii) The weights, in grams, of seven sweet potatoes are 260, 225, 205, 240, 232, 205, and 214.
What is the median weight?
(iii) Find the variance for the following data: 11, 13, 14, 15, 19, 22, 24, and 26
(iv) The heights in inches of the students in your class are as follows: 58, 58, 59, 60, 62, 64,
64, 65, 66, 66, 66, 66, 68, 68, 69, 70, 71, 72, 72, 74, 75, and 77. Calculate the Variance and
Standard Deviation for the given data.
Part B
1. Briefly explain the facets of data.
2. Explain in detail about the steps involved in Data Science process.
3. What is Data mining? Explain Data mining architecture.
4. Discuss about the Central tendency measures, with their formulae and examples.
5. Define Dispersion of data. What are the parameters included to find the dispersion of data?
List out their respective formulae and explain with examples.
6. Define IQR. Find IQR for the following sets of data.
(i) 4, 12, 1, 8, 9, 43, 21, 11, 15, 6, 31, 17, 10, 2, 5.
(ii) 42, 51, 62, 47, 38, 50, 54, 43, 59, 45.
7. Explain in detail about Data Warehousing with its architecture and working.
8. What is Exploratory Data Analysis? “Exploratory data analysis is important to find the
properties of data along with the traditional statistical measures”. Justify.
9.