MEENAKSHI RAMASWAMY ENGINEERING COLLEGE, b What is relative frequency distribution? Evaluate 1 C303.
2 K5
THATHANUR – 621804. GRE scores for a group of graduate school are 3
INTERNAL ASSESMENT TEST-I follows:
Department of Computer Science and Engineering(AIML) GRE Score Frequency
725-749 1
CS3352 – Foundations of Data Science
700-724 3
675-699 14
Year & Semester: II& III Date:04.09.2024 650-774 30
Duration: 03.00 Hours Total Marks: 100 625-649 34
600-624 42
Answer ALL Questions Part A: 09X 2 = 18 Marks 575-599 30
01 Define data science and big data. 2 C303.1 K1 550-574 27
C303.1 K2 525-549 13
02 Give an overview of common error? 2 500-524 4
C303.2 K2 475-499 2
03 Explain the types of data. 2 Total 200
04 Differentiate discrete and continuous variables. 2 C303.2 K2
05 Define regression towards the mean. 2 C303.3 K1 13 a i)Calculate the correlation coefficient for the heights 6 C303.3 K5
06 Define correlation coefficient. 2 C303.3 K1 in inches of father’s(x) and their son’s(y) with the
07 2 C303.4 K1 data presented below.
List the attributes of a Numpy array. Give an example.
X 66 68 67 70 71 72 72
08 Summarize some built-in Pandas aggregations? 2 C303.4 K2 Y 68 70 69 72 72 72 74 7
09 What is the purpose of errorbar function in Matlplotlib? 2 C303.5 K1 ii)Explain how the least squares equation.
10 2 C303.5 K1 (Or)
How [Link] function differs from [Link] function?
b i)Explain scatter plot. 6 C303.3 K5
ii)Describe range and variance. 7
Part B: 4 X 8 = 32 Marks 14 a Elaborate about aggregation and Grouping functions. 1 C303.4 K6
11 a i)Explain the diff facets of data with example. 7 C303.1 K5 (Or) 3
ii) Explore the various data science process. 6 b Explain the following in python---
(Or) i)Data indexing 6
C303.4 K5
b i) What is a data warehouse? outline the architecture 7 C303.1 K5 ii)operation on missing data 7
of a data warehouse with a diagram?
ii) Explain in detail about the cleansing, integrating, 6 15 a Explain about various visualization charts like line
transforming data and build a model. plots and histograms, text and annotation using 1
C303.5 K5
matplotlib with an example. (Or) 3
12 a i)Express each of the following scores as a Z score: 7 C303.2 K6
First, Mary’s intelligence quotient is 135, given a b Outline any two three-dimensional plotting in 1
mean of 100 standard deviation 15. Second, Mary matplotlib with an example. C303.5 K5
3
obtained a score of 470 in the Competitive
Part C: 1 X 15 = 15 Marks
Examination conducted in April 2022, given a mean
16 a Describe in detail about pivot table. (Or) 1 C303.4 K5
of 500 and a standard deviation of 100. 6
5
ii)Discuss the types of variable. b Discuss the detail about the mean, median, mode,
1
(Or) variance, standard deviation and skewness. C303.1 K6
5
NBA CO-ORDINATOR HOD VP PRINCIPAL