0% found this document useful (0 votes)
77 views2 pages

Data Science Foundations Assessment Test

The document is an internal assessment test for a computer science course on data science, containing various questions related to data concepts, statistical measures, and Python programming. It includes both theoretical questions and practical tasks, such as calculating correlation coefficients and explaining data visualization techniques. The test is structured into three parts with a total of 100 marks.

Uploaded by

chandru D
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
77 views2 pages

Data Science Foundations Assessment Test

The document is an internal assessment test for a computer science course on data science, containing various questions related to data concepts, statistical measures, and Python programming. It includes both theoretical questions and practical tasks, such as calculating correlation coefficients and explaining data visualization techniques. The test is structured into three parts with a total of 100 marks.

Uploaded by

chandru D
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

MEENAKSHI RAMASWAMY ENGINEERING COLLEGE, b What is relative frequency distribution? Evaluate 1 C303.

2 K5
THATHANUR – 621804. GRE scores for a group of graduate school are 3
INTERNAL ASSESMENT TEST-I follows:
Department of Computer Science and Engineering(AIML) GRE Score Frequency
725-749 1
CS3352 – Foundations of Data Science
700-724 3
675-699 14
Year & Semester: II& III Date:04.09.2024 650-774 30
Duration: 03.00 Hours Total Marks: 100 625-649 34
600-624 42
Answer ALL Questions Part A: 09X 2 = 18 Marks 575-599 30
01 Define data science and big data. 2 C303.1 K1 550-574 27
C303.1 K2 525-549 13
02 Give an overview of common error? 2 500-524 4
C303.2 K2 475-499 2
03 Explain the types of data. 2 Total 200
04 Differentiate discrete and continuous variables. 2 C303.2 K2
05 Define regression towards the mean. 2 C303.3 K1 13 a i)Calculate the correlation coefficient for the heights 6 C303.3 K5
06 Define correlation coefficient. 2 C303.3 K1 in inches of father’s(x) and their son’s(y) with the
07 2 C303.4 K1 data presented below.
List the attributes of a Numpy array. Give an example.
X 66 68 67 70 71 72 72
08 Summarize some built-in Pandas aggregations? 2 C303.4 K2 Y 68 70 69 72 72 72 74 7
09 What is the purpose of errorbar function in Matlplotlib? 2 C303.5 K1 ii)Explain how the least squares equation.
10 2 C303.5 K1 (Or)
How [Link] function differs from [Link] function?
b i)Explain scatter plot. 6 C303.3 K5
ii)Describe range and variance. 7
Part B: 4 X 8 = 32 Marks 14 a Elaborate about aggregation and Grouping functions. 1 C303.4 K6
11 a i)Explain the diff facets of data with example. 7 C303.1 K5 (Or) 3
ii) Explore the various data science process. 6 b Explain the following in python---
(Or) i)Data indexing 6
C303.4 K5
b i) What is a data warehouse? outline the architecture 7 C303.1 K5 ii)operation on missing data 7
of a data warehouse with a diagram?
ii) Explain in detail about the cleansing, integrating, 6 15 a Explain about various visualization charts like line
transforming data and build a model. plots and histograms, text and annotation using 1
C303.5 K5
matplotlib with an example. (Or) 3
12 a i)Express each of the following scores as a Z score: 7 C303.2 K6
First, Mary’s intelligence quotient is 135, given a b Outline any two three-dimensional plotting in 1
mean of 100 standard deviation 15. Second, Mary matplotlib with an example. C303.5 K5
3
obtained a score of 470 in the Competitive
Part C: 1 X 15 = 15 Marks
Examination conducted in April 2022, given a mean
16 a Describe in detail about pivot table. (Or) 1 C303.4 K5
of 500 and a standard deviation of 100. 6
5
ii)Discuss the types of variable. b Discuss the detail about the mean, median, mode,
1
(Or) variance, standard deviation and skewness. C303.1 K6
5

NBA CO-ORDINATOR HOD VP PRINCIPAL

You might also like