0% found this document useful (0 votes)
18 views

Study Question FDS Model

Uploaded by

mancymithin
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views

Study Question FDS Model

Uploaded by

mancymithin
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

CS3352-FOUNDATIONS OF DATA SCIENCE

(REGULATIONS 2021)
Answer ALL Questions
PART A-(10x2=20 Marks)
1) How missing values present in the dataset are treated during Data Analysis?
2) Identify and write down various Data Analytic Changes faced in the data conventional system.
3) Will treating categorical variables as continuous variables results in a better predictive model?
Justify your answer.
4) Classify the below list of data into their types.
5).What is the kind of relationship it is? Brief on it.
6) Define Multiple Regression.
7) Create a DataFrame with key and data pairs
8) Summarize some Built-in Pandas Aggregations.
9) What is the purpose of Errorbars function in Matplotlib? Give an Example.
10) How plt.scatter function differs from plt.plot function?
PART B-(5x13=65 Marks)

11.a) Explain in detail about Cleansing,Integrating,Transforming data and Build a Model.


Or
b.i)Explain the different facets of data with the challenges in their processing.

12.a.i)What is a Frequency distribution?Customers who have purchased a particular product


rated the usability of the product on a 10-point scale,ranging from 1(poor) to 10(excellent)
as follows:
Construct a frequency distribution for the above data. (5)
ii)Assume the SAT math scored approximate a normal curve with a mean of 500 and a standard
deviation of 100.Sketch a Normal curve and shade in the target area(s) described by each of the
following statements:
a) (2)
b) (2)
c) (2)
d) Convert to Z scores and find the target areas specific to the above values. (2)
Or
b) What are the measures most appropriate for describing variability? Explain.
13.a.i) Explain Scatter plot. (6)
ii) Explain how the least squares equation which is used to minimize the total of all squared
prediction errors with example. (7)
Or
b.i)What is the Philosophy of Logistic Regression? What kind of model it is? What does logistic
regression predict? Tabulate the cardinal differences of Linear and Logistic Regression.
(7)
ii)Explain the correlation coefficient. (6)
14.a.i)Explain the following in python a)Data Indexing (6) b)Operation on missing data. (7)
ii)What is Broadcasting? Explain the rules of broadcasting with an example. (6)
Or
b) What is an Aggregate function? Elaborate about Aggregate functions in Numpy.
15.a)Explain about various visualization charts like line plots, qscatter plots and histograms using
Matplotlib with an example.
Or
b)How text and image annotations are done using Python? Give an example of your own with
appropriate Python code.
PART C-(1x15=15 Marks)
16.a) Consider the following dataset with one response variable y and two predictor variables x1
and x2.
Fit a multiple Linear regression model to this dataset.
Or
b)Describe in detail about pivot table.
____________________________

You might also like