0% found this document useful (0 votes)
59 views2 pages

CMPN8 Applied Data scienceDLOC V

This document contains information about a paper for an applied data science course, including the subject code and course name. It discusses using data to help make decisions.

Uploaded by

SANIKA KOLTE
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
59 views2 pages

CMPN8 Applied Data scienceDLOC V

This document contains information about a paper for an applied data science course, including the subject code and course name. It discusses using data to help make decisions.

Uploaded by

SANIKA KOLTE
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

5B

28

1
F2

BD
2B
Paper / Subject Code: 52774 / Applied Data Science (DLOC - V)

3A

03
B3

0B
28

D1
F2
91

5
2B
3A

03
B3
95

BB
28

D1
B0

F2
91

0
2B
3A
B3
5

BB
3C

28
B0

F2
91
2E

0
Total Marks 80

B
3A
B3
5
C
ED

82
09
3
(3 Hours)

F2
91
2E

A2
B
B6

2B
B3
5
NB

C
ED

23
35

28
B0

91
2E
1) Question number 1 is compulsory

3F
6
10

3A
B

95
3C
D

1B
BD
2) Attempt any three out of the remaining five questions.

35

B0

F2
E
6
0

9
3) Assume suitable data if necessary and justify the assumptions.

2
0B

5B

3
1

95
3C
ED

1B
D
2B
4) Figures to the right indicate full marks

B0
3
B

2E

F
6
10

59
B

B
28

B3
C
D
0

09
5

E3
2B

6E
3A

03

91
BB
Q1 20

CB
Attempt any four

D2
5B
28

D1

95
F2

B0
a)

E3
6E
A
Explain in brief the objectives of Data Exploration

B0
03
B3

B
2
3

D2
B

5B
8

D1
F2

3C
91

0
b) Explain in brief the taxonomy of time series forecasting

A2

6E
03
B3
95

BB

2E

CB
82
23

5B
1
c) What are the outliers in the dataset? State the reasons for the outliers
B0

ED
0
2

BD
3F
59

E3
2B
3A

03
3C

B6
B
9

occurring in the dataset

D2
B
8

1
B0

2
91
2E

0
2

BD
3F

35
2B

6E
3A
5
C

d) Explain validation techniques bootstrap and cross-validation


ED

10
1B
9

B
3

5B
8
B0

F2
2E

0
2

BD
B6

59

e) State the importance of Data Visualization. State the purpose of scatter plots,

2B
3A

03
B3
C
ED

09
35

0B
3

D1
F2
91
2E

quartile plots, bubble charts, density chart

2
CB
B6
10

2B
3A
B3
95

BB
D
BD

35

28
E

F2
Q2 a) Given data of 10 companies. Find out the type of correlation between 10 91
2E

B0
CB
6
10
0B

3A
B

3
95
D

advertisement expenses and sales volume using Karl Pearson’s coefficient

82
B
BD

3
2B

F2
03

91
E

A2
B
6

of correlation method
0B

5B
28

B3
1

95
3C
ED
BD

23
2B
3A

0
3

91
E

CB

3F
B6
10

D2
0B
28

95
F2

1B
D

E3

1 2 3 4 5 6 7 8 9 10
2B

Company
E
3A

B0
03
B3

BB

B6

59
D2
28

D1
F2

C
91

09
35

3
2B

6E
3A
B3

Advt 11 13 14 16 16 15 15 14 13 13
95

CB
10

D2
B

5B
28
B0

F2
91

expenses
D

E3
B

E
3A

03
B3
95

B
3C

B6
82

D2
0B

D1
B0

F2
91
2E

35

Sales 50 50 55 60 65 65 65 60 60 50
2B

6E
3A
3
5

BB
3C
ED

10
1B
09

volume
5B
8
2
2E

B0
2

BD
CB

3F
9

3A

03
95
D

82
1B

B
E3
6E

D1
B0

F2

B0
2
9
D2

3A

b) Explain the data science process in detail 10


5B

B3
95

BB
3C

82
6E

B0

F2
03

91
2E

B0
A2
5B

B3
D1

95
3C
D

82

Q3 a) Explain the density-based outlier detection approach 10


23
6E

0
03

91
2E

A2
CB

b) Explain SMOTE in detail 10


5B

B3
D1

95
D

23
E3
6E

0
03

91
BB

3F
D2
5B
D1

95
3C
B0

Q4 a) Explain the working of the Auto Regressive Integrated Moving Average 10


1B
6E

B0
03
B

E
82

59

Model
D2
0B

5B
D1

3C
A2

09
2B

6E

The data given shows salary packages (in lakhs) offered after a campus 10
03

b)
B

CB
D2
0B

5B
28

interview. Find the coefficient of skewness using Bowley’s method.


D1

E3
2B

6E
3A

03
BB

D2
5B
28

D1
2

B0

Salary 4-8 8-12 12-16 16-20 20-24


3F

6E
3A

03
BB
82
1B

No of Candidates 4 10 15 8 3
5B
D1
2

B0
A2
3F
59

03
BB
82
B

23

D1
91

B0
A2
F
B3
95

BB
82
23
B0

91

B0
A2
F

28759 Page 1 of 2
B3
5
3C

82
09

23
91
2E

A2
CB

F
B3
5
09

23
E3

91
CB

3F

ED2E3CB09591B3F23A282B0BBD1035B6
D2

95
2E 59 A2 D1 ED
3C 1B 82 03 2E
3F B0 5B 3C
D2
B0
95 23A BB 6E B0
E3 91
B 28 D1 D 2E 95
CB 3F 2B 03 91
09 2 0B 5B 3C B3
5 91
3A B D1 6E B0
95
F2
3A
28 D2
B3 2B 03 91 28

Q6
Q5
CB F 5B E3 B3
09 23 0B CB 2B
5 B 6E F2 0B

28759
A2 D 09

a)
91 a)

b)
b)
82 D1
2E 5 3A BD
B3 03 91 28
F 23
B0
BB 5B 3C B3 2B 10
35
95 A2 6E B0 F2 0B
91 D1 D 95 3A BD B6
B3 82 03 2E 91B 28 ED
F B0 5B 3C 3 2B 10 2E

product.
23 BB 6E B0 F2 0B 35 3C
A2 D1 D2 95 3A B0 B6
82 03 9 ED BD 95
Food B
3F B0 5B 1B E3
2B 1 03
28 2E 91 Food A
23
A2 BB
06E 3FCB
2 0 5 B 3 C B3
D1 95 3A BB 6
children as follows

D2
D E B0 F2 52
49

82 03 E 91 2 D 9 3A
B0 5B 3C 82 10 2E 59
BB B
B3 B 0 3 3 1 28
6E 0 F2 B 5B C B3 2B
decomposition technique

D1 95 3A B 6
55
53

D2
E 2 E D
B0
9
F2
3 0B

prediction recommendation
03 91 8 D1 5 A BD
5B 3C B3 2B 03 2E 91 28
6E B0 F2 0B 5 B6 3 CB B 3F 2B 10
35
95 3A B 0
52
51

D2
E3 91 2 D ED 09 23 B B B6
B3 82 10 2 E 59 A2 D ED
CB F B0 35 3 CB
1B 8 2 1 0 2E
09 23 BB B6 3F B0 35 3C
E 0 2 B
53
52

59 A2 6
B. (Given t-value at alpha=0.05 is 2.365)

1B D1 D2 95 3A BB B0
3F
82 03 E 91 2 D ED 95

Page 2 of 2
B0 5B 3C B3 82 10 2E 91
23 B B B 0 3 3
BD 6E 0 F2 B 5B C B3
50
47

A2 6
82 10 D2 95 3A B E B0
9
F2
91 28 D1 D 5
______________________
B0 35 E3 B3 2E 91
2B 03
BB B6
E
CB
0 F2 0 5B 3C B3

ED2E3CB09591B3F23A282B0BBD1035B6
B
54
50

D1 D2 95 3A BB 6 09 F2
03 E 3
91
B 28 D1 ED
2 5 3A
5B C 3 2B 03 E 91 28
0
54
52

6E B0 F2 B 5B 3C B3
D2 95 3A BD 6E B0 F2
E3 91 28 1 D 2E 9 5 3A
B3 2B 03 91
53
53

CB 28
09 F2 0 B 5B 3C B3 2B
0
Paper / Subject Code: 52774 / Applied Data Science (DLOC - V)

59 3A BD 6E B0 F2
1B 28 1 D2 95 3A
3F 2B 03 E3 91 28
5B C
Examine the significance of the increase in weight of children due to food
the following results of the increase in weight (lbs) we observed in 8

23 0B B3 2B
A2 BD 6E B0 F2 0B
D2 95 3A BD
82 10 9
Explain how predictive modelling can be applied to the House price 10
Explain how the time-series approach is used to forecast the demand for a 10
In certain food experiment to compare two types of baby foods A and B, 10
What are the attributes of time series decomposition? Explain the classical 10

B0 35 E3
CB
1B 28
2B 1
BB B6 3F 0
D1 ED 09 23 BB
03 2E 59 A2 D1
5B 3C 1B 82
6E B0 3 F2 B 0
03
5B
D2 95 3A BB
E3 91 28 D1
CB B3 2B 03
F 5

You might also like