CSC413 - 356 - 153 CSC413 Dec 15
CSC413 - 356 - 153 CSC413 Dec 15
USN 2 B V
(06 marks)
2 a. I. why is tree pruning useful in decision tree induction?
II. What is the drawback of using separate tuples to evaluate pruning?
(06 marks)
Page 1
b. Find the frequent itemsets for the data given below using FP growth, minimum
support is 3.
Tid Itemset
100 {f,a,c,d,g,i ,m,p}
200 {a,b,c,f,l,m,o}
300 {b,f,h,j,o,w}
400 {b,c,k,s,p}
500 {a,f,c,e,l,p,m,n} (08 marks)
c. Explain Naves Bayesian classification with an example.
(06 marks)
3 a. Explain closed frequent item set. How is it different from frequent item set? (06 marks)
b. The following table consists of training data from the All electronics cutomer
data base.
Tid Age Income Student Credit Class=
Rating Buys_Computer
1. Youth High No Fair No
2. Youth High No Excellent No
3. Middle High No Fair Yes
4. Senior Medium No Fair Yes
5. Senior Low Yes Fair Yes
6. Senior Low Yes Excellent No
7. Middle Low Yes Excellent Yes
8. Youth Medium No Fair No
9. Youth Low Yes Fair Yes (08 marks)
10. Senior Medium Yes Fair Yes
11. Youth Medium Yes Excellent Yes
12. Middle Medium No Excellent Yes
13. Middle High Yes Fair Yes
14. Senior Medium No Excellent No
check whether the person X buys computer using Baysean classification.
x= {age=youth, income=medium, student=yes, credit-rating=fair}
c Following table shows mid-term and final grade marks obtained for students. Use
method of least squares and find the equation for prediction of students grade
based on students mid-term. Predict the grade of student who secured 86 in mid-
term exam.
Mid-Term Final
Grade (06 marks)
72 84
50 63
81 77
74 78
94 90
85 75
Page 2
UNIT-II
4 a. How K-Medoid algorithm overcomes disadvantage of K-means clustering.
Illustrate with an example.
(06 marks)
b. Design dashboard for monitoring university student’s performance. (08 marks)
c Can the terms BI and Data warehouse used interchangingly? Justify your answer. (06 marks)
5 a. Differentiate between ERP and BI. (06 marks)
b. What is metadata? Explain with suitable example. (08 marks)
c. Differentiate between power user and casual user. (06 marks)
6 a. Write KPI for measuring performance of educational institution. (06 marks)
b. Relational table below has patients described by binary variables. Gender is (06 marks)
asymmetric variable. Find the distance between 3 pairs of patients Jack, Mary and
Jim.
Name Gender Fever Cough Test1 Test2 Test3 Test4
Jack, M Y N P N N N
Mary F Y N P N P N
Jim. M Y Y N N N N
c Perform k-means clustering on the following 8 points. (08 marks)
A1(2,10), A2(2,5), A3(8,4)
B1(5,8), B2(7,5), B3(6,4)
C1(1,2), C2(4,9)
Distance is Euclidian distance A1, B1 and C1 are initial clusters. show
• Three clusters centres after first round of execution
• Final three clusters
UNIT-III
Page 3
Department:. COMPUTER SCIENCE AND ENGINEERING Sem: 7 Sub-Name:
Datamining and Business Analytics Sub-Code: CSC413 Faculty Name: Shantala Giraddi
Q.No Blooms Learning Course Learning a-k Criteria PI codes Marks
Levels (LL) Objectives(CLO’s)
1: a L2 CO1 1 1.4.3 8
b L3 CO2 2 2.1.4 6
c L3 CO3 3 3.2.2 6
2: a L2 CO3 3 3.2.2 6
b L3 CO2 2 2.1.4 8
c L2 CO3 3 3.2.2 6
3: a L2 CO2 2 2.1..4 6
b L3 CO3 3 3.2.2 6
c L3 CO3 3 3.2.2 8
4: a L2 CO6 3 3.2.2 6
b L2 CO3 2 2.1.3 8
c L3 CO6 2 2.1.3 6
5: a L3 CO5 2 2.1.3 8
b L2 CO6 2 2.1.3 6
c L2 CO5 2 2.1.3 6
6: a L2 CO6 2 2.1.3 6
b L3 CO3 3 3.2.2 6
c L2 CO4 2 2.1.3 8
7: a L2 CO5 2 2.1.3 10
b L2 CO5 2 2.1.3 10
8: a L2 CO4 2 2.1.3 10
b L2 CO4 2 2.1.3 10
Page 4
K.L.E Society’s
B.V.Bhomaraddi College of Engineering & Technology, Hubli-580 031
Examination Section
Semester End Examination Question Paper Review
Set I / Set II / Set III/ External
(Strike off the not Applicable one)
Programme. BE Course: Dataming and Business Analytics
Course Code: CSC413 Duration: 3 Hrs Semester: VII
Self Review Expert
Criterion (Yes/No/NA/ Review
Number (Yes/No/NA/
Number
1] Whether the following details are mentioned correctly on YES
the
Header of the question paper (Exam month and year etc
up to instructions)?
2] Whether the question paper covers the entire syllabus YES
(unit wise) as announced in the scheme of SEE at the end
of prescribed syllabus for this course?
3] Whether the pattern of question paper is in accordance YES
with the model question paper?
4] Whether marks distribution is proper for all the questions YES
and sub questions?
5] Whether the question paper has all the required data and YES
figures? If figures exist, mention the number of figures in
the paper.
6] Mention the time required for an average student to 180
answer this paper (in minutes)
7] How many corrections you have made in the print copy of NIL
the question paper (typographical errors etc)?
8] Whether the scheme is ready along with the paper? YES
9] Whether the scheme contains marks splitting along with YES
points?
10] a) How many numerical problems are there in the
question paper?
b) How many worked out solutions exist in the scheme?
11] Is the Softcopy previewed for printing & verified for YES
corrections?
12] Would you like to do modifications to any of the NA *Yes/No
questions? (only for reviewer)
Page 5
Reviewer’s Signature
Reviewer’s
Name Shantala
Date of Review Giraddi
To,
The Controller of Examinations
B.V.B College of Engineering & Technology, Hubli.
Sir,
After scrutinizing I Recommend No/ The Following (Strike out not applicable)
corrections for this paper. The details are as follows:
Department:__________________________ Course-
_____________________
Paper Code:___________________________ Course
Code:________________
Correction Unit Question & Existing Question Suggested Change Reasons for Change
No No Sub Questions
Page 6
Date:______________ Signature of Scrutinizer
Scrutinizer:_____________
Name of the Scrutinizer:_
Page 7