Theory Assignment
Theory Assignment
*Bloom’s taxonomy levels: Knowledge (L1), Comprehension (L2), Application (L3), Analysis (L4), Evaluation
(L5), Creation (L6)
1. For a 2 word query, the postings lists are as mentioned below: [2, 5, 7, 13, 16, 18, 23, 32,
39, 56, 59, 63, 68, 76, 80] and [12, 56]. Show how many comparisons (step by step
representation) would be done to find out the intersection of the above two postings lists
using skip pointers with a skip length of P .
2. For a given document stored in the data warehouse, compress the words by applying
following preprocessing technique separately.
i. Normalization
ii. Stemming (Use Porter stemmer)
iii. Stop words removal
Information retrieval is the activity of obtaining
information resources relevant to an information need from
a collection of information resources. Searches can be
based on full text or other content based indexing.
Automated information retrieval systems are used to reduce
what has been called "information overload". Many
universities and public libraries use IR systems to provide
access to books, journals and other documents. Web search
engines are the most visible IR applications.
Rank Q1 Q2
1 X -
2 - X
3 - -
4 X X
5 X -
6 - X
7 - X
8 - -
9 - X
10 - X
10. Based on the data below, estimate a multinomial Naive Bayes classifier and apply the
classifier to the test document. Calculate the probability that the classifier assigns the test
document to c = China or.