0% found this document useful (0 votes)
2 views

Data Mining IMP Objective Questions_Sep 2023

The document contains a series of objective questions and fill-in-the-blank statements related to data mining concepts, algorithms, and techniques. It covers topics such as the apriori algorithm, predictive models, data cleaning, and data transformation. Additionally, it includes assignment questions that require explanations of data mining functionalities and algorithms.

Uploaded by

Rks Teja
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views

Data Mining IMP Objective Questions_Sep 2023

The document contains a series of objective questions and fill-in-the-blank statements related to data mining concepts, algorithms, and techniques. It covers topics such as the apriori algorithm, predictive models, data cleaning, and data transformation. Additionally, it includes assignment questions that require explanations of data mining functionalities and algorithms.

Uploaded by

Rks Teja
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

DATA MINING

Objective questions
1 Extracting knowledge from large amount of data is called ____________________. [ ]
(a) Warehousing (b) Data mining
(c) Database (d) Cluster
2 The basic idea of the apriori algorithm is to generate________ item sets of a [ ]
particular size & scans
(a) candidate. primary.
secondary. superkey.
3 ________is the most well known association rule algorithm and is used in most [ ]
commercial products.
(a) Apriori algorithm. (b) Partition algorithm.
(c) Distributed algorithm (d) Pincer-search algorithm
4 The a priori frequent itemset discovery algorithm moves _______ in the lattice. [ ]
(a) upward. (b) downward
(c) breadthwise. (d) both upward and downward
5 Capability of data mining is to build ___________ models [ ]
(a) retrospective (b) interrogative
(c) predictive (d) imperative
6 Which of the following is a predictive model? [ ]
(a) Clustering. (b) Regression.
(c) Summarization. (d) Association rules.
7 ___________________is a summarization of the general characteristics or features of a [ ]
target class of data.
(a) Data Classification (b) Data Characterization
(c) Data discrimination (d) Data selection
8 Any subset of a frequent set is a frequent set. This is ___________. [ ]
(a) Upward closure property (b) Downward closure property
(c) Maximal frequent set. (d) Border set.
9 All set of items whose support is greater than the user-specified minimum support [ ]
are called as ___________________.
(a) border set (b) maximal frequent set
(c) Frequent set. (d) Lattice.
10 Any superset of an infrequent set is an infrequent set. This is _______. [ ]
(a) Maximal frequent set (b) Border set.
(c) Upward closure property (d) Downward closure property.
11 The absolute number of transactions supporting X in T is called_____ [ ]
(a) confidence. (b) support.
(c) support count. (d) None of the above.
12 Extreme values that occur infrequently are called as _____________ [ ]
(a) dimensionality reduction (b) outliers
(c) rare values (d) All of the above
13 Overfitting occurs when a model _________. [ ]
(a) does fit in future states. (b) does not fit in future states.
(c) does fit in current state. (d) does not fit in current state.

Faculty: Mr.D.Krishna, Associate professor, CSE Dept


14 A priori algorithm is otherwise called as ________________ [ ]
(a) width-wise algorithm (b) level-wise algorithm
(c) pincer-search algorithm (d) FP growth algorithm
15 Users of data mining systems can be classified into____________ categories [ ]
(a) 1 (b) 2
(c) 3 (d) 4
16 Removing duplicate records is a process called _____________ [ ]
(a) recovery (b) data cleaning
(c) data cleansing (d) data pruning
17 Strategic value of data mining is ______________ [ ]
(a) cost-sensitive (b) work-sensitive
(c) time-sensitive (d) technical-sensitive
18 Most of the partitioning methods cluster objects are based on _________ [ ]
(a) number of clusters (b) distance between objects
(c) number of objects in each class (d) learning rate
19 Discovery of cross-sales opportunities is called ________________. [ ]
(a) segmentation. (b) visualization.
(c) correction. (d) association.
20 Which of the following is a descriptive model? [ ]
(a) Classification. (b) Regression.
(c) Sequence discovery. (d) Association rules.
21 Some telecommunication company wants to segment their customers into distinct groups [ ]
in order to send appropriate subscription offers, this is an example of
a) Supervised learning b) Unsupervised learning
c) Data Extraction d) None of the above
22 Which one of the following refers to querying the unstructured textual data [ ]
a) Information access b) Information update
c) Information retrieval d) Information manipulation
23 Identifying outliers and smooth out noisy data is comes under_________ [ ]
a) Data transformation b) Data reduction

c) Data cleaning d) Data integration


24 ______________are extreme values that deviate from other observations on data.
a) FD values b) Na values
c) Missing Values d) Outliers
25 ____________________is a data preprocessing technique that combines data from
multiple heterogeneous data sources into a coherent data store.
a) Data transformation b) Data reduction

c) Data cleaning d) Data integration


26 The below is an example for __________ type of data.
Example: Height and Sales
a) Nominal Data b) Ordinal Data

Faculty: Mr.D.Krishna, Associate professor, CSE Dept


c) Numerical data d) Binary data
27 Which of the following issue is considered before investing in Data Mining?
a) Functionality b) Vendor consideration
c) Compatibility d) All of the above
28 Quantitative data is also called as __________
a) Numerical data b) Continues data
c) Binary data d) None
29 The grouping of patients based on their medical records can be considered as
a) Regression b) Classification
c) Both d) None of the above
30 The self-organizing maps can also be considered as the instance of _________ type of
learning.
a) Supervised learning b) Unsupervised learning
c) Missing data imputation d) None of the above

Fill in the Blanks


1. AOI stands for _______________________( Attribute Orientation Induction)
2. Removing duplicate records is a process called ___________________.
(Data cleaning)
3. A ________________ allows data to be modeled and viewed in multiple dimensions
.( Data cube)
4. ________________contains a subset of corporate –wide data that is of value to a
specific group of users.( Data mart)
5. The left hand side of an association rule is called ______________(
antecedent)
6. _______________ is an essential process where intelligent methods are
applied to extract data patterns. (Data mining)
7. _________________ is the estimate of the strength of the implication of the rule.
( Confidence)
8. _____________is a rule-based machine learning technique used to find
frequent patterns in a data set.
9. __________ is a subject-oriented, integrated, time-variant, nonvolatile collection of
data in support of management decisions.(Data warehouse)
10. ______________ is task of discovering interesting patterns from large amounts of
data. (Data Mining)
11. Converting data from different sources into a common format for processing is called
as______________________(Transformation)
12. ________________ Algorithm uses frequent datasets to generate
association rules.(The Apriori algorithm)
13. Equivalence Class Transformation algorithm uses a ___________
technique to find frequent item sets in a transaction database.( depth-
first search)
14. The F-P growth algorithm stands for _________ (Frequent pattern-growth)
15. _______________ involves scaling all values for a given attribute to
make them fall within a small specified range.
(Normalization)
16. Reducing the number of attributes to solve the high dimensionality problem is called
as___________________(Dimensionality reduction)

Faculty: Mr.D.Krishna, Associate professor, CSE Dept


17. The value that says that transactions in D that support X also support Y is called
__________________ (confidence).
18. The right hand side of an association rule is called ______________ (consequent)
19. _______________clustering technique needs the merging approach.
(Hierarchical clustering technique)
20. Full form of KDD is __________________(Knowledge Discovery Database)

Assignment Questions of Data Mining

1. List and describe the five primitives for specifying a data mining task?
2. Explain Data Mining Functionalities
3. What is Data Mining? Explain data mining as a step process of
knowledge discovery.
4. What are the applications of Association Rule Mining?
5. Discuss about constraint-based association mining?
6. Compare FP-Growth algorithm and Apriori algorithm.
7. Explain APRIORI algorithm with an example. Give the limitations of
APRIORI algorithm.
8. Explain the following terms with example
i)Frequent Patteren ii) Closed Frequent Patteren
iii) Maximal Frequent Patteren
9. Explain about the classification and prediction with an example?
10. Explain Decision tree induction?

Faculty: Mr.D.Krishna, Associate professor, CSE Dept

You might also like