0% found this document useful (0 votes)
4 views

DM imp bits

The document consists of a series of questions related to data mining concepts, techniques, and algorithms. It covers topics such as types of learning, data cleaning, classification, and the application of data mining in various fields. Additionally, it includes specific algorithms like Apriori and FP-growth, as well as principles like the support and confidence of association rules.

Uploaded by

kaviloki0405
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views

DM imp bits

The document consists of a series of questions related to data mining concepts, techniques, and algorithms. It covers topics such as types of learning, data cleaning, classification, and the application of data mining in various fields. Additionally, it includes specific algorithms like Apriori and FP-growth, as well as principles like the support and confidence of association rules.

Uploaded by

kaviloki0405
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

1.

Which of the following refers to the problem of finding abstracted patterns in


the unlabelled data? [ ]
A) Supervised learning B) Unsupervised learning
C) Reinforcement learning D) Hybrid learning
2. Which of the following is an essential process in which the intelligent
methods are applied to extract data patterns? [ ]
A) Warehousing B) Data Mining C) Text Mining D) Data Selection
3. The self-organizing maps can also be considered as the instance of
_________ type of learning. [ ]
A) Supervised learning B) Unsupervised learning C) data imputation D) None
4. Which of the following can be considered as the classification or mapping of
a set or class with some predefined group or classes? [ ]
A) Dataset B) Data Characterization C) Data Sub Structure D) Data
Discrimination
5. Which one of the following statement is correct about the data cleaning?
A) Process of data B) Correct data C) Inconsistent data D) None [ ]
6. Classification of the data mining system involves____________ [ ]
A) Database technology B) Information Science
C) Machine Learning D) All of the above
7. The issues like efficiency, scalability of data mining algorithms comes
under_______
A) Performance issues B) Diverse data types issues [ ]
C) Mining Methodology D) None of the above
8. Which one of the following can be considered as the correct application of
the data mining? [ ]
A) Fraud detection B) Corporate Analysis
C) Management & Market Analysis D) All of the above
9. Which one of the following correctly refers to the class study in data
cauterization?
A) Final Class B) Study Class C) Target Class D) Both A & C [ ]
___________ technique can be used to reduce the size of the candidate K – itemsets in
11.
Apriori algorithm.
a) Pruning b) Cleaning c) Hash based d) Partitioning [ ]
12. __________ algorithm mines frequent itemsets without candidate generation.
a) Apriori b) FP growth c) FP tree d) maxpattern [ ]
In a decision tree ___________ node has exactly one incoming edge and two or more
13
outgoing edges.
a) root b) leaf c) internal d) terminal [ ]
___________ is constructed by integrating the data from multiple heterogeneous
14.
sources.
b) Data
a) Data Mining c) Itemsets d) Association Rules [ ]
Warehouse
_________ is the task of identifying observations whose characteristics are significantly
15.
different from the rest of the data.
a) Predictive modelling c) Association analysis [ ]
b) Cluster analysis d) Anomaly detection
16. ________ method sorts data and partition into equal- frequency buckets or bins.
a) Binning b) Regression c) Clustering d) None [ ]
17. Dimensionality reduction usually refers to reduction in_____________.
a) Number of attributes c) Missing Values
b) Number of objects d) Outliers [ ]

_________ is a task of learning a target function that maps each attribute set to one of
18.
the predefined class label.
a) Reduction c) Classification
[ ]
b) Transformation d) None
10. Which of the following refers to the sequence of pattern that occurs
frequently?
A) Frequent sub-sequence B) Frequent Sub-structure C) Cluster D) None

19.Which of the following fields typically make use of data mining techniques?
9.
a) Business b) Engineering c) Medicine d) All [ ]
120 ________ frequent itemset is defined as a frequent itemset for which none of its
immediate supersets are frequent.
a) Maximal b) Open c) Closed d) Ordinal [ ]
21The term “DMQL” stands for
____________________________________________.
22. Self-organizing map is an example of
______________________________Learning.
23. State whether True or False: Data warehouse is generally updated in real-
time_____.
24. OLTP stand for___________________________
____________________________.
25. Small logical units where data warehouses hold large amounts of data is
known
as_________________________________.
26. The classification or mapping of a class using a predefined class or group is
called__________________________.
27. ______________ means the description and trends or model regularities for
those
objects whose behaviour would change eventually over time.
28. ______________ is the initial steps concerned in the process of knowledge
discovery.
29. Multiple number of data sources is combined in which step of the
knowledge
discovery________________.
30. “Handling the rational and complex types of data” comes under the ___
31.An association rule is X→Y, find support and confidence_____________________.

32.The class labels of training data is unknown in ____________ learning.

33_________ is task of discovering interesting patterns from large amounts of data.

34.In Baye’s Theorem P(H|X) = __________________________.


35. The objective of ___________ tasks is to predict the value of a particular attribute based
on the values of other attributes.
36___________ analysis is used to discover patterns that describe strongly associated
features in the data.
37___________ method is used to organize similar values into groups or clusters.

38________ routines attempt to fill in missing values, smooth out noise, identify outliers
and correct inconsistencies in the data.
39___________ principle states that if an item set is frequent, then all of its subsets must
also be frequent.
40__________ algorithm generates frequent itemsets from an FP-tree by exploring the tree
in a bottom-up fashion.

You might also like