0% found this document useful (0 votes)
13 views

Data Mining

The document contains a series of multiple-choice questions related to data mining, machine learning, and knowledge discovery. It covers topics such as types of learning, clustering techniques, data cleaning, and the processes involved in data mining. Each question tests knowledge on specific concepts and definitions within the field.

Uploaded by

mukil.msc
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views

Data Mining

The document contains a series of multiple-choice questions related to data mining, machine learning, and knowledge discovery. It covers topics such as types of learning, clustering techniques, data cleaning, and the processes involved in data mining. Each question tests knowledge on specific concepts and definitions within the field.

Uploaded by

mukil.msc
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 9

1) Which of the following refers to the problem of finding abstracted patterns (or structures) in the

unlabeled data?

a. Supervised learning
b. Unsupervised learning
c. Hybrid learning
d. Reinforcement learning

2) Which one of the following refers to querying the unstructured textual data?

a. Information access
b. Information update
c. Information retrieval
d. Information manipulation

3) Which of the following can be considered as the correct process of Data Mining?

a. Infrastructure, Exploration, Analysis, Interpretation, Exploitation


b. Exploration, Infrastructure, Analysis, Interpretation, Exploitation
c. Exploration, Infrastructure, Interpretation, Analysis, Exploitation
d. Exploration, Infrastructure, Analysis, Exploitation, Interpretation

4) Which of the following is an essential process in which the intelligent methods are applied to
extract data patterns?

a. Warehousing
b. Data Mining
c. Text Mining
d. Data Selection

5) What is KDD in data mining?

a. Knowledge Discovery Database


b. Knowledge Discovery Data
c. Knowledge Data definition
d. Knowledge data house

6) The adaptive system management refers to:


a. Science of making machine performs the task that would require intelligence when
performed by humans.
b. A computational procedure that takes some values as input and produces some values as
the output.
c. It uses machine learning techniques, in which programs learn from their past experience
and adapt themself to new conditions or situations.
d. All of the above.

7) For what purpose, the analysis tools pre-compute the summaries of the huge amount of data?

a. In order to maintain consistency


b. For authentication
c. For data access
d. To obtain the queries response

8) What are the functions of Data Mining?

a. Association and correctional analysis classification


b. Prediction and characterization
c. Cluster analysis and Evolution analysis
d. All of the above

9) In the following given diagram, which type of clustering is used?

a. Hierarchal
b. Naive Bayes
c. Partitional
d. None of the above

10) Which of the following statements is incorrect about the hierarchal clustering?
a. The hierarchal type of clustering is also known as the HCA
b. The choice of an appropriate metric can influence the shape of the cluster
c. In general, the splits and merges both are determined in a greedy manner
d. All of the above

11) Which one of the following can be considered as the final output of the hierarchal type of
clustering?

a. A tree which displays how the close thing are to each other
b. Assignment of each point to clusters
c. Finalize estimation of cluster centroids
d. None of the above

12) Which one of the following statements about the K-means clustering is incorrect?

a. The goal of the k-means clustering is to partition (n) observation into (k) clusters
b. K-means clustering can be defined as the method of quantization
c. The nearest neighbor is the same as the K-means
d. All of the above

13) Which of the following statements about hierarchal clustering is incorrect?

a. The hierarchal clustering can primarily be used for the aim of exploration
b. The hierarchal clustering should not be primarily used for the aim of exploration
c. Both A and B
d. None of the above

14) Which one of the clustering technique needs the merging approach?

a. Partitioned
b. Naïve Bayes
c. Hierarchical
d. Both A and C

15) The self-organizing maps can also be considered as the instance of _________ type of learning.

a. Supervised learning
b. Unsupervised learning
c. Missing data imputation
d. Both A & C

Suppose one wants to predict the number of newborns according to the size of storks' population by
performing supervised learning

a. Structural equation modeling


b. Clustering
c. Regression
d. Classification

18) Which of the following statement is true about the classification?

a. It is a measure of accuracy
b. It is a subdivision of a set
c. It is the task of assigning a classification
d. None of the above

Which of the following statements is correct about data mining?

a. It can be referred to as the procedure of mining knowledge from data


b. Data mining can be defined as the procedure of extracting information from a set of the data
c. The procedure of data mining also involves several other processes like data cleaning, data
transformation, and data integration
d. All of the above

In data mining, how many categories of functions are included?

a. 5
b. 4
c. 2
d. 3

21) Which of the following can be considered as the classification or mapping of a set or class with
some predefined group or classes?

a. Data set
b. Data Characterization
c. Data Sub Structure
d. Data Discrimination

22) The analysis performed to uncover the interesting statistical correlation between associated -
attributes value pairs are known as the _______.

a. Mining of association
b. Mining of correlation
c. Mining of clusters
d. All of the above

Which one of the following can be defined as the data object which does not comply with the
general behavior (or the model of available data)?

a. Evaluation Analysis
b. Outliner Analysis
c. Classification
d. Prediction

Which one of the following statements is not correct about the data cleaning?

a. It refers to the process of data cleaning


b. It refers to the transformation of wrong data into correct data
c. It refers to correcting inconsistent data
d. All of the above

The classification of the data mining system involves:

a. Database technology
b. Information Science
c. Machine learning
d. All of the above

In order to integrate heterogeneous databases, how many types of approaches are there in the data
warehousing?

a. 3
b. 4
c. 5
d. 2
Which one of the following correctly defines the term cluster?

a. Group of similar objects that differ significantly from other objects


b. Symbolic representation of facts or ideas from which information can potentially be
extracted
c. Operations on a database to transform or simplify data in order to prepare it for a machine-
learning algorithm
d. All of the above

Which one of the following refers to the binary attribute?

a. This takes only two values. In general, these values will be 0 and 1, and they can be coded
as one bit
b. The natural environment of a certain species
c. Systems that can be used without knowledge of internal operations
d. All of the above

Which of the following correctly refers the data selection?

a. A subject-oriented integrated time-variant non-volatile collection of data in support of


management
b. The actual discovery phase of a knowledge discovery process
c. The stage of selecting the right data for a KDD process
d. All of the above

Which of the following correctly defines the term "Discovery"?

a. It is hidden within a database and can only be recovered if one is given certain clues (an
example IS encrypted information).
b. An extremely complex molecule that occurs in human chromosomes and that carries genetic
information in the form of genes.
c. It is a kind of process of executing implicit, previously unknown and potentially useful
information from data
d. None of the above

Euclidean distance measure is can also defined as ___________


a. The process of finding a solution for a problem simply by enumerating all possible solutions
according to some predefined order and then testing them
b. The distance between two points as calculated using the Pythagoras theorem
c. A stage of the KDD process in which new data is added to the existing selection.
d. All of the above

Which one of the following can be considered as the correct application of the data mining?

a. Fraud detection
b. Corporate Analysis & Risk management
c. Management and market analysis
d. All of the above

Which of the following also used as the first step in the knowledge discovery
process?

a. Data selection
b. Data cleaning
c. Data transformation
d. Data integration

Which of the following refers to the steps of the knowledge discovery


process, in which the several data sources are combined?

a. Data selection
b. Data cleaning
c. Data transformation
d. Data integration

Which of the following correctly refers to the term "Data Independence"?

a. It means that the programs are not dependent on the logical


attributes
b. It refers to that data that is defined separately, not included in the
program
c. It means that the programs are totally dependent on the physical
attributes of data
d. Both A and C

You might also like