Question Bank of Advanced Dbms
Question Bank of Advanced Dbms
DECISION TREES
UNSUPERVISED LEARNING
1. What is the main difference between unsupervised learning and supervised learning?
2. What is a cluster of examples? Define some mechanisms to describe the clusters.
3. How can we measure the distance between an example and a cluster?
4. Summarize the main benefits of cluster analysis.
5. Write the pseudo code for clustering algorithm k-means.
6. What would be the consequence if we did not normalize the training set? Write down
the simple normalization formula.
7. Describe some methods for the initialization of k-means. What are the main
consequences of good or bad initialization?
8. Explain the principle of hierarchical application of k-means.
9. Write the basic algorithm of hierarchical aggregation.
10. What kind of clusters cannot be detected by the k-means algorithm?
11. What distance metric is used in hierarchical aggregation? What are the advantages
and disadvantages of this metric?
12. Describe the principle of the hierarchical-aggregation approach to clustering.