A Exercises Solutions
A Exercises Solutions
EXERCISES SOLUTIONS I
Dr. Aso Mohammad Darwesh
Exercise 1
2
How Data Mining help a business man to develop his marketing? Data Mining can be used in business: To predict the benefits of a new customers To find the relation between sold products To predict the price of products in tomorrow's market To prepare a new offer
Data Mining - 4rth class UHD Aso M. Darwesh
Exercise 2
3
Suppose you are the scientific administrator of an ELearning site. Your site collect, in average, the answers of 137 students per day. You have a black-box that analyses these answers. Your school decide to make some modification in the site to extend his capacity. How can Data Mining help you in preparing your proposal?
Aso M. Darwesh
Solution to exercise 2
4
You can use Data Mining to Group students that make the same mistakes in the same questions Find related questions Find the equi-importance questions
Aso M. Darwesh
Exercise 3
5
Data Mining can improve decision making, how? Decision making bases on the knowledge-base of decider about the subject. As Data mining is the process of extracting hidden information in Datasets, so the knowledge base of the decider is enriched and then we can predict more accurately
Aso M. Darwesh
Exercise 4
6
What are the major types of Data Mining operations? - Supervised Learning - Unsupervised Learning
Describe Data Mining Tasks: - Prediction Methods: use some variables to predict unknown or future values of other variables - Description Methods: find human-interpretable patterns that describe the data
Data Mining - 4rth class UHD Aso M. Darwesh
Exercise 4
7
Suppose you are a software engineer at big mobile phone operator. You have to design a client database. This database contains, Name, Date of birth, place of birth, marital status, ID-card, adress, email, telephon number. And you have also mobile numbers from 111XXXXX to 999XXXXX in another databse that you have to choose one for each new client.Explain the possible type(s) for all fields of your databases.
Data Mining - 4rth class UHD Aso M. Darwesh
Exercise 4
8
Name: Date of birth: Place of birth: Marital status: ID-card: Adress: Email: Telephon number: Mobile numbers from 111XXXXX to 999XXXXX:
Data Mining - 4rth class UHD Aso M. Darwesh
Exercise 5
9
Use average / mode to find the missing values in the dataset (next page)
Aso M. Darwesh
Exercise 5 contd
10
Outlook sunny ? overcast rainy rainy rainy overcast sunny sunny rainy sunny overcast overcast rainy
Temperature 85 80 83 70 ? 65 64 72 69 75 75 72 81 71
Humidity 85 90 86 96 80 70 65 95 70 80 70 ? 75 91
Aso M. Darwesh
Windy FALSE TRUE FALSE FALSE FALSE TRUE TRUE ? FALSE FALSE TRUE TRUE FALSE TRUE
PlayGolf NO NO YES YES YES NO YES NO YES YES YES YES YES NO
Solution to exercise 5
11
Outlook sunny rainy overcast rainy rainy rainy overcast sunny sunny rainy sunny overcast overcast rainy
Temperature 85 80 83 70 72 65 64 72 69 75 75 72 81 71
Humidity 85 90 86 96 80 70 65 95 70 80 70 80 75 91
Aso M. Darwesh
Windy FALSE TRUE FALSE FALSE FALSE TRUE TRUE FALSE FALSE FALSE TRUE TRUE FALSE TRUE
PlayGolf NO NO YES YES YES NO YES NO YES YES YES YES YES NO
Binning
12
Equal-width binning
3, 4, 7, 8, 9, 13, 15, 21, 21, 24, 25, 26, 28, 29, 34,35
Width = Width = (35 3) =2 16 ( Max Min) N
Aso M. Darwesh
Equal-depth binning
13
Sorted price values 4, 8, 9, 15, 21, 21, 24, 25, 26, 28, 29, 34
Partition into three (equi-depth) bins Bin 1: 4, 8, 9, 15 Bin 2: 21, 21, 24, 25 Bin 3: 26, 28, 29, 34