0% found this document useful (0 votes)
253 views

Week 1 - 2 - Its 665 Intro To Weka

This document provides instructions for a lab exercise introducing the data mining tool WEKA. It includes 4 exercises: 1. Analyzing weather data to find min, max, average values and frequencies for variables. 2. Loading a labor dataset and identifying the number of instances, attributes, and values for the "vacation" attribute. 3. Downloading and preparing a COVID-19 hotspot dataset for analysis in WEKA. 4. Downloading a COVID-19 case dataset related to a religious gathering, preparing it for WEKA, and identifying potential knowledge that could be discovered.

Uploaded by

reen vote
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
253 views

Week 1 - 2 - Its 665 Intro To Weka

This document provides instructions for a lab exercise introducing the data mining tool WEKA. It includes 4 exercises: 1. Analyzing weather data to find min, max, average values and frequencies for variables. 2. Loading a labor dataset and identifying the number of instances, attributes, and values for the "vacation" attribute. 3. Downloading and preparing a COVID-19 hotspot dataset for analysis in WEKA. 4. Downloading a COVID-19 case dataset related to a religious gathering, preparing it for WEKA, and identifying potential knowledge that could be discovered.

Uploaded by

reen vote
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

ITS665/ISP565 DATA MINING 2021

LAB EXERCISE

INTRODUCTION INTO WEKA


Exercise 1:
i. Using weather.numeric dataset, find the min, max and average for each variable.
a) Temperature

b) Humidity

ii. Using weather.numeric dataset, find the frequencies for each value for each variable.
a) Outlook

Prepared and Updated by Dr Sofianita / Dr Shuzlina Semester Oct. 2021


ITS665/ISP565 DATA MINING 2021

b) Windy

Exercise 2:
No Activity
Load the labor.arff file into WEKA. The file is in the data folder of WEKA. This dataset classifies
individuals described by a set of attributes.
1. How many instances and attributes contained in the dataset?

Number of 57
instances
Number of 17
attributes

2. Identify type of attribute for vacation, the values of vacation and the number of instances for each
value.
Type of attribute: Nominal
Values of vacation Below average Average Generous
Number of instances 18 17 16

3. For the third value of attribute vacation, give the count of each class.

Vacation =
Class=good – 16
Class=bad – 2

Prepared and Updated by Dr Sofianita / Dr Shuzlina Semester Oct. 2021


ITS665/ISP565 DATA MINING 2021

Exercise 3:
i. Go to the following link: http://www.data.gov.my/data/ms_MY/dataset/senarai-lokaliti-hotspot-
wabak-denggi-di-malaysia
ii. Download the dataset
iii. Try to prepare or process the data until it can be uploaded to WEKA

Note: If you still unable to do it, see this link: https://youtu.be/itixU0jIX3Q

Prepared and Updated by Dr Sofianita / Dr Shuzlina Semester Oct. 2021


ITS665/ISP565 DATA MINING 2021

Exercise 4:
i. Find any related data of COVID19 from the Internet. State the URL for that website.
https://data.world/erhanazrai/httpsdocsgooglecomspreadsheetsd15a43eb68lt7ggk9vavy

ii. Download the dataset.


iii. Prepare the dataset until it can be uploaded to WEKA. Report the problems that you encounter and
briefly explain how you solve it.
Dataset- TABLIGH (Taburan Kes Kluster Tabligh Sri Petaling)

There is no problem encountered for this dataset.

iv. Identify the knowledge that you think can be discovered from this dataset.

The number of confirmed positive cases of covid 19 from the Sri Petaling tabligh cluster from different
states.

Azrai, E. (2020). Covid-19 Malaysia - dataset by erhanazrai. Retrieved 27 October 2021, from
https://data.world/erhanazrai/httpsdocsgooglecomspreadsheetsd15a43eb68lt7ggk9vavy

Prepared and Updated by Dr Sofianita / Dr Shuzlina Semester Oct. 2021

You might also like