0% found this document useful (0 votes)

179 views8 pages

DWDM LAB Manual SVEC-16

This document describes two experiments conducted in Weka to preprocess data and create a decision tree. The first experiment uses various filters to handle missing data through marking, removing, and imputing missing values. The second experiment trains a decision tree classifier on bank data using the J48 algorithm to classify customers.

Uploaded by

Pottli Siddhu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

179 views8 pages

DWDM LAB Manual SVEC-16

Uploaded by

Pottli Siddhu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Title: WEKA SVEC/CSE/EXPT-DWDM

SREE VIDYANIKETHAN ENGINEERING COLLEGE(AUTONOMOUS)

Sree Sainath Nagar, A. Rangampet – 517 102

Department of Computer Science and Engineering

III B. Tech – II Semester
DATA WAREHOUSING AND DATA MINING LAB (16BT61531)
Title: WEKA SVEC/CSE/EXPT-DWDM

WEKA----WEEK 1
Aim:
To Pre-process the data in weka with a simple experiments
a) Handling missing data (both nomial and numerical)
b) All types normalization (min-max, z-score, decimal scaling)
c) Sampling.
DESCRIPTION:

A) Mark Missing Values

1. Open the Weka Explorer.

2. Load the Pima Indians onset of diabetes dataset.

3. Click the “Choose” button for the Filter and select NumericalCleaner, it us under

unsupervized.attribute.NumericalCleaner.

Weka Select Numeric Cleaner Data Filter

4. Click on the filter to configure it.

5. Set the attributeIndicies to 6, the index of the mass attribute.

6. Set minThreshold to 0.1E-8 (close to zero), which is the minimum value allowed for the attribute.

7. Set minDefault to NaN, which is unknown and will replace values below the threshold.
8. Click the “OK” button on the filter configuration.

9. Click the “Apply” button to apply the filter.

Click “mass” in the “attributes” pane and review the details of the “selected attribute”. Notice that the 11

attribute values that were formally set to 0 are not marked as Missing.

Weka Missing Data Marked

In this example we marked values below a threshold as missing.

You could just as easily mark them with a specific numerical value. You could also
mark values missing between a upper and lower range of values.
Next, let’s look at how we can remove instances with missing values from our
dataset.
Remove Missing Data
Now that you know how to mark missing values in your data, you need to learn
how to handle them.
A simple way to handle missing data is to remove those instances that have one
or more missing values.
You can do this in Weka using the RemoveWithValues filter.
Continuing on from the above recipe to mark missing values, you can remove
missing values as follows:
1. Click the “Choose” button for the Filter and select RemoveWithValues, it us
under unsupervized.instance.RemoveWithValues.
Weka Select RemoveWithValues Data Filter

2. Click on the filter to configure it.

3. Set the attributeIndicies to 6, the index of the mass attribute.
4. Set matchMissingValues to “True”.
5. Click the “OK” button to use the configuration for the filter.
6. Click the “Apply” button to apply the filter.
Click “mass” in the “attributes” section and review the details of the “selected
attribute”.
Notice that the 11 attribute values that were marked Missing have been removed
from the dataset.
Weka Missing Values Removed

Note, you can undo this operation by clicking the “Undo” button.
Impute Missing Values
Instances with missing values do not have to be removed, you can replace
the missing values with some other value.
This is called imputing missing values.
It is common to impute missing values with the mean of the numerical
distribution. You can do this easily in Weka using the ReplaceMissingValues
filter.
Continuing on from the first recipe above to mark missing values, you can
impute the missing values as follows:
1. Click the “Choose” button for the Filter and select Replace Missing
Values, it us under unsupervized.attribute. ReplaceMissingValues
Weka ReplaceMissingValues Data Filter

2. Click the “Apply” button to apply the filter to your dataset.

Click “mass” in the “attributes” section and review the details of the “selected attribute”.

Notice that the 11 attribute values that were marked Missing have been set to the mean value of the

distribution.

Weka Imputed Values

EXPERIMENT-2

Aim: To create a Decision tree by training data set using Weka mining tool.

Tools/ Apparatus: Weka mining tool..

mbinations of values in the historical data.

Procedure:

1) Open Weka GUI Chooser.

2) Select EXPLORER present in Applications.

3) Select Preprocess Tab.

4) Go to OPEN file and browse the file that is already stored in the system “bank.csv”.

5) Go to Classify tab.

6) Here the c4.5 algorithm has been chosen which is entitled as j48 in Java and can be selected by clicking
the button choose

7) and select tree j48

9) Select Test options “Use training set”

10) if need select attribute.

11) Click Start .

12)now we can see the output details in the Classifier output.

13) right click on the result list and select ” visualize tree “option .

Sample output:
The decision tree constructed by using the implemented C4.5 algorithm

DWDM File-Final Ver3.pdf 20241230 172003 0000
No ratings yet
DWDM File-Final Ver3.pdf 20241230 172003 0000
54 pages
Anne - CCS341 - DW - Students Record - 1a - 1b - 2 - Print
No ratings yet
Anne - CCS341 - DW - Students Record - 1a - 1b - 2 - Print
63 pages
Experiment No: 01 Data Exploration & Data Preprocessing
No ratings yet
Experiment No: 01 Data Exploration & Data Preprocessing
54 pages
Data Mining Lab Questions
100% (1)
Data Mining Lab Questions
47 pages
CCS341-Data Warehousing Lab Manual (2021)
No ratings yet
CCS341-Data Warehousing Lab Manual (2021)
88 pages
Data Mining - Lab - Manual
No ratings yet
Data Mining - Lab - Manual
20 pages
DWDM Lab Manual
No ratings yet
DWDM Lab Manual
55 pages
Lecture 12 - Weka Tutorial
No ratings yet
Lecture 12 - Weka Tutorial
84 pages
Data Mining Lab Manual
No ratings yet
Data Mining Lab Manual
40 pages
Data Warehousing Lab Excercise
No ratings yet
Data Warehousing Lab Excercise
45 pages
Perform Data Preprocessing Tasks Using Labor Data Set in WEKA
No ratings yet
Perform Data Preprocessing Tasks Using Labor Data Set in WEKA
6 pages
Can You Double Check It and Give Me Detailed Step - .
No ratings yet
Can You Double Check It and Give Me Detailed Step - .
56 pages
DMDV 210
No ratings yet
DMDV 210
61 pages
DW Lab
No ratings yet
DW Lab
85 pages
DMDV 210
No ratings yet
DMDV 210
63 pages
Demonstration of Preprocessing On Dataset Student - Arff Aim: This Experiment Illustrates Some of The Basic Data Preprocessing Operations That Can Be
100% (1)
Demonstration of Preprocessing On Dataset Student - Arff Aim: This Experiment Illustrates Some of The Basic Data Preprocessing Operations That Can Be
4 pages
DM Lab Material
No ratings yet
DM Lab Material
88 pages
Selection From The Book Exploring Geological Data With WEKA For iSE-ACADEMY
No ratings yet
Selection From The Book Exploring Geological Data With WEKA For iSE-ACADEMY
17 pages
SOMATOM Definition Flash: System Specific Requirements For CT Project Planning
100% (1)
SOMATOM Definition Flash: System Specific Requirements For CT Project Planning
128 pages
DM Tools Sample-1
No ratings yet
DM Tools Sample-1
72 pages
DWDM Lab Manual
No ratings yet
DWDM Lab Manual
47 pages
Data Warehousing Lab Record Final
No ratings yet
Data Warehousing Lab Record Final
45 pages
DMDV Main Manual
No ratings yet
DMDV Main Manual
35 pages
Weka-: Data Warehousing and Data Mining Lab Manual-Week 9
100% (1)
Weka-: Data Warehousing and Data Mining Lab Manual-Week 9
8 pages
Lab Updated - Merged
No ratings yet
Lab Updated - Merged
49 pages
Data Mining Lab Manual
No ratings yet
Data Mining Lab Manual
36 pages
Wolkite University Cost Sharing System PDF
No ratings yet
Wolkite University Cost Sharing System PDF
97 pages
Weka Software Manuala
No ratings yet
Weka Software Manuala
20 pages
A Hans On Introduction To Data Science-1-300
No ratings yet
A Hans On Introduction To Data Science-1-300
300 pages
Data Mining Lab File
No ratings yet
Data Mining Lab File
20 pages
DMLab
No ratings yet
DMLab
27 pages
Introduction To Weka: Xingquan (Hill) Zhu
No ratings yet
Introduction To Weka: Xingquan (Hill) Zhu
63 pages
DWDM Record With Alignment
No ratings yet
DWDM Record With Alignment
69 pages
Data Warehousing and Data Mining Lab
No ratings yet
Data Warehousing and Data Mining Lab
53 pages
Weka LAB-ALL
No ratings yet
Weka LAB-ALL
19 pages
Data Mining Lab Manual: Aurora's PG College Moosarambagh Mca Department
No ratings yet
Data Mining Lab Manual: Aurora's PG College Moosarambagh Mca Department
42 pages
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
No ratings yet
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
31 pages
DWDM Lab Manual Using Weka-For MIC
No ratings yet
DWDM Lab Manual Using Weka-For MIC
42 pages
Data-Mining-Lab-Manual Cs 703b
No ratings yet
Data-Mining-Lab-Manual Cs 703b
41 pages
Weka Lab
No ratings yet
Weka Lab
11 pages
Intro To Weka
No ratings yet
Intro To Weka
13 pages
DWM1
No ratings yet
DWM1
19 pages
MC0717 Lab Manual
No ratings yet
MC0717 Lab Manual
42 pages
Weka Experiment
No ratings yet
Weka Experiment
13 pages
Presentation 9
No ratings yet
Presentation 9
12 pages
DWM1 Riya
No ratings yet
DWM1 Riya
16 pages
RidgeBot Deployment Quick Start Guide v4.2.2 Latest
No ratings yet
RidgeBot Deployment Quick Start Guide v4.2.2 Latest
98 pages
Task 0: Weka Introduction
No ratings yet
Task 0: Weka Introduction
11 pages
EX-01-Weka and Rapidminer
No ratings yet
EX-01-Weka and Rapidminer
9 pages
BI - Experiment - No - 1
No ratings yet
BI - Experiment - No - 1
7 pages
DM L-6
No ratings yet
DM L-6
7 pages
DMLab
No ratings yet
DMLab
14 pages
Introduction To Weka-A Toolkit For Machine Learning
No ratings yet
Introduction To Weka-A Toolkit For Machine Learning
11 pages
Wa0000.
No ratings yet
Wa0000.
4 pages
Disease Prediction Using ML
100% (1)
Disease Prediction Using ML
43 pages
Sun Sumlator SMT A Manual
No ratings yet
Sun Sumlator SMT A Manual
35 pages
Week 2 Basheer
No ratings yet
Week 2 Basheer
3 pages
Weka: A Tool For Data Preprocessing, Classification, Ensemble, Clustering and Association Rule Mining
No ratings yet
Weka: A Tool For Data Preprocessing, Classification, Ensemble, Clustering and Association Rule Mining
4 pages
Certscare Cs0 003 Comptia Cybersecurity Analyst Cysa Exam Verified Questions Answers by Ortiz 15-04-2024 7qa
No ratings yet
Certscare Cs0 003 Comptia Cybersecurity Analyst Cysa Exam Verified Questions Answers by Ortiz 15-04-2024 7qa
13 pages
Data Base Management Key Points
No ratings yet
Data Base Management Key Points
8 pages
Weka Tutorial: 1. Downloading and Installing Weka (Version 3.6)
No ratings yet
Weka Tutorial: 1. Downloading and Installing Weka (Version 3.6)
4 pages
BRKCRS-2810 Cisco Software-Defined Access
No ratings yet
BRKCRS-2810 Cisco Software-Defined Access
83 pages
Step1. Open The Data/bank Data - CSV Dataset
No ratings yet
Step1. Open The Data/bank Data - CSV Dataset
3 pages
DW Ex No 2
No ratings yet
DW Ex No 2
2 pages
Structure C
No ratings yet
Structure C
19 pages
Unit 1 Blockchain-1
No ratings yet
Unit 1 Blockchain-1
5 pages
Job Aid: Fi-Aa 3-1: Auc Set Up of The Auc Cost Collector (Wbse/Io)
No ratings yet
Job Aid: Fi-Aa 3-1: Auc Set Up of The Auc Cost Collector (Wbse/Io)
36 pages
Algouniversity Student Resume Template
No ratings yet
Algouniversity Student Resume Template
1 page
Be10085 - Adarsh Jaiswal - Noopur Yadav - Umar Ullah Khan - Poonam Chanpuriya
No ratings yet
Be10085 - Adarsh Jaiswal - Noopur Yadav - Umar Ullah Khan - Poonam Chanpuriya
13 pages
Full Stack Roadmap: Opinions
No ratings yet
Full Stack Roadmap: Opinions
8 pages
Introduction MAD Unit-1
No ratings yet
Introduction MAD Unit-1
18 pages
Class XI (As Per CBSE Board) : Computer Science
No ratings yet
Class XI (As Per CBSE Board) : Computer Science
14 pages
Unit 3
No ratings yet
Unit 3
17 pages
Supported Operating Systems
No ratings yet
Supported Operating Systems
14 pages
The Co-Operative University of Kenya: Student'S Clearance Form
No ratings yet
The Co-Operative University of Kenya: Student'S Clearance Form
2 pages
Artificial Neural Network Trained Self-Tuning Pid Controller As A Cyber Security Protective Measure in Power Grid
No ratings yet
Artificial Neural Network Trained Self-Tuning Pid Controller As A Cyber Security Protective Measure in Power Grid
12 pages
HALO Overview
No ratings yet
HALO Overview
15 pages
Philosophy, The Unknown Knowns,'' and The Public Use of Reason
No ratings yet
Philosophy, The Unknown Knowns,'' and The Public Use of Reason
6 pages
Section 6 - SAP HCM TY
No ratings yet
Section 6 - SAP HCM TY
9 pages
Functional Safety Concepts in Motor Control
No ratings yet
Functional Safety Concepts in Motor Control
9 pages
Data Overview - Technical Officer (Scale I) (Backlog Vacancy)
No ratings yet
Data Overview - Technical Officer (Scale I) (Backlog Vacancy)
4 pages
CV Qazim PDF
No ratings yet
CV Qazim PDF
1 page
File 50 PDF
No ratings yet
File 50 PDF
4 pages
Lab 3
No ratings yet
Lab 3
5 pages
NC Dbase
No ratings yet
NC Dbase
2 pages
Bug-Bounty Video Collection PDF
No ratings yet
Bug-Bounty Video Collection PDF
12 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet

DWDM LAB Manual SVEC-16

Uploaded by

DWDM LAB Manual SVEC-16

Uploaded by

Title: WEKA SVEC/CSE/EXPT-DWDM

SREE VIDYANIKETHAN ENGINEERING COLLEGE(AUTONOMOUS)

Department of Computer Science and Engineering

A) Mark Missing Values

2. Load the Pima Indians onset of diabetes dataset.

Weka Select Numeric Cleaner Data Filter

4. Click on the filter to configure it.

5. Set the attributeIndicies to 6, the index of the mass attribute.

9. Click the “Apply” button to apply the filter.

Weka Missing Data Marked

In this example we marked values below a threshold as missing.

2. Click on the filter to configure it.

2. Click the “Apply” button to apply the filter to your dataset.

Weka Imputed Values

Tools/ Apparatus: Weka mining tool..

mbinations of values in the historical data.

1) Open Weka GUI Chooser.

2) Select EXPLORER present in Applications.

3) Select Preprocess Tab.

7) and select tree j48

9) Select Test options “Use training set”

10) if need select attribute.

11) Click Start .

12)now we can see the output details in the Classifier output.

You might also like