Weka Tool
Weka Tool
Weka is a free open-source software with a range of built-in machine learning algorithms that
you can access through a graphical user interface! “
WEKA stands for Waikato Environment for Knowledge Analysis and was developed at the
University of Waikato, New Zealand.
Weka has multiple built-in functions for implementing a wide range of machine learning
algorithms from linear regression to neural network. This allows you to deploy the most complex
of algorithms on your dataset at just a click of a button! Not only this, Weka gives support for
accessing some of the most common machine learning library algorithms of Python and R!
With Weka you can preprocess the data, classify the data, cluster the data and even visualize the
data! This you can do on different formats of data files like ARFF, CSV, C4.5, and JSON. Weka
even allows you to add filters to your dataset through which you can normalize your data,
standardize it, interchange features between nominal and numeric values, and what not!
A Decision Tree is a categorization approach made up of three parts: the root node, the branch
(edge or link), and the leaf node. The root node contains the test condition for various attributes,
the branch node represents all possible outcomes in the test, and the leaf nodes contain the label of
the category to which it corresponds. The root node is located at the beginning of the tree, often
known as the tree’s top. Classification and regression challenges can be helped by decision trees.
Classification and Regression Trees are newer terms for decision trees (CART). They work by
constructing a tree to analyze a piece of data, beginning at the root and progressing to the leaves
(roots) until a forecast can be produced. The method of developing a decision tree involves
greedily selecting the optimal split point to make predictions and then repeating the process until
the tree reaches a fixed depth.
Following the construction of the tree, it is pruned to increase the model’s capacity to generalize
to new data.
C4.5 generates an algorithm for generating a decision tree (an extension of ID3). It is often
referred to as a statistical classifier. A database is required for decision tree categorization. The
vote dataset described in the preceding article will be used.
Follow the steps below to load the dataset and perform classification in Weka:
3) Go to the “Classify” tab for classifying the unclassified data. Click on the “Choose” button.
From this, select “trees -> J48”. Let us also have a quick look at other options in the Choose
button:
4) Click on Start Button. The classifier output will be seen on the Right-hand panel. It shows the
run information in the panel as:
· The number of leaves and the size of the tree describes the decision tree.
1.Select height weight csv file from the “Open file” under the preprocess tab option.
2.Go to Classify tab and select Naivebayes Multinomial algorithm and click on start button.
You can easily save a trained model to file in the Weka Explorer interface.
1. Right click on the result item for your model in the “Result list” on the “Classify” tab.
3. Select a location and enter a filename such as “heightwt”, click the “Save button.