11
AIM:
i) Downloading and/or installation of WEKA data mining toolkit
Note: if weka installation package don’t have java included in it then compatible version of the
java has to be installed initially as given below.
Step 1: Select appropriate JAVA version (Here we are using:jdk1.8). Operate the executable file
of welcome window to the installation wizard of Development kit ,click next button
Step 2: Custom setup window to select path and click next.
Step 3: In the destination folder window we check java file stored location and click next.
III [Link] II Sem Data Mining-Lab GEC
12
Step 4: Installation process completion window is displayed and click close.
Weka –Installation:
Go to the Weka website, [Link] and download the
[Link] the left hand side, click on the link that says download. Select the appropriate link
corresponding to the version of the software based on your operating system and whether or
not you already have Java VM running on your [Link] the self extracting executable to
the disk and double-click on it to install weka.
Step 1: Welcome setup wizard window is viewed where next button has to be clicked
Step2: In the license agreement window accept the agreement with “I Agree” button.
III [Link] II Sem Data Mining-Lab GEC
13
Step 3: The component window is displayed where the type of installation must be “Full” and
click on next.
Step 4: In the installation window to select the path where the software loads can be selected
through Browse button to the destination folder and click next button.
Step 5: To add to the list of programs from the start menu check the create shortcut and press
the Install button
III [Link] II Sem Data Mining-Lab GEC
14
Step 6: completion of installation is indicated in the window where the next button is selected.
Step 7: The setup wizard with the start up of Weka is chosen and finish the process.
Step 8: An warning window tells about the feature of package manager, click ok.
III [Link] II Sem Data Mining-Lab GEC
15
Step 9: Weka home page is viewed.
Step 10: The installation process is completed fully.
2. Understand the feature of Weka toolkit
The Weka GUI Chooser (class [Link]) provides a starting point for launching
Weka’s main GUI applications and supporting tools. If one prefers a MDI (“multiple document
interface”) appearance, then this is provided by an alternative launcher called “Main” (class
[Link]). The GUI Chooser consists of four buttons—one for each of the four major
Weka applications—and four menus.
III [Link] II Sem Data Mining-Lab GEC
16
The buttons can be used to start the following applications:
• Explorer An environment for exploring data with WEKA (the rest of this documentation deals
with this application in more detail).
• Experimenter An environment for performing experiments and conduct ing statistical tests
between learning schemes.
• KnowledgeFlow This environment supports essentially the same functions as the Explorer but
with a drag-and-drop interface. One advantage is that it supports incremental learning.
• SimpleCLI Provides a simple command-line interface that allows direct execution of WEKA
commands for operating systems that do not provide their own command line interface.
The menu consists of four sections:
1. Program
• LogWindow Opens a log window that captures all that is printed to stdout or stderr.
Useful for environments like MS Windows, where WEKA is normally not started from a
terminal.
• Exit Closes WEKA.
2. Visualization
• Plot For plotting a 2D plot of a dataset.
• ROC Displays a previously saved ROC curve.
• TreeVisualizer For displaying directed graphs, e.g., a decision tree.
• GraphVisualizer Visualizes XML BIF or DOT format graphs, e.g., for Bayesian networks.
• BoundaryVisualizer Allows the visualization of classifier decision boundaries in two
dimensions.
3. Tools
III [Link] II Sem Data Mining-Lab GEC
17
• ArffViewer An MDI application for viewing ARFF files in spread- sheet format.
• SqlViewer Represents an SQL worksheet, for querying databases via JDBC.
• Bayes net editor An application for editing, visualizing and learn- ing Bayes nets.
4. Help
• Weka homepage Opens a browser window with WEKA’s home-page.
• HOWTOs, code snippets, etc. The general WekaWiki [2], con-taining lots of examples
and HOWTOs around the development and use of WEKA.
• Weka on SourceforgeWEKA’s project homepage on [Link].
• SystemInfo Lists some internals about the Java/WEKA environment, e.g., the
CLASSPATH.
3. Navigate the options available in the WEKA (ex. Select attributes panel, preprocess panel,
classify panel, cluster panel, associate panel and visualize panel).
Explorer
1. The user interface
1.1 Section Tabs
At the very top of the window, just below the title bar, is a row of tabs. When the Explorer is
first started only the first tab is active; the others are greyed out. This is because it is necessary
to open (and potentially pre-process) a data set before starting to explore the data.
The tabs are as follows:
1. Preprocess. Choose and modify the data being acted on.
2. Classify. Train and test learning schemes that classify or perform regression.
3. Cluster. Learn clusters for the data.
4. Associate. Learn association rules for the data.
5. Select attributes. Select the most relevant attributes in the data.
6. Visualize. View an interactive 2D plot of the data.
1.2 Status Box
The status box appears at the very bottom of the window. It displays messages that keep you
informed about what’s going on. For example, if the Explorer is busy loading a file, the status
box will say that.
III [Link] II Sem Data Mining-Lab GEC