0% found this document useful (0 votes)
348 views

Syllabus - Data Mining Solution With Weka

The document provides an overview of a 5-day course on using the open-source data mining tool WEKA. The course aims to teach attendees how to use WEKA's algorithms and components to recognize patterns in data, prepare and visualize data, perform classification, association rule mining, clustering, and compare results. Key algorithms covered include J48 for decision trees, Apriori for association rules, K-means for clustering. The course also covers automating data mining workflows using WEKA's KnowledgeFlow component.

Uploaded by

Jaya Wijaya
Copyright
© Attribution Non-Commercial (BY-NC)
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
348 views

Syllabus - Data Mining Solution With Weka

The document provides an overview of a 5-day course on using the open-source data mining tool WEKA. The course aims to teach attendees how to use WEKA's algorithms and components to recognize patterns in data, prepare and visualize data, perform classification, association rule mining, clustering, and compare results. Key algorithms covered include J48 for decision trees, Apriori for association rules, K-means for clustering. The course also covers automating data mining workflows using WEKA's KnowledgeFlow component.

Uploaded by

Jaya Wijaya
Copyright
© Attribution Non-Commercial (BY-NC)
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

Data Mining Solution with WEKA

Course ID : PHI-BI-004 Platform : Windows http://www.phi-Integration.com

Data Mining Solution with WEKA

Data Mining Solution with WEKA


I. Introduction
With the growing complexities data in terms of size and dictionary structures, organizations are challenge to dig deeply into the data to find useful information or to gain hidden predictive informations from it. Using a powerful analysis or OLAP solution is not enough since it will depends on a good analysist and usually takes time. Data mining application with advanced algorithm is the answer to the problem of finding useful hidden pattern. WEKA is the open source machine learning tool which has evolved into a practical data cost efficient mining tool. With a growing powerful algorithms and features, WEKAs adoptions into mainstream is undoubtable. Combining with statistical skills, using data mining application likes WEKA requires the knowledge of problem domain and which algorithms better cope with them. This course is designed for this.

PHI-Integration.com

Page : 2 / 5

Data Mining Solution with WEKA

II.

Who Should Attend ?


This course is designed for Business Analysts, IT developer, Database Administrator and those who new and interest to data mining and predictive analysis.

III.

Objectives
At the completion of this course, attendee should be able to : Understand the practical usage of data mining. Use Weka tools to recognize and use algorithm. Recognize suitable algorithms to solve problem. Able to automate the data mining flow.

IV. V.

Course Duration
5 days / 30 hours

Course Prerequisites
Basic understanding one of several popular DBMS (Oracle, SQL Server, MySQL, etc) and Structured Query Language (SQL). Basic understanding of statictic concepts (data sampling, standard deviation, etc).

VI.

Course Resources
PC or Laptop with minium of 2GHz CPU, 1 GB of RAM, DVD Drive and 2 GB of available hard disk space. Softwares : Windows XP Pro Operating System Java Runtime Environment (JRE) MySQL 5.0 Database Server Open Office Calc. WEKA Data Mining Application.

PHI-Integration.com

Page : 3 / 5

Data Mining Solution with WEKA

VII.

Course Outline
1. Data Mining and WEKA Data Mining. Machine Learning. Supervised and Unsupervised Learning. Data Mining Algorithms. WEKA as machine learning application. Dataset Sample. 2. Weka Application and Components Installation. Weka Components Explorer Experimenter KnowledgeFlow Simple CLI 3. Preprocess (Data Preparation) Weka Explorer s Preprocess Dataset, Instance, Attribute, Class Attribute and Data Types Data Input Format ARFF (Attribute-Relation File Format) CSV (Comma Separated Value) Database Tables Access Data Population Statistics Practice : Opening and Exploring ARFF Data Practice : Data Transformation Practice : MySQL Database Access Filtering Practice : Removing and Discretize Attributes Using Filter 4. Data Visualization Visualize Data and Data Mining Results Practice: Explore Data Visualization Types Jitter to Help Visualize Nodes Practice : Using Jitter with Heavily Populated Data Missing Value

PHI-Integration.com

Page : 4 / 5

Data Mining Solution with WEKA

5. Classification What is Classsification ? Classfication Types : Decision Tree Decision Tree Algorithm : J48 Practice : Pattern classification using J48 User Classifier Practical : Decision Tree dengan User Classifier Exporting and Importing Object Model as Pattern Result 6. Association Rules Association Rules to find sequence or dependency data. Association Rule algorithm : Apriopri. Practice : Using Apriori to solve market basket analysis problem. 7. Clustering Finite categorization with Clustering. What is K-Mean Clustering ? K-Mean implementation using SimpleKMean algorithm. Practice : Cluster heavily populated data with many attributes. 8. Experimenter Experimenter to compare several Object Model predictions. Object Model and experiment configurations. Practice : Compare results on several object models from previous exercises. 9. Simple CLI SimpleCLI and Java Command. Benefits of using SimpleCLI. Practice : Using SimpleCLI to generate Decision Tree Object Model. 10. KnowledgeFlow KnowledgeFlow as automation tool to do data acquisition, pattern recognition and producing prediction result. Calling knowledgeflow from Java command line interface. Practice : Design and calling knowledgeflow.

PHI-Integration.com

Page : 5 / 5

You might also like