Weka-: Data Warehousing and Data Mining Lab Manual-Week 9
Weka-: Data Warehousing and Data Mining Lab Manual-Week 9
WEKA-
DATA MINING AND MACHINE LEARNING TOOL
09 Nov, 2020
What is WEKA?
Waikato Environment for Knowledge Analysis
It’s a data mining/machine learning tool
developed by Department of Computer Science,
University of Waikato, New Zealand.
Weka is also a bird found only on the islands of
New Zealand.
Explorer
The Graphical user interface
Section Tabs
At the very top of the window, just below the title bar, is a row of tabs. When the
Explorer is first started only the first tab is active; the others are grayed out. This is
because it is necessary to open (and potentially pre-process) a data set before
starting to explore the data.
The tabs are as follows:
1. Preprocess. Choose and modify the data being acted on.
2. Classify. Train & test learning schemes that classify or perform regression
3. Cluster. Learn clusters for the data.
4. Associate. Learn association rules for the data.
5. Select attributes. Select the most relevant attributes in the data.
6. Visualize. View an interactive 2D plot of the data.
Once the tabs are active, clicking on them flicks between different screens, on
which the respective actions can be performed. The bottom area of the window
(including the status box, the log button, and the Weka bird) stays visible
regardless of which section you are in. The Explorer can be easily extended with
custom tabs.
Steps:
1) Open Start Programs Accessories Notepad
2) Type the following training data set with the help of Notepad for Employee Table.
@relation employee
@attribute name {x,y,z,a,b}
@attribute id numeric
@attribute salary {low,medium,high}
@attribute exp numeric
@attribute gender {male,female}
@attribute phone numeric
@data
x,101,low,2,male,250311
y,102,high,3,female,251665
z,103,medium,1,male,240238
a,104,low,5,female,200200
b,105,high,2,male,240240
3) After that the file is saved with .arff file format.
4) Minimize the arff file and then open Start Programs weka-3-4.
5) Click on weka-3-4, then Weka dialog box is displayed on the screen.
6) In that dialog box there are four modes, click on explorer.
7) Explorer shows many options. In that click on ‘open file’ and select the arff file
8) Click on edit button which shows employee table on weka.
Apply following Pre-Processing techniques to the training data set of Employee Table
1) Add
2) Remove
3) Normalization
Procedure:
1) Start Programs Weka-3-4 Weka-3-4
2) Click on explorer.
3) Click on open file.
4) Select Employee.arff file and click on open.
5) Click on Choose button and select the Filters option.
6) In Filters, we have Supervised and Unsupervised data.
7) Click on Unsupervised data.
8) Select the attribute Add.
9) A new window is opened.
10) In that we enter attribute index, type, data format, nominal label values for Address.
11) Click on OK.
12) Press the Apply button, then a new attribute is added to the Employee Table.
13) Save the file.
14) Click on the Edit button, it shows a new Employee Table on Weka.
Procedure:
1) Start Programs Weka-3-4 Weka-3-4
2) Click on explorer.
3) Click on open file.
4) Select Employee.arff file and click on open.
5) Click on Choose button and select the Filters option.
6) In Filters, we have Supervised and Unsupervised data.
7) Click on Unsupervised data.
8) Select the attribute Remove.
9) Select the attributes salary, gender to Remove.
10) Click Remove button and then Save.
11) Click on the Edit button, it shows a new Employee Table on Weka.
If there are missing values in the data set, Use Weka for
the imputation (to fill the missing values) of missing
values.
Use Weka to discretize the data sets if your classifier
works on discretized data sets.