没有合适的资源?快使用搜索试试~ 我知道了~
weka使用介绍机器学习算法
5星 · 超过95%的资源 需积分: 9 13 下载量 95 浏览量
2011-03-07
11:32:15
上传
评论
收藏 416KB PDF 举报
温馨提示
试读
22页
Weka 是实施数据挖掘任务所需的各种机器学习算法的合集。这些算法既可以直接应用到某数据集上,也可以在你自己设计的Java程序调用它们。Weka 包含了下列工具:数据预处理,分类,回归,聚类,关联规则,以及可视化。另外也可以在Weka 的基础上开发新的机器学习。
资源推荐
资源详情
资源评论
WEKA Explorer User Guide
for Version 3-5-8
Richard Kirkby
Eibe Frank
Peter Reutemann
July 14, 2008
c
2002-2008 University of Waikato
Contents
1 Launching WEKA 2
2 The WEKA Explorer 4
2.1 Section Tabs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.2 Status Box . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.3 Log Button . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.4 WEKA Status Icon . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.5 Graphical output . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3 Preprocessing 6
3.1 Loading Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.2 The Curre nt Relation . . . . . . . . . . . . . . . . . . . . . . . . 6
3.3 Working With Attributes . . . . . . . . . . . . . . . . . . . . . . 7
3.4 Working With Filters . . . . . . . . . . . . . . . . . . . . . . . . 8
4 Classification 10
4.1 Selecting a Classifier . . . . . . . . . . . . . . . . . . . . . . . . . 10
4.2 Test O ptio ns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
4.3 The Class Attribute . . . . . . . . . . . . . . . . . . . . . . . . . 11
4.4 Training a Classifier . . . . . . . . . . . . . . . . . . . . . . . . . 12
4.5 The Class ifier Output Text . . . . . . . . . . . . . . . . . . . . . 12
4.6 The Result List . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
5 Clustering 14
5.1 Selecting a Clusterer . . . . . . . . . . . . . . . . . . . . . . . . . 14
5.2 Cluster Modes . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
5.3 Ignoring Attributes . . . . . . . . . . . . . . . . . . . . . . . . . . 14
5.4 Working with Filters . . . . . . . . . . . . . . . . . . . . . . . . . 15
5.5 Learning Clusters . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
6 Associating 16
6.1 Setting Up . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
6.2 Learning Associations . . . . . . . . . . . . . . . . . . . . . . . . 16
7 Selecting Attributes 17
7.1 Searching and Evaluating . . . . . . . . . . . . . . . . . . . . . . 17
7.2 Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
7.3 Performing Selection . . . . . . . . . . . . . . . . . . . . . . . . . 17
8 Visualizing 19
8.1 The scatter plot matrix . . . . . . . . . . . . . . . . . . . . . . . 19
8.2 Selecting an individual 2D scatter plot . . . . . . . . . . . . . . . 19
8.3 Selecting Instances . . . . . . . . . . . . . . . . . . . . . . . . . . 20
1
1 Launching WEKA
The new menu-driven GUI in WEKA (class weka.gui.Main) succeeds the old
GUI Choo ser (class weka.gui.GUIChooser). Its MDI (“multiple document in-
terface”) appearance makes it easier to keep track of all the open windows. If
one prefers an SDI (“single document interface”) driven layout, one can invoke
this with option -gui sdi on the commandline.
The menu consists of six sections:
1. Program
• LogWindow Opens a log window that captures all that is printed
to stdout or stderr. Useful for e nvironments like MS Windows,
where WEKA is normally not started from a terminal.
• Exit Closes WEKA.
2. Applications Lists the main applications within WEKA.
• Explorer An environment for exploring data with WEKA (the
rest o f this documentation deals with this application in more
detail).
• Experimenter An environment for performing experiments and
conducting statistical tests between learning schemes.
• KnowledgeFlow This environment supports essentially the same
functions as the Ex plorer but with a dr ag-and-drop interface. One
advantage is that it supports incremental learning.
• SimpleCLI Provides a simple command-line interface that allows
direct execution of WEKA commands for operating systems that
do not provide their own command line interface.
3. Tools O ther useful a pplications.
• ArffViewer An MDI application for viewing ARFF files in
spreadsheet format.
• SqlViewer represents an SQL worksheet, for querying databases
via JDBC.
4. Visualization Ways of vis ualizing data with WEKA.
• Plot For plotting a 2D plot of a dataset.
• ROC Displays a previously saved ROC curve.
2
• TreeVisualizer For displaying directed graphs, e.g., a decision
tree.
• GraphVisualizer Visualizes XML BIF or DOT format graphs,
e.g., for Bayesian networks.
• BoundaryVisualizer Allows the visualization o f class ifier deci-
sion boundaries in two dimensions.
5. Windows All open windows are listed here.
• Minimize Minimizes all current windows.
• Restore Restores all minimized windows again.
6. Help Online resourc es for WEKA can be found here.
• Weka homepage Opens a browser window with WEKA’s home-
page.
• Online documentation Directs to the WekaDoc Wiki [4].
• HOWTOs, code snippets, etc. The general WekaWiki [3], con-
taining lots of examples and HOWTOs around the development and
use of WEKA.
• Weka on Sourceforge WEKA’s project homepage on Sourceforge.net.
• SystemInfo Lists some internals about the Java /WEKA environ-
ment, e.g., the CLASSPATH.
• About The infamous “About” box.
To make it easy for the user to add new functionality to the menu with-
out having to modify the code of WEKA itself, the GUI now offers a plugin
mechanism for such add-ons. Due to the inherent dynamic class discovery, plu-
gins only need to implement the weka.gui.MainMenuExtension interface and
WEKA notified of the package they reside in to be displayed in the menu un-
der “Extensio ns” (this extra menu appear s automatically as soon as extensions
are discovered). More details can be found in the Wiki article “Extensions for
Weka’s main GUI” [6].
If you launch WEKA from a terminal window, some text begins scrolling
in the terminal. Ignore this text unless something goes wrong, in which case it
can help in tracking down the cause (the LogWindow from the Program menu
displays that information as well).
This User Manual, which is also available online on the WekaDoc Wiki [4],
fo c uses on using the Explorer but does not explain the individual data prepro-
cessing tools a nd learning algorithms in WEKA. For more information on the
various filters and learning methods in WEKA, see the book Data Mining [2].
3
2 The WEKA Explorer
2.1 Section Tabs
At the very top of the window, just be low the title bar, is a row of tabs. When
the Explorer is first started only the first tab is active; the others are greyed
out. This is because it is necessary to open (and potentially pre- process) a data
set before starting to explore the data.
The tabs are as follows:
1. Preprocess. Choose and modify the data being acted on.
2. Classify. Train and test lear ning schemes that classify or perform regres-
sion.
3. Cluster. Learn clusters for the data.
4. Associate. Lear n association rules for the data.
5. Select attributes. Select the most relevant attributes in the data.
6. Visualize. View an interactive 2D plot of the data.
Once the tabs are active, clicking on them flicks between different scree ns , on
which the respec tive actions can be performed. The bottom area of the window
(including the status box, the log button, and the Weka bird) stays visible
regardless of which section you are in.
The Explorer can be eas ily extended with custom tabs. The Wiki article
“Adding tabs in the Explorer” [7] explains this in detail.
2.2 Status Box
The status box appears at the very bottom of the window. It displays messages
that keep you infor med about what’s g oing o n. For example, if the Explorer is
busy loading a file, the status box will say that.
TIP—right-clicking the mouse anywhere inside the status box brings up a
little menu. The menu gives two options:
1. Memory information. Display in the log box the amount of memory
available to WEKA.
2. Run garbage collector. For c e the Java garbage collector to sea rch for
memory that is no longer needed and free it up, allowing more memory
for new tasks. Note that the garbage collector is constantly running as a
background task anyway.
2.3 Log Button
Clicking on this button brings up a se parate window containing a scro llable text
field. Each line of text is stamped with the time it was entered into the log. As
you pe rform actions in WEKA, the log keeps a reco rd of what has happened.
For people using the command line or the SimpleCLI, the log now also contains
the full setup strings for classification, clustering, attribute selection, etc., so
4
剩余21页未读,继续阅读
资源评论
- thetonyapple2013-05-18还好。。不过想找的是详细介绍weka里面的算法的书额?请问哪里有啊?
- 奥格瑞泽-孟2013-05-09比较详细的资料
xuwei1989404
- 粉丝: 0
- 资源: 3
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功