InductionofdecisiontreesbyJ.R.Quinlan资源-CSDN文库

decision

trees

5星 · 超过95%的资源需积分: 32 66 浏览量 2008-09-13 15:32:32 上传评论收藏 1.77MB PDF 举报

资源推荐

资源详情

资源评论

Machine Learning

81-106,

1986

Kluwer

Academic Publishers, Boston

Manufactured

in The

Netherlands

Induction

Decision Trees

J.R, QUINLAN (munnarilnswitgould.oz!quinlan@seismo,css.gov)

Centre

for

Advanced Computing Sciences,

New

South

Wales

Institute

Technology,

Sydney

2007,

Australia

(Received

August

1985)

Key

words:

classification,

induction, decision trees,

information

theory,

knowledge

acquisition,

expert

systems

Abstract.

The

technology

for

building

knowledge-based

systems

inductive

inference

from

examples

has

been

demonstrated

successfully

several practical applications. This paper summarizes

approach

synthesizing

decision trees that

has

been used

in a

variety

systems,

and it

describes

one

such

system,

ID3,

detail. Results from recent studies show

ways

which

the

methodology

can be

modified

deal

with

information that

noisy

and/or

incomplete.

reported shortcoming

of the

basic algorithm

discussed

and two

means

overcoming

it are

compared.

The

paper concludes

with

illustrations

current

research directions.

Introduction

Since

artificial

intelligence

first

achieved recognition

as a

discipline

in the mid

1950's,

machine

learning

has

been

central research

area.

Two

reasons

can be

given

for

this

prominence.

The

ability

learn

is a

hallmark

intelligent behavior,

so any

attempt

understand intelligence

as a

phenomenon must include

understanding

learn-

ing.

concretely, learning provides

potential

methodology

for

building high-

performance systems.

Research

learning

made

up of

diverse subfields.

At one

extreme there

are

adaptive systems that monitor their

own

performance

and

attempt

improve

it by

adjusting

internal parameters. This approach, characteristic

of a

large proportion

the

early learning work, produced self-improving programs

for

playing games

(Samuel, 1967), balancing poles (Michie, 1982), solving problems (Quinlan, 1969)

and

many other domains.

quite

different

approach sees learning

as the

acquisition

structured knowledge

in the

form

concepts (Hunt, 1962; Winston, 1975),

discrimination nets (Feigenbaum

and

Simon,

1963),

production rules (Buchanan,

1978).

The

practical

importance

machine learning

this

latter

kind

has

been underlin-

J.R.

QUINLAN

ed by the

advent

knowledge-based expert systems.

their name suggests, these

systems

are

powered

knowledge that

represented

explicitly

rather than being

im-

plicit

algorithms.

The

knowledge needed

drive

the

pioneering expert systems

was

codified

through

protracted

interaction between

domain specialist

and a

knowledge

engineer. While

the

typical

rate

knowledge elucidation

this method

is a few

rules

per man

day,

expert system

for a

complex task

may

require hundreds

even

thousands

such rules.

It is

obvious that

the

interview approach

knowledge

ac-

quisition cannot keep

pace

with

the

burgeoning demand

for

expert systems; Feigen-

baum

(1981) terms this

the

'bottleneck' problem. This perception

has

stimulated

the

investigation

machine learning methods

as a

means

explicating knowledge

(Michie,

1983).

This paper focusses

on one

microcosm

machine learning

and on a

family

learning

systems that have been used

build knowledge-based systems

of a

simple

kind.

Section

outlines

the

features

this

family

and

introduces

its

members.

All

these systems

address

the

same task

inducing decision trees from examples. After

more complete specification

this task,

one

system (ID3)

described

detail

Section

Sections

5 and 6

present extensions

to ID3

that enable

it to

cope

with

noisy

and

incomplete information.

review

of a

central facet

of the

induction algorithm

reveals

possible

improvements

that

are set out in

Section

7. The

paper

concludes with

two

novel initiatives that give some idea

of the

directions

which

the

family

may

grow.

2. The

TDIDT family

learning systems

Carbonell, Michalski

and

Mitchell (1983) identify

three

principal dimensions along

which

machine learning systems

can be

classified:

• the

underlying learning strategies

used;

• the

representation

knowledge acquired

by the

system;

and

• the

application domain

of the

system.

This

paper

concerned

with

family

learning systems

that

have

strong

common

bonds

these dimensions.

Taking these features

reverse order,

the

application domain

these systems

not

limited

to any

particular

area

intellectual activity such

Chemistry

Chess;

they

can be

applied

to any

such

area.

While they

are

thus

general-purpose

systems,

the

applications that they address

all

involve

classification.

The

product

learning

piece

procedural knowledge that

can

assign

hitherto-unseen object

to one

specified number

disjoint

classes.

Examples

classification tasks are:

INDUCTION

DECISION TREES

1. the

diagnosis

of a

medical condition

from

symptoms,

which

the

classes could

either

the

various disease states

or the

possible therapies;

determining

the

game-theoretic value

of a

chess position,

with

the

classes

won for

white, lost

for

white,

and

drawn;

and

deciding

from

atmospheric observations whether

severe thunderstorm

unlike-

ly,

possible

probable.

might

appear

that classification tasks

are

only

minuscule subset

procedural

tasks,

but

even activities such

robot

planning

can be

recast

classification prob-

lems (Dechter

and

Michie, 1985).

The

members

this

family

are

sharply characterized

their representation

ac-

quired

knowledge

decision trees. This

is a

relatively simple knowledge formalism

that

lacks

the

expressive power

semantic networks

other first-order representa-

tions.

As a

consequence

this

simplicity,

the

learning methodologies used

in the

TDIDT

family

are

considerably less complex than those employed

systems that

can

express

the

results

their learning

in a

powerful

language. Nevertheless,

it is

still

possible

generate knowledge

in the

form

decision trees that

capable

solving

difficult

problems

practical significance.

The

underlying strategy

non-incremental learning from examples.

The

systems

are

presented with

a set of

cases

relevant

to a

classification task

and

develop

deci-

sion tree from

the top

down, guided

frequency information

in the

examples

but

not by the

particular order

which

the

examples

are

given. This contrasts

with

in-

cremental

methods such

that employed

MARVIN (Sammut, 1985),

which

dialog

carried

with

instructor

'debug'

partially correct concepts,

and

that

used

Winston (1975),

which

examples

are

analyzed

one at a

time, each produc-

ing

small change

in the

developing concept;

both

these systems,

the

order

which

examples

are

presented

most important.

The

systems described here search

for

patterns

in the

given examples

and so

must

able

examine

and

re-examine

all

them

many

stages

during learning. Other well-known programs

that

this data-driven

approach

include BACON (Langley, Bradshaw

and

Simon, 1983)

and

INDUCE (Michalski,

1980).

summary, then,

the

systems described here develop decision trees

for

classifica-

tion tasks. These trees

are

constructed beginning with

the

root

of the

tree

and

pro-

ceeding down

to its

leaves.

The

family's palindromic name emphasizes that

its

members

carry

out the

Top-Down Induction

Decision

Trees.

The

example objects

from

which

classification rule

developed

are

known only

through

their values

of a set of

properties

attributes,

and the

decision trees

turn

are

expressed

terms

these same attributes.

The

examples themselves

can be

assembled

in two

ways. They might come

from

existing

database

that forms

history

observations, such

patient

records

some

area

medicine that have

accumulated

at a

diagnosis

center.

Objects

this kind give

reliable statistical pic-

ture but, since they

are not

organized

in any

way, they

may be

redundant

omit

J.R.

QUINLAN

Figure

1, The

TDIDT

family

tree.

uncommon

cases that have

not

been encountered during

the

period

record-

keeping.

On the

other hand,

the

objects might

be a

carefully

culled

set of

tutorial

ex-

amples

prepared

by a

domain expert, each

with

some particular relevance

to a

com-

plete

and

correct classification rule.

The

expert might take pains

avoid redundancy

and to

include examples

rare cases. While

the

family

systems

will

deal

with

col-

lections

either kind

in a

satisfactory way,

should

mentioned that earlier

TDIDT systems were designed with

the

'historical

record'

approach

mind,

but all

systems described here

are now

often used with tutorial sets (Michie, 1985).

Figure

shows

family

tree

of the

TDIDT

systems.

The

patriarch

this

family

Hunt's

Concept

Learning System framework (Hunt, Marin

and

Stone,

1966).

CLS

constructs

decision

tree

that attempts

minimize

the

cost

classifying

object.

This cost

has

components

of two

types:

the

measurement cost

determining

the

value

property

exhibited

by the

object,

and the

misclassification cost

deciding

that

the

object belongs

class

when

its

real class

is K. CLS

uses

lookahead

strategy

similar

minimax.

each stage,

CLS

explores

the

space

possible deci-

sion

trees

to a

fixed

depth, chooses

action

minimize cost

this limited space,

then

moves

one

level

down

in the

tree. Depending

on the

depth

lookahead chosen,

CLS can

require

substantial amount

computation,

but has

been able

unearth

subtle

patterns

in the

objects shown

to it.

ID3

(Quinlan, 1979, 1983a)

is one of a

series

programs

developed from

CLS in

response

to a

challenging induction task posed

Donald Michie, viz.

decide

from

pattern-based features alone whether

particular chess position

in the

King-Rook

King-Knight

endgame

lost

for the

Knight's side

in a

fixed

number

ply.

full

description

of ID3

appears

Section

4, so it is

sufficient

note here that

embeds

tree-building method

in an

iterative outer shell,

and

abandons

the

cost-driven

lookahead

of CLS

with

information-driven

evaluation

function.

ACLS (Paterson

and

Niblett, 1983)

is a

generalization

ID3.

CLS and ID3

both

require that each property used

describe objects

has

only values from

specified

set.

addition

properties

this type, ACLS permits

properties

that have

INDUCTION

DECISION TREES

unrestricted

integer values.

The

capacity

deal

with

attributes

this

kind

has

allow-

ACLS

to be

applied

difficult

tasks such

image recognition (Shepherd, 1983).

ASSISTANT (Kononenko, Bratko

and

Roskar, 1984) also acknowledges

ID3 as

its

direct ancestor.

differs

from

ID3 in

many ways, some

which

are

discussed

detail

later sections. ASSISTANT

further

generalizes

on the

integer-valued

at-

tributes

ACLS

permitting attributes

with

continuous (real) values. Rather than

insisting

that

the

classes

disjoint, ASSISTANT allows them

form

hierarchy,

that

one

class

may be a

finer

division

another. ASSISTANT

does

not

form

decision tree iteratively

in the

manner

ID3,

but

does

include algorithms

for

choos-

ing

'good'

training

set

from

the

objects available.

ASSISTANT

has

been used

several medical domains

with

promising results.

The

bottom-most three systems

in the

figure

are

commercial derivatives

ACLS.

While they

do not

significantly advance

the

underlying theory, they incorporate

many user-friendly innovations

and

utilities

that

expedite

the

task

generating

and

using decision

trees.

They

all

have industrial

successes

their

credit.

Westinghouse

Electric's

Water

Reactor

Division,

for

example, points

to a

fuel-enrichment applica-

tion

which

the

company

was

able

boost

revenue

'more

than

ten

million

dollars

per

annum' through

the use of one of

them.

3. The

induction task

We now

give

more precise statement

of the

induction task.

The

basis

is a

universe

objects that

are

described

terms

of a

collection

attributes.

Each attribute

measures

some important feature

of an

object

and

will

limited here

taking

(usually

small)

set of

discrete, mutually

exclusive

values.

For

example,

if the

objects

were

Saturday mornings

and the

classification task involved

the

weather, attributes

might

outlook, with values

{sunny,

overcast, rain}

temperature,

with

values {cool, mild, hot}

humidity,

with values {high, normal}

windy,

with values {true, false

}

Taken

together,

the

attributes provide

zeroth-order

language

for

characterizing

ob-

jects

in the

universe.

particular Saturday morning might

described

outlook:

overcast

temperature:

cool

humidity:

normal

windy:

false

Letter cited

in the

journal

Expert

Systems (January, 1985),

p. 20.

剩余25页未读，继续阅读

评论收藏

内容反馈

rexzhao236

2011-10-09

很经典的东西，全英文，值得阅读

suzhuo123456

粉丝: 1
资源: 3

Induction of decision trees by J. R. Quinlan

最新资源

Induction of decision trees by J. R. Quinlan

Induction of decision tree

数据挖掘ID3

Strategic induction of decision trees

FAST和Simulink的双馈异步发电机的风力发电机组的仿真-Simulation of a wind turbine with doubly fed induction generator by FAST and Simulink.pdf

Coq Induction.v 答案

induction-motor-vector-control.zip_induction motor_site:www.pudn

Manual for Induction Motors and Generators_EN.pdf

Murthy S K , Kasif S , Salzberg S . A System for Induction of Ob

vfcontrol-of-induction-motor.rar_VFcontrol_f induction motor_ind

双馈式风力发电-Vector Controlled Doubly Fed Induction Generator for Wind Applications.doc

dtc-of-induction-motor.rar_DTC induction motor_dtc_three-level

Matlab-Simulation-of-Induction-Motor.zip_MOTOR INDUCTION_inducti

决策树ID3算法编程（c语言课程设计） by Chain_Gank

Induction motor flux observer.zip_ac induction motor_flux_flux o

induction_heater_1.zip_induction heater

An Introduction to Machine Learning, 2nd Edition

induction_motor3PH.rar_control

UCIcar.zip

induction_machine_model.zip_electronics_induction

power_acdrive.rar_Universal machine_abc to dq_induction motor dq

Z-Source.zip_control-of-inverter_inverter control_z inverter_z s

Expressions of Faraday’s law of induction in media

A study of harmonics effect on performances of three-phase induction motor using BSRG.pdf

ac1_example.zip_ac1_example_six_six-phase induction_step motor_v

induction-motor.rar_MOTOR INDUCTION_induction_induction motor

Role of γ-Interferon in Induction of Foxp3 and Conversion of CD4+CD25- T cells to CD4+ Regulatory T Cells

最新资源