matlab-用于Matlab的贝叶斯网络工具箱.zip_matlab的动态贝叶斯网络的工具箱资源-CSDN文库

共2000个文件

m：1535个

htm：172个

root：59个

版权申诉

matlab

贝叶斯网络

优质项目

15 浏览量 2024-10-20 19:30:52 上传评论收藏 13.53MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

matlab-用于Matlab的贝叶斯网络工具箱.zip （2000个子文件）

genops.c 17KB

init_pot1.c 15KB

init_pot.c 15KB

collect_evidence.c 15KB

init_pot.c 15KB

distribute_evidence.c 15KB

distribute_evidence.c 14KB

nrutil.c 9KB

score_x.c 5KB

marg_sparse_table.c 4KB

triangulate.c 4KB

C_quickscore.c 4KB

repmatC.c 4KB

rectintSparseLoopC.c 4KB

mult_by_sparse_table.c 4KB

ind2subvKPM.c 3KB

convert_to_sparse_table.c 3KB

multiply_one_marginal.c 3KB

marg_tableC.c 3KB

marg_table.c 3KB

divide_by_sparse_table.c 3KB

get_slice_dbn.c 3KB

parzenC.c 3KB

divide_by_table.c 2KB

mult_by_table.c 2KB

mult_by_tableC.c 2KB

compute_posterior.c 2KB

rep_mult.c 2KB

repmat_and_mult.c 2KB

score_dag_x.c 2KB

score_family_x.c 2KB

junk.c 2KB

subv2indKPM.c 2KB

rectintLoopC.c 1KB

max_mult.c 1KB

colmult.c 1KB

mexutil.c 1KB

normaliseC.c 627B

sample_single_discrete.c 455B

german.doc 5KB

australian.doc 2KB

segment.doc 2KB

heart.doc 1KB

dummy 0B

Entries 702B

Entries 652B

Entries 608B

Entries 584B

Entries 546B

Entries 542B

Entries 493B

Entries 404B

Entries 393B

Entries 391B

Entries 388B

Entries 383B

Entries 370B

Entries 331B

Entries 328B

Entries 327B

Entries 320B

Entries 318B

Entries 317B

Entries 310B

Entries 309B

Entries 291B

Entries 290B

Entries 287B

Entries 282B

Entries 279B

Entries 271B

Entries 258B

Entries 241B

Entries 233B

Entries 231B

Entries 228B

Entries 226B

Entries 215B

Entries 213B

Entries 184B

Entries 180B

Entries 178B

Entries 173B

Entries 169B

Entries 168B

Entries 163B

Entries 162B

Entries 151B

Entries 119B

Entries 113B

Entries 112B

Entries 109B

Entries 64B

Entries 63B

Entries 59B

共 2000 条

Journal of Machine Learning Research ? (2008) 1-?? Submitted 4/08; Published ??/08

BNT STRUCTURE LEARNING PACKAGE :

Documentation and Experiments

Olivier C.H. FRANCOIS Francois.Olivier.C.H@gmail.com

Sustainable Urban Environments Research Division, URS building, Room 3n38,

Whiteknights, PO Box 219, Reading, RG6 6AW, UK

http: // ofrancois. tuxfamily. org

Philippe LERAY Philippe.Leray@univ-nantes.fr

Ecole Polytechnique de l’Université de Nantes,

Laboratoire d’Informatique de Nantes Atlantique,

Rue Christian Pauc, BP 50609, 44306 Nantes Cedex 3

http: // www. polytech. univ-nantes. fr/ COD/ ?Pages_ personnelles: Philippe_ Leray

Editor: ?? Leslie Pack Kaelbling ??

Abstract

Bayesian networks are a formalism for probabilistic reasoning that have grown in-

creasingly popular for tasks such as classiﬁcation in data-mining. In some situations,

the structure of the Bayesian network can be given by an expert. If not, retrieving

it automatically from a database of cases is a NP-hard problem; notably because of

the complexity of the search space. In the last decade, numerous methods have been

introduced to learn the network’s structure automatically, by simplifying the search

space or by using an heuristic in the search space. Most methods deal with completely

observed data, but some can deal with incomplete data. The Bayes Net Toolbox for

Matlab, introduced by Murphy (2004), offers functions for both using and learning

Bayesian Networks. But this toolbox is not ’state of the art’ as regards structural

learning methods. This is why we propose the SLP package.

Keywords: Bayesian Networks, Structure Learning, Classiﬁcation

1. Introduction

Bayesian networks are probabilistic graphical models introduced by Kim and Pearl

(1987), Lauritzen and Spiegelhalter (1988), Jensen (1996), Jordan (1998).

Deﬁnition 1. B = (G,θ) is a discrete bayesian network (or belief network) if G = (X, E)

is a directed acyclic graph (DAG) where the set of nodes represents a set of random

variables X = {X

, · · · , X

}, and if θ

= [P(X

Pa(X

)

) ] is the matrix containing the

conditional probability of node i given the state of its parents Pa(X

A Bayesian network B represents a probability distribution over X which admits the

following joint distribution decomposition:

P(X

, X

, · · · , X

) =

∏

i=1

P(X

Pa(X

)

) (1)

This decomposition allows the use of some powerful inference algorithms for which

Bayesian networks became simple modeling and reasoning tools when the situation is

2008 Olivier François and Philippe Leray.

FRANÇOIS AND LERAY

uncertain or the data are incomplete. Bayesian networks are also practical for classiﬁ-

cation problems when interactions between features can be modelized with conditional

probabilities. When the network structure is not given (by an expert), it is possible to

learn it automatically from data. This learning task is hard, because of the complexity

of the search space. Many softwares deal with Bayesian networks, for instance :

• gR Lauritzen et al. (2004)

• BNT Murphy (2004)

• PNL Bradski (2004)

• BNJ Perry and Stilson (2002)

• TETRAD Spirtes et al. (2004)

• Causal explorer Tsamardinos et al.

(2005)

• LibB Friedman and Elidan (1999)

• BNPC Cheng et al. (2001)

• Web WeavR Xiang (1999)

• JavaBayes Drakos and Moore (1998)

• ProBayes Mazer et al. (2004)

• BayesiaLab Munteanu et al. (2001)

• Hugin Andersen et al. (1989)

• Netica Netica (1998)

• BayesWare Sebastiani et al. (1999)

• MSBNx Kadie et al. (2001)

• B-Course Myllymäki et al. (2002)

• Bayes Builder Nijman et al. (2002)

For experiments, we have used Matlab with the Bayes Net Toolbox Murphy (2004)

and the Structure Learning Package we develop and propose over our website Leray

et al. (2003). This paper is organized as follows. We introduce some general con-

cepts concerning Bayesian network structures, how to evaluate these structures and

some interesting properties of scoring functions. In section 3, we describe the common

methods used in structure learning; from causality search to heuristic searches in the

Bayesian network space. We also discuss the initialization problems of such methods.

In section 4, we compare these methods using two series of tests. In the ﬁrst series,

we try to retrieve a known structure while the other tests aim at obtaining a good

Bayesian network for classiﬁcation tasks. We then conclude on the respective advan-

tages and drawbacks of each method or family of methods before discussing future

relevant research. We describe the syntax of a function as follows.

Ver

[out1, out2] = function(in1, in2)

Brief description of the function.

’Ver’, in the top-right corner, specifies the function location : BNT if it is a

native function of the BNT, or v1.5 if it can be found in the SLP package

The following fields are optionals :

INPUTS :

in1 - description of the input argument in1

in2 - description [default value in brackets for optional arguments]

OUTPUTS :

out1 - description of the output argument out1

e.g., out = function(in), a sample of the calling syntax.

BNT STRUCTURE LEARNING PACKAGE

2. Preliminaries

2.1 Exhaustive search and score decomposability

The ﬁrst (but naive) idea as to ﬁnding the best network structure is the exploration

and evaluation of all possible graphs in order to choose the best structure. Robinson

Robinson (1977) has proven that r(n), the number of different structures for a Bayesian

network with n nodes, is given by the recursive formula of equation 2.

r(n) =

∑

i=1

(−1)

i+1





i(n −i)

r(n − i) = n

O(n)

(2)

This equation gives r(2) = 3, r(3) = 25, r(5) = 29281, r(10) ' 4, 2 × 10

BNT

Gs = mk_all_dags(n, order)

generates all DAGs with n nodes according to the optional ordering

Since equation 2 is super exponential, it is impossible to perform an exhaustive

search in a decent time as soon as the node number exceeds 7 or 8. So, structure

learning methods often use search heuristics.

In order to explore the DAGs space, we use operators like arc-insertion or arc-

deletion. In order to make this search effective, we have to use a local score to limit

the computation to the score variation between two neighboring DAGs.

Deﬁnition 2. A score S is said to be decomposable if it can be written as the sum or the

product of functions that depend only of one vertex and its parents. If n is the numbers

of vertices in the graph, a decomposable score S must be the sum of local scores s:

S(B) =

∑

i=1

s(X

, pa(X

) ) or S(B) =

∏

i=1

s(X

, pa(X

) )

2.2 Markov equivalent set and Completed-PDAGs

Deﬁnition 3. Two DAGs are said to be equivalent (noted ≡) if they imply the same

set of conditional (in)dependencies (i.e. have the same joint distribution). The Markov

equivalent classes set (named E) is deﬁned as E =

≡

where A is the DAGs’ set.

Deﬁnition 4. An arc is said to be reversible if its reversion leads to a graph which

is equivalent to the ﬁrst one. The space of Completed-PDAGs (CPDAGs or also named

essential graphs) is deﬁned as the set of Partially Directed Acyclic Graphs (PDAGs) that

have only undirected arcs and unreversible directed arcs.

For instance, as Bayes’ rule gives

P(A, B, C) = P(A)P(B|A)P(C|B) = P(A|B)P(B)P(C|B) = P(A|B) P(B|C)P(C)

those structures,

/.-,()*+

'&%$ !"#

≡

/.-,()*+

'&%$ !"#

oo //

'&%$ !"#

≡

/.-,()*+

'&%$ !"#

, are equivalent

(they all imply A⊥⊥C|B).

Then, they can be schematized by the CPDAG

/.-,()*+

'&%$ !"#

without ambiguities.

But they are not equivalent to

/.-,()*+

'&%$ !"#

(where P (A, B, C) = P(A)P(B|A, C)P(C) ) for

which the corresponding CPDAG is the same graph, which is named a V-structure.

FRANÇOIS AND LERAY

Verma and Pearl (1990) have proven that DAGs are equivalent if, and only if, they

have the same skeleton (i.e. the same edge support) and the same set of V-structures

(like

/.-,()*+

'&%$ !"#

). Furthermore, we make the analogy between the Markov equivalence

classes set (E) and the set of Completed-PDAGs as they share a natural one-to-one re-

lationship. Dor and Tarsi (1992) proposes a method to construct a consistent extension

of a DAG.

v1.5

dag = pdag_to_dag(pdag)

gives an instantiation of a pdag in the dag space whenever it is possible.

Chickering (1996) introduces a method for ﬁnding a DAG which instantiates a CDPAG

and also proposes the method which permits to ﬁnd the CDPAG representing the equiv-

alence classe of a DAG.

v1.5

cpdag = dag_to_cpdag(dag)

gives the complete pdag of a dag (also works with a cell array of cpdags, return-

ing a cell array of dags).

v1.5

dag = cpdag_to_dag(cpdag)

gives an instantiation of a cpdag in the dag space (also works with a cell array

of cpdags, returning a cell array of dags).

2.3 Score equivalence and dimensionality

Deﬁnition 5. A score is said to be equivalent if it returns the same value for equivalent

DAGs.

For instance, the BIC score is decomposable and equivalent. It is derived from

principles stated in Schwartz (1978) and has the following formulation:

BIC(B, D) = log P(D|B,θ

) −

Dim(B) log N (3)

where D is the dataset, θ

are the parameter values obtained by likelihood maximi-

sation, and where the network dimension Dim(B) is deﬁned as follows.

As we need r

− 1 parameters to describe the conditional probability distribution

P(X

/Pa(X

) = pa

) , where r

is the size of X

and pa

a speciﬁc value of X

parents, we

need Dim(X

, B) parameters to describe P (X

/Pa(X

) ) with

Dim(X

, B) = (r

− 1)q

where q

∏

∈Pa(X

)

(4)

And the dimension of the Bayesian network is deﬁned by Dim(B) =

∑

i=1

Dim(X

, B).

v1.5

D = compute_bnet_nparams(bnet)

gives the number of parameters of the Bayesian network bnet

BNT STRUCTURE LEARNING PACKAGE

The BIC-score is the sum of a likelihood term and a penalty term which penalizes

complex networks. As two equivalent graphs have the same likelihood and the same

complexity, the BIC-score is equivalent. Using scores with these properties, it becomes

possible to perform structure learning in Markov equivalent space (i.e. E =

≡

). This

space has good properties: since a algorithm using a score over the DAGs space can

happen to cycle on equivalent networks, the same method with the same score on the

E space will progress (in practice, such a method will manipulate CPDAGs).

v1.5

score = score_dags(Data, ns, G)

compute the score (’Bayesian’ by default or ’BIC’ score) of a dag G

This function exists in BNT, but the new version available in the Structure Pack-

age uses a cache to avoid recomputing all the local score in the score_family

sub-function when we compute a new global score.

INPUTS :

Data{i,m} - value of node i in case m (can be a cell array).

ns(i) - size of node i.

dags{g} - g’th dag

The following optional arguments can be specified in the form of (’name’,value)

pairs : [default value in brackets]

scoring_fn - ’Bayesian’ or ’bic’ [’Bayesian’] currently,

only networks with all tabular nodes support Bayesian scoring.

type - type{i} is the type of CPD to use for node i, where the type is a

string of the form ’tabular’, ’noisy_or’, ’gaussian’, etc.

[all cells contain ’tabular’]

params - params{i} contains optional arguments passed to the CPD

constructor for node i, or [] if none.

[all cells contain {’prior’, 1}, meaning use uniform Dirichlet priors]

discrete - the list of discrete nodes [1:N]

clamped - clamped(i,m) = 1 if node i is clamped in case m

[zeros(N, ncases)]

cache - data structure used to memorize local score computations (cf.

SCORE_INIT_CACHE function) [ [] ]

OUTPUT :

score(g) is the score of the i’th dag

e.g., score = score_dags(Data, ns, mk_all_dags(n), ’scoring_fn’, ’bic’,

’params’, [],’cache’,cache);

In particular, CPDAGs can be evaluated with

v1.5

score = score_dags(Data, ns, cpdag_to_dag(CPDAGs), ’scoring_fn’, ’bic’)

评论收藏

内容反馈

版权申诉

__AtYou__

粉丝: 3514
资源: 2177

matlab-用于Matlab的贝叶斯网络工具箱.zip

matlab贝叶斯网络工具箱

matlab 贝叶斯网络工具箱

【MATLAB工具箱集锦】- matlab优化工具箱.rar

matlab贝叶斯估计、分析和回归工具箱(BEAR)是一个全面的(贝叶斯面板)VAR工具箱，用于预测和政策分析.zip

Matlab-MCMC工具箱.zip

【MATLAB工具箱集锦】- 基于约束的重构分析工具箱Cobratoolbox.zip

【MATLAB工具箱集锦】- Gibbs-SeaWater (GSW)海洋学工具箱.zip

【MATLAB工具箱集锦】-贝叶斯网工具箱Bayes Net Toolbox(BNT).zip

【MATLAB工具箱集锦】-Nurbs-surface工具箱.zip

【MATLAB工具箱集锦】- 聚类分析工具箱FuzzyClusteringToolbox.zip

【MATLAB工具箱集锦】- 机器人工具箱robot-10.3.1.zip

【MATLAB工具箱集锦】- 图像分割质量评估工具包.zip

【MATLAB工具箱集锦】- 图像分割graphcut工具箱.zip

【MATLAB工具箱集锦】- jplv7.zip

【MATLAB工具箱集锦】- 图像局域特征匹配工具箱.zip

【MATLAB工具箱集锦】-PlotHub工具箱.zip

【MATLAB工具箱集锦】- hctsa时间序列分析工具箱.zip

【MATLAB工具箱集锦】- 数据包络分析工具箱.zip

【MATLAB工具箱集锦】- 音频处理工具箱.zip

【MATLAB工具箱集锦】-鱼群算法工具箱OptimizedAFSAr.zip

【MATLAB工具箱集锦】- ttsbox1.1语音合成工具箱.zip

【MATLAB工具箱集锦】- cvx凸优化处理工具箱.zip

【MATLAB工具箱集锦】- Minimal Paths 2工具箱.zip

【MATLAB工具箱集锦】- 光声仿真工具箱K-Wave-toolbox-1.2.1.zip

【MATLAB工具箱集锦】-生理学研究工具箱EEGLAB.zip

【MATLAB工具箱集锦】- 医学图像处理工具箱MedicalImageProcessingToolbox.zip

【MATLAB工具箱集锦】-DOMFluor Toolbox v1.7.zip

【MATLAB工具箱集锦】-PSORT粒子群优化工具箱.zip

【MATLAB工具箱集锦】- Schwarz-Christoffel Toolbox.zip

最新资源