灰太狼优化进化内核极限学习机：在破产预测中的应用资源-CSDN文库

189 浏览量 2021-03-27 14:22:24 上传评论收藏 2.48MB PDF 举报

这篇文章的主题是关于一种名为“灰太狼优化进化内核极限学习机（Grey Wolf Optimization Evolving Kernel Extreme Learning Machine，简称GWO-KELM）”的新模型在破产预测领域的应用。该研究由Mingjing Wang、Huiling Chen等人撰写，发表在2017年的《工程应用人工智能》杂志上。知识点一：内核极限学习机（Kernel Extreme Learning Machine，KELM）内核极限学习机是一种用于分类和回归任务的机器学习方法，它是基于单隐藏层前馈神经网络（Single-hidden Layer Feedforward Neural Networks，SLFNs）的一个改进。KELM通过引入核函数，能有效地解决非线性问题，且训练速度快，泛化能力强。知识点二：参数调优（Parameter Tuning）在机器学习模型中，参数调优是指寻找模型最优参数的过程，以期达到最佳的性能表现。参数的选择对模型的准确性和效率有着决定性的影响，因此调优是一个重要的步骤。调优方法包括网格搜索（Grid Search）、遗传算法（Genetic Algorithm）、粒子群优化算法（Particle Swarm Optimization）等。知识点三：灰太狼优化算法（Grey Wolf Optimization，GWO）灰太狼优化算法是一种模拟自然界灰狼社会等级和狩猎行为的新型群智能算法。在该算法中，灰狼群的领导者是“阿尔法狼”，其下是“贝塔狼”和“德尔塔狼”，它们构成了群体的统治层。算法通过模拟灰狼群捕猎过程中的追踪、围攻和攻击等行为，用于解决优化问题。知识点四：破产预测（Bankruptcy Prediction）破产预测是财务分析领域的一个重要问题，它涉及到预测企业是否会发生财务失败（破产）。利用机器学习和数据挖掘技术进行破产预测，可以提前发出预警信号，帮助投资者、债权人和管理者做出更好的决策。知识点五：10折交叉验证（10-Fold Cross Validation） 10折交叉验证是一种统计分析方法，用于评估数据模型的泛化能力。在10折交叉验证中，数据集被随机分为10个子集，用9个子集训练模型，剩下的1个子集用来测试，这个过程重复10次，每次使用不同的测试集。通过10次验证的平均结果，可以估算模型在未知数据上的表现。知识点六：受试者工作特征曲线（Receiver Operating Characteristic Curve，ROC Curve） ROC曲线是反映敏感性和特异性连续变量的评价指标，它通过将真正率（True Positive Rate）和假正率（False Positive Rate）之间的关系绘制为曲线来展示分类器的性能。曲线下面积（Area Under the Curve，AUC）越大，分类器的分类能力越好。知识点七：研究论文中的实验设计与结果分析文章在破产预测的研究中，将提出的GWO-KELM模型与其他三种KELM方法（基于粒子群优化、遗传算法和网格搜索技术的KELM）以及极限学习机、改进的极限学习机、支持向量机和随机森林等其他模型进行了对比实验。实验基于两组真实数据集，并采用10折交叉验证分析方法。结果显示，GWO-KELM模型在分类准确性、第一类错误率、第二类错误率、ROC曲线下的面积以及计算时间等方面均优于其他方法，从而验证了该模型在破产预测任务中的优越性和高效性。从上述内容可以看出，这篇文章通过提出一种创新的参数调优策略，结合了灰太狼优化算法和内核极限学习机，开发出了一种新的破产预测模型。这项研究对于金融风险评估领域具有重要的实际应用价值，同时也为后续相关领域的研究提供了新的思路和方法。

资源推荐

资源详情

资源评论

Grey wolf optimization evolving kernel extreme learning machine:

Application to bankruptcy prediction

Mingjing Wang

, Huiling Chen

, Huaizhong Li

, Zhennao Cai

, Xuehua Zhao

Changfei Tong

, Jun Li

, Xin Xu

College of Physics and Electronic Information Engineering, Wenzhou University, 325035 Wenzhou, China

Department of Computing, Lishui University, Lishui 323000, Zhejiang, China

School of Digital Media, Shenzhen Institute of Information Technology, Shenzhen 518172, China

Electric Power Research Institute, State Grid Jilin Electric Power Company Limited, Changchun 130021, China

article info

Article history:

Received 13 May 2016

Received in revised form

17 February 2017

Accepted 9 May 2017

Keywords:

Kernel extreme learning machine

Parameter tuning

Grey wolf optimization

Bankruptcy prediction

abstract

This study proposes a new kernel extreme learning machine (KELM) parameter tuning strategy using a

novel swarm intelligence algorithm called grey wolf optimization (GWO). GWO, which simulates the

social hierarchy and hunting behavior of grey wolves in nature, is adopted to construct an effective KELM

model for bankruptcy prediction. The derived model GWO-KELM is rigorously compared with three

competitive KELM methods, which are typical in a comprehensive set of methods including particle

swarm optimization-based KELM, genetic algorithm-based KELM, grid-search technique-based KELM,

extreme learning machine, improved extreme learning machine, support vector machines and random

forest, on two real-life datasets via 10-fold cross validation analysis. Results obtained clearly conﬁrm the

superiority of the developed model in terms of classiﬁcation accuracy (training, validation, test), Type I

error, Type II error, area under the receiver operating characteristic curve (AUC) criterion as well as

computational time. Therefore, the proposed GWO-KELM prediction model is promising to serve as a

powerful early warning tool with excellent performance for bankruptcy prediction.

1. Introduction

Due to ﬁnancial crisis all over the world, company bankruptcy

predication attracts signi ﬁcant attention for ﬁnancial institutions.

It is important for enterprises to build a trustworthy and accurate

early warning system to predicate potential risk of company's

bankruptcy beforehand.

Bankruptcy predication generally forms a binary classiﬁcation

that needs to be resolved in a rational approach. The output result

generated from the classiﬁcation models has two types, namely,

type 1 represents a company with bankruptcy and type 0 other-

wise. Input values of the classiﬁcation models are often ﬁnancial

statistic ratios derived from credible ﬁnancial statements in the

real enterprises. So far, considerable amount of classiﬁcation

models based on different domain knowledge has been proposed

for bankruptcy prediction. In general, the proposed predication

models can be classiﬁed as statistical approaches or artiﬁcial in-

telligence methods (AI).

A great deal of typical statistical approaches that are

constructed for bankruptcy prediction models apply simple uni-

variate analysis (Beaver, 1966), multivariate discriminant analysis

(Altman, 1968), logistic regression (Ohlson, 1980) and factor ana-

lysis (West, 1985). Recently, AI methods are drawing more atten-

tion for failure prediction. Approaches that are based on the AI

means, such as artiﬁcial neural networks (ANN) (Atiya, 2001a),

support vector machines (SVM) (Min and Lee, 2005; Shin et al.,

2005), k-nearest neighbor (KNN) approach (Chen et al., 2011c),

Bayesian network models (Sarkar and Sriram, 2001; Sun and

Shenoy, 2007), extreme learning machine and ensemble methods

(Fedorova et al., 2013; Abellán and Mantas, 2014), as well as dif-

ferent hybrid approaches, have been widely used in ﬁnancial area.

Reddy and Ravi (2013) constructed two novel kernels based soft

computing techniques for classiﬁcation task. The experimental

results indicated that the proposed approaches could perform well

for bankruptcy prediction. Sharma et al. (2013) successfully pro-

posed a hybrid algorithm based on ant colony optimization and

Nelder-Mead simplex for training neural networks with an appli-

cation to bankruptcy prediction. Paramjeet and Ravi (2011) mod-

iﬁed bacterial foraging technique to train wavelet neural network

in order to predict bankruptcy in banks. A hybrid approach based

on differential evolution and radial basis function network

(DERBF) proposed by Naveen et al. (2010) was applied to

Contents lists available at ScienceDirect

journal homepage: www. elsevier.com/locate/engappai

Engineering Applications of Artiﬁcial Intelligence

http://dx.doi.org/10.1016/j.engappai.2017.05.003

Corresponding author.

E-mail address: chenhuiling.jlu@gmail.com (H. Chen).

Engineering Applications of Artiﬁcial Intelligence 63 (2017) 54–68

bankruptcy prediction. The results showed that DERBF had a good

performance of generalization on bank bankruptcy datasets.

Chauhan et al. (2009) employed differential evolution algorithm to

train wavelet neural network (DEWNN), predicting the bankruptcy

in bank s. The results on the four bankruptcy datasets revealed that

the DEWNN was obviously superior to other existed methods. Ravi

and Pramodh (2008) proposed a new architecture called principal

component neural network (PCNN) applied to bankruptcy pre-

diction problem in commercial banks. It is inferred that the pro-

posed PCNN hybrids outperformed other classiﬁers on the bank-

ruptcy dataset. A new neural network architecture kernel principal

component neural network (KPCNN) trained by threshold ac-

cepting was presented in Ravisankar and Ravi (2009). Its applica-

tion to bankruptcy prediction in banks reveled that KPCNN yields

comparable results with all the techniques. Vasu and Ravi (2011)

proposed new principal component analysis-wavelet neural net-

work hybrid (PCATAWNN) architecture trained by threshold ac-

cepting algorithm to predict bankruptcy in banks. The experi-

mental results showed that the PCATAWNN could convincingly

outperformed other techniques in terms of area under ROC curve

(AUC) in 10-fold cross-validation. In all of the employed methods,

ANN (Tsai and Wu, 2008; Atiya, 2001b; Zhang et al., 1999) has

become more and more popular for ﬁnancial prediction, thanks to

its prominent ability to capture the nonlinearity relationship that

exists between different features in real data set. Nevertheless, it is

worth to point out that traditional ANN learning methods, such as

the back-propagation approach, are based on the gradient descent

strategy which may result in local optimum. Furthermore, it is

generally required that a fair amount of network parameters be

tuned.

In order to avoid ANN's drawbacks, Huang et al. proposed a

new machine learning paradigm named extreme learning ma-

chine (ELM) (Huang et al., 2006). ELM is a representative learning

model of neural network named after single hidden layer feed-

forward neural networks (SLFNs). The hidden biases and input

weights in this method can be randomly generated, and the output

weights are mathematically determined using Moore-Penrose

(MP) generalized inverse. It is well-known that the universal ap-

proximation can reﬂect the approximation capabilities of the

neural networks. The approximation capabilities of multilayer

feedforward networks were proved by Hornik (1991), namely, no-

constant bounded continuous activation functions and continuous

mappings could be approximated in measure by neural networks.

Leshno et al. (1993) advocated that continuous functions could be

approximated by feedforward networks with a non-polynomial

activation function. Guang-Bin and Babri (1998) proposed that

SLFNs with N hidden nodes and almost nonlinear activation

function could exactly learn N distinct observations. Due to its

classiﬁcation performance, ELM has been adopted in ﬁelds such as

image classiﬁcation (Cao et al., 2016a; Jun et al., 2011), disease

diagnosis (Chen et al., 2015; Zhang et al., 2007), and engineering

application (Cao et al., 2016b, 2015 ). In addition, methods based on

ELM have also been widely applied in ﬁ nancial areas such as

bankruptcy prediction (Yu et al., 2014), corporate life cycle pre-

diction (Lin et al., 2013) and corporate credit ratings (Zhong et al.,

2014). One limitation of ELM, nevertheless, is that the randomly

assigned input weights can increase the variations of accuracies

obtained by classi ﬁers in multiple trials. In order to overcome this

limitation, Huang et al. (2012) proposes an extension version of

ELM, namely, kernel extreme learning machine (KELM), whose

connection weights between hidden layers and input are not ne-

cessary. Compared with ELM, KELM can achieve comparative or

more excellent property with faster training speed and much ea-

sier implementation in applications such as hyperspectral remote-

sensing image classiﬁcation (Pal et al., 2013; Chen et al., 2014),

activity recognition (Deng et al., 2014), 2-D proﬁles reconstruction

(Liu et al., 2014), disease diagnosis (Chen et al., 2016) and fault

diagnosis (Jiang et al., 2014).

We recently applied the KELM to bankruptcy prediction's issue

(Zhao et al., 2017), and obtained better performance than other

ﬁve competitive approaches including SVM, ELM, random forest

(RF), particle swarm optimization boosted fuzzy KNN, and Logit

model on the same real data set. Nevertheless, it should be noticed

that the two signiﬁcant parameters in KELM with RBF kernel are

kernel penalty parameter C and bandwidth

.Ccontrols the trade-

off between the model complexity and the ﬁtting error mini-

mization, while

deﬁnes the non-linear mapping from the input

space to some high-dimensional feature space. Several studies

have illustrated that these two parameters have an important ef-

fect on KELM's performance, similar to that in SVM. Thus, these

two key parameters must be properly set prior to its application to

realistic problems. These parameters are traditionally obtained

using the grid-search method whose main drawback, however, is

that it is easy to be trapped in a local optimum. Presently, it has

been shown that biologically-inspired methods (such as the ge-

netic algorithm (Liu et al., 2014), particle swarm optimization

(PSO) (Zhang and Yuan, 2015), and artiﬁcial bee colony (Ma et al.,

2016 ) are more likely to ﬁnd the global-best solution than the

grid-search method. As a new member in the nature-inspired

methods, Grey Wolf Optimizer (GWO) (Mirjalili et al., 2014) mi-

mics the social hierarchy and hunting behavior of grey wolves in

nature. The main traits of GWO are social hierarchy, encircling

prey, hunting, attacking prey (exploitation), and search for prey

(exploration).

Due to its good search ability, GWO has been applied in a var-

ious ﬁelds. Muangkote et al. (2014) used the GWO with improve-

ments to training q-Gaussian Radial Basis Functional-link nets

neural networks. The experimental result indicated that the pro-

posed algorithm obtained competitive performance comparing

with other meta-heuristic methods. Komaki and Kayvanfar (2015)

successfully applied GWO for the two-stage assembly ﬂow shop

scheduling problem with release time to greatly improve the ef-

ﬁciency. Sulaiman et al. (2015) used GWO to solve optimal reactive

power dispatch problem. Mirjalili (2015) employed GWO to train

multi-layer perceptron and eight standard datasets including ﬁve

classiﬁcation and three function-approximation datasets ware

evaluated. The results demonstrated that a high level of accuracy

in classiﬁcation and approximation of the proposed trainer could

be obtained. However, to the best of our knowledge, the potential

of GWO has not been explored to ﬁne tune the optimal parameters

appeared in KELM. Therefore, this study aims at exploring the

GWO technique's ability to address KELM's model selection pro-

blem for classiﬁcation, and further applying the resulted model

GWO-KELM to successfully and effectively predict company

bankruptcy. For veriﬁcation purpose, the effectiveness and efﬁ-

ciency of the proposed GWO-KELM is compared against the

common methods such as grid-search optimized KELM (GS-

KELM), genetic algorithm optimized KELM (GA-KELM), particle

swarm optimization optimized KELM (PSO-KELM) and other four

advanced machine learning methods including original ELM, self-

adaptive evolutionary extreme learning machine proposed by Cao

et al. (2012) (SaE-ELM), SVM and RF on the real-life ﬁnancial da-

taset. All methods are compared in terms of the training accuracy,

the validation accuracy and test accuracy, Type I error, Type II error

and the area under the receive operating characteristic curve

(AUC) criterion. For the stability of the results, the cross validation

(CV) strategy is also adopted including external 10-fold CV and the

inner 5-fold CV. The experimental results show that our proposed

methodology, GWO-KELM, performs better when compared with

some other well-known common methods. The main contribution

of this study can be summarized as follows:

M. Wang et al. / Engineering Applications of Artiﬁcial Intelligence 63 (2017) 54–68 55

a) A new nature-inspired method, GWO is successfully employed

to resolve the parameters optimization for KELM for the ﬁrst

time.

b) A potential model, GWO-KELM, is successfully applied to

bankruptcy prediction with the purpose of being treated as a

potential early warning tool for bankruptcy in the ﬁnancial

ﬁeld.

c) The proposed GWO-KELM tends to achieve better classiﬁca-

tion, and generate more stable and robust results in dis-

criminating the bankrupt companies from the healthy ones

when compared to several other methods.

The remainder of this paper is structured as following: Section

2 presents a brief description of GWO. Section 3 explains the de-

tailed implementation of the GWO-KELM methodology. In Section

4, the details of the experimental designs are elaborated. The ex-

perimental results are presented in Section 5. Conclusions are ﬁ-

nally summarized in Section 6.

2. Grey wolf optimization (GWO)

Recently, a new swarm intelligence optimization algorithm

called GWO was introduced by Mirjalili et al. (2014). This creative

algorithm actually simulates the social hierarchy and hunting be-

havior of grey wolves in nature. For modeling the social hierarchy

behavior of grey wolf, the group is divided into four parts: alpha

(

), beta (

), delta (

), and omega (

) as shown in Fig. 1.

is considered to be the best ﬁttest solution followed by

and

, respectively, and the rest of solutions are belonging to the

The ﬁrst three ﬁttest wolves that are closest to the prey are

and

who guide

to search prey in promising search areas.

During encircling grey, the wolves update their position sur-

rounding

,or

as shown in Eqs. (1) and (2):

→

⋅

→

−

→

()

DCXXt

→

(+ )=

→

−

→

⋅

→

()

Xt X AD1

where t is the current iteration,

→

(

)

indicates the current position

of prey and

→

(

means the current position of a wolf.

→

is the

distance between wolves and prey, and coefﬁcient vectors

→

and

→

are mathematically given as follows.

→

→→

−

→

()

Aara2

→

()

Cr2

where

→

and

→

are two vectors generating between [0, 1] ran-

domly, the element of

→

is linearly decreasing from 2 to 0 at each

process of iteration. In the GWO algorithm,

, and

are always

assumed to be likely near the position of the prey. During the

process of hunting, the ﬁrst three best solutions achieved so far in

terms of

, and

are saved and remained, then the remaining

wolves such as

which are capable of re-position according to the

ﬁrst three best wolves. The positions of wolves are updated ac-

cording to the Eqs. (5)–(11):

→

⋅

→

−

→

()

αα

DCXX

→

⋅

→

−

→

()

ββ

DCXX

→

⋅

→

−

→

()

δδ

DCXX

()

→

−

→

⋅

→

()

αα

XXAD

()

→

−

→

⋅

→

()

ββ

XXAD

()

→

−

→

⋅

→

()

δδ

XXAD

→

(+ )=

→

()

XXX

123

where

→

shows the position of

→

indicates the position of

→

shows the position of

→

shows the position of current solu-

tion,

→

and

→

are vectors generating randomly. The approx-

imate distance between the current solution and

and

are

calculated according the Eqs. (5)–(7). Eqs. (8)–(11) calculate the

ﬁnal position of the current solution after deﬁning the distance.

Where

→

and

→

are random vectors, and t indicates the

number of iteration. As we may see from the above equations, the

step size of

wolves running after

, and

are deﬁned by Eqs.

Fig. 1. The social hierarchy of grey wolves.

M. Wang et al. / Engineering Applications of Artiﬁcial Intelligence 63 (2017) 54–6856

剩余14页未读，继续阅读

评论收藏

内容反馈

weixin_38601499

粉丝: 2
资源: 938

灰太狼优化进化内核极限学习机：在破产预测中的应用

灰太狼优化器，用于无人作战飞机路径规划

SaDE_ELM_SaDE-ELM_极限学习机_进化算法预测_elm优化_ELM预测_源码.zip

SaDE_ELM_SaDE-ELM_极限学习机_进化算法预测_elm优化_ELM预测.zip

【ELM回归预测】基于差分进化算法优化极限学习机实现数据回归预测附matlab代码(DE-ELM)+.zip

【ELM回归预测】基于差分进化算法优化极限学习机实现数据回归预测附matlab代码(DE-ELM)+运行结果.zip

极端学习机：理论和应用

基于灰狼算法优化核极限学习机GWO-KELM时间序列预测，GWO-KELM时间序列预测，matlab代码 模型评价指标包括:

灰狼算法(GWO)优化核极限学习机(KELM)的分类预测，多特征输入模型 GWO-KELM分类预测模型 多特征输入单输出的二

基于灰狼算法(GWO)优化混合核极限学习机HKELM回归预测， GWO-HKELM数据回归预测，多变量输入模型 优化参数为H

【DELM预测】基于灰狼算法改进深度学习极限学习机实现数据预测附matlab代码.zip

改进的增强拉格朗日函数，带有改进的灰太狼优化算法，用于约束优化问题

锅拍灰太狼图片素材

js实现锅打灰太狼小游戏

锅打灰太狼（源码，内含图片）

灰太狼快递单打印软件V935.rar

灰狼算法(GWO)优化极限学习机ELM回归预测,GWO-ELM回归预测，多变量输入模型 评价指标包括:R2、MAE、MSE、

基于GWO-HKELM灰狼算法优化混合核极限学习机多变量回归预测（matlab完整源码和数据）

基于深度学习的企业破产预测研究.pdf

灰狼优化算法优化极限学习机(GWO-ELM)回归预测（Matlab完整源码和数据)

游戏 喜羊羊大战灰太狼源代码

锅拍灰太狼js代码实现

HTML5锅打灰太狼网页版游戏.zip

狂拍灰太狼练习源码和素材.rar

HTML5锅打灰太狼网页版游戏源码 HTML5pottoplaythewebversion.rar

MATLAB实现GWO-ELM灰狼优化算法优化极限学习机时间序列预测（完整源码和数据)

灰狼算法(GWO)优化极限学习机(ELM)的分类预测，多特征输入模型 GWO-ELM分类预测模型 多特征输入单输出的二分类及

最新资源

基于灰狼算法优化核极限学习机GWO-KELM时间序列预测，GWO-KELM时间序列预测，matlab代码模型评价指标包括:

灰狼算法(GWO)优化核极限学习机(KELM)的分类预测，多特征输入模型 GWO-KELM分类预测模型多特征输入单输出的二

基于灰狼算法(GWO)优化混合核极限学习机HKELM回归预测， GWO-HKELM数据回归预测，多变量输入模型优化参数为H

灰狼算法(GWO)优化极限学习机ELM回归预测,GWO-ELM回归预测，多变量输入模型评价指标包括:R2、MAE、MSE、

游戏喜羊羊大战灰太狼源代码

灰狼算法(GWO)优化极限学习机(ELM)的分类预测，多特征输入模型 GWO-ELM分类预测模型多特征输入单输出的二分类及