562 M.J. Beynon, M.J. Peel / Omega 29 (2001) 561–576
not need preliminary or additional information about data,
such as probability distributions in statistics, basic probabil-
ity assignment in the Dempster Shafer theory of evidence,
or grade of membership of the value of possibility in fuzzy
set theory’ [20, p. 89].
RST incorporates the use of indiscernibility (equivalence)
relations to approximate sets of objects by upper and lower
set approximations and, as noted by Slowinski and Zopouni-
dis [21, p. 79], ‘it is a formal framework for discovering
deterministic and non-deterministic rules from a given rep-
resentation of knowledge ... [it] ... assumes knowledge rep-
resentation in a decision table form which is a special case
of an information system’. Initial RST applications focused
on medical diagnosis, drug research and process control
[22,23], but more recently it has been extended to cover
credit fraud detection, stock market rule-generation, market
research, climate change and the development of expert sys-
tems for the NASA space centre [24,25,20].
Slowinski and Zopounidis [21] also investigated the use
of RST to assess the risk of a Greek bank’s clients (rms) in
terms of granting nance. Although they did not examine the
predictive accuracy of the RST rules, they did conclude [21,
p. 39] that (based on nancial ratios and other rm-specic
variables), RST ‘is a useful tool for discovering a prefer-
ential attitude of the decision maker in multi-attribute sort-
ing problems’. More recently, Dimitras et al. [26, p. 278]
reported that (on the basis of nancial ratios) a rough set
approach to predicting between failed and non-failed Greek
rms ‘was generally better than those obtained by classical
discriminant and logit models’. A limitation of these studies
is that the continuous data used to derive the rough set rules,
have been discretised (a requirement of RST) with the aid
of a selected ‘expert’. Clearly dierent experts may proer
dierent views and the operational costs and complexities
of using RST (and related techniques) will increase when
there is over-reliance on an expert. In this context An et
al. [27, p. 647] have stated that ‘It has to be emphasised ...
that the question of how to optimally discretise the attribute
(variable) values, is unsolved, and a subject of on-going
research’. This paper therefore employs a new (and more
objective) discretisation method, namely the FUSINTER
technique. However, the motivation for data discretisation
extends beyond the requirements of RST, to include dis-
cretising data of an imprecise quality (‘noisy’ data). The
ability to formulate rules from interval data (via discreti-
sation) may also facilitate a more informed understanding
of the interaction of the characteristics of objects. In this
context, it is of interest to note that, even with regard to
traditional statistical estimators (logit=discriminant analy-
sis), it has recently been advocated that continuous vari-
ables (nancial ratios) should be rank-transformed to im-
prove their distributional properties in a failure prediction
setting [28].
A further RST innovation has been the development by
Ziarko [29] of a variable precision rough sets (VPRS) model,
which incorporates probabilistic decision rules. This is an
important extension, since as noted by Kattan and Cooper
[30, p. 468], when discussing computer based decision tech-
niques in a corporate failure setting, ‘In real world decision
making, the patterns of classes often overlap, suggesting
that predictor information may be incomplete... This lack of
information results in probabilistic decision making, where
perfect prediction accuracy is not expected’.
An et al. [27] applied VPRS (which they termed
‘Enhanced RST’) to generating probabilistic rules to pre-
dict the demand for water. Relative to the traditional rough
set approach, VPRS has the additional desirable property of
allowing for partial classication compared to the complete
classication required by RST. More specically, when
an object is classied using RST it is assumed that there
is complete certainty that it is a correct classication. In
contrast, VPRS facilitates a degree of condence in clas-
sication, invoking a more informed analysis of the data,
which is achieved through the use of a majority inclusion
relation [29].
This paper extends previous work by providing an em-
pirical exposition of VPRS, where we present the results of
an experiment which applies VPRS rules to the corporate
failure decision. In addition, we mitigate the impact of us-
ing the subjective views of an expert (as employed in previ-
ous studies) to discretise the data, by utilising the sophisti-
cated FUSINTER discretisation technique which is applied
to a selection of attributes (variables) relating to companies’
nancial and non-nancial characteristics. The discretised
data, in conjunction with other nominal attributes, are then
used in this new VPRS framework to identify rules to clas-
sify companies in a failure setting.
To facilitate a comparison of our experimental VPRS re-
sults with those of existing techniques, we present the pre-
dictive ability of classical statistical methods—logit anal-
ysis and MDA—together with two more closely related
non-parametric decision-tree methods, RPA and the Elysee
method, which utilises ordinal discriminant analysis (see
[15,31], for an exposition of these methods). However in
the spirit of previous experimental research—and more par-
ticularly the previous failure prediction study of Frydman
et al. [15, p. 239], who concluded that ‘we feel that the at-
tributes of new techniques like RPA can be presented and
evaluated in a rigorous framework without the necessity of
proving its absolute superiority over existing procedures’—
the comparative classication results are not meant to be
denitive, but rather to illustrate the potential of VPRS. In
this context, research on the criteria to select the most e-
cacious and parsimonious set of VPRS rules (for predictive
purposes) is still in its infancy [27].
The remainder of the paper is organised as follows: The
next section gives a brief exposition of the VPRS method
and a discussion of the FUSINTER discretisation method.
The results of the empirical experiments are then reported,
including a discussion of the predictive ability of VPRS
relative to other existing parametric and non-parametric
methods.