Semi-supervised learning is an important method that uses
an unlabelled dataset to improve the final result of the learning
algorithm. Many semi-supervised learning methods have been
proposed in BCI. Tu et al. [14] used the unlabelled dataset to
generate regular items for the spatial filter, which makes the
model more robust and adaptive to the testing set. Jiannan
Meng et al. [15] first developed a base model from the tagged
dataset and then applied it to the unlabelled dataset. Based on
the results and linear projection score of the unlabelled dataset,
they produced a new model. Chen et al. [16] designed a new
self-training method to select the most effective unlabelled
data for the extended training set. C ompared with classic
methods, these methods improve the classification perfor-
mance with a shorter calibration time. However, the improve-
ment is not obvious. Moreover, these methods focus on the
classifica tion of two types of movements. These methods
would be better evaluated in a four category motion imagery
situation.
In addition to the semi-supervised approach, other methods
have also been proposed to improve accuracy when using a
small training set. Lotte et al. [13, 17] designed a method of
generating artificial signals from an existing training set to
construct a large datas et. Lotte’s method obtained a much
better result than the semi-supervised study. However, this
result was also obtained using the two-motor classification
approach.
Limited by the size of the training dataset, the current
method that directly reduces the calibration time using the
user ’s own training data does not perform well. It is very
difficult to achieve good training results just by using a
small number of samples (such as 10 samples per type of
action). Therefore, using data from other subjects to im-
prove t he recognition result is an importa nt alternative
way of training the BCI.
The manner in which to take advantage of datasets from
other subjects is of increasing interest to researchers. The two
main ways to do so are through knowledge transfer and data
fusion. Knowledge transfer refers to the transfer of knowledge
from an existing dataset to new individuals. In the BCI sys-
tem, compared with the traditional common spatial patter
(CSP), the regularized common spatial pattern (RCSP) is an
effective method of knowledge transfer. The RCSP reduces
the over-fitting of the CSP filters based on the existing knowl-
edge [18–20]. Lu et al. [21] directly used the covariance of
EEG data from multiple subjects to develop a regularization
term. H. Kang et al. [22] adopted a weighted average method
to develop CSP regularization terms. Lotte et al. [13, 23, 24]
evaluated the distance between the covariance of EEG data
from different individuals using the Riemann distance and
designed the regular term of the RCSP based on this distance.
Tu et al. [25] calculated the CSP filters from multiple subjects
and applied a cross-validation method to obtain the best one
from the CSP filters from other people.
In addition to knowledge transfer, data fusion is another
way to use data from multiple subjects. The two main types
of fusion methods are the pooled data method and the ensem-
ble method. The pooled data method establishes the training
set directly from the data of multiple subjects. The ensemble
method establishes a model for the data from each subject and
then assembles these models into a final model. The key prob-
lem with each of these approaches is how to combine the
results of these models. Fazli et al. [26] divided the EEG signal
from multiple subjects into different frequency bands and then
trained the classification models for each of the frequency
bands. By choosing the optimal frequency bands for each
dataset, they established a user-independent motion recogni-
tion model. Tu et al. [27] designed a meta-classifier for the
results of each dataset and then assembled the outputs of the
models from multiple subjects.
However, these methods only take data from other subjects
as a reference, constrain criteria for the new subject’smodel,
or directly establish a user-independent model with a dataset
from other subjects. The former methods have a shortage of
insufficient information mining, while the latter methods ig-
nore the essen tial differences among different in dividuals.
Moreover, these methods are all evaluated in a two-category
motor classification problem. Once the type of motion in-
creases, these methods can no longer address the noises and
biases. To solve these problems, this paper focuses on the
analysis of the differences in the EEG signals from different
su
bjects. We eliminated the differences as much as possible
and exploited the commonalities between the EEG signals
from different individuals. When people execute or imagine
a motion, the corresponding area in the brain is activated.
These event-related phenomena present variations on specific
frequencies of the ongoing EEG activity. These variations
may either be a decrease or an increase in power in the given
frequency bands. The former case is called event-related
desynchronization (ERD) and the latter is called event-
related synchronization (ERS) [28]. In our research, the dif-
ference between the EEG signals from each user was divided
into the following: 1) the mismatch of the channel position for
different subjects; 2) the intensity of the difference of ERD/
ERS, and 3) the frequency band differences of ERD/ERS.
This study focused on eliminating the first two differences.
The commonly used bandpass filter effectively reduces the
impact of ERD/ERS band differences. Furthermore, for the
purpose of making full use of the data from multiple subjects,
we did not use each dataset separately but combined datasets
from multiple subjects and the data from the target user to-
gether. Hence, the final model has both the target user’sown
characteristics and the characteristics of multiple other sub-
jects. In addition, a confidence score for the fusion of models
from different users was developed. The confidence score was
based on the linear projection and projection variance of the
LDA. By using the developed confidence score, the final
Med Biol Eng Comput