to improve the performance of recognition, very deep networks were verified in [29]. Chan et
al. [9] proposed a simple yet powerful deep network using multi-layered principal component
analysis to enrich the discriminative ability during feature exacting. This algorithm was
considered as a baseline for deep-learning based face recognition on some famous face
databases, e.g., AR, extend Yale-B, CMU PIE and LFW datasets. These deep-learning based
approaches provide end-to-end solutions for big data application scenario. However, there are
two shortcomings for deep learning based face recognition algorithms: GPU cluster which is
essential for deep-learning optimization and large-scale date resources are always not available
for general researchers [2, 3, 17]. On the other hand, designing and optimizing these complex
networks are always time-consuming and labor-intensive. Above two shortcomings limit the
large scale extensive applications of deep-learning based approaches, especially in resource-
limited scenarios, e.g., mobile computing, autonomous robots.
In fact, developing efficient and accurate face recognition algorithms are still challenging
tasks. Especially, when the face images under complex conditions e.g., facial disguise, noise,
and expression or illumination changes, performances of feature extraction and classification
are both degraded. E. Candes et al. [8] theoretically proved that low rank minimization can be
used to rectify the defects in an observation matrix by decomposing it into two parts: low-rank
clear part and noise part. Thus this low-rank part of observation matrix would play more
important role in recognition against noise. Du et al. [11] proposed a low-rank constrained
sparse representation algorithm to enhance both feature representation and classification
abilities. However, it suffered from high computational complexity of low-rank and sparse
representation. Considering discriminations of traditional classifiers, e.g., K-Nearest Neighbor
(KNN) [34] and Support Vector Machine (SVM) [12], they are in efficient computing
manners, but just with limited accuracy. Some complex models including deep-learning based
approaches, e.g. FaceIDs [29, 30] and PCANet [9], achieve impressive performance, but they
are time-consuming to train their complex networks. Recently, a novel type of single hidden
layer feed forward network, called Bextreme learning machine (ELM)^ offers a fast training
paradigm for machine learning with strong generalization ability. ELM looks reasonable for
accelerating training process in face recognition scenario. But [19] had pointed out that even
though the ELM had advantages of low computational complexity and generalization ability,
its prediction accuracy was sensitive to the noise in the input data. Thus, when the training or
test data is noisy, the prediction performance of ELM drops dramatically.
To promote recognition performance with both high efficiency and accuracy [44], in this
paper, we propose a novel low-rank supported extreme learning machine termed LSELM,
aiming at extracting robust features, with fast learning manner and real application-oriented.
By the merit of low-rank recovery, the contaminated input images can be exactly separated
into clear inherent content and noise part. With those low-rank parts, the outliers would be
transferred into their inherent contents which bring discriminative ability of feature represen-
tation. Furthermore, ELM is utilized to accelerate training phase to get fast training results.
Consequently, by the low-rank content supported, the novel LSELM is robust to input noise
and time-efficient for face recognition. The main contributions of the proposed approach can
be summarized as follow: (1) We use low-rank recovery to decompose the input images into
inherent part and noise part, including variable illumination, disguise, and expression changes.
The low-rank part brings robustness of feature representation. (2) Low-rank supported extreme
learning machine, not only promote the robust representation ability, but also in an efficient
training manner. It ensures that the computational complexity of the proposed algorithm is
lower than other traditional approaches. (3) The proposed three-layered architecture is easy to
Multimed Tools Appl (2018) 77:11219–11240 11221