使用具有深度感知功能的内核极限学习机进行交通标志识别资源-CSDN文库

需积分: 9 62 浏览量 2021-04-06 06:19:53 上传评论收藏 991KB PDF 举报

### 使用具有深度感知功能的内核极限学习机进行交通标志识别 #### 摘要与研究背景在自动驾驶车辆及高级驾驶辅助系统（ADAS）领域中，交通标志识别（TSR）扮演着极其重要的角色。尽管过去几年里已经开发出了多种识别方法，但要在保证较低计算成本的同时实现高精度的识别仍然是一个挑战。本研究基于对不同颜色空间对卷积神经网络（CNN）表征学习影响的探究，提出了一种新的交通标志识别方法——深度感知内核极限学习机（DP-KELM）。该方法利用了基于内核的极限学习机（KELM）分类器，并结合深度感知特征来提高识别效率和准确性。 #### 方法论 ##### 1. 卷积神经网络（CNN）与颜色空间卷积神经网络是一种深度学习模型，擅长处理图像数据，尤其适用于图像分类任务。本研究中提到的颜色空间对CNN表征学习的影响至关重要。不同的颜色空间（如RGB、HSV或Lab等）会影响图像的表示方式，进而影响到CNN的学习效果。具体而言，研究发现，在Lab颜色空间中进行表征学习能够更有效地提取交通标志的关键特征。 ##### 2. 深度感知内核极限学习机（DP-KELM） DP-KELM是一种创新的交通标志识别方法，其核心是使用内核极限学习机作为分类器，并结合深度感知特征进行训练。这种方法与传统的交通标志识别方法相比有几个显著优势： - **高效性**：KELM作为一种快速的单层前馈神经网络，能够在较短的时间内完成训练过程，从而降低计算成本。 - **准确性**：通过结合深度感知特征，DP-KELM能够更加准确地识别各种交通标志，尤其是在复杂多变的道路环境中。 - **通用性**：DP-KELM不仅适用于特定的数据集，还能够在不同的场景下保持较高的识别精度。 ##### 3. 实验结果分析为了验证DP-KELM的有效性和优越性，研究团队在德国交通标志识别基准（GTSRB）上进行了实验。实验结果表明，DP-KELM在识别精度方面超过了大多数现有的方法。特别是在与铰链损失随机梯度下降法（一种目前精度最高的算法）的比较中，DP-KELM能够以显著更低的计算成本实现相近的识别率。 #### 结论与展望本研究提出的DP-KELM方法为交通标志识别领域带来了新的突破，不仅提高了识别精度，还大大降低了计算成本。未来的研究方向可能包括进一步优化DP-KELM中的特征提取过程以及探索更多类型的内核函数，以适应更广泛的交通标志类型和更为复杂的道路环境。此外，将DP-KELM与其他先进的计算机视觉技术相结合也是值得探索的方向之一，这将有助于推动自动驾驶技术和高级驾驶辅助系统的进一步发展。 DP-KELM在交通标志识别领域的应用具有广阔的前景，对于提升自动驾驶汽车的安全性和可靠性具有重要意义。随着技术的不断进步和发展，可以预见的是，未来在这一领域将会涌现出更多创新的方法和技术。

资源推荐

资源详情

资源评论

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS 1

Trafﬁc Sign Recognition Using Kernel Extreme

Learning Machines With Deep Perceptual Features

Yujun Zeng, Xin Xu, Senior Member, IEEE, Dayong Shen, Yuqiang Fang, and Zhipeng Xiao

Abstract—Trafﬁc sign recognition plays an important role

in autonomous vehicles as well as advanced driver assistance

systems. Although various methods have been developed, it

is still difﬁcult for the state-of-the-art algorithms to obtain

high recognition precision with low computational costs. In this

paper, based on the investigation on the inﬂuence that color

spaces have on the representation learning of convolutional

neural network, a novel trafﬁc sign recognition approach called

DP-KELM is proposed by using a kernel-based extreme learning

machine (KELM) classiﬁer with deep perceptual features. Unlike

the previous approaches, the representation learning process in

DP-KELM is implemented in the perceptual Lab color space.

Based on the learned deep perceptual feature, a kernel-based

ELM classiﬁer is trained with high computational efﬁciency

and generalization performance. Through the experiments on

the German trafﬁc sign recognition benchmark, the proposed

method is demonstrated to have higher precision than most of the

state-of-the-art approaches. In particular, when compared with

the hinge loss stochastic gradient descent method which has the

highest precision, the proposed method can achieve a comparable

recognition rate with signiﬁcantly fewer computational costs.

Index Terms—Trafﬁc sign recognition, convolutional neural

network, extreme learning machine, kernel, color space, lab.

I. INTRODUCTION

RIVEN by the development of driver assistance system

(DAS) and autonomous vehicles, trafﬁc sign recogni-

tion (TSR) has received lots of attention since it is necessary

to automatically provide timely information of trafﬁc signs

for safe driving [1]. TSR is also beneﬁcial for tasks like

trafﬁc sign monitoring and maintenance. In the past decade,

trafﬁc sign recognition has become an important research topic

not only in intelligent transportation systems but also in the

pattern recognition community. Factors such as changeable

viewpoint, motion blur, partial occlusion, color distortion,

contrast degradation, etc., always make TSR a challenging

problem.

Manuscript received November 9, 2015; revised August 2, 2016; accepted

September 17, 2016. This work was supported in part by the National

Natural Science Foundation of China (NSFC) under Grant 91220301 and

Grant 61375050 and in part by the Joint Innovation Foundation between NSFC

and Chinese Automobile Industry under Grant U1564214. The Associate

Editor for this paper was J. Zhang.

Y. Zeng, X. Xu, Y. Fang, and Z. Xiao are with the College of Mechatronics

and Automation, National University of Defense Technology, Changsha

410073, China (e-mail: xinxu@nudt.edu.cn).

D. Shen is with the College of Information Systems and Management,

National University of Defense Technology, Changsha 410073, China.

Color versions of one or more of the ﬁgures in this paper are available

online at http://ieeexplore.ieee.org.

Digital Object Identiﬁer 10.1109/TITS.2016.2614916

As a typical pattern recognition task, the accuracy of traf-

ﬁc sign recognition mainly lies on the feature extractor as

well as the classiﬁer. Earlier TSR methods generally share

a similar scheme which consists of hand-crafted features

and conventional classiﬁers. Even though many hand-crafted

features have been created and integrated with classiﬁers like

support vector machine (SVM) [2], [3], random forests [4], and

extreme learning machine (ELM) [5], etc., it is still difﬁcult

to deal with the increasing diversity and variability of trafﬁc

signs. The recognition performance is far from satisfaction.

Vondrick et al. [6] demonstrated that hand-crafted features

like histogram of oriented gradients (HOG) [7] were not

discriminative enough. Samples from different classes could

often be similar in the hand-crafted feature space.

With the growth of massive databases and high-performance

computing hardware (e.g. Graphics Processing Units, GPUs),

deep neural network (DNN) [8]–[10] has gradually shown

its outstanding feature-learning capabilities. In contrast with

hand-crafted features, the learned deep features could auto-

matically learn the potential essence stored in massive data

even better. As a consequence, the DNN-based methods have

obtained state-of-the-art results in many pattern classiﬁcation

tasks.

As a representative DNN model, the convolutional neural

network (CNN) was inspired from the research on creature

visual systems [11]. Representative CNN models were pro-

posed by Fukushima [12] and LeCun [13], both of which

tried to imitate the perceptual mechanism of human visual

cortex and could learn more discriminative features. Recently,

CNN has been used to tackle TSR tasks and some promising

results have been obtained [14]–[17]. However, CNN is a

deep multi-layer neural network and it is usually trained by

back-propagation (BP) algorithm. The local minima problem

of BP will cause the limited generalization capability of the

fully-connected layers in CNN. In order to obtain state-of-the-

art performance, CNN-based algorithms usually suffer from

a huge computational burden due to the need of a rather

deep single CNN or an ensemble of multiple CNNs. Besides,

current CNN-based TSR approaches usually deal with images

in RGB space. Various issues such as information coupling

and non-uniform color distribution in RGB space would have

a negative effect on the feature learning process of CNN.

In this paper, based on an investigation on the inﬂuence

that color spaces have on the representation learning of CNN,

we propose a novel TSR method named DP-KELM that uses

the kernel ELM classiﬁer with deep perceptual (DP) features.

See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

2 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS

The DP features are learned from the perceptual Lab color

space instead of RGB space. In terms of recognition precision,

the proposed method outperforms almost all the state-of-

the-art approaches. In particular, when compared with the

HLSGD method which has the highest recognition precision,

the proposed method can obtain a competitive result with much

less computational costs. The main contributions of this paper

can be summarized as follows.

1) A novel TSR method named DP-KELM is presented.

To our knowledge, this is the ﬁrst time to utilize the

kernel ELM classiﬁer with deep perceptual features for

TSR. The DP-KELM approach can be viewed as an

extension of our previous work in [18], which only

combined CNN-based features with conventional ELM.

Thus, the recognition performance is further improved

and more stable. Compared with the state-of-the-art

method, the proposed method can achieve competitive

results with signiﬁcantly less computational costs.

2) An Lab-based perceptual color space is employed in the

representation learning process of CNN. After extensive

experiments and comparisons, it is demonstrated that the

color space greatly inﬂuences the CNN feature learning

process and the discriminability of the CNN-learned

features. Thus, the usage of such perceptual color space

is more beneﬁcial for CNN-based TSR.

The rest of this paper is organized as follows. Section II

reviews related works. The proposed method is presented

in detail in section III. Experimental results are shown in

section IV. The conclusion and future work are given in

section V.

II. R

ELATED WORKS

Conventional TSR methods often relies on hand-crafted

features, e.g. HOG [4], [5], [19]–[23], scale-invariant feature

transform (SIFT) [24], [25], color global and local oriented

edge magnitude pattern [26], Gabor features [27] or the

integration of the aforementioned features by coding tech-

nique [28]. Nevertheless, the recognition accuracy of these

methods is not satisfying enough because of the limited dis-

criminability of the existing hand-crafted features. To design a

new robust and discriminative feature is challenging and time-

consuming. Recent high-performance TSR approaches mostly

resort to CNN.

There are various CNN models proposed for TSR and tested

in the competition of German trafﬁc sign recognition bench-

mark (GTSRB) [19]. For the purpose of combining global

invariant features with the local detailed ones, Sermanet and

LeCun [14] applied multi-level features extracted by different

layers of CNN to the fully-connected layers. Cire¸san et al. [15]

used image enhancement algorithms to conduct training data

augmentation and boosted CNN performance with multi-layer

perceptrons (MLPs) fed on HOG features. Its performance was

further improved through multiple CNNs [16]. Jin et al. [17]

proposed a hinge loss stochastic gradient descent (HLSGD)

method to train 20 CNNs and obtained the best result through

the same ensemble strategy in [16]. However, such outstanding

performance is on the basis of CNN ensemble. The need of

Fig. 1. Different categories of trafﬁc signs. The blue ones stand for mandatory

while the yellow ones mean warning and the red ones prohibition (top).

Regardless of colors, some signs from distinctive classes look fairly similar,

which makes it hard to realize a precise recognition (bottom).

training multiple CNNs means a huge computational burden

and a strong dependence on high-performance parallel com-

puting hardware.

Unlike other visual objects (pedestrian, face, vehicle, etc.),

trafﬁc signs are highly related to colors. Colors could some-

times reﬂect class labels (see Fig.1). What’s more, some signs

might mainly be distinguished by colors (see Fig.1 bottom).

Sermanet and LeCun [14] suggested that normalizing color

channels rather than taking raw colors will make datasets more

informative. Cire¸san et al. [15] showed that CNN trained on

RGB images instead of the grayscale usually performed better.

It implies that the format of the input image data will have an

important effect on the performance of CNN, but as far as we

know, there is little work on this topic.

As for the classiﬁers, both [29] and [30] showed that

just feeding CNN-learnt features to a linear support vec-

tor machine (SVM) classiﬁer yielded impressive results on

standard datasets for classiﬁcation [31]–[34]. Nevertheless, to

determine the appropriate parameters for training SVM is trou-

blesome. Both the time and computational cost will become

unacceptable as the dataset becomes larger. Besides that, the

optimality of generalization performance is not guaranteed.

As an efﬁcient learning framework that uses random hid-

den nodes [35]–[39], extreme learning machine (ELM) was

proposed by Huang et al. [40]–[42]. It is able to ensure a

high training efﬁciency together with an approximate optimal

generalization capability. When combined with deep features,

ELM has achieved competitive or better results than other

popular algorithms in many visual detection and recognition

tasks [18], [43]–[45].

In [18], a CNN-ELM method was proposed for the TSR

task, where the CNN was trained by images in RGB space

and a conventional ELM classiﬁer with a large number of

random hidden nodes was used to ensure a high recognition

precision. So, the CNN-ELM method may lead to relatively

unstable performance which needs many training trials. In this

paper, we promote the work in [18] and propose a novel

TSR algorithm called DP-KELM. In DP-KELM, the repre-

sentation learning process of convolutional neural networks

is implemented in the perceptual Lab color space. Based

on the learned deep perceptual feature, a kernel-based ELM

classiﬁer is trained with high computational efﬁciency and

剩余6页未读，继续阅读

评论收藏

内容反馈

weixin_38690275

粉丝: 7
资源: 971

使用具有深度感知功能的内核极限学习机进行交通标志识别

基于深度学习的交通信号灯识别系统

通过极限学习机学习深度表示

交通信号识别系统的深度学习

极限学习机

学习目前企业经常使用的Linux基本操作命令 深度学习Linux内核源码及源码剖析

linux内核源代码深度解析.zip

使用基于FPCA的内核极限学习机进行高光谱图像分类

基于深度学习的具有内核模型的极限学习机

深度剖析linux内核

一种基于HDR技术的交通标志牌检测和识别方法

一种基于深度学习的交通标志识别算法.pdf

交通标志识别，

毕设&课程作业_ 基于CNN深度学习网络的交通标志识别.zip

基于深度学习的交通标志识别技术研究.pdf

中文AN5763：LSM6DSV16X：具有嵌入式机器学习内核和Qvar+检测功能.pdf

视频识别，深度学习，人工智能

步步高学习机源码

交通标志的检测与标定

基于深度学习的交通标志识别方法研究.pdf

超限学习机

基于深度学习的交通标志识别算法.pdf

灰太狼优化进化内核极限学习机：在破产预测中的应用

基于FairMOT与ByteTrack追踪内核而制作的python深度学习项目软件模板.zip

深度：一文看懂Linux内核！Linux内核架构和工作原理详解

Linux内核学习 Linux内核学习

基于均值滤波的内核极限学习机的高光谱监督分类

最新资源

学习目前企业经常使用的Linux基本操作命令深度学习Linux内核源码及源码剖析