基于图像空间金字塔检测模型的航空图像鲁棒车辆检测

167 浏览量 2021-03-08 05:34:04 上传评论收藏 562KB PDF 举报

在航空图像中进行鲁棒车辆检测是智能交通监控领域的一项关键技术。随着无人机(UAV)的快速部署能力，利用高分辨率航空图像进行车辆检测变得越来越流行。这些图像提供了俯瞰视角，能够用来监视地面交通状况，例如交通流量、车辆排队等。然而，这一任务也面临诸多挑战，尤其是在不同高度飞行的无人机捕获的图像中，由于视角变化，目标大小会出现显著差异，这对检测算法的精确局部化能力带来了巨大挑战。传统上，车辆检测算法主要基于手工特征描述符，如局部二值模式（Local Binary Pattern, LBP）、Harr特征、尺度不变特征变换（Scale-Invariant Feature Transform, SIFT）和方向梯度直方图（Histogram of Oriented Gradient, HOG）。这些算法虽然在特定条件下能够取得一定的效果，但在复杂变化的航空图像中往往表现不佳。例如，文献[7]中提出了一种基于提升HOG描述符的算法，并利用线性支持向量机（Support Vector Machine, SVM）来区分车辆和背景。而文献[10]则使用不同的描述符进行无人机图像中的车辆检测，并观察到原始SIFT特征与颜色和形态特征结合时表现最佳。近年来，基于卷积神经网络（Convolutional Neural Network, CNN）的检测器在对象检测领域取得了巨大成功。CNN在特征提取方面表现出色，能够自动学习从低级到高级的图像特征。然而，应用于航空图像时，CNN检测器面临着一个主要障碍：由于无人机平台的飞行高度变化导致的目标尺寸差异。为了解决这一问题，研究者们提出了基于图像空间金字塔检测模型（Image Spatial Pyramid Detection Model, ISPDM）的车辆检测方法。该模型主要包含两个阶段：将图像划分为多个补丁，并通过图像补丁选择过程选择其中的一部分。然后，在第二阶段，使用YOLOv3模型对选中的补丁和原始图像进行检测，并采用集成决策算法得出最终结果。YOLOv3是一种流行的实时对象检测系统，它将图像分割为网格，并在每个网格中预测边界框和概率。YOLOv3的优势在于其速度和准确性之间的良好平衡，使其非常适合实时检测任务。通过广泛的实验，提出的方法与现有的高分辨率航空图像车辆检测解决方案进行了比较，验证了该算法的优越性。实验证明，所提出的ISPDM在检测不同高度无人机图像中的车辆时，表现更为鲁棒和准确。在高速飞行的情况下，能够准确地检测到小型车辆，而在低速飞行的情况下，则能够更加精确地识别车辆的尺寸和位置。此外，这种方法还具有良好的可扩展性，可以通过引入更多的特征选择和集成更先进的检测网络来进一步提高性能。未来的研究可以集中于结合无人机图像的时空特征，利用深度学习进行多模态学习，以进一步增强车辆检测的鲁棒性和准确性。同时，随着无人机技术和图像处理能力的不断进步，实时性和准确性都有望得到显著提升，从而为智能交通监控系统提供更为强大和可靠的支持。

资源推荐

资源详情

资源评论

Robust Vehicle Detection in Aerial Images Based on Image Spatial

Pyramid Detection Model

Xianghui Li

and Xinde Li

∗2

Abstract— Vehicle detection in high resolution aerial images

obtained by unmanned aerial vehicles (UAV) has a wide

application in trafﬁc surveillance. Recently, many detectors

based on convolutional neural network (CNN) have achieved

great success in object detection. However, it would be difﬁcult

for them to perform efﬁciently on aerial images because the

signiﬁcant difference in target size caused by the altitude change

of the UAV platform brings great challenge for these detec-

tors to conduct precise localization. To improve the detection

performance on aerial images, we propose an Image Spatial

Pyramid Detection Model (ISPDM) which mainly consists of

two stages. In the ﬁrst stage, we divide the image into several

patches and select some of them with an image patch selection

progress. In the second stage, we utilize YOLOv3 to detect

vehicles the original image along with the selected patches

and obtain the ﬁnal result with an integrated decision-making

algorithm. Finally, the superiority of the proposed algorithm

is well demonstrated by comparison with other solutions for

vehicle detection in high resolution aerial images through

extensive experiments.

I. INTRODUCTION

Vehicle detection in aerial images has become more and

more popular in trafﬁc surveillance [1]–[5] because of the

fast and ﬂexible deployment of unmanned aerial vehicles

(UAV). Traditional vehicle detection algorithms are mainly

based on handcrafted descriptors including Local Binary Pat-

tern (LBP), Harr, Scale-Invariant Feature Transform (SIFT)

and Histogram of Oriented Gradient (HOG) [6]–[10]. The

authors in reference [7] proposed a boosting HOG descriptor

to characterize vehicle shape and appearance and utilized a

linear Support Vector Machine (SVM) to distinguish vehicles

from the background. Moranduzzo and Melagni [10] utilized

different descriptors to conduct vehicle detection in UAV

imagery and observed that the integration of the original

SIFT features with color and morphological features had

the best performance. A combination of multiple features

including HOG, LBP and opponent histograms is proposed

in [9] to detect cars in aerial images.

However, the complex background makes it difﬁcult for

handcrafted features to characterize the objects precisely.

Recently these features have been outperformed by convolu-

tional neural network (CNN) [4]. Regions with CNN features

series like Faster RCNN [11]–[13] obtain higher precision

but are more time-consuming compared with SSD [14] and

Xianghui Li is with the Key Laboratory of Measurement and Control of

CSE, Ministry of Education, School of Automation, Southeast University,

Nanjing 210096, China (e-mail: 230149424@seu.edu.cn)

Xinde Li is the corresponding author and with the Key Laboratory

of Measurement and Control of CSE, Ministry of Education, School of

Automation, and also with School of Cyber Science and Engineering,

Southeast University, Nanjing 210096, China (e-mail: xindeli@seu.edu.cn).

YOLO series [15]–[17]. Although these methods achieve

state-of-the-art performance in object detection, it would be

difﬁcult for them to achieve the same performance on aerial

images because of the signiﬁcant difference in target size.

These detectors would struggle with precise localization of

vehicles with different scales.

In this paper, we propose an Image Spatial Pyramid

Detection Model (ISPDM) to detect vehicles in UAV imagery

with high detection accuracy. The framework of ISPDM

which is shown in Figure 1 mainly consists of two stages. In

order to improve the detection of vehicles with multiscales,

we propose an image spatial pyramid in the ﬁrst stage to

select the image patches which might contain vehicles to

feed the detection model. In the second stage, an integrated

decision-making algorithm is proposed to fuse detections on

different image patches to get ﬁnal results.

II. CONSTRUCTION OF IMAGE SPATIAL

PYRAMID

To achieve better performance on detecting vehicles with

different scales, we construct an image spatial pyramid with

two layers. The ﬁrst layer consists of the original image

while in the second layer, the image is divided into n

patches and the second layer can be represented as X

} (i =1, ··· ,n). The detection in the original image

would be able to ﬁnd the relatively large vehicles while the

detection in the image patches would be able to localize the

relative small vehicles.

However, there is no object in some patches which would

cost extra computation if we conduct vehicle detection in

these patches. Therefore, the SURF (Speeded Up Robust

Feature) [18] is utilized to remove the patches which contain

few vehicles with great probability. As is shown in Figure

2, ﬁrstly the original image is divided into n patches while

n = n

× n

can be calculated by (1):

⎧

⎪

⎨

⎪

⎩

width

floor(I

width

÷i

width

)

height

floor(I

height

÷i

height

)

(1)

where n

and n

refer to the number of segmented rows and

columns respectively, I

width

and I

height

refer to the width

and height of the input image, i

width

and i

height

refer to

the width and height of the input layer which belongs to the

object detection neural network, f loor() is utilized to round

down decimal places. Typically, if n ≤ 1, the image would

not be segmented and there would be only one layer.

After the image is divided into n patches, the SURF

feature is extracted in the original image. The amount of

2019 IEEE 4th International Conference on Advance

Robotics and Mechatronics (ICARM)

850

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余5页未读，立即下载

评论收藏

内容反馈

weixin_38554781

粉丝: 6
资源: 884

基于图像空间金字塔检测模型的航空图像鲁棒车辆检测

论文研究 - 基于YOLO的航空图像车辆检测方法。

基于改进的空间金字塔词袋模型的图像分类算法研究.pdf

基于局部高斯统计模型的浮油遥感图像鲁棒主动轮廓边缘检测算法

基于MATLAB图像处理的车辆检测与识别.pdf

图像篡改检测及区域定位

基于图像投影序列的盲数字水印鲁棒检测方法

基于ResNet的遥感图像飞机目标检测新方法

和车辆检测相关的图像数据集

一种鲁棒的夜间图像显著性对象检测模型.pdf

 Detect Vehicles.zip_vehicle detection_外观检测_车辆检测_车辆检测模型

基于眼底图像的糖尿病视网膜病变分级的贴片级和图像级注释的鲁棒协作学习

一种基于低对比度图像的车辆检测算法

基于改进模糊聚类算法鲁棒的图像分割.pdf

基于蚁群算法的图像边缘检测算法MATLAB

鲁棒的人脸的检测与定位

基于蚁群算法的图像边缘检测PPT课件.pptx

基于深度学习的图像目标检测算法综述.pdf

一种新的棋盘格图像角点检测算法

基于图像处理的混凝土裂缝检测软件

关于基于图像的行人检测的介绍说明.rar

基于图像识别的移动端应用控件检测方法.pdf

最新资源

Detect Vehicles.zip_vehicle detection_外观检测_车辆检测_车辆检测模型