ADrivingBehaviorRecognitionModelwithBi-LSTM.pdf

120 浏览量 2024-07-05 09:33:03 上传评论收藏 1.21MB PDF 举报

### 驾驶行为识别模型：结合Bi-LSTM与多尺度卷积神经网络 #### 摘要本文介绍了一种结合Bi-LSTM（双向长短时记忆）与多尺度卷积神经网络（Multi-Scale Convolutional Neural Network, MSCNN）的驾驶行为识别模型。在自动驾驶领域中，对周围车辆的行为进行准确感知对于自身车辆做出合理决策至关重要。为此，研究人员提出了一种基于轨迹信息的神经网络模型来实现这一目标。 #### 研究背景在构建安全可靠的自动驾驶系统或高级驾驶辅助系统(Advanced Driver Assistance System, ADAS)时，实时感知其他车辆的驾驶行为对于分析交通场景动态演变并作出合理决策非常重要。例如，准确感知前方车辆的刹车行为或变道行为有助于预测潜在的危险事件。此外，精确的驾驶行为识别不仅能够辅助路径规划和运动决策，还可以作为高层次语义信息来支持车辆或行人的轨迹预测。 #### 方法论该研究中的模型主要包括两个核心组件：Bi-LSTM模块和MSCNN模块。Bi-LSTM模块用于处理时间序列数据，可以同时考虑过去和未来的信息，这对于捕捉驾驶行为的时间特征非常关键。而MSCNN模块则负责自动提取高阶特征，这些特征能够编码丰富的空间和时间信息，从而更全面地理解驾驶行为模式。 #### 模型架构 - **输入**: 输入数据为一个车辆的轨迹序列。 - **Bi-LSTM模块**: 对输入的轨迹序列进行处理，生成包含时间和方向信息的特征。 - **MSCNN模块**: 通过多尺度卷积核自动提取特征，以捕获不同时间尺度上的行为模式。 - **特征融合**: 将Bi-LSTM模块和MSCNN模块产生的特征进行融合，以便进行最终的行为分类。 - **输出**: 根据融合后的特征进行行为分类，识别出具体的行为类别。 #### 实验结果该模型在公共数据集BLVD上进行了评估，并取得了令人满意的结果。这表明，通过结合Bi-LSTM和MSCNN的技术，能够在不依赖手工特征的情况下有效地识别驾驶行为，从而为自动驾驶系统提供重要的决策依据。 #### 讨论 1. **模型优势**: - **自适应特征提取**: MSCNN模块能够自动学习并提取轨迹数据中的复杂特征，无需手动设计特征，提高了模型的泛化能力。 - **双向时间信息**: Bi-LSTM模块允许模型同时利用过去的轨迹信息和未来的预测信息，增强了模型对驾驶行为变化趋势的理解能力。 2. **应用场景**: - **安全预警系统**: 通过准确识别前方车辆的行为，如紧急制动、突然变道等，提前向驾驶员发出警告，提高行车安全性。 - **智能交通管理**: 识别车辆的驾驶行为模式可以帮助交通管理部门更好地规划道路资源，优化交通流量控制策略。 3. **未来工作**: - **多模态融合**: 探索将视觉信息（如摄像头拍摄的图像）与轨迹数据相结合的方法，进一步提升模型性能。 - **复杂场景处理**: 增强模型处理复杂交通环境的能力，例如多车道交叉口、高速公路出口等。 #### 结论本文提出的结合Bi-LSTM与MSCNN的驾驶行为识别模型，在自动驾驶领域具有显著的应用价值。它不仅能够准确识别复杂的驾驶行为，还具备良好的扩展性和适用性，有望成为未来智能交通系统的重要组成部分之一。

资源推荐

资源详情

资源评论

A Driving Behavior Recognition Model with

Bi-LSTM and Multi-Scale CNN

He Zhang, Zhixiong Nan*, Tao Yang, Yifan Liu and Nanning Zheng

Abstract— In autonomous driving, perceiving the driving

behaviors of surrounding agents is important for the ego-vehicle

to make a reasonable decision. In this paper, we propose a

neural network model based on trajectories information for

driving behavior recognition. Unlike existing trajectory-based

methods that recognize the driving behavior using the hand-

crafted features or directly encoding the trajectory, our model

involves a Multi-Scale Convolutional Neural Network (MSCNN)

module to automatically extract the high-level features which

are supposed to encode the rich spatial and temporal infor-

mation. Given a trajectory sequence of an agent as the input,

ﬁrstly, the Bi-directional Long Short Term Memory (Bi-LSTM)

module and the MSCNN module respectively process the input,

generating two features, and then the two features are fused to

classify the behavior of the agent. We evaluate the proposed

model on the public BLVD dataset, achieving a satisfying

performance.

I. INTRODUCTION

Researches on understanding complex trafﬁc scenarios

have recently been widely studied in the autonomous driv-

ing community [1]. When constructing a safe and reliable

autonomous driving system or Advanced Driver Assistance

System (ADAS), in order to analyze the dynamic evolution

of the trafﬁc scene and then make a reasonable decision, it

is necessary to perceive the driving behavior of other agents

around the autonomous vehicle in real-time. For example,

sensing the braking behavior and the lane changing behavior

of vehicles in front of the autonomous vehicle is signiﬁ-

cant for predicting possible dangerous events. Meanwhile,

accurate recognition of driving behavior can not only assist

path planning and motion decisions but also serve as high-

level semantics to assist trajectory prediction of vehicles or

pedestrians [2], [3]. In this paper, we focus on accurately

identifying interactive behavior in the trafﬁc environment,

and the interactive behavior refers to the movement status

of surrounding trafﬁc agents (vehicles, pedestrians, riders,

etc) relative to the ego-vehicle [4]. The driving behavior

categories of vehicles around the ego-vehicle are shown in

Fig. 1.

In the autonomous driving environment, the trajectory

sequence is considered as relatively reliable and valuable

information to model trafﬁc agent behaviors. Due to the

complexity and dynamics of real trafﬁc environments, it

is challenging to classify the driving behavior. The main

challenges are three-fold: 1) Generally, each kind of driving

event has different temporal durations. If we use a big

*Corresponding author: Zhixiong Nan nzx2018@xjtu.edu.cn

The authors are with the Institute of Artiﬁcial Intelligence and Robotics,

Xi’an Jiaotong University, Xi’an, China

Fig. 1. Driving behavior categories

window to split the trajectory into training samples, there

may exist multiple kinds of behaviors in a sample; 2) For

a ﬁxed temporal window, the number and the behavior type

of agents around ego-vehicle are highly dynamic; 3) There

exists a severe imbalance in behavioral data, and the limited

training samples are available for most anomalous behavior

categories.

Recent progress in Lidar, GPS and visual vehicle de-

tection technologies allows collecting accurate and robust

trajectory data, which makes it possible to leverage data-

driven methods for driving behavior recognition task. Exist-

ing trajectory-based methods can be generally divided into

two types, one is to construct a classiﬁer based on some

hand-crafted features [5]–[10], the other is to directly model

the trajectory sequence to obtain dynamic evolution rules,

and then implement the behavior classiﬁcation [2], [3], [11].

However, there exist many drawbacks for both of them. The

former requires some domain knowledge to design manual

features and generally needs to select different features for

different datasets, leading to a lack of generalization across

different scenarios. The drawback of the latter is that the

original information included in trajectory points may be

insufﬁcient, which may lead to the under-ﬁtting of the model.

To overcome those drawbacks existing in the conventional

methods, we propose a neural network model to recognize

the driving behavior of surrounding agents. Unlike existing

trajectory-based methods that recognize the driving behavior

using the hand-crafted features or directly encoding the

arXiv:2103.00801v1 [cs.CV] 1 Mar 2021

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余5页未读，立即下载

评论收藏

内容反馈

colin工作室

粉丝: 1107
资源: 388

A Driving Behavior Recognition Model with Bi-LSTM.pdf

最新资源

A Driving Behavior Recognition Model with Bi-LSTM.pdf

chineseocr model part3-2 : ocr-dense-lstm.zip

face_recognition-1.3.0-py2.py3-none-any.whl

SpeechRecognition-3.8.1-py2.py3-none-any.whl

chineseocr model part3-1 ：Angle-model.zip

face_recognition_models-0.3.0.tar.gz

face_recognition-1.3.0.tar.gz

face-api.js使用模型

基于 BERT+Bi-LSTM+CRF 的航天领域命名实体识别研究.pdf

python语音识别SpeechRecognition-3.8.1-py2.py3 和 PyAudio-0.2.11-cp37

face_recognition_models-0.3.0.tar

Recognition.cpython-36.pyc

Recognition.cpython-37.pyc

Recognition.cpython-311.pyc

face-recognition-cv2-master.zip

face_recognition_models-0.3.0-py2.py3-none-any.whl

HYBRID SPEECH RECOGNITION WITH DEEP BIDIRECTIONAL LSTM.pdf

dlib-face-recognition-resnet-model-v1.dat.zip

人脸采集-基于face-api.js实现人脸采集-javascript-项目源码-优质项目实战.zip

Facial-Expression-Recognition.Pytorch-master

YOLOv8-deepsort 实现智能车辆目标检测+车辆跟踪+车辆计数

YOLOv8网络结构图，自制visio文件，yolov8.vsds，需要的自取，在原有的基础上直接改就行了

Transformer模型实现长期预测并可视化结果（附代码+数据集+原理介绍）

yolov8(2023年8月版本),已经下好yolov8s.pt和yolov8n.pt

社交平台上经济类话题的文章热度信息，数据是真实的，但不是真实日期

行人跌倒数据集（VOC格式）

CIFAR10数据集免费下载

大作业05-YOLOV5口罩检测数据集+代码+模型 2000张标注好的数据+教学视频.zip

Deep Learning Tuning Playbook（中译版）

zotero翻译插件.xpi

最新资源