处理实时跟踪对象遮挡的多级框架_多级目标检测框架资源-CSDN文库

需积分: 5 127 浏览量 2021-05-29 18:16:20 上传评论收藏 1.02MB PDF 举报

实时跟踪是计算机视觉领域中的一项重要技术，广泛应用于视频监控系统、自动驾驶、人机交互等领域。在单目交通图像序列中，目标遮挡是常见的问题，严重影响跟踪的准确性和稳定性。遮挡现象可能由其他物体、场景布局变化或摄像头运动造成，导致目标部分或全部不可见。本文提出了一种有效的多级框架来处理实时跟踪中遇到的对象遮挡问题。研究的动机在于，不同的遮挡分割方法在处理目标遮挡时性能各异。研究者提出了一种基于场景驱动的方法，结合不同方法的优势以获得更好的性能。具体而言，研究将遮挡根据前景情况分类为四种类型，并利用多级遮挡处理框架来进行处理。在帧内级别，使用基于凸包分析的图像分割算法来处理遮挡分割问题。该分割算法通过前景的紧凑度比例和内部距离比例来建立。然后，在跟踪级别，采用在线样本基础的分类算法进行遮挡分割。从遮挡之前的帧中提取训练样本，并通过自适应搜索策略从当前帧中提取测试样本。遮挡的分割被转化为对测试样本的在线分类问题。此类算法根据连续帧中目标属性的相似性和连贯性来建立。本研究提出的方法通过实验验证了其有效性。实验采用视频序列进行测试，在不同条件下展示出良好的性能，并且计算成本低。研究首先介绍了视频监控系统中准确的多目标检测与跟踪作为高级行为分析和事件理解的基础，提供了目标分类、目标轨迹、交通流量、交通密度和车道占用率等丰富参数。这些信息对于道路使用者非常有用，潜在地能够提供紧急情况下的即时帮助，为实时决策提供数据支持。研究的内容强调了多目标跟踪与遮挡处理在视频监控系统中的重要性，并通过实验数据展示了所提方法在实际应用中的可行性。所提出的多级框架能够有效处理各种不同类型的遮挡，并且能够适应不同的跟踪场景。这些成果对于提高监控系统中的目标跟踪技术具有重要意义，为相关的研究与开发工作提供了理论基础和实际应用的参考。文章还指出了不同遮挡情况下，目标的可见部分、遮挡区域、遮挡的类型以及遮挡发生前后目标属性变化之间的关系。通过建立遮挡分类和处理框架，能够更好地理解和预测遮挡对目标跟踪的影响，从而在遮挡发生时仍能保持跟踪的连续性和准确性。文章通过引用相关的研究文献，显示了本研究在现有领域的定位和进步空间。研究者在文中提出了明确的研究背景和目标，并详细阐述了研究方法的理论基础和实现步骤。同时，文章也展望了未来可能的研究方向，例如进一步提高遮挡处理算法的效率和准确率，以及将算法应用于更复杂的真实世界场景中。通过上述内容，研究者们为理解和处理实时视频中的目标遮挡问题提供了一个全新的视角和解决方案，这对提高视频监控系统的性能和智能水平具有重要的推动作用。

资源推荐

资源详情

资源评论

IET Image Processing

Research Article

Multilevel framework to handle object

occlusions for real-time tracking

ISSN 1751-9659

Received on 17th March 2016

Revised 24th May 2016

Accepted on 17th June 2016

doi: 10.1049/iet-ipr

.2016.0176

www.ietdl.org

Yingfeng Cai

, Hai W

ang

, Xiaobo Chen

, Long Chen

Automotive Engineering Research Institute, Jiangsu University

, Zhenjiang 212013, People's Republic of China

School of Automotive and Traffic Engineering, Jiangsu University, Zhenjiang 212013, People's Republic of China

E-mail: wanghai1019@163.com

Abstract: This

study proposes an efficient method to handle the object occlusions seen in monocular traffic image sequences.

The motivation of this study is different methods perform differently in occlusion segmentation and the authors’ idea is to use a

situation-driven approach to aggregate different methods in order to get a good performance. This study classifies occlusion into

four categories according to the foreground situation and a multilevel occlusion handling framework is utilised. First, the image

segmentation algorithm based on convex hull analysis is utilised for intra-frame level occlusion segmentation. The segmentation

algorithm is established by the compactness ratio and interior distance ratio of the foreground. Second, an online sample-based

classification algorithm is utilised for tracking level occlusion segmentation. Training samples are extracted from the historical

frames before occlusion and testing samples are extracted from the current frame by an adaptive searching strategy. The

segmentation of occlusion is transferred into the online classification of testing samples. Such algorithm is established by the

similarity and coherence of target's property between continuous frames. Experiments on video sequences illustrate the good

performance of the proposed method under different conditions with low computational cost.

1 Introduction

video surveillance systems, accurate multi-target detection and

tracking is a key foundation of high level behaviour analysis and

event understanding [1]. It provides rich parameters such as target

classification, target trajectory, traffic flow, traffic density and lane

occupancy rate. Such information is useful to road users, which can

potentially reduce traffic congestion and enhance road safety [2, 3].

However, in monocular systems, the loss of depth information in

the projection of a 3D scene onto a 2D image plane may cause

occlusion between objects. In these cases, it is quite possible for

the trackers to miss the objects. That might further lead to the

wrong estimation of traffic parameters. Hence, the occlusion

detection and handling methods are usually applied to improve the

accuracy of multiple objects tracking [4–6].

1.1 Related work

The 3D/2.5D model has received great attention in occlusion

handling for a long time. Pang et al. [7] presented a 2.5D model to

handle vehicle occlusion. This paper added an axis which is

perpendicular to the lane vanishing point to the 2D imaging plane

and named the new model as a 2.5D model. The occluded vehicles

can be separated by contour searching even in the congested traffic

scene. Song and Nevatia [8] integrated 3D vehicle model, camera

calibration result, and the ground plane knowledge into a Bayesian

framework to detect and track occluded vehicles by searching a

maximum posteriori solution. Lou et al. [9] proposed an online

method to detect and eliminate occlusion by a 3D vehicle model.

The pose of the 3D vehicle model was decomposed into translation

and rotation. An improved extended Kalman filter was also

proposed to track and predict vehicle motion with a precise

kinematics model.

The feature model is a popular method to handle occlusion in

region-based tracking. Features such as Gabor, colour, edge and

corner are usually used to represent the object. Then, they are fed

into a classifier or reasoning model to recognise objects. Such

model works well when the matching features are still visible in the

occluded objects during tracking. Kanhere and Birchfield [10] used

a feature matching method to segment occlusions. The feature

points are detected and tracked through the image sequence. 3D

vehicles are reconstructed by these points and a relative height

constraint.

Gentile et al. [11] studied the relationship between

tracking performance and target's different features. According to

the relationship, the most influential features are selected and the

object is divided into some small blocks. By tracking these blocks,

the occluded target can be tracked successfully. Zhu et al. [12]

proposed an online sample-based occlusion handling framework

based on the similarity of the local colour, texture and spatial

features in sequential frames. This paper is partly inspired by Zhu's

methods. Compared with Zhu's method, ours has a better

framework with lower computational cost.

The part-based model is recently focused by many researchers

to recognise objects. Combining with reasoning models or

grammar models, part-based model is efficient to handle partial

occlusion. Niknejad et al. [13] proposed a two-layer classifier

using the deformable part models (DPMs) and conditional random

field to detect occluded vehicles. Such framework works well in

urban environment. Li et al. proposed an AND-OR Graph method

[14] and an AND-OR graph and hybrid image templates method

[15] to detect front-view and rear-view vehicles. Li's methods work

well in congested traffic conditions, but the methods focus on two

views of vehicles and cannot deal with person-vehicle and inter-

person occlusions. Tian et al. [16] proposed a vehicle detection

algorithm by DPM and object detection grammars. The occlusion

is handled by the results of DPM and the specific occlusion

grammars. Like Li's methods, Tian's method only considers two

views of vehicles.

In addition, the statistical and reasoning models are also

proposed to solve the occlusion problem. Zhang et al. [17]

proposed a multilevel framework to handle vehicle occlusion.

Their framework consists of three levels: the intraframe,

interframe, and tracking levels. Convex hull analysis method,

optical flow method and bidirectional reasoning method is used in

the corresponding level to solve the occlusion problem. These

methods can deal with moving vehicles, but it might fail for

stopping ones in the congested traffic scene. Jung et al. [5];

Veeraraghavan et al. [18] used occlusion-reasoning methods to

detect and separate occlusions based on a priori knowledge

trajectory matching. Huang and Liao [19] treated the object as

some moving masses and occlusions are separated by the

standardised variance difference of the mass's motion vector.

IET Image Process.

echnology 2016

However, it is difficult to get the segmentation threshold of

variance. Kamijo et al.

[20] divided the object into a number of 8 ×

8 image blocks. Based on the spatial correlation of the neighbour

pixels inner the block and the temporal correlation intra the

neighbour blocks, a Markov random model is used to decide which

object the occlusion block belongs to.

The comparisons of typical methods to deal with occlusion are

given in Table 1 [21]. Generally speaking, each of the current

methods has its own advantages and disadvantages. For example,

the 3D/2.5D model can get good segmentation accuracy but with

large computational cost and depends much on the spatial

constraints. The feature model usually does not need a priori

knowledge and can get good segmentation but it is hard to ensure

the effectiveness of the chosen features in the occluded objects.

The statistical model shows good performance in occlusion

segmentation but it is difficult to estimate the parameters in

practice. The reasoning model makes full use of the feature

relationship between frames but depends too much on the priori

knowledge.

1.2 Proposed method

the basis of the situation-driven conception, a multilevel

occlusion handling framework is proposed in this paper. Our

method combines feature model and reasoning model by the

foreground situation which contains object's type and occlusion

degree. The flowchart of the proposed framework is shown in

Fig. 1. The reasoning model is utilised in occlusion classification

and the feature model is utilised in occlusion segmentation.

First, occlusion is classified into four categories based on

object's type and occlusion degree during the tracking procedure.

Fig. 2 gives the illustration of different categories which are the

inter-vehicle weak occlusion, the inter-vehicle strong occlusion, the

person-vehicle occlusion and the inter-person occlusion. On one

hand, on the intra frame level, the inter-vehicle weak occlusion is

segmented by a ‘cutting line’ method based on the compactness

ratio and interior distance ratio of the convex hull of vehicles. On

the other hand, on the tracking level, the inter-vehicle strong

occlusion, the person-vehicle occlusion and the inter-person

occlusion is handled by an online sampling and classification

(OSC) method based on the similarity in objects’ local colour,

texture and relative spatial features between continuous frames. An

adaptive online searching method is utilised for different occlusion

type.

Our work is inspired by the effectiveness of the image

segmentation method of rigid objects and the prowess of the online

sample-based occlusion segmentation method [12]. In paper [12],

an online sample-based tracking method is proposed. The biggest

advantage of this method is that it does not depend on depth

information of the scene or prior models of the objects. However,

this method is time-consuming when the area of the foreground is

large. In our method, we integrate the online sample-based method

into our multilevel framework to handle the tracking level

occlusions. Moreover, we proposed an adaptive training sample

selection strategy to reduce the computational cost of the

algorithm.

The characteristics of this paper are: (i) a multilevel occlusion

segmentation framework is proposed by the combination of the

feature model and the reasoning model; (ii) an improved online

sample-based occlusion segmentation method is proposed in

tracking level. Based on the above characteristics, the proposed

method has the following advantages. (i) By relaxing the smooth

motion assumption, this method can deal with the case of

unsmooth position changing between consecutive frames. (ii) By

extracting the instantaneous spatial features rather than the fixed

prior spatial models, different kinds of objects can be

simultaneously tracked efficiently. (iii) By an adaptive online

searching method, this method avoids large computational cost.

Hence it is more suitable for multi-object tracking in real-time

traffic applications.

The rest of this paper is organised as follows. Section 2 explains

our approach in detail. Experimental results are given in Section 3

and followed by the conclusion in Section 4.

Table 1 Comparisons of dif

ferent methods to deal with

occlusion

Ref. Image

size

Average

processing

time per

frame

True

positive

rate, %

Description

Pang et al.

[7] 320 × 240 0.98 s 94.5 2.5D model

Song and

Nevatia[8]

720 × 480 3 s 94.3 3D model

Kanhere and

Birchfield [10]

320 × 240 32 ms 95.5 feature points

tracking, motion

cue reasoning

Zhang et al.

[17]

320 × 240 <0.16 s 94.1 convex hull

analysis, motion

vector

reasoning

Huang and

Liao [19]

320 × 240 — 93.9 motion vector

reasoning

Fig. 1 Flowchart of the pr

oposed framework

Fig. 2 Differ

ent occlusions

(a) Inter-vehicle weak occlusion, (b) Inter-vehicle strong occlusion, (c) Person-vehicle

occlusion, (d) Inter-person occlusion

2 IET Image Process.

echnology 2016

剩余7页未读，继续阅读

评论收藏

内容反馈

weixin_38673237

粉丝: 2
资源: 843

处理实时跟踪对象遮挡的多级框架

多层框架可处理对象遮挡以进行实时跟踪

matlab女孩代码-Occlusion-aware-real-time-object-tracking-:遮挡感知的实时对象跟踪

matlab匹配滤波代码-sort_oh:具有遮挡处理的简单在线和实时跟踪

matlab女孩代码-Occlusion-Tracking:遮挡感知的实时对象跟踪，IEEETMM2017

matlab开发-对象跟踪对象跟踪对象跟踪对象检测对象跟踪BallFollowerRobtVision

遮挡情况下的视觉目标跟踪方法研究

FragTrack-master_2007_cvpr_目标跟踪_抗遮挡_跟踪遮挡_

基于深度学习抗遮挡的多目标跟踪研究.pdf

Unity实时遮挡剔除 无需烘焙

具有遮挡处理的上下文感知3D均值平移，可在RGB-D视频中进行可靠的对象跟踪

FragTrack-master_2007_cvpr_目标跟踪_抗遮挡_跟踪遮挡_源码.zip

跟踪遮挡目标的一种鲁棒算法1

使用多级表示进行跟踪.zip

基于KCF、融入尺度池、抗遮挡处理的OTB数据集上目标检测跟踪matlab完整源码（毕业设计）.zip

论文研究-遮挡情况下的多目标跟踪算法.pdf

相互遮挡的目标分割和跟踪

遮挡情况下基于Kalman均值偏移的目标跟踪

复杂背景中的目标抗遮挡跟踪

vc6.0视频处理框架

论文研究-视频中遮挡情况下目标的跟踪.pdf

关于部分观测遮挡跟踪的几个期刊

遮挡情况下运动目标的跟踪pdf

目标跟踪过程中的遮挡问题研究

基于MATLAB的图像去遮挡修复数字图像处理系统.zip

基于mean-shift 算法的人脸实时跟踪方法

最新资源

Unity实时遮挡剔除无需烘焙