KLT算法detectionandtrackingofpointfeatures1991.pdf资源-CSDN文库

版权申诉

文档资料

193 浏览量 2022-07-10 22:02:47 上传评论收藏 306KB PDF 举报

资源推荐

资源详情

资源评论

Shape and Motion from Image Streams: a Factorization Method—Part 3

Detection and Tracking of Point Features

Technical Report CMU-CS-91-132

Carlo Tomasi Takeo Kanade

April 1991

Chapter 1

Introduction

The factorization method introduced in reports 1 and 2 of this series [12] [13] requires selecting and tracking of

features in an image stream. In this report we address the issues involved, and present our algorithm.

In general, two basic questions must be answered: how to select the features, and how to track them from frame

to frame. We base our solution to the tracking problem on a previous result by Lucas and Kanade [6], who proposed a

method for registering two images for stereo matching.

Their approach is to minimize the sum of squared intensity differences between a past and a current window.

Because of the small inter-frame motion, the current window can be approximated by a translation of the old one.

Furthermore, for the same reason, the image intensities in the translated window can be written as those in the original

window plus a residue term that depends almost linearly on the translation vector. As a result of these approximations,

one can write a linear 2 × 2 system whose unknown is the displacement vector between the two windows.

In practice, these approximations introduce errors, but a few iterations of the basic solution step sufﬁce to converge.

The result is a simple, fast, and accurate registration method.

The ﬁrst question posed above, however, was left unanswered in [6]: how to select the windows that are suitable

for accurate tracking. In the literature, several deﬁnitions of a ”good feature” have been proposed, based on an a priori

notion of what constitutes an ”interesting” window. For example, Moravec and Thorpe propose to use windows with

high standard deviations in the spatial intensity proﬁle [8], [11], Marr, Poggio, and Ullman prefer zero crossings of

the Laplacian of the image intensity [7], and Kitchen, Rosenfeld, Dreschler, and Nagel deﬁne corner features based on

ﬁrst and second derivatives of the image intensity function [5], [2].

In contrast with these selection criteria, which are deﬁned independently of the registration algorithm, we show in

this report that a criterion can be derived that explicitly optimizes the tracking performance. In other words, we deﬁne

a feature to be good if it can be tracked well.

In this report, we ﬁrst pose the problem (chapter 2), and rederive the equations of Lucas and Kanade in a physically

intuitive way (chapter 3). Chapter 4 introduces the selection criterion. We then show by experiment (chapter 5) that

the performance of both selector and tracker is satisfactory in a wide variety of situations, and discuss the problem of

detecting feature occlusion. Finally, in chapter 6, we close with a discussion of the suitability of this approach to our

factorization method for the computation of shape and motion, and point out directions for further research.

Chapter 2

Feature Tracking

As the camera moves, the patterns of image intensities change in a complex way. In general, any function of three

variables I(x, y, t), where the space variables x and y as well as the time variable t are discrete and suitably bounded,

can represent an image sequence. However, images taken at near time instants are usually strongly related to each

other, because they refer to the same scene taken from only slightly different viewpoints.

We usually express this correlation by saying that there are patterns that move in an image stream. Formally, this

means that the function I(x, y, t) is not arbitrary, but satisﬁes the following property:

I(x, y, t + τ ) = I(x − ξ, y − η, t) ; (2.1)

in plain English, a later image taken at time t + τ can be obtained by moving every point in the current image, taken

at time t, by a suitable amount. The amount of motion d = (ξ, η) is called the displacement of the point at x = (x, y)

between time instants t and t + τ , and is in general a function of x, y, t, and τ.

Even in a static environment under a constant lighting, the property described by equation (2.1) is violated in many

situations. For instance, at occluding boundaries, points do not just move within the image, but appear and disappear.

Furthermore, the photometric appearance of a region on a visible surface changes when reﬂectivity is a function of the

viewpoint.

However, the invariant (2.1) is by and large satisﬁed at surface markings, and away from occluding contours. At

locations where the image intensity changes abruptly with x and y, the point of change remains well deﬁned even in

spite of small variations of overall brightness around it.

Surface markings abound in natural scenes, and are not infrequent in man-made environments. In our experiments,

we found that markings are often sufﬁcient to obtain both good motion estimates and relatively dense shape results.

As a consequence, this report is essentially concerned with surface markings.

The Approach

An important problem in ﬁnding the displacement d of a point from one frame to the next is that a single pixel cannot

be tracked, unless it has a very distinctive brightness with respect to all of its neighbors. In fact, the value of the pixel

can both change due to noise, and be confused with adjacent pixels. As a consequence, it is often hard or impossible

to determine where the pixel went in the subsequent frame, based only on local information.

Because of these problems, we do not track single pixels, but windows of pixels, and we look for windows that

contain sufﬁcient texture. In chapter 4, we give a deﬁnition of what sufﬁcient texture is for reliable feature tracking.

Unfortunately, different points within a window may behave differently. The corresponding three-dimensional

surface may be very slanted, and the intensity pattern in it can become warped from one frame to the next. Or the

window may be along an occluding boundary, so that points move at different velocities, and may even disappear or

appear anew.

This is a problem in two ways. First, how do we know that we are following the same window, if its contents change

over time? Second, if we measure ”the” displacement of the window, how are the different velocities combined to give

the one resulting vector? Our solution to the ﬁrst problem is residue monitoring: we keep checking that the appearance

of a window has not changed too much. If it has, we discard the window.

The second problem could in principle be solved as follows: rather than describing window changes as simple

translations, we can model the changes as a more complex transformation, such as an afﬁne map. In this way, different

velocities can be associated to different points of the window.

This approach was proposed already in [6], and was recently explored in a more general setting in [10]. We feel,

however, that in cases where the world is known to be rigid the danger of over-parametrizing the system outweighs

the advantages of a richer model. More parameters to estimate require the use of larger windows to constrain the

parameters sufﬁciently. On the other hand, using small windows implies that only few parameters can be estimated

reliably, but also alleviates the problems mentioned above.

We therefore choose to estimate only two parameters (the displacement vector) for small windows. Any dis-

crepancy between successive windows that cannot be explained by a translation is considered to be error, and the

displacement vector is chosen so as to minimize this residue error.

Formally, if we redeﬁne J (x) = I(x, y, t + τ), and I(x − d) = I(x − ξ, y − η, t), where the time variable has

been dropped for brevity, our local image model is

J(x) = I(x − d) + n(x) , (2.2)

where n is noise.

The displacement vector d is then chosen so as to minimize the residue error deﬁned by the following double

integral over the given window W:

² =

[I(x − d) − J (x)]

w dx . (2.3)

In this expression, w is a weighting function. In the simplest case, w could be set to 1. Alternatively, w could be a

Gaussian-like function, to emphasize the central area of the window. The weighting function w could also depend on

the image intensity pattern: the relation (3.3) holds for planar patches, and w could be chosen, as suggested in [6], to

de-emphasize regions of high curvature.

Several ways have been proposed in the literature to minimize this residue (see [1] for a survey). When the

displacement d is much smaller than the window size, the linearization method presented in [6] is the most efﬁcient

way to proceed.

In the next chapter, we rederive this method, and explain it in a physically intuitive way. Then, in chapter 4, we

show that the registration idea can be extended also to selecting good features to track. As a consequence, feature

selection is no longer based on an arbitrary criterion for deciding what constitutes a feature. Rather, a good feature is

deﬁned as one that can be tracked well, in a precise mathematical sense.

剩余21页未读，继续阅读

评论收藏

内容反馈

版权申诉

老帽爬新坡

粉丝: 82
资源: 2万+

KLT 算法 detection and tracking of point features1991.pdf

最新资源

KLT 算法 detection and tracking of point features1991.pdf

KLT Tracking_klt_file.zip

论文研究-基于KLT算法和MMCE的说话人识别.pdf

gpu的klt算法

ff.rar_Face to Face_face tracking_klt face detection_klt track

KLT织物疵点检测算法研究及FPGA实现.pdf

klt.rar_KLT matlab math_KLT tracking_feature extraction_klt_trac

KLT（KLT（Kanade-Lucas-Tomasi）算法的人脸检测与跟踪）.m

基于SURF和KLT跟踪的图像拼接算法.pdf

KLT算法c++实现

klt跟踪算法的vc实现

klt.rar_ track_KLT tracking_Track_klt_tracking

基于KLT算法的视频图像点特征跟踪设计与实现毕业设计

基于计算机视觉的KLT跟踪图像拼接模型设计.pdf

论文研究-基于FPGA的KLT特征点多层次归并排序.pdf

论文研究-基于改进的PSO算法求解电力公司最优报价策略.pdf

基于KLT特征点的LK光流金字塔FPGA实现.pdf

KLT Tracking_klt_file_源码.zip

KLT算法程序说明文档

论文研究-KLT织物疵点检测算法研究及FPGA实现.pdf

全国计算机等级考试二级Python真题及解析.docx

1000份ppt模版，PPT模板优秀PPT

导入证书可以解决”无法建立到信任根颁发机构的证书链"问题。

matlab批量读取excel表格数据并处理画图

OpenCv车辆识别训练模型

代码随想录知识星球精华-大厂面试八股文第二版v1.2.pdf

数学建模对乙醇偶合制备C4烯烃的问题研究

Vue-Element UI集成ECharts实现数据统计分析页代码部分(如果帮助到你，感谢关注点赞)

STM32F103C8T6中文数据手册

（头歌）计算机组成原理存储系统设计（HUST）1-7关答案

最新资源