没有合适的资源?快使用搜索试试~ 我知道了~
LK金字塔光流法实现——英文
3星 · 超过75%的资源 需积分: 50 21 下载量 102 浏览量
2012-12-13
17:42:40
上传
评论
收藏 188KB PDF 举报
温馨提示
试读
9页
英文版描述opencv实现的光流算法Pyramidal Implementation of the Lucas Kanade Feature Tracker Description of the algorithm
资源推荐
资源详情
资源评论
Pyramidal Implementation of the Lucas Kanade Feature Tracker
Description of the algorithm
Jean-Yves Bouguet
Intel Corporation
Microprocessor Research Labs
jean-yves.bouguet@intel.com
1 Problem Statement
Let I and J be two 2D grayscaled images. The two quantities I(x) = I(x, y) and J(x) = J(x, y) are then the
grayscale value of the two images are the location x = [x y]
T
, where x and y are the two pixel coordinates of a generic
image point x. The image I will sometimes be referenced as the first image, and the image J as the second image.
For practical issues, the images I and J are discret function (or arrays), and the upper left corner pixel coordinate
vector is [0 0]
T
. Let n
x
and n
y
be the width and height of the two images. Then the lower right pixel coordinate
vector is [n
x
− 1 n
y
− 1]
T
.
Consider an image point u = [u
x
u
y
]
T
on the first image I. The goal of feature tracking is to find the location
v = u + d = [u
x
+ d
x
u
y
+ d
y
]
T
on the second image J such as I(u) and J(v) are “similar”. The vector d = [d
x
d
y
]
T
is the image velocity at x, also known as the optical flow at x. Because of the aperture problem, it is essential to
define the notion of similarity in a 2D neighborhood sense. Let ω
x
and ω
y
two integers. We define the image velocity
d as being the vector that minimizes the residual function defined as follows:
(d) = (d
x
, d
y
) =
u
x
+ω
x
X
x=u
x
−ω
x
u
y
+ω
y
X
y=u
y
−ω
y
(I(x, y) − J(x + d
x
, y + d
y
))
2
. (1)
Observe that following that defintion, the similarity function is measured on a image neighborhood of size (2ω
x
+
1) × (2ω
y
+ 1). This neighborhood will be also called integration window. Typical values for ω
x
and ω
y
are 2,3,4,5,6,7
pixels.
2 Description of the tracking algorithm
The two key components to any feature tracker are accuracy and robustness. The accuracy component relates to
the local sub-pixel accuracy attached to tracking. Intuitively, a small integration window would be preferable in order
not to “smooth out” the details contained in the images (i.e. small values of ω
x
and ω
y
). That is especially required
at occluding areas in the images where two patchs potentially move with very different velocities.
The robustness component relates to sensitivity of tracking with respect to changes of lighting, size of image
motion,... In particular, in oder to handle large motions, it is intuively preferable to pick a large integration window.
Indeed, considering only equation 1, it is preferable to have d
x
≤ ω
x
and d
y
≤ ω
y
(unless some prior matching
information is available). There is therefore a natural tradeoff between local accuracy and robustness when choosing
the integration window size. In provide to provide a solution to that problem, we propose a pyramidal implementation
of the classical Lucas-Kanade algorithm. An iterative implementation of the Lucas-Kanade optical flow computation
provides sufficient local tracking accuracy.
2.1 Image pyramid representation
Let us define the pyramid representsation of a generic image I of size n
x
× n
y
. Let I
0
= I be the “zero
th
” level
image. This image is essentially the highest resolution image (the raw image). The image width and height at that
level are defined as n
0
x
= n
x
and n
0
y
= n
y
. The pyramid representation is then built in a recursive fashion: compute
I
1
from I
0
, then compute I
2
from I
1
, and so on... Let L = 1, 2, . . . be a generic pyramidal level, and let I
L−1
be
the image at level L − 1. Denote n
L−1
x
and n
L−1
y
the width and height of I
L−1
. The image I
L−1
is then defined as
follows:
I
L
(x, y) =
1
4
I
L−1
(2x, 2y) +
1
8
I
L−1
(2x − 1, 2y) + I
L−1
(2x + 1, 2y) + I
L−1
(2x, 2y − 1) + I
L−1
(2x, 2y + 1)
+
1
16
I
L−1
(2x − 1, 2y − 1) + I
L−1
(2x + 1, 2y + 1) + I
L−1
(2x − 1, 2y + 1) + I
L−1
(2x + 1, 2y + 1)
.
(2)
1
资源评论
- ForeverCust2013-12-05并不全面,不是我想要的代码。不过有点可以参考的东西。
watchit2008
- 粉丝: 0
- 资源: 3
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- 前端开发-什么是前端开发-关于前端开发的一些相关介绍
- Sora AI-关于文生视频的使用场景说明
- suno AI文生视频的相关教程和介绍使用
- 什么是后端开发-关于后端开发的一些小介绍分享
- Jurassic Pack Vol. II Dinosaurs 侏罗纪包卷恐龙二号Unity游戏模型资源unitypackage
- Jurassic Pack Vol. III Dinosaurs 侏罗纪包卷恐龙三号Unity游戏模型资源unitypackag
- Ultimate Seating Controller 终极座椅控制器Unity游戏开发插件资源unitypackage
- 什么是人工智能-关于人工智能的相关介绍说明
- Figma Converter for Unity适用Unity的Figma转换器Unity游戏开发插件unitypackage
- Creepy Animatronic Anims 令人毛骨悚然的电子动画Unity游戏动画插件资源unitypackage
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功