形状上下文Python_形状上下文算法资源-CSDN文库

共49个文件

pdf：16个

png：13个

py：6个

python

形状上下文

1星需积分: 39 11 浏览量 2018-12-02 22:19:42 上传评论 4 收藏 14.41MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

Python-Shape-Context-master.zip （49个子文件）

Python-Shape-Context-master

munkres.pyc 24KB

9M.png 1KB

test.jpg 2KB

SC.py 7KB

9M2.png 564B

utils.pyc 6KB

A2.png 10KB

test_captcha.py 3KB

AM2.png 6KB

9.png 13KB

utils.py 5KB

BM2.png 19KB

LAPJV

LAPJV.p 4KB

GNRL.H 515B

SYSTEM.CPP 516B

LAP.CPP 10KB

LAPMAIN.CPP 2KB

LAP.H 746B

SYSTEM.H 351B

munkres.py 24KB

info

2009_ijra_bk.pdf 1.87MB

thayananthan_cvpr03.pdf 2.6MB

ShapeContextSlides.pdf 590KB

54.pdf 968KB

TPS.pdf 771KB

10.1.1.18.8852.pdf 943KB

A_shortest_augmenting_path_algorithm_for_dense_and_sparse_linear_assignment_problems.pdf 802KB

ShapeContexts425.pdf 749KB

mori-gimpy.pdf 408KB

TPS2.pdf 1.83MB

algorythm.pdf 943KB

10.1.1.112.2716.pdf 870KB

mori-cvpr01.pdf 376KB

Chapter 8 - Dense Matrix Algorithms.pdf 1.11MB

13_diplaros.pdf 592KB

FaceDetection.pdf 294KB

SC.pyc 8KB

README 83B

BM.png 19KB

test3.jpg 4KB

AM.png 15KB

test2.jpg 2KB

B.png 14KB

D.png 9KB

AM3.png 15KB

test_single.py 3KB

test.png 3KB

SC2.py 7KB

A.png 9KB

Shape Context and Chamfer Matching in Cluttered Scenes

A. Thayananthan



B. Stenger



P. H. S. Torr



R. Cipolla



University of Cambridge



Microsoft Research Ltd.

Department of Engineering 7 JJ Thompson Avenue

Cambridge, CB2 1PZ, UK Cambridge, CB3 OFB, UK



at315|bdrs2|cipolla



@eng.cam.ac.uk philtorr@microsoft.com

Abstract

This paper compares two methods for object localization

from contours: shape context and chamfer matching of tem-

plates. In the light of our experiments, we suggest improve-

ments to the shape context: Shape contexts are used to ﬁnd

corresponding features between model and image. In real

images it is shown that the shape context is highly inﬂu-

enced by clutter, furthermore even when the object is cor-

rectly localized, the feature correspondence may be poor.

We show that the robustness of shape matching can be in-

creased by including a ﬁgural continuity constraint. The

combined shape and continuity cost is minimized using

the Viterbi algorithm on features sequentially around the

contour, resulting in improved localization and correspon-

dence. Our algorithm can be generally applied to any fea-

ture based shape matching method.

Chamfer matching correlates model templates with the

distance transform of the edge image. This can be done

efﬁciently using a coarse-to-ﬁne search over the transfor-

mation parameters. The method is robust in clutter, how-

ever multiple templates are needed to handle scale, rotation

and shape variation. We compare both methods for locat-

ing hand shapes in cluttered images, and applied to word

recognition in EZ-Gimpy images.

1. Introduction

People use multiple visual cues to recognize objects, such

as object color, texture and shape. In the absence of color

and texture information, we can mostly still recognize ob-

jects by their geometry alone, for example in line drawings.

Groupinglowlevel features to segment the object is by itself

a hard problem. A common approach is, therefore, to use a

prototype shape, and search for it in the image. This leads

to the task of shape matching, which has numerous appli-

cations, such as object localization, image retrieval, model

registration, and tracking. One way to represent a shape

is by a set number of feature points, for example Canny

edges. In order to match two shapes, point correspondences

on the two shapes have to be established. Subsequently a

transformation which aligns the two shapes can be found.

The type of transformation depends on the particular set-

ting. Two examples are 2D afﬁne transforms, and non-

rigid thin-plate spline transformations. The two problems

of ﬁnding correspondences and estimating the transforma-

tion are tightly coupled: The better the correspondences are

known, the better the transformation can be estimated, and

vice versa. Therefore, many methods are based on an it-

erated two-step algorithm, alternating estimation of corre-

spondence and transformation.

In the next section, we review existing work on shape

based and chamfer matching. The two methods are ex-

plained brieﬂy in section 2, and we outline some of the

problems that arise when applied to scenes with cluttered

background in section 3. In section 4 we show how shape

context matching can be signiﬁcantly improved by using

a continuity constraint. The dynamic programming algo-

rithm used for optimization readily generalizes to any other

type of feature. Section 5 shows experimental results on

two types of data, images of hands, and words on textured

background.

1.1. Previous Work

Belongie et al. [3] have introduced the shape context de-

scriptor, which characterizes a particular point location on

the shape. This descriptor is the histogram of the relative

polar coordinates of all other points. Corresponding points

on two different shapes have a similar relative position in

each shape, and will ideally have a similar shape context.

Shape context matching has been applied to a variety of ob-

ject recognition problems [3, 13]. The background clutter

in these applications was usually limited.

Sullivan and Carlsson [17] use a topology-based shape

descriptor to ﬁnd correspondences. The topological type

of all combinations of four points is recorded in a voting

matrix, and one-to-one correspondences are found using a

greedy algorithm. The examples shown did not contain

signiﬁcant clutter. While their topological descriptor has

higher discriminative power than the shape context, com-

puting the descriptor for all combinations of four points

is of complexity



(



number of points), and is sig-

niﬁcantly slower than computing shape contexts, which is

of complexity



. Both methods use shape descrip-

tors without enforcing any continuity constraint, resulting

in a number of incorrect correspondences. This shortcom-

ing may sometimes be compensated by iterative alignment

and recomputation of the shape descriptor. However, this

is computationally expensive, and it would be desirable to

obtain good correspondences in the ﬁrst step.

Chamfer matching was ﬁrst proposed by Barrow et

al. [2] and improved versions have been used for object

recognition and contour alignment. Borgefors [5] intro-

duced hierarchical chamfer matching, in which a coarse-

to-ﬁne search is performed using a resolution pyramid of

the image. Olson and Huttenlocher [15] use a template hi-

erarchy to recognize three dimensional objects from differ-

ent views. They also demonstrate the importance of using

oriented edge information for Hausdorff matching, which is

closely related to chamfer matching. Gavrila [9] uses cham-

fer matching to detect pedestrian shapes in real time. In this

case a template hierarchy is used to handle shape variation.

When a single template is used, chamfer matching can-

not handle large shape variations. Either multiple templates

have to be used, or, if the initial localization is good, the

shapes can subsequently be aligned using point registra-

tion. A standard method for point registration is the It-

erated Closest Point (ICP) algorithm [4, 6], where corre-

spondences are found using a nearest-neighbor assignment,

and the transformation is estimated by minimizing the ge-

ometric error between point pairs. ICP is fast and con-

verges to a local minimum. However, it requires a good

initial alignment of model and image. A number of im-

proved point registration methods have been developed re-

cently [7, 8, 11]. Fitzgibbon [8] introduced a version of

the ICP algorithm which combines the correspondence and

the alignment steps within the structure of the Levenberg-

Marquardt algorithm.

2. Methods

In this section we explain the two methods of shape context

matching and chamfer matching.

2.1. Shape Context Matching

The shape context descriptor for a point on the shape is

a histogram of the relative polar coordinates of all other

points on the shape [3]. Point correspondences between

two shapes are found by minimizing the point matching

costs, which is the





test statistic for histograms. Glob-

ally optimal correspondences are found by minimizing the

sum of the individual matching costs. This is solved with a

(a) (b) (c) (d)

(e) (f)

Figure 1: Point correspondences found with shape con-

texts. Shape contexts can be used to ﬁnd corresponding

points on similar shapes in uncluttered scenes. (a,b) Im-

ages of two pairs of scissors. (e) Connections between cor-

responding points. (c,d) Images of a hand and a 3D hand

model. (f) Corresponding points between edge map of (c)

and projected contours of (d). For visual clarity not all cor-

respondences are shown.

bi-partite graph matching algorithm, enforcing one-to-one

point matching. igure 1 shows point correspondences be-

tween different shapes which were found using the shape

context descriptor. The shape context descriptor has the fol-

lowing invariance properties.

1. Translation: The shape context descriptor is inher-

ently translation invariant as it is based on relative point lo-

cations.

2. Scale: For clutter-free images the descriptor can be

made scale invariant by normalizing the radial distances by

the mean (or median) distance between all point pairs.

3. Rotation: It can be made rotation invariant by rotat-

ing the coordinate system at each point so that the posi-

tive



-axis is aligned with the tangent vector. However, this

reduces the discriminative power of the descriptor signiﬁ-

cantly, and is therefore not used here.

4. Shape variation: The shape context is robust towards

slight shape variations. When points in the shape vary a lot,

the discrete binning effect will leadto larger matching costs,

and wrong matches.

5. Few outliers: Points with a ﬁnal matching cost larger

than a threshold value



are classiﬁed as outliers. Additional

‘dummy’ points with the cost



are introduced to make the

number of points on the two shapes equal, and the points

matched to these dummy points are also classiﬁed as out-

liers. A common way to increase the robustness towards

outliers is to use knowledge from the model and only use

those bins for computing the matching cost which are non-

empty for the model point.

2.2. Chamfer Matching

The similarity between two shapes can be measured us-

ing their chamfer distance. Given the two point sets







and

!"

$#&%'(

%)

, the chamfer distance function

is the mean of the distances between each point,

+



and

its closest point in

,.-0/1

(

32)!4576



9:<;=?>@BA

C)DE;GFIHH

&JLK%

HBHNM

(1)

The symmetric chamfer distance is obtained by adding

-0/1

(

O!P2QR

. The chamfer distance between two shapes

can be efﬁciently computed using a distance transform

(DT). This transformation takes a binary feature image as

input, and assigns to each pixel in the image the distance

to its nearest feature. The distance between a template and

an edge map can then be computed as the mean of the DT

values at the template point coordinates. The matching can

be made more robust by using the mean of the thresholded

distance

,S-0/1

(PT U

32)!45V6



9:0;=

>XW'Y

>@BA

;GF

HH

[JLK%

HBH

2\^]

(2)

where

is the threshold value. This reduces the effect of

outliers and missing edges.

Chamfer matching as proposed by Barrow et al. [2] re-

quires a good initialization of the template. In the hierar-

chical chamfer matching algorithm [5], candidate template

locations are found using by hierarchical search using a res-

olution pyramid of the image. Subsequently an aligning

transform for these candidate matches is estimated. Mul-

tiple templates are used to ﬁnd three dimensional objects

in an image [9, 15]. In our experiments we use templates

which are generated by projecting a 3D hand model.

After the detection step, the best matching model is

aligned by estimating the intrinsic parameters of this 3D

model. Levenberg-Marquardt optimization is used for

alignment, as described in [8], using the same chamfer cost

function in the transformation step as in the search step of

the algorithm.

3. Problems With Methods in Clutter

There are, however, problems with the techniques in the

presence of background clutter, which are described in the

following section.

3.1. Shape Context

It turns out that using the shape context in cluttered scenes

is unreliable. It is difﬁcult to recover the scale parame-

ter, since normalizing the radial distances by the mean or

median point distances will no longer work. Object and

non-object points close to the object are hard to distinguish

on the basis of their shape context alone. Points which are

close to each other on the model shape are often matched to

points which are far away from each other in the image. The

iterative nature of the algorithm may sometimes be able to

compensate for this shortcoming, improving the point cor-

respondences in each step. Another approach could be, if

some of the correspondences are correct, to identify out-

liers in the alignment phase of the iteration process using

a robust estimation scheme, e.g. RANSAC. Outliers can

then be excluded from the next shape context computation.

However, shape deformations cannot be handled easily this

way.

3.2. Chamfer Matching

When using a single template, chamfer matching cannot

handle large shape variations. The chamfer distance is not

invariant towards translation, rotation or scale. Further-

more, the number of templates needed increases with ob-

ject complexity. Each of these cases has to be handled by

matching with different templates. In order to match a large

number of templates efﬁciently, tree-based search methods

have been suggested, where a large number of hypotheses

can be eliminated at an early stage [9]. In scenes with clut-

tered background the chamfer cost function (2) will typi-

cally have several local minima. In order to make a deci-

sion about the object location, orientation and scale, it may

be necessary to use a subsequent veriﬁcation stage [9].

4. Proposed Improvements for Shape

Context Matching

This section describes two methods of improving the ro-

bustness of point matching using shape contexts.

4.1. Using Edge Orientation

Accordingto [9, 15] multiple feature images can be used, by

dividing edge points into discrete sets based on the edge ori-

entation. The same idea can be applied to the shape context

by only matching points with similar gradient orientation.

Figure 2 shows an example of estimating point correspon-

dences when using single versus multiple features. Using

multiple edge features increases the discrimination power

of the shape context, and generally leads to improved re-

sults. However, as can be seen in ﬁgure 2, also with multi-

ple features, incorrect matches can occur (points on middle

ﬁnger are mapped to ring ﬁnger). Note that in cases where

评论收藏

内容反馈

把杯子倒进水里面

2020-07-27

https://github.com/creotiv/Python-Shape-Context/

韩龙科技

粉丝: 4
资源: 18

形状上下文Python

形状上下文

shape-context-ocr：“形状上下文”是一个形状描述符，用于捕获形状轮廓上其他点的相对位置，并用于识别字符

Shape Context

shape context形状上下文

基于形状上下文的形状匹配

形状上下文相似度匹配算法

IDSC内部距离形状上下文

经典的形状上下文英文原版

形状上下文在验证码识别中的应用

基于H-EMD 的形状上下文特征形状匹配方法

基于形状上下文和粒子滤波的多目标跟踪

matlab光照模型代码-facial:面部的

matlab嘴部检测代码-Robust-Facial-Landmark:鲁棒的面部地标检测

基于ShapeContext的形状匹配方法的改进

DFT的matlab源代码-shape-context-matching:使用形状上下文获取形状模板

2D_3D shape recognition using shape context

Matlab中shapecontext源代码-sc_demo.rar

Learning-to-Group:用于3D形状的零镜头分割框架

使用图像分类-OpenCV和SVM：使用机器学习进行图像处理和分类：使用Open CV和SVM机器学习模型进行图像分类

math_reader:手写数学表达式识别器

context-mover-distance-and-barycenters:AISTATS 2020论文随附的代码

Seaborn中文用户指南.docx

克莱尔沃

Japa:只是另一个处理API-受处理启发的Java 2D图形库

PCT:Jittor实施PCT

最新资源