图像分割和边缘检测-BSDS500数据集_边缘检测数据集资源-CSDN文库

3星 · 超过75%的资源需积分: 46 185 浏览量 2019-01-22 16:38:09 上传评论 6 收藏 70.89MB TGZ 举报

共1099个文件

mat：515个

jpg：505个

txt：27个

资源推荐

资源详情

资源评论

收起资源包目录

图像分割和边缘检测-BSDS500数据集（1099个子文件）

Matrix.cc 46KB

match.cc 12KB

String.cc 4KB

Timer.cc 2KB

correspondPixels.cc 2KB

Random.cc 2KB

kofn.cc 2KB

Exception.cc 1KB

csa.cc 701B

Thumbs.db 682KB

Thumbs.db 8KB

isoF.fig 96KB

.gitignore 11B

csa_defs.h 5KB

csa_types.h 4KB

csa.hh 60KB

Matrix.hh 13KB

Array.hh 8KB

Sort.hh 7KB

String.hh 4KB

Random.hh 3KB

Timer.hh 3KB

Exception.hh 2KB

Point.hh 1KB

match.hh 280B

kofn.hh 111B

291000.jpg 123KB

365073.jpg 121KB

175032.jpg 118KB

147062.jpg 115KB

134067.jpg 115KB

309004.jpg 114KB

33039.jpg 113KB

8143.jpg 112KB

202012.jpg 111KB

187058.jpg 109KB

187099.jpg 109KB

130034.jpg 109KB

55073.jpg 108KB

86016.jpg 107KB

346016.jpg 107KB

102062.jpg 107KB

326038.jpg 107KB

167083.jpg 106KB

109034.jpg 106KB

311081.jpg 105KB

196088.jpg 105KB

207049.jpg 105KB

79073.jpg 105KB

26031.jpg 105KB

108004.jpg 104KB

87065.jpg 104KB

176019.jpg 104KB

296058.jpg 104KB

100098.jpg 102KB

148026.jpg 102KB

141012.jpg 102KB

95006.jpg 101KB

101084.jpg 101KB

78098.jpg 100KB

101085.jpg 100KB

41069.jpg 98KB

254054.jpg 98KB

229036.jpg 98KB

274007.jpg 97KB

108070.jpg 97KB

134049.jpg 97KB

35091.jpg 96KB

87015.jpg 96KB

259060.jpg 96KB

175043.jpg 96KB

220003.jpg 96KB

58060.jpg 96KB

231015.jpg 95KB

113044.jpg 95KB

61086.jpg 95KB

277053.jpg 95KB

65033.jpg 94KB

148089.jpg 94KB

138032.jpg 94KB

170054.jpg 93KB

169012.jpg 93KB

145079.jpg 93KB

56028.jpg 93KB

217090.jpg 93KB

304074.jpg 93KB

304034.jpg 93KB

156065.jpg 92KB

376043.jpg 92KB

365072.jpg 92KB

376001.jpg 92KB

175083.jpg 92KB

113016.jpg 92KB

100039.jpg 92KB

69020.jpg 92KB

236037.jpg 92KB

77062.jpg 91KB

311068.jpg 91KB

159045.jpg 91KB

共 1099 条

Contour Detection and

Hierarchical Image Segmentation

Pablo Arbel

aez, Member, IEEE, Michael Maire, Member, IEEE,

Charless Fowlkes, Member, IEEE, and Jitendra Malik, Fellow, IEEE.

Abstract—This paper investigates two fundamental problems in computer vision: contour detection and image segmentation. We

present state-of-the-art algorithms for both of these tasks. Our contour detector combines multiple local cues into a globalization

framework based on spectral clustering. Our segmentation algorithm consists of generic machinery for transforming the output of

any contour detector into a hierarchical region tree. In this manner, we reduce the problem of image segmentation to that of contour

detection. Extensive experimental evaluation demonstrates that both our contour detection and segmentation methods signiﬁcantly

outperform competing algorithms. The automatically generated hierarchical segmentations can be interactively reﬁned by user-

speciﬁed annotations. Computation at multiple image resolutions provides a means of coupling our system to recognition applications.

1 INTRODUCTION

This paper presents a uniﬁed approach to contour de-

tection and image segmentation. Contributions include:

• A high performance contour detector, combining

local and global image information.

• A method to transform any contour signal into a hi-

erarchy of regions while preserving contour quality.

• Extensive quantitative evaluation and the release of

a new annotated dataset.

Figures 1 and 2 summarize our main results. The

two Figures represent the evaluation of multiple con-

tour detection (Figure 1) and image segmentation (Fig-

ure 2) algorithms on the Berkeley Segmentation Dataset

(BSDS300) [1], using the precision-recall framework in-

troduced in [2]. This benchmark operates by compar-

ing machine generated contours to human ground-truth

data (Figure 3) and allows evaluation of segmentations

in the same framework by regarding region boundaries

as contours.

Especially noteworthy in Figure 1 is the contour de-

tector gP b, which compares favorably with other leading

techniques, providing equal or better precision for most

choices of recall. In Figure 2, gP b-owt-ucm provides

universally better performance than alternative segmen-

tation algorithms. We introduced the gP b and gP b-owt-

ucm algorithms in [3] and [4], respectively. This paper

offers comprehensive versions of these algorithms, mo-

tivation behind their design, and additional experiments

which support our basic claims.

We begin with a review of the extensive literature on

contour detection and image segmentation in Section 2.

• P. Arbel´aez and J. Malik are with the Department of Electrical Engineering

and Computer Science, University of California at Berkeley, Berkeley, CA

94720. E-mail: {arbelaez,malik}@eecs.berkeley.edu

• M. Maire is with the Department of Electrical Engineering, California

Institute of Technology, Pasadena, CA 91125. E-mail: mmaire@caltech.edu

• C. Fowlkes is with the Department of Computer Science, University of

California at Irvine, Irvine, CA 92697. E-mail: fowlkes@ics.uci.edu

Section 3 covers the development of the gP b contour

detector. We couple multiscale local brightness, color,

and texture cues to a powerful globalization framework

using spectral clustering. The local cues, computed by

applying oriented gradient operators at every location

in the image, deﬁne an afﬁnity matrix representing the

similarity between pixels. From this matrix, we derive

a generalized eigenproblem and solve for a ﬁxed num-

ber of eigenvectors which encode contour information.

Using a classiﬁer to recombine this signal with the local

cues, we obtain a large improvement over alternative

globalization schemes built on top of similar cues.

To produce high-quality image segmentations, we link

this contour detector with a generic grouping algorithm

described in Section 4 and consisting of two steps. First,

we introduce a new image transformation called the

Oriented Watershed Transform for constructing a set of

initial regions from an oriented contour signal. Second,

using an agglomerative clustering procedure, we form

these regions into a hierarchy which can be represented

by an Ultrametric Contour Map, the real-valued image

obtained by weighting each boundary by its scale of

disappearance. We provide experiments on the BSDS300

as well as the BSDS500, a superset newly released here.

Although the precision-recall framework [2] has found

widespread use for evaluating contour detectors, con-

siderable effort has also gone into developing metrics

to directly measure the quality of regions produced by

segmentation algorithms. Noteworthy examples include

the Probabilistic Rand Index, introduced in this context

by [5], the Variation of Information [6], [7], and the

Segmentation Covering criteria used in the PASCAL

challenge [8]. We consider all of these metrics and

demonstrate that gP b-owt-ucm delivers an across-the-

board improvement over existing algorithms.

Sections 5 and 6 explore ways of connecting our

purely bottom-up contour and segmentation machinery

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

iso−F

Recall

Precision

[F = 0.79] Human

[F = 0.70] gPb

[F = 0.68] Multiscale − Ren (2008)

[F = 0.66] BEL − Dollar, Tu, Belongie (2006)

[F = 0.66] Mairal, Leordeanu, Bach, Herbert, Ponce (2008)

[F = 0.65] Min Cover − Felzenszwalb, McAllester (2006)

[F = 0.65] Pb − Martin, Fowlkes, Malik (2004)

[F = 0.64] Untangling Cycles − Zhu, Song, Shi (2007)

[F = 0.64] CRF − Ren, Fowlkes, Malik (2005)

[F = 0.58] Canny (1986)

[F = 0.56] Perona, Malik (1990)

[F = 0.50] Hildreth, Marr (1980)

[F = 0.48] Prewitt (1970)

[F = 0.48] Sobel (1968)

[F = 0.47] Roberts (1965)

Fig. 1. Evaluation of contour detectors on the Berke-

ley Segmentation Dataset (BSDS300) Benchmark [2].

Leading contour detection approaches are ranked ac-

cording to their maximum F-measure (

2·P recision·Recall

P recision+Recall

)

with respect to human ground-truth boundaries. Iso-F

curves are shown in green. Our gP b detector [3] performs

signiﬁcantly better than other algorithms [2], [17], [18],

[19], [20], [21], [22], [23], [24], [25], [26], [27], [28] across

almost the entire operating regime. Average agreement

between human subjects is indicated by the green dot.

to sources of top-down knowledge. In Section 5, this

knowledge source is a human. Our hierarchical region

trees serve as a natural starting point for interactive

segmentation. With minimal annotation, a user can cor-

rect errors in the automatic segmentation and pull out

objects of interest from the image. In Section 6, we target

top-down object detection algorithms and show how to

create multiscale contour and region output tailored to

match the scales of interest to the object detector.

Though much remains to be done to take full advan-

tage of segmentation as an intermediate processing layer,

recent work has produced payoffs from this endeavor

[9], [10], [11], [12], [13]. In particular, our gP b-owt-ucm

segmentation algorithm has found use in optical ﬂow

[14] and object recognition [15], [16] applications.

2 PREVIOUS WORK

The problems of contour detection and segmentation are

related, but not identical. In general, contour detectors

offer no guarantee that they will produce closed contours

and hence do not necessarily provide a partition of the

image into regions. But, one can always recover closed

contours from regions in the form of their boundaries.

As an accomplishment here, Section 4 shows how to do

the reverse and recover regions from a contour detector.

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

iso−F

Recall

Precision

[F = 0.79] Human

[F = 0.71] gPb−owt−ucm

[F = 0.67] UCM − Arbelaez (2006)

[F = 0.63] Mean Shift − Comaniciu, Meer (2002)

[F = 0.62] Normalized Cuts − Cour, Benezit, Shi (2005)

[F = 0.58] Canny−owt−ucm

[F = 0.58] Felzenszwalb, Huttenlocher (2004)

[F = 0.58] Av. Diss. − Bertelli, Sumengen, Manjunath, Gibou (2008)

[F = 0.56] SWA − Sharon, Galun, Sharon, Basri, Brandt (2006)

[F = 0.55] ChanVese − Bertelli, Sumengen, Manjunath, Gibou (2008)

[F = 0.55] Donoser, Urschler, Hirzer, Bischof (2009)

[F = 0.53] Yang, Wright, Ma, Sastry (2007)

Fig. 2. Evaluation of segmentation algorithms on

the BSDS300 Benchmark. Paired with our gP b contour

detector as input, our hierarchical segmentation algorithm

gPb-owt-ucm [4] produces regions whose boundaries

match ground-truth better than those produced by other

methods [7], [29], [30], [31], [32], [33], [34], [35].

Fig. 3. Berkeley Segmentation Dataset [1]. Top to Bot-

tom: Image and ground-truth segment boundaries hand-

drawn by three different human subjects. The BSDS300

consists of 200 training and 100 test images, each with

multiple ground-truth segmentations. The BSDS500 uses

the BSDS300 as training and adds 200 new test images.

Historically, however, there have been different lines of

approach to these two problems, which we now review.

2.1 Contours

Early approaches to contour detection aim at quantifying

the presence of a boundary at a given image location

through local measurements. The Roberts [17], Sobel

[18], and Prewitt [19] operators detect edges by convolv-

ing a grayscale image with local derivative ﬁlters. Marr

and Hildreth [20] use zero crossings of the Laplacian of

Gaussian operator. The Canny detector [22] also models

edges as sharp discontinuities in the brightness chan-

nel, adding non-maximum suppression and hysteresis

thresholding steps. A richer description can be obtained

by considering the response of the image to a family of

ﬁlters of different scales and orientations. An example

is the Oriented Energy approach [21], [36], [37], which

uses quadrature pairs of even and odd symmetric ﬁlters.

Lindeberg [38] proposes a ﬁlter-based method with an

automatic scale selection mechanism.

More recent local approaches take into account color

and texture information and make use of learning tech-

niques for cue combination [2], [26], [27]. Martin et al.

[2] deﬁne gradient operators for brightness, color, and

texture channels, and use them as input to a logistic

regression classiﬁer for predicting edge strength. Rather

than rely on such hand-crafted features, Dollar et al. [27]

propose a Boosted Edge Learning (BEL) algorithm which

attempts to learn an edge classiﬁer in the form of a

probabilistic boosting tree [39] from thousands of simple

features computed on image patches. An advantage of

this approach is that it may be possible to handle cues

such as parallelism and completion in the initial classi-

ﬁcation stage. Mairal et al. [26] create both generic and

class-speciﬁc edge detectors by learning discriminative

sparse representations of local image patches. For each

class, they learn a discriminative dictionary and use the

reconstruction error obtained with each dictionary as

feature input to a ﬁnal classiﬁer.

The large range of scales at which objects may ap-

pear in the image remains a concern for these modern

local approaches. Ren [28] ﬁnds beneﬁt in combining

information from multiple scales of the local operators

developed by [2]. Additional localization and relative

contrast cues, deﬁned in terms of the multiscale detector

output, are fed to the boundary classiﬁer. For each scale,

the localization cue captures the distance from a pixel

to the nearest peak response. The relative contrast cue

normalizes each pixel in terms of the local neighborhood.

An orthogonal line of work in contour detection fo-

cuses primarily on another level of processing, globaliza-

tion, that utilizes local detector output. The simplest such

algorithms link together high-gradient edge fragments

in order to identify extended, smooth contours [40],

[41], [42]. More advanced globalization stages are the

distinguishing characteristics of several of the recent

high-performance methods benchmarked in Figure 1,

including our own, which share as a common feature

their use of the local edge detection operators of [2].

Ren et al. [23] use the Conditional Random Fields

(CRF) framework to enforce curvilinear continuity of

contours. They compute a constrained Delaunay triangu-

lation (CDT) on top of locally detected contours, yielding

a graph consisting of the detected contours along with

the new “completion” edges introduced by the trian-

gulation. The CDT is scale-invariant and tends to ﬁll

short gaps in the detected contours. By associating a

random variable with each contour and each completion

edge, they deﬁne a CRF with edge potentials in terms

of detector response and vertex potentials in terms of

junction type and continuation smoothness. They use

loopy belief propagation [43] to compute expectations.

Felzenszwalb and McAllester [25] use a different strat-

egy for extracting salient smooth curves from the output

of a local contour detector. They consider the set of

short oriented line segments that connect pixels in the

image to their neighboring pixels. Each such segment is

either part of a curve or is a background segment. They

assume curves are drawn from a Markov process, the

prior distribution on curves favors few per scene, and

detector responses are conditionally independent given

the labeling of line segments. Finding the optimal line

segment labeling then translates into a general weighted

min-cover problem in which the elements being covered

are the line segments themselves and the objects cover-

ing them are drawn from the set of all possible curves

and all possible background line segments. Since this

problem is NP-hard, an approximate solution is found

using a greedy “cost per pixel” heuristic.

Zhu et al. [24] also start with the output of [2] and

create a weighted edgel graph, where the weights mea-

sure directed collinearity between neighboring edgels.

They propose detecting closed topological cycles in this

graph by considering the complex eigenvectors of the

normalized random walk matrix. This procedure extracts

both closed contours and smooth curves, as edgel chains

are allowed to loop back at their termination points.

2.2 Regions

A broad family of approaches to segmentation involve

integrating features such as brightness, color, or tex-

ture over local image patches and then clustering those

features based on, e.g., ﬁtting mixture models [7], [44],

mode-ﬁnding [34], or graph partitioning [32], [45], [46],

[47]. Three algorithms in this category appear to be

the most widely used as sources of image segments in

recent applications, due to a combination of reasonable

performance and publicly available implementations.

The graph based region merging algorithm advocated

by Felzenszwalb and Huttenlocher (Felz-Hutt) [32] at-

tempts to partition image pixels into components such

that the resulting segmentation is neither too coarse nor

too ﬁne. Given a graph in which pixels are nodes and

edge weights measure the dissimilarity between nodes

(e.g. color differences), each node is initially placed in

its own component. Deﬁne the internal difference of a

component Int(R) as the largest weight in the minimum

spanning tree of R. Considering edges in non-decreasing

order by weight, each step of the algorithm merges

components R

and R

connected by the current edge if

the edge weight is less than:

min(Int(R

) + τ(R

), Int(R

) + τ(R

)) (1)

where τ (R) = k/|R|. k is a scale parameter that can be

used to set a preference for component size.

The Mean Shift algorithm [34] offers an alternative

clustering framework. Here, pixels are represented in

the joint spatial-range domain by concatenating their

spatial coordinates and color values into a single vector.

Applying mean shift ﬁltering in this domain yields a

convergence point for each pixel. Regions are formed by

grouping together all pixels whose convergence points

are closer than h

in the spatial domain and h

in the

range domain, where h

and h

are respective bandwidth

parameters. Additional merging can also be performed

to enforce a constraint on minimum region area.

Spectral graph theory [48], and in particular the Nor-

malized Cuts criterion [45], [46], provides a way of

integrating global image information into the grouping

process. In this framework, given an afﬁnity matrix W

whose entries encode the similarity between pixels, one

deﬁnes diagonal matrix D

and solves for the

generalized eigenvectors of the linear system:

(D − W )v = λDv (2)

Traditionally, after this step, K-means clustering is

applied to obtain a segmentation into regions. This ap-

proach often breaks uniform regions where the eigenvec-

tors have smooth gradients. One solution is to reweight

the afﬁnity matrix [47]; others have proposed alternative

graph partitioning formulations [49], [50], [51].

A recent variant of Normalized Cuts for image seg-

mentation is the Multiscale Normalized Cuts (NCuts)

approach of Cour et al. [33]. The fact that W must

be sparse, in order to avoid a prohibitively expensive

computation, limits the naive implementation to using

only local pixel afﬁnities. Cour et al. solve this limitation

by computing sparse afﬁnity matrices at multiple scales,

setting up cross-scale constraints, and deriving a new

eigenproblem for this constrained multiscale cut.

Sharon et al. [31] propose an alternative to improve

the computational efﬁciency of Normalized Cuts. This

approach, inspired by algebraic multigrid, iteratively

coarsens the original graph by selecting a subset of nodes

such that each variable on the ﬁne level is strongly

coupled to one on the coarse level. The same merging

strategy is adopted in [52], where the strong coupling of

a subset S of the graph nodes V is formalized as:

j∈S

j∈V

> ψ ∀i ∈ V − S (3)

where ψ is a constant and p

the probability of merging

i and j, estimated from brightness and texture similarity.

Many approaches to image segmentation fall into a

different category than those covered so far, relying on

the formulation of the problem in a variational frame-

work. An example is the model proposed by Mumford

and Shah [53], where the segmentation of an observed

image u

is given by the minimization of the functional:

F(u, C) =

Ω

(u − u

)

dx + µ

Ω\C

|∇(u)|

dx + ν|C| (4)

where u is piecewise smooth in Ω\C and µ, ν are weight-

ing parameters. Theoretical properties of this model can

be found in, e.g. [53], [54]. Several algorithms have been

developed to minimize the energy (4) or its simpliﬁed

version, where u is piecewise constant in Ω\C. Koepﬂer

et al. [55] proposed a region merging method for this

purpose. Chan and Vese [56], [57] follow a different

approach, expressing (4) in the level set formalism of

Osher and Sethian [58], [59]. Bertelli et al. [30] extend

this approach to more general cost functions based on

pairwise pixel similarities. Recently, Pock et al. [60] pro-

posed to solve a convex relaxation of (4), thus obtaining

robustness to initialization. Donoser et al. [29] subdivide

the problem into several ﬁgure/ground segmentations,

each initialized using low-level saliency and solved by

minimizing an energy based on Total Variation.

2.3 Benchmarks

Though much of the extensive literature on contour

detection predates its development, the BSDS [2] has

since found wide acceptance as a benchmark for this task

[23], [24], [25], [26], [27], [28], [35], [61]. The standard for

evaluating segmentations algorithms is less clear.

One option is to regard the segment boundaries

as contours and evaluate them as such. However, a

methodology that directly measures the quality of the

segments is also desirable. Some types of errors, e.g. a

missing pixel in the boundary between two regions, may

not be reﬂected in the boundary benchmark, but can

have substantial consequences for segmentation quality,

e.g. incorrectly merging large regions. One might argue

that the boundary benchmark favors contour detectors

over segmentation methods, since the former are not

burdened with the constraint of producing closed curves.

We therefore also consider various region-based metrics.

2.3.1 Variation of Information

The Variation of Information metric was introduced for

the purpose of clustering comparison [6]. It measures the

distance between two segmentations in terms of their

average conditional entropy given by:

V I(S, S

) = H(S) + H(S

) − 2I(S, S

) (5)

where H and I represent respectively the entropies and

mutual information between two clusterings of data S

and S

. In our case, these clusterings are test and ground-

truth segmentations. Although V I possesses some inter-

esting theoretical properties [6], its perceptual meaning

and applicability in the presence of several ground-truth

segmentations remains unclear.

2.3.2 Rand Index

Originally, the Rand Index [62] was introduced for gen-

eral clustering evaluation. It operates by comparing the

compatibility of assignments between pairs of elements

in the clusters. The Rand Index between test and ground-

truth segmentations S and G is given by the sum of the

number of pairs of pixels that have the same label in

S and G and those that have different labels in both

segmentations, divided by the total number of pairs of

pixels. Variants of the Rand Index have been proposed

[5], [7] for dealing with the case of multiple ground-truth

segmentations. Given a set of ground-truth segmenta-

tions {G

}, the Probabilistic Rand Index is deﬁned as:

P RI(S, {G

}) =

i<j

+ (1 − c

)(1 − p

)] (6)

where c

is the event that pixels i and j have the same

label and p

its probability. T is the total number of

pixel pairs. Using the sample mean to estimate p

, (6)

amounts to averaging the Rand Index among different

ground-truth segmentations. The P RI has been reported

to suffer from a small dynamic range [5], [7], and its

values across images and algorithms are often similar.

In [5], this drawback is addressed by normalization with

an empirical estimation of its expected value.

2.3.3 Segmentation Covering

The overlap between two regions R and R

, deﬁned as:

O(R, R

) =

|R ∩ R

|R ∪ R

(7)

has been used for the evaluation of the pixel-wise clas-

siﬁcation task in recognition [8], [11]. We deﬁne the

covering of a segmentation S by a segmentation S

as:

C(S

→ S) =

R∈S

|R| · max

∈S

O(R, R

) (8)

where N denotes the total number of pixels in the image.

Similarly, the covering of a machine segmentation S by

a family of ground-truth segmentations {G

} is deﬁned

by ﬁrst covering S separately with each human segmen-

tation G

, and then averaging over the different humans.

To achieve perfect covering the machine segmentation

must explain all of the human data. We can then deﬁne

two quality descriptors for regions: the covering of S by

} and the covering of {G

} by S.

3 CONTOUR DETECTION

As a starting point for contour detection, we consider

the work of Martin et al. [2], who deﬁne a function

P b(x, y, θ) that predicts the posterior probability of a

boundary with orientation θ at each image pixel (x, y)

by measuring the difference in local image brightness,

color, and texture channels. In this section, we review

these cues, introduce our own multiscale version of the

P b detector, and describe the new globalization method

we run on top of this multiscale local detector.

0 0.5 1

Upper Half−Disc Histogram

0 0.5 1

Lower Half−Disc Histogram

Fig. 4. Oriented gradient of histograms. Given an

intensity image, consider a circular disc centered at each

pixel and split by a diameter at angle θ. We compute

histograms of intensity values in each half-disc and output

the χ

distance between them as the gradient magnitude.

The blue and red distributions shown in the middle panel

are the histograms of the pixel brightness values in the

blue and red regions, respectively, in the left image. The

right panel shows an example result for a disc of radius

5 pixels at orientation θ =

after applying a second-

order Savitzky-Golay smoothing ﬁlter to the raw histogram

difference output. Note that the left panel displays a larger

disc (radius 50 pixels) for illustrative purposes.

3.1 Brightness, Color, Texture Gradients

The basic building block of the P b contour detector is

the computation of an oriented gradient signal G(x, y, θ)

from an intensity image I. This computation proceeds

by placing a circular disc at location (x, y) split into two

half-discs by a diameter at angle θ. For each half-disc, we

histogram the intensity values of the pixels of I covered

by it. The gradient magnitude G at location (x, y) is

deﬁned by the χ

distance between the two half-disc

histograms g and h:

(g, h) =

(g(i) − h(i))

g(i) + h(i)

(9)

We then apply second-order Savitzky-Golay ﬁltering

[63] to enhance local maxima and smooth out multiple

detection peaks in the direction orthogonal to θ. This is

equivalent to ﬁtting a cylindrical parabola, whose axis

is orientated along direction θ, to a local 2D window

surrounding each pixel and replacing the response at the

pixel with that estimated by the ﬁt.

Figure 4 shows an example. This computation is moti-

vated by the intuition that contours correspond to image

discontinuities and histograms provide a robust mech-

anism for modeling the content of an image region. A

strong oriented gradient response means a pixel is likely

to lie on the boundary between two distinct regions.

The Pb detector combines the oriented gradient sig-

nals obtained from transforming an input image into

four separate feature channels and processing each chan-

nel independently. The ﬁrst three correspond to the

channels of the CIE Lab colorspace, which we refer to

评论收藏

内容反馈

kyrieC

2019-06-14

效果还不错，适合下载。

卷帘依旧

粉丝: 47
资源: 6

图像分割和边缘检测-BSDS500数据集

BSDS500 轮廓检测与语义分割数据集.7z

BSDS500物体轮廓检测数据集

BSD500数据集

bsds数据集

BSDS 数据集

BSDS500数据集（内含已经编译好的测试图片）

bsds500标准图片库

BSDS500分割数据集以及处理脚本

BSDS完整版数据集下载

检测数据集标注。

边缘检测任务数据集BSR-BSDS500

图像边缘检测

BSR_bsds500标准图像库

BSR_bsds500.tgz

DRIVE数据集，用于视网膜图像分割

血管分割的STARE数据集

数字图像处理的经典标准图像数据集.zip

weizmann分割图像数据库

BSDS500标准图片库train

Natural-color-image-segmentation:有关自然彩色图像分割（处理）的论文和数据集的列表

bsds500val

HED_edgeDetect_边缘检测_hed_深度学习_

RCF:丰富的边缘检测卷积功能

OpenCv实现Canny边缘检测

ChatGPT教程（终极版）最全整理

博客中Kmeans以及FCM算法数据（免积分）

hugging face的models-openai-clip-vit-large-patch14文件夹

神经网络回归预测--气温数据集

最新资源