基于Randomwalker的图像分割资源-CSDN文库

需积分: 10 124 浏览量 2009-07-01 08:06:23 上传评论收藏 3.23MB PDF 举报

### 基于Random Walker的图像分割技术 #### 概述本文介绍了一种新颖的交互式多标签图像分割方法——基于随机游走(Random Walker)的图像分割算法。该算法利用用户定义或预定义的少量标记像素来计算其余未标记像素最有可能到达的已标记像素的概率，并根据这一概率为每个像素分配相应的标签。这种方法不仅能快速获得高质量的图像分割结果，还具有理论上的严谨性和实际应用中的灵活性。 #### 随机游走在图像分割中的应用 1. **基础概念**：随机游走是一种数学模型，用于模拟物体在空间中随机移动的过程。在图像分割领域，这种模型被用来预测一个像素最可能归属于哪个区域。 2. **算法流程**： - **初始化**：首先由用户或预处理阶段为图像中的某些像素分配特定的标签（如前景、背景等）。 - **概率计算**：对于每个未标记的像素，计算一个随机游走从该像素出发首次到达已标记像素的概率。 - **分割决策**：将每个未标记像素分配给使得其概率最大的已标记像素所代表的类别。 3. **优势特点**： - **快速计算**：通过求解稀疏、对称正定线性方程组实现快速计算。 - **灵活编辑**：允许用户对结果进行快速修改，适应不同需求。 - **直观分割**：能够直观地生成高质量的分割结果。 - **任意分割能力**：通过足够的交互操作可以产生任意形状的分割区域。 #### 理论基础与潜在联系 1. **离散势理论**：该算法基于离散势理论，利用组合算子和连续势理论的原则在图上构建模型，从而适用于任意维度和任意图结构。 2. **电网络类比**：算法与电网络理论紧密相连，可以将图像分割问题转换为电网络中的电流流动问题。 3. **拉普拉斯方程**：随机游走算法中的核心问题可以表述为解决一个离散的拉普拉斯方程，即寻找满足特定边界条件的调和函数。 #### 技术细节 1. **离散空间表示**：算法在离散空间（即图结构）中进行建模，利用组合算子来近似连续空间中的操作符。 2. **拉普拉斯矩阵**：算法的核心在于求解一个与图像对应的图的拉普拉斯矩阵相关的线性方程组。 3. **边界条件**：用户定义的标记像素作为边界条件，确保算法能够找到合理的分割结果。 #### 应用场景与未来展望 1. **医学影像分析**：在医学影像中识别特定组织或器官。 2. **自动驾驶**：道路标志、行人和障碍物的检测与识别。 3. **遥感图像处理**：土地覆盖分类和变化检测。 4. **视频监控**：动态目标的跟踪与识别。基于随机游走的图像分割算法不仅理论基础扎实，而且在实际应用中展现出极高的效率和灵活性，是当前图像分割领域一项重要的技术进展。随着计算机视觉技术的不断发展，未来有望在更多领域发挥重要作用。

资源推荐

资源详情

资源评论

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 28, NO. 11, NOV. 2006 1

Random Walks for Image Segmentation

Leo Grady

Abstract— A novel method is proposed for performing multi-

label, interactive image segmentation. Given a small number

of pixels with user-deﬁned (or pre-deﬁned) labels, one can

analytically and quickly determine the probability that a random

walker starting at each unlabeled pixel will ﬁrst reach one of

the pre-labeled pixels. By assigning each pixel to the label for

which the greatest probability is calculated, a high-quality image

segmentation may be obtained. Theoretical properties of this

algorithm are developed along with the corresponding connec-

tions to discrete potential theory and electrical circuits. This

algorithm is formulated in discrete space (i.e., on a graph) using

combinatorial analogues of standard operators and principles

from continuous potential theory, allowing it to be applied in

arbitrary dimension on arbitrary graphs.

Index Terms—Image segmentation, interactive segmentation,

graph theory, random walks, combinatorial Dirichlet problem,

harmonic functions, Laplace equation, graph cuts, boundary

completion

I. INTRODUCTION

MAGE segmentation has often been deﬁned as the problem

of localizing regions of an image relative to content (e.g.,

image homogeneity). However, recent image segmentation

approaches have provided interactive methods that implicitly

deﬁne the segmentation problem relative to a particular task

of content localization. This approach to image segmentation

requires user (or preprocessor) guidance of the segmentation

algorithm to deﬁne the desired content to be extracted.

A practical interactive segmentation algorithm must provide

four qualities: 1) Fast computation, 2) Fast editing, 3) An

ability to produce an arbitrary segmentation with enough

interaction, 4) Intuitive segmentations. The random walker al-

gorithm introduced here exhibits all of these desired qualities.

We note that this algorithm was ﬁrst presented in a shortened

form as a conference paper [1]. The random walker algorithm

requires the solution of a sparse, symmetric positive-deﬁnite

system of linear equations which may be solved quickly

through a variety of methods. The algorithm may perform fast

editing by using the previous solution as the initialization of

an iterative matrix solver. An arbitrary segmentation may also

be achieved through enough user interaction.

In this paper, we present a novel approach to K-way image

segmentation given user-deﬁned seeds indicating regions of

the image belonging to K objects. Each seed speciﬁes a

location with a user-deﬁned label. The algorithm labels an

unseeded pixel by resolving the question: Given a random

walker starting at this location, what is the probability that

Published in: IEEE Trans. on Pattern Analysis and Machine Intelligence,

Vol. 28, No. 11, pp. 1768–1783, Nov., 2006

Leo Grady is with Siemens Corporate Research, Department of Imaging

and Visualization, 755 College Road East, Princeton, NJ 08540, E-mail:

Leo.Grady@siemens.com

it ﬁrst reaches each of the K seed points? It will be shown

that this calculation may be performed exactly without the

simulation of a random walk. By performing this calculation,

we assign a K-tuple vector to each pixel that speciﬁes the

probability that a random walker starting from each unseeded

pixel will ﬁrst reach each of the K seed points. A ﬁnal

segmentation may be derived from these K-tuples by selecting

for each pixel the most probable seed destination for a random

walker. By biasing the random walker to avoid crossing sharp

intensity gradients, a quality segmentation is obtained that

respects object boundaries (including weak boundaries). In a

uniform image (e.g., all black) or, as will be proved in Section

IV, an image of pure noise, a segmentation will be obtained

that roughly corresponds to Voronoi cells for each set of seed

points. We term this segmentation the neutral segmentation

since the image is neutral (i.e., devoid of meaningful content).

In our approach, we treat an image (or volume) as a purely

discrete object — a graph with a ﬁxed number of vertices and

edges. Each edge is assigned a real-valued weight correspond-

ing to the likelihood that a random walker will cross that edge

(e.g., a weight of zero means that the walker may not move

along that edge). The advantage of formulating the problem on

a graph is that purely combinatorial operators may be used that

require no discretization and therefore incur no discretization

errors or ambiguities. Formulation of the algorithm on a graph

also allows the application of the algorithm to surface meshes

or space-variant images [2], [3]. Regardless of the dimensions

of the data, we will use the term pixel throughout this paper to

refer to the basic picture element in the context of its intensity

values. In contrast, the term node will be used in the context

of a graph-theoretical discussion.

Although the present algorithm is motivated in terms of

random walks, an adequate sampling from this distribution

would be completely infeasible for segmentation problems of

interest. Fortunately, it has been previously established [4], [5]

that the probability a random walker ﬁrst reaches a seed point

exactly equals the solution to the Dirichlet problem [6] with

boundary conditions at the locations of the seed points and

the seed point in question ﬁxed to unity while the others are

set to zero. For a popular account of this connection, see [7].

The development of a fully discrete calculus [8] has allowed

for the connection between random walks on graphs [9] and

discrete potential theory [10] to be made completely explicit

[5]. The solution to the combinatorial Dirichlet problem on

an arbitrary graph is given exactly by the distribution of

electric potentials on the nodes of an electrical circuit with

resistors representing the inverse of the weights (i.e., the

weights represent conductance) and the “boundary conditions”

given by voltage sources ﬁxing the electric potential at the

“boundary nodes”.

In light of the connection between random walks on graphs

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 28, NO. 11, NOV. 2006 2

and discrete potential theory, one may calculate the probability,

, that a random walker starting at pixel v

ﬁrst reaches

a seed with label s, by solving the circuit theory problem

that corresponds to a combinatorial analog of the Dirichlet

problem [5]. Ground (i.e., ﬁx the potential to zero) all seed

points belonging to labels other than s and establish a unit

voltage source with ground that ﬁxes the s-labeled seeds to

have a unit potential. The electric potentials established at

each unseeded node provide the probabilities that a walker

originating from that node will ﬁrst reach the seed with label

s. These electric potentials may be calculated through the

solution of a system of sparse linear equations, as described in

section III-G. The full K-tuple may be calculated by ﬁnding

the potentials established through switching “on” (providing

a unit voltage source to) each labeled collection of nodes

and “off” (grounding) the remaining labeled nodes. Therefore,

K −1 systems of linear equations must be solved. By linearity

(i.e., the principle of superposition in circuit theory), the

potentials so calculated must sum to unity. This allows us to

avoid solving for one of the systems by subtracting the sum

of the calculated potentials from unity to ﬁnd the last entry in

the full K-tuple. A function that solves the Dirichlet problem

for a given set of boundary conditions is known as harmonic.

Figure I illustrates the harmonic functions (and subsequent

segmentation) obtained for a 4 × 4 graph with unit weights in

the presence of three seeds with different labels.

Additional properties of our approach that will be estab-

lished in Section IV-C include:

1) Each segment is guaranteed to be connected to seed

points with the same label, i.e., there are no isolated

regions of a particular label that contain no seed points.

2) The K-tuple of probabilities at each pixel is equal to the

weighted average of the K-tuples of neighboring pixels,

with the weights given by walker biases.

3) The solution for the potentials is unique.

4) The expected segmentation for an image of pure noise,

given by independent, equal-mean, random variables, is

equal to that obtained in the neutral segmentation.

A rich tradition of work in image segmentation has focused

on the establishment of appropriate image (object) models

and the development of algorithms focused on ﬁnding the

parameters for these models (e.g., [11]). For example, the

FRAME model of [12] provides a method for both synthesis

and analysis of image textures. A different line of research

in computer vision has ﬁrst established the desired behavior

of an algorithm and then set out to identify a PDE or other

physical process that exhibits the desired behavior. In such

approaches, an image is typically viewed as a domain with

material properties (metric) induced by the image content upon

which the PDE or other physical process is simulated. Notable

examples of research along this second line of work include

anisotropic diffusion for image ﬁltering [13] and normalized

cuts for image segmentation [14]. In such approaches, the

primary focus is typically on the characteristic behavior of

the process and the manner in which the image content

induces a metric is left as a task-speciﬁc question (e.g.,

this information may come from intensity gradients, color

gradients or texture gradients, as appropriate to the particular

problem). The present random walker approach follows from

this second tradition in computer vision in which desirable

behavioral properties of an interactive segmentation algorithm

are identiﬁed and a particular physical process is proposed

that exhibits the required characteristics. In this case, the

characteristics that we try to capture in an interactive seg-

mentation algorithm are: 1) Location of weak (or missing)

boundaries, 2) Noise robustness, 3) Ability to identify multiple

objects simultaneously, 4) Fast computation (and editing), 5)

Avoidance of small/trivial solutions (i.e., an avoidance of a

“small cut” phenomenon).

This paper is organized as follows. Section II reviews the

relationship of this work to previous approaches. Section III

gives a simple weighting function, derives the set of linear

equations that must be solved and provides implementation de-

tails. Section IV establishes theoretical properties and Section

V examines behavioral properties of the algorithm. Section VI

provides segmentation results and we conclude in Section VII

with a summary of the algorithm presented and a discussion

of future work.

II. PRIOR WORK

Image segmentation is a vast topic. Therefore, we limit

our review to supervised and/or graph-based algorithms. Ad-

ditional work on random walks and combinatorial harmonic

functions will also be discussed.

A. Supervised segmentation

Supervised segmentation algorithms typically operate under

one of two paradigms for guidance: 1) Speciﬁcation of pieces

of the boundary of the desired object or a nearby complete

boundary that evolves to the desired boundary, 2) Speciﬁcation

of a small set of pixels belonging to the desired object and

(possibly) a set of pixels belonging to the background. We

note also that any of the automatic segmentation algorithms

might be considered supervised by subsequent user selection

of the desired segment. However, if the desired object is

not a complete segment, a secondary clustering/segmentation

algorithm must be employed to split or merge the automatic

segments.

The intelligent scissors algorithm [15] treats the image as

a graph where each pixel is associated with a node and a

connectivity structure is imposed. This technique requires the

user to place points along the boundary of the desired object.

Dijkstra’s algorithm is then used to compute the shortest path

between the user-deﬁned points and this path is treated as

the object boundary. The algorithm is simple to implement,

very fast and may be used to obtain an arbitrary boundary

with enough points. Unfortunately, a low-contrast or noisy

boundary may require the speciﬁcation of many points and

the algorithm is inapplicable to 3D boundaries.

Although the family of active contours and level sets is

large [16], a user is generally asked to place a contour near

the desired boundary and the algorithm evolves the boundary

to a local energy minimum. Many different terms in the energy

functional may be used to achieve different effects or employ

剩余16页未读，继续阅读

评论收藏

内容反馈

dongshidu

粉丝: 0
资源: 4

基于Random walker 的图像分割

最新资源

基于Random walker 的图像分割

多随机游走 图像协同分割

Random walks for image segmentation.zip_coolhpo_random_random wa

Random Walker 图像分割 Matlab 源代码

基于概率图谱和Random Walker的肝脏三维分割算法 (2012年)

基于grabcut的图像分割

基于k-mean的图像分割

随机游走matlab代码-SubMarkov-Random-Walk-for-Image-Segmentation-:用于图像分割的SubMa

基于随机游走的图像分割matlab代码

基于图割的图像分割

基于MRF的图像分割源码（VC）

基于GrabCut图像分割

基于马尔科夫的图像分割

基于贝叶斯的图像分割（MATLAB源码和数据文件以及PPT详解）基于经

图像分割 基于图的图像分割

基于超像素的Graph_Based图像分割算法.rar_Graph-Based 分割_graph-based_图像分割_基于超像

基于数据驱动的马尔可夫链的图像分割

各种图像分割的代码

图像分割、图像分割介绍、图像分割资料打包

图像分割方法概述

GraphSeg.zip_image segmentation_图像分割 图论_图论 分割_图论图像分割_图论图像处理

图像分割边缘检测源代码

ICM--MRF--matlab.rar_MRF 图像分割_MRF 算法_SAR MATLAB_SAR 图像_sar图像分割

图论 图像分割

Qt 5实现串口调试助手 （源工程文件、0积分下载）

【SystemVerilog】路科验证V2学习笔记（全600页）.pdf

最新资源

多随机游走图像协同分割

图像分割基于图的图像分割

GraphSeg.zip_image segmentation_图像分割图论_图论分割_图论图像分割_图论图像处理

图论图像分割

Qt 5实现串口调试助手（源工程文件、0积分下载）