ELAS算法原文+库文件+VS2015x64实现资源-CSDN文库

共3个文件

zip：2个

pdf：1个

5星 · 超过95%的资源需积分: 36 106 浏览量 2018-10-19 19:32:59 上传评论收藏 12.81MB ZIP 举报

**正文** 本文将深入探讨"ELAS算法"及其在立体匹配中的应用，同时结合提供的资源——"Efficient Large-Scale Stereo Matching.pdf"（ELAS算法原文）、"libelas.zip"（算法源码库）和"ELAS-x64.zip"（VS2015 x64实现工程）进行详细讲解。立体匹配是计算机视觉领域中的一个核心问题，它旨在通过两幅或多幅图像的对应关系来计算场景的三维几何信息。ELAS（Efficient Large-Scale Stereo Matching）算法由Claus Grabner等人提出，是一种针对大规模立体匹配问题的高效解决方案，尤其适用于处理高分辨率图像。 ELAS算法的核心在于其创新的特征匹配和优化策略。算法利用图像的局部亮度一致性来生成初始的匹配候选集，然后通过全局的光度一致性检查和几何一致性约束来剔除错误匹配。接着，采用稀疏的多级四叉树结构进行大规模数据的高效处理，实现了快速的匹配成本聚合。通过最小化光流能量函数，结合光流连续性和局部平面性假设，优化匹配结果，确保了匹配的精度。 "Efficient Large-Scale Stereo Matching.pdf"是ELAS算法的原始论文，其中详细阐述了算法的设计思想、理论基础以及实验验证。读者可以通过阅读这篇论文了解算法的详细工作流程、数学模型以及在不同场景下的性能表现。 "libelas.zip"包含了ELAS算法的源码库文件。这些源代码是理解算法实现的关键，开发者可以深入研究每一行代码，理解如何在实际编程中实现ELAS算法，包括关键的数据结构设计、匹配成本计算、四叉树结构的构建与操作等。对于想要在自己的项目中应用或改进ELAS算法的人来说，这是一个宝贵的资源。 "ELAS-x64.zip"是基于Visual Studio 2015的x64版本实现的工程文件，这使得开发者能够在Windows平台上直接编译和运行ELAS算法。VS2015提供了强大的调试工具，可以帮助开发者调试和优化代码，确保在不同硬件环境下都能高效运行。 ELAS算法通过其高效的数据结构和优化策略，为大规模立体匹配问题提供了一个实用且性能优良的解决方案。通过结合提供的资源，无论是研究者还是开发者，都可以深入学习和实践这一算法，进一步提升在计算机视觉领域的技能。

资源推荐

资源详情

资源评论

收起资源包目录

ELAS原文+库文件+VS2015 x64实现.zip （3个子文件）

libelas.zip 8.15MB

Efficient Large-Scale Stereo Matching.pdf 3.9MB

ELAS-x64.zip 807KB

Eﬃcient Large-Scale Stereo Matching

Andreas Geiger

, Martin Roser

, and Raquel Urtasun

Dep. of Measurement and Control, Karlsruhe Institute of Technology

Toyota Technological Institute at Chicago

geiger@kit.edu, martin.roser@kit.edu, rurtasun@ttic.edu

Abstract. In this paper we propose a novel approach to binocular stereo

for fast matching of high-resolution images. Our approach builds a prior

on the disparities by forming a triangulation on a set of support points

which can be robustly matched, reducing the matching ambiguities of

the remaining points. This allows for eﬃcient exploitation of the dispar-

ity search space, yielding accurate dense reconstruction without the need

for global optimization. Moreover, our method automatically determines

the disparity range and can be easily parallelized. We demonstrate the

eﬀectiveness of our approach on the large-scale Middlebury benchmark,

and show that state-of-the-art performance can be achieved with signif-

icant speedups. Computing the left and right disparity maps for a one

Megapixel image pair takes about one second on a single CPU core.

1 Introduction

Estimating depth from binocular imagery is a core subject in low-level vision as

it is an important building block in many domains such as multi-view reconstruc-

tion. In order to be of practical use for applications such as autonomous driving,

disparity estimation methods should run at speeds similar to other low-level vi-

sual processing techniques, e.g. edge extraction or interest point detection. Since

depth errors increase quadratically with the distance [1], high-resolution images

are needed to obtain accurate 3D representations. While the beneﬁts of high

resolution imagery are already exploited exhaustively in structure-from-motion,

object recognition and scene classiﬁcation, only few binocular stereo methods

deal eﬃciently with large images.

Stereo algorithms based on local correspondences [2, 3] are typically fast,

but require an adequate choice of window size. As illustrated in Fig. 1 this

leads to a trade-oﬀ between low matching ratios for small window sizes and

border bleeding artifacts for larger ones. As a consequence, poorly-textured and

ambiguous surfaces cannot be matched consistently.

Algorithms based on global correspondences [4–9] overcome some of the afore-

mentioned problems by imposing smoothness constraints on the disparities in the

form of regularized energy functions. Since optimizing such MRF-based energy

functions is in general NP-hard, a variety of approximation algorithms have been

proposed, e.g., graph cuts [4, 5] or belief propagation [6]. However, even on low-

resolution imagery, they generally require large computational eﬀorts and high

2 Andreas Geiger, Martin Roser, Raquel Urtasun

Fig. 1. Low-textured areas often pose problems to stereo algorithms. Using local meth-

ods one faces the trade-oﬀ between low matching ratios (top-right, window size 5 × 5)

and border bleeding eﬀects (bottom-left, window size 25 × 25). Our method is able to

combine small window sizes with high matching ratios (bottom-right).

memory capacities. For example, storing all messages of a one Megapixel image

pair requires more than 3 GB of RAM [10]. In these approaches, the disparity

range usually has to be known in advance, and a good choice of the regularization

parameters is crucial. Furthermore, when increasing image resolution, the widely

used priors based on binary potentials fail to reconstruct poorly-textured and

slanted surfaces, as they favor fronto-parallel planes. Recently developed meth-

ods based on higher-order cliques [7] overcome these problems, but are even more

computationally demanding.

In this paper we propose a generative probabilistic model for stereo matching,

called ELAS (Eﬃcient LArge-scale Stereo)

, which allows for dense matching

with small aggregation windows by reducing ambiguities on the correspondences.

Our approach builds a prior over the disparity space by forming a triangulation

on a set of robustly matched correspondences, named ‘support points’. Since our

prior is piecewise linear, we do not suﬀer in the presence of poorly-textured and

slanted surfaces. This results in an eﬃcient algorithm that reduces the search

space and can be easily parallelized. As demonstrated in our experiments, our

method is able to achieve state-of-the-art performance with signiﬁcant speedups

of up to three orders of magnitude when compared to prevalent approaches; we

obtain 300 MDE/s (million disparity evaluations per second) on a single CPU

core.

2 Related work

In the past few years much progress has been made towards solving the stereo

problem, as evidenced by the excellent overview of Scharstein et al. [2]. Local

methods typically aggregate image statistics in a small window, thus imposing

C++ source code, Matlab wrappers and videos online at http://www.cvlibs.net

Eﬃcient Large-Scale Stereo Matching 3

smoothness implicitly. Optimization is usually performed using a winner-takes-

all strategy, which selects for each pixel the disparity with the smallest value

under some distance metric [2]. Weber et al. [3] achieved real-time performance

using the Census transform and a GPU implementation. However, as illustrated

by Fig. 1, traditional local methods [11] often suﬀer from border bleeding eﬀects

or struggle with correspondence ambiguities. Approaches based on adaptive sup-

port windows [12, 13] adjust the window size or adapt the pixel weighting within

a ﬁxed-size window to improve performance, especially close to border discon-

tinuities. Unfortunately, since for each pixel many weight factors have to be

computed, these methods are much slower than ﬁxed-window ones [13].

Dense and accurate matching can be obtained by global methods, which en-

force smoothness explicitly by minimizing an MRF-based energy function which

can be decomposed as the sum of a data ﬁtting term and a regularization term.

Since for most energies of practical use such an optimization is NP-hard, approx-

imate algorithms have been proposed, e.g. graph-cuts [4, 5], belief propagation

[6]. Klaus et al. [14] extend global methods to use mean-shift color segmentation,

followed by belief propagation on super-pixels. In [15], a parallel VLSI hardware

design for belief propagation that achieves real time performance on VGA im-

agery was proposed . The application of global methods to high-resolution images

is, however, limited by their high computational and memory requirements, es-

pecially in the presence of large disparity ranges. Furthermore, models based

on binary potentials between pixels favor fronto-parallel surfaces which leads to

errors in low-textured slanted surfaces. Higher order cliques can overcome these

problems [7], but they are even more computationally demanding.

Hirschm¨uller proposed semi-global matching [16], an approach which extends

polynomial time 1D scan-line methods to propagate information along 16 orien-

tations. While reducing streaking artifacts and improving accuracy compared to

traditional methods based on dynamic programming, computational complex-

ity increases with the number of computed paths. ‘ground control points’ are

used in [17] to improve the occlusion cost sensitivity of dynamic programming

algorithms. In [18, 19] disparities are ‘grown’ from a small set of initial corre-

spondence seeds. Though these methods produce accurate results and can be

faster than global approaches, they do not provide dense matching and strug-

gle with textureless and distorted image areas. Approaches to reduce the search

space have been investigated for global stereo methods [10, 20]. However, they

mainly focus on memory requirements and start with a full search using local

methods ﬁrst. Furthermore, the use of graph-cuts imposes high computational

costs particularly for large-scale imagery.

In contrast, in this paper we propose a Bayesian approach to stereo matching

that is able to compute accurate disparity maps of high resolution images at

frame rates close to real time without the need for global optimization. The

remainder of this paper is structured as follows: In Section 3 we describe our

approach to eﬃcient large-scale stereo matching. Experimental results on real-

world datasets and comparisons to a variety of other methods on large-scale

4 Andreas Geiger, Martin Roser, Raquel Urtasun

versions of the Middlebury benchmark images are reported in Section 4. Finally,

Section 5 gives our conclusions and future work.

3 Eﬃcient Large-Scale Stereo Matching

In this section we describe our approach to eﬃcient stereo matching of high-

resolution images. Our method is inspired from the observation that despite

the fact that many stereo correspondences are highly ambiguous, some of them

can be robustly matched. Assuming piecewise smooth disparities, such reliable

’support points’ contain valuable prior information for the estimation of the

remaining ambiguous disparities. Our approach proceeds as follows: First, the

disparities of a sparse set of support points are computed using a full disparity

range. The image coordinates of the support points are then used to create

a 2D mesh via Delaunay triangulation. A prior is computed to disambiguate

the matching problem, making the process eﬃcient by restricting the search to

plausible regions. In particular, this prior is formed by computing a piecewise

linear function induced by the support point disparities and the triangulated

mesh. For simplicity of the presentation, we will assume rectiﬁed input images,

such that correspondences are restricted to the same line in both images.

3.1 Support Points

As support points, we denote pixels which can be robustly matched due to

their texture and uniqueness. While a variety of methods for obtaining stable

correspondences are available [17, 21, 22], we ﬁnd that matching support points

on a regular grid using the `

distance between vectors formed by concatenating

the horizontal and vertical Sobel ﬁlter responses of 9 × 9 pixel windows to be

both eﬃcient and eﬀective. In all of our experiments we used Sobel masks of size

3 × 3 and a grid with ﬁxed step-size of 5 pixels. A large disparity search range

of half the input image width was employed to impose no restrictions on the

disparities. We also experimented with sparse interest point descriptors such as

SURF [23], but found that they did not improve matching accuracy while being

slower to compute.

For robustness we impose consistency, i.e., correspondences are retained only

if they can be matched from left-to-right and right-to-left. To get rid of am-

biguous matches, we eliminate all points whose ratio between the best and the

second best match exceeds a ﬁxed threshold, τ = 0.9 . Spurious mismatches are

removed by deleting all points which exhibit disparity values dissimilar from all

surrounding support points. To cover the full image, we add additional support

points at the image corners whose disparities are taken to be the ones of their

nearest neighbors.

3.2 Generative Model for Stereo Matching

We now describe our probabilistic generative model which, given a reference

image and the support points, can be used to draw samples from the other im-

Eﬃcient Large-Scale Stereo Matching 5

Support Points

(a) Sampling process and graphical model

(b) Left image (c) Sample mean (d) Right image

Fig. 2. Illustration of the sampling process. (a) Graphical model and sampling

process: Given support points {s

, ..., s

} and an observation in the left image o

(l)

, a

disparity d is drawn. Given the observation on the left image and the disparity, we can

draw an observation in the right image o

(r)

. (c) Repeating this process 100 times for

each pixel and (d) computing the mean results in a blurred version of the right image.

age. More formally, let S = {s

, ..., s

} be a set of robustly matched support

points. Each support point, s

= (u

, v

, d

)

, is deﬁned as the concatena-

tion of its image coordinates, (u

, v

) ∈ N

, and its disparity, d

∈ N. Let

O = {o

, ..., o

} be a set of image observations, with each observation o

, v

, f

)

formed as the concatenation of its image coordinates, (u

, v

) ∈ N

and a feature vector, f

∈ <

, e.g., the pixel’s intensity or a low-dimensional

descriptor computed from a small neighborhood. We denote o

(l)

and o

(r)

as the

observations in the left and right image respectively. Without loss of generality,

in the following we consider the left image as the reference image.

Assuming that the observations {o

(l)

, o

(r)

} and support points S are condi-

tionally independent given their disparities d

, the joint distribution factorizes

p(d

, o

(l)

, o

(r)

, S) ∝ p(d

|S, o

(l)

)p(o

(r)

(l)

, d

) (1)

with p(d

|S, o

(l)

) the prior and p(o

(r)

(l)

, d

) the image likelihood. The graph-

ical model of our approach is depicted in Fig. 2(a). In particular, we take the

prior to be proportional to a combination of a uniform distribution and a sam-

评论收藏

内容反馈

ganleiboy

2019-05-14

感谢博主，我在win10+VS2017中打开你的解决方案，在DEBUGx64模式下成功编译并运行。

$南山种豆$

粉丝: 232
资源: 15

ELAS算法原文+库文件+VS2015x64实现

VS2015 x86/x64运行环境

ELAS开源程序（高效大规模立体匹配）

立体匹配ELAS代码

ORB+ELAS+DIASY+LBP算法生成视差图

LMS算法最新完整代码

OpenCV双目标定双目校正

CRC算法原理讲解与实现（原文与翻译）

2918.G密钥分散管理系统——密钥确认算法实现外文资料翻译--原文.doc

详细的a星算法 原文为英文 但是有对应的中文翻译 有详细的图片 所以文件有点大

Matlab生成视差图

(ELAS)Efficient Large-Scale Stereo Matching - 翻译1

matlab由频域变时域的代码-elas3D-python:来自NIST的elas3D的翻译代码

elas论文1

matlab新年快乐代码-elas:支持libsimdpp、eigen3、cgal的ELAS

双目稠密建图，来自熊猫飞天博主，需要的联系博主，也可以给我留QQ号，我私发给你，不要拿积分下载

stereoMatch

Evisions说明书1

elas-job.zip

meetup_agora_sao_elas:参加聚会集会的人Elas Dextra

f1-implicit-elastic.zip_LS_DYNA fortran_dyna_dyna implicit_elas

Ebook-Elas.net:Elas.net儿童电子书的内容

KiCad-RP-Pico:将Raspberry Pi Pico板的3D尺寸添加到KiCad所需的简单文件存储库

一款好用又强大的开源社区，基于 Spring Boot、MyBatis-Plus、MySQL、Redis、Elas.zip

Spring-data-elasctisearch使用笔记

matlab新年快乐代码-ELASwithOpencv:ELASwithOpencv

使用eLAS事件，在b =标记的pp碰撞中，使用ATLAS探测器在pp碰撞中使用e-事件测量tt的生产横截面

vagrant-elasticsearch:Elasticsearch 1.4.x 由 Ansible 在 Ubuntu precision64 上提供

最新资源

详细的a星算法原文为英文但是有对应的中文翻译有详细的图片所以文件有点大