vlfeat图像处理库_vlfeat安装python资源-CSDN文库

需积分: 12 34 浏览量 2012-09-12 18:45:36 上传评论收藏 9.76MB GZ 举报

共1782个文件

png：439个

html：427个

m：295个

** vlfeat 图像处理库详解 ** VLFeat（Visual Library Features）是一个开源的、跨平台的C/C++图像处理库，特别专注于计算机视觉中的特征检测和描述算法。它由Alexandre Bergel和Massimo Vento开发，是研究人员和开发者进行图像分析任务的重要工具。VLFeat的最新版本为0.9.14，其提供的二进制包`vlfeat-0.9.14-bin.tar.gz`包含了预编译的库和实用程序，便于在多种操作系统上快速部署和使用。 ### 主要功能模块 1. **SIFT（Scale-Invariant Feature Transform）**: SIFT是David Lowe提出的著名特征检测与描述方法，能够提取图像中的尺度不变特征。VLFeat实现了SIFT算法，包括关键点检测、尺度空间极值检测、关键点定位、方向分配以及特征描述符计算。 2. **SURF（Speeded Up Robust Features）**: SURF是Hans-Peter Bunke等人提出的一种快速且稳健的特征检测与描述方法，相比SIFT，它在计算速度上有显著优势，同时保持了良好的匹配性能。 3. **MSER（Maximally Stable Extremal Regions）**: MSER是一种区域稳定性的边缘检测算法，用于检测图像中的稳定区域，常用于物体检测和图像分段。 4. **HOG（Histogram of Oriented Gradients）**: HOG是一种用于物体检测的特征描述符，通过计算和积累图像局部区域的梯度方向直方图来捕捉图像的形状信息。 5. **K-means聚类**: VLFeat提供了K-means算法的实现，用于对数据进行无监督分类或量化。 6. **图像金字塔**: 用于处理不同尺度的图像，是许多图像处理任务的基础，如SIFT和SURF等特征检测。 7. **数据结构与算法**: 包括快速傅里叶变换(FFT)、向量机支持(VLFeat SVM)、动态规划等基础算法，为其他高级功能提供底层支持。 ### 应用场景 VLFeat广泛应用于计算机视觉和机器学习领域，例如： - **目标识别与检测**: 利用SIFT、SURF或HOG特征进行物体识别和检测。 - **图像匹配与拼接**: 使用SIFT或SURF特征进行图像间的对应关系搜索，实现全景图像拼接。 - **图像分类**: 结合SVM等机器学习模型，利用提取的特征进行图像分类。 - **视频分析**: 在视频流中检测和追踪对象，利用VLFeat的特征提取和匹配能力。 - **机器人视觉**: 在机器人导航和避障中，VLFeat可以帮助识别环境特征。 ### 使用与集成 VLFeat提供了易于使用的命令行接口和API，用户可以将这些功能轻松集成到自己的项目中。对于C++开发者，可以包含VLFeat的头文件并链接其库，而对于Python用户，也有相应的Python绑定使得调用更加方便。 ### 总结 VLFeat作为一个强大的图像处理库，提供了多种关键的特征检测和描述方法，是进行计算机视觉研究和应用开发的重要资源。无论是学术研究还是工业项目，都能从中受益，提升图像处理的效率和精度。通过理解并掌握VLFeat的各项功能，开发者可以更好地利用这些工具解决实际问题。

资源推荐

资源详情

资源评论

收起资源包目录

vlfeat 图像处理库（1782个子文件）

sift.1 5KB

mser.1 4KB

vlfeat.7 3KB

aib 10KB

aib 9KB

aib 8KB

vlfeat.bib 3KB

sift.c 71KB

kmeans.c 52KB

generic.c 32KB

mser.c 29KB

sift.c 26KB

dsift.c 24KB

imopv.c 23KB

kdtree.c 23KB

aib.c 21KB

vl_alldist2.c 17KB

homkermap.c 17KB

mser.c 17KB

vl_sift.c 14KB

host.c 14KB

quickshift.c 14KB

slic.c 14KB

pgm.c 14KB

pegasos.c 13KB

mathop.c 12KB

stringop.c 11KB

vl_imsmooth.c 10KB

vl_mser.c 9KB

getopt_long.c 9KB

vl_kmeans.c 9KB

rodrigues.c 9KB

lbp.c 9KB

vl_pegasos.c 9KB

vl_dsift.c 8KB

vl_ubcmatch.c 8KB

mathop_sse2.c 8KB

imopv_sse2.c 7KB

hikmeans.c 7KB

vl_aib.c 7KB

vl_siftdescriptor.c 7KB

vl_localmax.c 7KB

random.c 6KB

ikmeans.c 6KB

vl_inthist.c 6KB

vl_ihashsum.c 6KB

vl_hikmeanspush.c 6KB

vl_imwbackwardmx.c 6KB

vl_hikmeans.c 6KB

vl_alldist.c 6KB

array.c 6KB

vl_erfill.c 6KB

vl_homkermap.c 5KB

vl_kdtreequery.c 5KB

vl_twister.c 5KB

vl_kdtreebuild.c 5KB

vl_ikmeans.c 4KB

vl_ihashfind.c 4KB

vl_slic.c 4KB

vl_aibhist.c 4KB

test_heap-def.c 4KB

vl_imdisttf.c 4KB

vl_binsum.c 4KB

test_stringop.c 4KB

vl_quickshift.c 4KB

vl_ikmeanspush.c 4KB

vl_samplinthist.c 3KB

vl_imintegral.c 3KB

test_imopv.c 3KB

vl_lbp.c 2KB

test_getopt_long.c 2KB

vl_irodr.c 2KB

vl_rodr.c 2KB

test_threads.c 2KB

vl_tpsumx.c 2KB

vl_binsearch.c 2KB

vl_version.c 2KB

test_mathop.c 2KB

test_vec_comp.c 2KB

aib.c 1KB

test_mathop_abs.c 1KB

vl_simdctrl.c 985B

vl_getpid.c 852B

test_nan.c 823B

test_qsort-def.c 785B

test_rand.c 722B

test_host.c 419B

doxygen.conf 73KB

COPYING 1KB

doxygen.css 18KB

doxygen.css 15KB

web.css 7KB

pygmentize.css 3KB

tabs.css 1KB

xhtml1.dcl 7KB

vl_binsum.def 8KB

共 1782 条

/** @file sift.c ** @brief SIFT - Definition ** @author Andrea Vedaldi **/ /* Copyright (C) 2007-12 Andrea Vedaldi and Brian Fulkerson. All rights reserved. This file is part of the VLFeat library and is made available under the terms of the BSD license (see the COPYING file). */ /**  @page sift Scale Invariant Feature Transform (SIFT) @author Andrea Vedaldi @par "Credits:" May people have contributed with suggestions and bug reports. Although the following list is certainly incomplete, we would like to thank: Wei Dong, Loic, Giuseppe, Liu, Erwin, P. Ivanov, and Q. S. Luo.  @ref sift.h implements a @ref sift-usage "SIFT filter object", a reusable object to extract SIFT features @cite{lowe99object} from one or multiple images. - @ref sift-intro - @ref sift-intro-detector - @ref sift-intro-descriptor - @ref sift-intro-extensions - @ref sift-usage - @ref sift-tech - @ref sift-tech-ss - @ref sift-tech-detector - @ref sift-tech-detector-peak - @ref sift-tech-detector-edge - @ref sift-tech-detector-orientation - @ref sift-tech-descriptor - @ref sift-tech-descriptor-can - @ref sift-tech-descriptor-image - @ref sift-tech-descriptor-std  @section sift-intro Overview  A SIFT feature is a selected image region (also called keypoint) with an associated descriptor. Keypoints are extracted by the @ref sift-intro-detector "SIFT detector" and their descriptors are computed by the @ref sift-intro-descriptor "SIFT descriptor". It is also common to use independently the SIFT detector (i.e. computing the keypoints without descriptors) or the SIFT descriptor (i.e. computing descriptors of custom keypoints).  @subsection sift-intro-detector SIFT detector  @sa @ref sift-tech-ss "Scale space technical details", @ref sift-tech-detector "Detector technical details" A SIFT keypoint is a circular image region with an orientation. It is described by a geometric frame of four parameters: the keypoint center coordinates @e x and @e y, its @e scale (the radius of the region), and its @e orientation (an angle expressed in radians). The SIFT detector uses as keypoints image structures which resemble “blobs”. By searching for blobs at multiple scales and positions, the SIFT detector is invariant (or, more accurately, covariant) to translation, rotations, and rescaling of the image. The keypoint orientation is also determined from the local image appearance and is covariant to image rotations. Depending on the symmetry of the keypoint appearance, determining the orientation can be ambiguous. In this case, the SIFT detectors returns a list of up to four possible orientations, constructing up to four frames (differing only by their orientation) for each detected image blob. @image html sift-frame.png "SIFT keypoints are circular image regions with an orientation." There are several parameters that influence the detection of SIFT keypoints. First, searching keypoints at multiple scales is obtained by constructing a so-called “Gaussian scale space”. The scale space is just a collection of images obtained by progressively smoothing the input image, which is analogous to gradually reducing the image resolution. Conventionally, the smoothing level is called scale of the image. The construction of the scale space is influenced by the following parameters, set when creating the SIFT filter object by ::vl_sift_new(): - Number of octaves. Increasing the scale by an octave means doubling the size of the smoothing kernel, whose effect is roughly equivalent to halving the image resolution. By default, the scale space spans as many octaves as possible (i.e. roughly <code> log2(min(width,height)</code>), which has the effect of searching keypoints of all possible sizes. - First octave index. By convention, the octave of index 0 starts with the image full resolution. Specifying an index greater than 0 starts the scale space at a lower resolution (e.g. 1 halves the resolution). Similarly, specifying a negative index starts the scale space at an higher resolution image, and can be useful to extract very small features (since this is obtained by interpolating the input image, it does not make much sense to go past -1). - Number of levels per octave. Each octave is sampled at this given number of intermediate scales (by default 3). Increasing this number might in principle return more refined keypoints, but in practice can make their selection unstable due to noise (see [1]). Keypoints are further refined by eliminating those that are likely to be unstable, either because they are selected nearby an image edge, rather than an image blob, or are found on image structures with low contrast. Filtering is controlled by the follow: - Peak threshold. This is the minimum amount of contrast to accept a keypoint. It is set by configuring the SIFT filter object by ::vl_sift_set_peak_thresh(). - Edge threshold. This is the edge rejection threshold. It is set by configuring the SIFT filter object by ::vl_sift_set_edge_thresh(). <table> <caption>Summary of the parameters influencing the SIFT detector.</caption> <tr style="font-weight:bold;"> <td>Parameter</td> <td>See also</td> <td>Controlled by</td> <td>Comment</td> </tr> <tr> <td>number of octaves</td> <td> @ref sift-intro-detector </td> <td>::vl_sift_new</td> <td></td> </tr> <tr> <td>first octave index</td> <td> @ref sift-intro-detector </td> <td>::vl_sift_new</td> <td>set to -1 to extract very small features</td> </tr> <tr> <td>number of scale levels per octave</td> <td> @ref sift-intro-detector </td> <td>::vl_sift_new</td> <td>can affect the number of extracted keypoints</td> </tr> <tr> <td>edge threshold</td> <td> @ref sift-intro-detector </td> <td>::vl_sift_set_edge_thresh</td> <td>decrease to eliminate more keypoints</td> </tr> <tr> <td>peak threshold</td> <td> @ref sift-intro-detector </td> <td>::vl_sift_set_peak_thresh</td> <td>increase to eliminate more keypoints</td> </tr> </table>  @subsection sift-intro-descriptor SIFT Descriptor  @sa @ref sift-tech-descriptor "Descriptor technical details" A SIFT descriptor is a 3-D spatial histogram of the image gradients in characterizing the appearance of a keypoint. The gradient at each pixel is regarded as a sample of a three-dimensional elementary feature vector, formed by the pixel location and the gradient orientation. Samples are weighed by the gradient norm and accumulated in a 3-D histogram @em h, which (up to normalization and clamping) forms the SIFT descriptor of the region. An additional Gaussian weighting function is applied to give less importance to gradients farther away from the keypoint center. Orientations are quantized into eight bins and the spatial coordinates into four each, as follows: @image html sift-descr-easy.png "The SIFT descriptor is a spatial histogram of the image gradient." SIFT descriptors are computed by either calling ::vl_sift_calc_keypoint_descriptor or ::vl_sift_calc_raw_descriptor. They accept as input a keypoint frame, which specifies the descriptor center, its size, and its orientation on the image plane. The following parameters influence the descriptor calculation: - magnification factor. The descriptor size is determined by multiplying the keypoint scale by this factor. It is set

评论收藏

内容反馈