【免费】HED边缘检测opencv推理.rar_hed边缘检测资源-CSDN文库

共5个文件

py：2个

prototxt：1个

caffemodel：1个

需积分: 0 5 浏览量更新于2021-03-24 2 收藏 53.52MB RAR 举报

边缘检测是计算机视觉领域中的一个基础任务，用于识别图像中物体的边界，这些边界通常代表了图像的重要特征。HED（Hierarchical Edge Detection）是一种基于深度学习的边缘检测方法，它利用了全卷积网络（Fully Convolutional Network, FCN）的优势，能够在图像上进行端到端的边缘检测。在本项目中，我们使用Python和OpenCV库来实现HED边缘检测的推理过程。 HED模型的核心是基于VGG16的预训练模型。VGG16是一个深度卷积神经网络，它在ImageNet数据集上进行了预训练，拥有强大的特征提取能力。HED将VGG16的多个层的输出作为边缘检测的预测，通过多尺度信息融合来提升边缘检测的准确性。在Python中，我们需要先加载预训练的HED模型权重。OpenCV库虽然主要设计用于图像处理，但它也支持加载和使用预训练的深度学习模型。你可以使用`cv2.dnn.readNetFromTorch()`或`cv2.dnn.readNetFromCaffe()`函数（根据模型格式）来加载模型。注意，确保模型权重文件与Python脚本在同一目录下，或者提供正确的路径。接下来，你需要准备输入图像。可以使用OpenCV的`cv2.imread()`函数读取图像，并使用`cv2.resize()`将其调整为模型所需的尺寸。然后，将图像转换为适合神经网络输入的数据格式，通常需要归一化和通道顺序的调整。执行推理时，使用`cv2.dnn.blobFromImage()`创建一个数据blob，它是一个四维张量，可以直接传递给模型。然后调用`net.forward()`函数，将blob作为输入，模型将返回多层的边缘预测结果。为了得到最终的边缘图像，你需要对模型的各个层输出进行融合。每个层的权重可以在HED的论文或实现中找到。这些权重用于加权平均各个层的预测，从而得到更精确的边缘结果。可以使用阈值处理来消除噪声，保留较明显的边缘。压缩包中的"HED1"可能包含了训练好的模型权重、示例图像、处理后的边缘图像或其他相关资源。在实际应用中，你可以根据自己的需求对这个模型进行微调，以适应特定的边缘检测场景，例如增加数据增强、调整模型结构等。 HED边缘检测结合了深度学习的强大学习能力与OpenCV的易用性，提供了一种高效且准确的边缘检测解决方案。通过Python编程，我们可以轻松地实现这一技术，应用于各种图像分析和处理任务。

收起资源包目录

HED边缘检测opencv推理.rar （5个子文件）

HED1

hed1.py 3KB

hed_pretrained_bsds.caffemodel 56.15MB

__init__.py 0B

001.JPG 970KB

deploy.prototxt 8KB

资源推荐

资源预览

资源评论

import cv2 as cv import argparse parser = argparse.ArgumentParser( description='This sample shows how to define custom OpenCV deep learning layers in Python. ' 'Holistically-Nested Edge Detection (https://arxiv.org/abs/1504.06375) neural network ' 'is used as an example model. Find a pre-trained model at https://github.com/s9xie/hed.') parser.add_argument('--input', help='Path to image or video. Skip to capture frames from camera', default='001.JPG') parser.add_argument('--prototxt', help='Path to deploy.prototxt', default='deploy.prototxt') parser.add_argument('--caffemodel', help='Path to hed_pretrained_bsds.caffemodel', default='hed_pretrained_bsds.caffemodel') parser.add_argument('--width', help='Resize input image to a specific width', default=500, type=int) parser.add_argument('--height', help='Resize input image to a specific height', default=500, type=int) args = parser.parse_args() # ! [CropLayenr] class CropLayer(object): def __init__(self, params, blobs): self.xstart = 0 self.xend = 0 self.ystart = 0 self.yend = 0 # Our layer receives two inputs. We need to crop the first input blob # to match a shape of the second one (keeping batch size and number of channels) def getMemoryShapes(self, inputs): inputShape, targetShape = inputs[0], inputs[1] batchSize, numChannels = inputShape[0], inputShape[1] height, width = targetShape[2], targetShape[3] # self.ystart = (inputShape[2] - targetShape[2]) / 2 # self.xstart = (inputShape[3] - targetShape[3]) / 2 self.ystart = int((inputShape[2] - targetShape[2]) / 2) self.xstart = int((inputShape[3] - targetShape[3]) / 2) self.yend = self.ystart + height self.xend = self.xstart + width return [[batchSize, numChannels, height, width]] def forward(self, inputs): return [inputs[0][:, :, self.ystart:self.yend, self.xstart:self.xend]] # ! [CropLayer] # ! [Register] cv.dnn_registerLayer('Crop', CropLayer) # ! [Register] # Load the model. net = cv.dnn.readNet(cv.samples.findFile(args.prototxt), cv.samples.findFile(args.caffemodel)) kWinName = 'Holistically-Nested Edge Detection' cv.namedWindow('Input', cv.WINDOW_NORMAL) cv.namedWindow(kWinName, cv.WINDOW_NORMAL) frame = cv.imread('001.JPG') cv.imshow('Input', frame) # cv.waitKey(0) inp = cv.dnn.blobFromImage(frame, scalefactor=1.0, size=(args.width, args.height), mean=(104.00698793, 116.66876762, 122.67891434), swapRB=False, crop=False) net.setInput(inp) out = net.forward() out = out[0, 0] out = cv.resize(out, (frame.shape[1], frame.shape[0])) cv.imshow(kWinName, out) cv.imwrite('result.png', out) cv.waitKey(0)