基于Deeplab-v3算法实现对遥感图像的语义分割python源码(带详细注释+项目说明+数据集+模型).zip

共21个文件

xml：7个

py：7个

png：5个

版权申诉

课程设计

期末大作业

项目源码

5星 · 超过95%的资源 95 浏览量 2023-12-20 15:00:26 上传评论 6 收藏 3.73MB ZIP 举报

【资源说明】 1.项目代码均经过功能验证ok，确保稳定可靠运行。欢迎下载使用体验！ 2.主要针对各个计算机相关专业，包括计算机科学、信息安全、数据科学与大数据技术、人工智能、通信、物联网等领域的在校学生、专业教师、企业员工。 3.项目具有丰富的拓展空间，不仅可作为入门进阶，也可直接作为毕设、课程设计、大作业、初期项目立项演示等用途。 4.当然也鼓励大家基于此进行二次开发。在使用过程中，如有问题或建议，请及时沟通。 5.期待你能在项目中找到乐趣和灵感，也欢迎你的分享和反馈！【项目介绍】基于Deeplab-v3算法实现对遥感图像的语义分割python源码(带详细注释+项目说明+数据集+模型).zip CCF卫星影像的AI分类与识别提供的数据集初赛复赛训练集，一共五张卫星遥感影像 * 百度云盘：[点击这里](https://pan.baidu.com/s/1LWBMklOr39yI7fYRQ185Og) * 密码：3ih2 * 预训练模型：[点击这里下载](http://download.tensorflow.org/models/resnet_v2_50_2017_04_14.tar.gz) ``` dataset ├── origin //5张遥感图片，有标签 ├── test //3张遥感图片，无标签，在这个任务中没有用到 └── train //为空，通过`python preprocess.py`随机采样生成 ├── images └── labels ``` 其中我们使用前四张用来做训练，最后一张用来做测试 dependency cuda==8.0 cudnn==6 python==3.5 pip install opencv-python==3.4.2.17 pip install tensorflow-gpu==1.13.1 pip install sklearn pip install pandas 主要策略： - [x] 将原始的遥感图像裁成大小为(256x256)的图片块，裁剪的方法为随机采样，并进行数据扩增 - [x] 搭建Deeplab-v3模型，使用预训练的 resnet-v2-50 迁移学习 - [x] 完整的训练测试程序，使用 tensorboard 监控模型训练 - [x] 多尺度拼接预测，提升模型 - [ ] 后处理优化，比如消除预测图片拼接痕迹 - [ ] 使用更好的骨干网络，如 Xception 最终结果：评价方法为 mean-IoU，在数据集极少的情况下，测试集评价结果得到了 **77.3** 的分数 | 方法 | mean-IoU | accuracy | | :-----| :----: | :----: | | baseline(deeplabv3) | 71.2 | - | | resnet-v2-50 pretrain | 77.1 | - | | 旋转四次预测取平均 | 77.6 | 85.5 |

资源推荐

资源详情

资源评论

收起资源包目录

基于Deeplab-v3算法实现对遥感图像的语义分割python源码(带详细注释+项目说明+数据集+模型).zip （21个子文件）

preprocess.py 4KB

metric_utils.py 899B

.idea

dictionaries

anxiang.xml 381B

webServers.xml 579B

deeplab_v3.iml 443B

vcs.xml 180B

workspace.xml 13KB

misc.xml 300B

modules.xml 272B

deployment.xml 444B

deeplab_v3.py 10KB

predicts_utils.py 5KB

项目使用说明.md 3KB

images

5_color.png 639KB

step_50000.png 670KB

step_10000.png 670KB

5_view.png 2.46MB

metric.png 93KB

train.py 6KB

color_utils.py 1KB

data_utils.py 2KB

"""ResNet model. Related papers: https://arxiv.org/pdf/1603.05027v2.pdf https://arxiv.org/pdf/1512.03385v1.pdf https://arxiv.org/pdf/1605.07146v1.pdf """ from __future__ import absolute_import from __future__ import division from __future__ import print_function from tensorflow.python.training import moving_averages import tensorflow as tf # 为了finetune resnet_v2_50 对数据每个通道中心化 _R_MEAN = 123.68 _G_MEAN = 116.78 _B_MEAN = 103.94 class Deeplab_v3(): def __init__(self, batch_norm_decay=0.99, batch_norm_epsilon=1e-3,): self._batch_norm_decay = batch_norm_decay self._batch_norm_epsilon = batch_norm_epsilon # 模型训练开关占位符 self._is_training = tf.placeholder(tf.bool, name='is_training') self.num_class = 5 self.filters = [64, 256, 512, 1024, 2048] self.strides = [2, 2, 1, 1] self.n = [3, 4, 6, 3] def forward_pass(self, x): """Build the core model within the graph""" with tf.variable_scope('resnet_v2_50', reuse=tf.AUTO_REUSE): size = tf.shape(x)[1:3] x = x - [_R_MEAN, _G_MEAN, _B_MEAN] x = self._conv(x, 7, 64, 2, 'conv1', False, False) x = self._max_pool(x, 3, 2, 'max') res_func = self._bottleneck_residual_v2 for i in range(4): with tf.variable_scope('block%d' % (i + 1)): for j in range(self.n[i]): with tf.variable_scope('unit_%d' % (j + 1)): if j == 0: x = res_func(x, self.filters[i], self.filters[i+1], 1) elif j == self.n[i] - 1: x = res_func(x, self.filters[i+1], self.filters[i+1], self.strides[i]) else: x = res_func(x, self.filters[i+1], self.filters[i+1], 1) tf.logging.info('the shape of features after block%d is %s' % (i+1, x.get_shape())) # DeepLab_v3的部分 with tf.variable_scope('DeepLab_v3', reuse=tf.AUTO_REUSE): x = self._atrous_spatial_pyramid_pooling(x) x = self._conv(x, 1, 5, 1, 'logits', False, False) x = tf.image.resize_bilinear(x, size) return x def _A_ASPP(self): pass def _atrous_spatial_pyramid_pooling(self, x): """空洞空间金字塔池化 """ with tf.variable_scope('ASSP_layers'): feature_map_size = tf.shape(x) image_level_features = tf.reduce_mean(x, [1, 2], keep_dims=True) image_level_features = self._conv(image_level_features, 1, 256, 1, 'global_avg_pool', True) image_level_features = tf.image.resize_bilinear(image_level_features, (feature_map_size[1], feature_map_size[2])) at_pool1x1 = self._conv(x, kernel_size=1, filters=256, strides=1, scope='assp1', batch_norm=True) at_pool3x3_1 = self._conv(x, kernel_size=3, filters=256, strides=1, scope='assp2', batch_norm=True, rate=6) at_pool3x3_2 = self._conv(x, kernel_size=3, filters=256, strides=1, scope='assp3', batch_norm=True, rate=12) at_pool3x3_3 = self._conv(x, kernel_size=3, filters=256, strides=1, scope='assp4', batch_norm=True, rate=18) net = tf.concat((image_level_features, at_pool1x1, at_pool3x3_1, at_pool3x3_2, at_pool3x3_3), axis=3) net = self._conv(net, kernel_size=1, filters=256, strides=1, scope='concat', batch_norm=True) return net def _bottleneck_residual_v2(self, x, in_filter, out_filter, stride,): """Bottleneck residual unit with 3 sub layers, plan B shortcut.""" with tf.variable_scope('bottleneck_v2'): origin_x = x with tf.variable_scope('preact'): preact = self._batch_norm(x) preact = self._relu(preact) residual = self._conv(preact, 1, out_filter // 4, stride, 'conv1', True, True) residual = self._conv(residual, 3, out_filter // 4, 1, 'conv2', True, True) residual = self._conv(residual, 1, out_filter, 1, 'conv3', False, False) if in_filter != out_filter: short_cut = self._conv(preact, 1, out_filter, stride, 'shortcut', False, False) else: short_cut = self._subsample(origin_x, stride, 'shortcut') x = tf.add(residual, short_cut) return x def _conv(self, x, kernel_size, filters, strides, scope, batch_norm=False, activation=False, rate=None ): """Convolution.""" with tf.variable_scope(scope): x_shape = x.get_shape().as_list() w = tf.get_variable(name='weights', shape=[kernel_size, kernel_size, x_shape[3], filters]) if rate == None: x = tf.nn.conv2d(input=x, filter=w, padding='SAME', strides=[1, strides, strides, 1], name='conv', ) else: x = tf.nn.atrous_conv2d(value=x, filters=w, padding='SAME', name='conv', rate=rate) if batch_norm: with tf.variable_scope('BatchNorm'): x = self._batch_norm(x) else: b = tf.get_variable(name='biases', shape=[filters]) x = x + b if activation: x = tf.nn.relu(x) return x def _batch_norm(self, x): x_shape = x.get_shape() params_shape = x_shape[-1:] axis = list(range(len(x_shape) - 1)) beta = tf.get_variable(name='beta', shape=params_shape, initializer=tf.zeros_initializer) gamma = tf.get_variable(name='gamma', shape=params_shape, initializer=tf.ones_initializer) moving_mean = tf.get_variable(name='moving_mean', shape=params_shape, initializer=tf.zeros_initializer, trainable=False) moving_variance = tf.get_variable(name='moving_variance', shape=params_shape, initializer=tf.ones_initializer, trainable=False) tf.add_to_collection('BN_MEAN_VARIANCE', moving_mean) tf.add_to_collection('BN_MEAN_VARIANCE', moving_variance) # These ops will only be preformed when training. mean, variance = tf.nn.moments(x, axis) update_moving_mean = moving_averages.assign_moving_average(moving_mean, mean, self._batch_norm_decay, name='MovingAvgMean') update_moving_variance = moving_averages.assign_moving_average(moving_variance, variance, self._batch_norm_decay, name='MovingAvgVariance')

评论收藏

内容反馈

版权申诉