Github代码复现——自监督学习SimCLR跑自己的数据集（TensorFlow2）

共3383个文件

tif：3360个

py：9个

pyc：8个

版权申诉

SimCLR

Tensorflow2

自监督学习

图像分类

自己的数据集

5星 · 超过95%的资源 175 浏览量 2022-03-27 20:06:41 上传评论 8 收藏 459.65MB ZIP 举报

**正文** 在本文中，我们将深入探讨如何使用TensorFlow2框架实现自监督学习SimCLR算法，并将其应用于我们自己的数据集。SimCLR是一种先进的无监督学习方法，尤其在图像表示学习领域展现出强大的性能。通过自我监督，模型可以在没有标签的情况下学习到数据的内在结构和特征，这对于预训练模型的构建非常有用，进而可以应用于各种下游任务，如图像分类。我们需要了解SimCLR的基本原理。SimCLR的核心思想是通过数据增强和对比学习来提取图像的特征。数据增强是对原始图像进行随机变换，如翻转、裁剪、颜色扰动等，生成两个不同的但相关的视图。这两个视图随后会被馈送到同一个神经网络中，得到两个特征向量。然后，我们会使用一个非线性的投影头将这些特征向量映射到一个高维空间，在这个空间中，来自同一图像的视图应当尽可能接近，而来自不同图像的视图应尽可能远离。这就是所谓的对比学习，通过最大化相似性损失来实现。在TensorFlow2中，实现SimCLR的关键步骤包括： 1. **数据预处理与增强**：使用`tf.data.Dataset` API创建数据管道，应用随机变换如`tf.image.random_flip_left_right`, `tf.image.random_crop`等对输入图像进行增强。 2. **构建模型**：SimCLR通常基于ResNet架构，我们可以使用`tf.keras.applications.ResNet50`作为基础模型，但移除最后一层全连接层，保留卷积层作为特征提取器。之后添加两个线性层作为投影头，用于将特征向量映射到高维空间。 3. **损失函数**：SimCLR使用NT-Xent（Normalized Temperature-Scaled Cross Entropy）损失函数，它衡量了不同视图的特征向量在高维空间中的距离。这个损失函数需要负样本对，可以通过批次内所有其他图像的视图来生成。 4. **优化器与训练**：选择合适的优化器，如`tf.keras.optimizers.Adam`，并设定学习率策略，例如使用余弦退火调度。进行多轮训练，每轮遍历整个数据集一次。 5. **评估与微调**：预训练模型的表示可以用于下游任务，如图像分类。去除投影头，然后在目标任务上添加新的分类层，最后对微调模型进行训练。在实际应用中，对于"自己"的数据集，你需要确保数据集已准备就绪，包括正确的格式、合适的类别划分以及足够的样本数量。此外，调整SimCLR的超参数，如批量大小、学习率、训练轮数等，以适应你的特定数据集，也是至关重要的。总结来说，SimCLR通过TensorFlow2的实现，让我们能够在无标签数据上进行有效的特征学习。通过理解其核心机制，我们可以利用这种自监督学习方法在自己的数据集上训练出强大的预训练模型，进一步提升在图像分类等任务上的表现。在实践中，不断试验和优化是提升模型性能的关键。

资源推荐

资源详情

资源评论

收起资源包目录

Github代码复现——自监督学习SimCLR跑自己的数据集（TensorFlow2）（3383个子文件）

finetuning.ipynb 435KB

load_and_inference.ipynb 387KB

distillation_self_training.ipynb 381KB

imagenet_results.ipynb 26KB

README.md 9KB

resnet.py 25KB

run.py 25KB

data_util.py 18KB

model.py 11KB

lars_optimizer.py 6KB

objective.py 5KB

data.py 3KB

metrics.py 3KB

dataset.py 1KB

data_util.cpython-36.pyc 17KB

resnet.cpython-36.pyc 14KB

model.cpython-36.pyc 8KB

lars_optimizer.cpython-36.pyc 4KB

objective.cpython-36.pyc 3KB

data.cpython-36.pyc 3KB

metrics.cpython-36.pyc 2KB

dataset.cpython-36.pyc 2KB

agricultural08.tif 219KB

agricultural09.tif 218KB

agricultural14.tif 217KB

agricultural25.tif 214KB

agricultural18.tif 213KB

agricultural07.tif 213KB

agricultural20.tif 211KB

chaparral06.tif 208KB

agricultural24.tif 208KB

agricultural15.tif 206KB

agricultural29.tif 206KB

forest22.tif 205KB

chaparral61.tif 204KB

chaparral60.tif 203KB

chaparral62.tif 202KB

forest20.tif 201KB

agricultural27.tif 200KB

agricultural23.tif 200KB

river54.tif 200KB

river53.tif 200KB

chaparral63.tif 199KB

forest23.tif 198KB

chaparral64.tif 198KB

agricultural05.tif 198KB

parkinglot78.tif 197KB

agricultural06.tif 195KB

agricultural66.tif 195KB

parkinglot80.tif 195KB

agricultural34.tif 195KB

mobilehomepark82.tif 194KB

chaparral07.tif 194KB

parkinglot81.tif 193KB

chaparral90.tif 192KB

agricultural87.tif 191KB

parkinglot84.tif 191KB

agricultural62.tif 191KB

forest16.tif 190KB

mobilehomepark84.tif 190KB

parkinglot71.tif 190KB

共 3383 条

# TF2 implementation of SimCLR This implementation is based on TensorFlow 2.x. We use `tf.keras` layers for building the model and use `tf.data` for our input pipeline. The model is trained using a [custom training loop](https://www.tensorflow.org/tutorials/distribute/custom_training) with `tf.distribute` on multiple TPUs. <div align="center"> <img width="50%" alt="SimCLR Illustration" src="https://1.bp.blogspot.com/--vH4PKpE9Yo/Xo4a2BYervI/AAAAAAAAFpM/vaFDwPXOyAokAC8Xh852DzOgEs22NhbXwCLcBGAsYHQ/s1600/image4.gif"> </div> <div align="center"> An illustration of SimCLR (from <a href="https://ai.googleblog.com/2020/04/advancing-self-supervised-and-semi.html">our blog here</a>). </div> <br/><br/> ## Pre-trained models for SimCLRv2 <a href="tf2/colabs/finetuning.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a> We have converted the checkpoints for the TF1 models of SimCLR v1 and v2 to TF2 [SavedModel](https://www.tensorflow.org/guide/saved_model): * Pretrained SimCLRv2 models (with linear eval head): [gs://simclr-checkpoints-tf2/simclrv2/pretrained](https://console.cloud.google.com/storage/browser/simclr-checkpoints-tf2/simclrv2/pretrained) * Fine-tuned SimCLRv2 models on 1% of labels: [gs://simclr-checkpoints-tf2/simclrv2/finetuned_1pct](https://console.cloud.google.com/storage/browser/simclr-checkpoints-tf2/simclrv2/finetuned_1pct) * Fine-tuned SimCLRv2 models on 10% of labels: [gs://simclr-checkpoints-tf2/simclrv2/finetuned_10pct](https://console.cloud.google.com/storage/browser/simclr-checkpoints-tf2/simclrv2/finetuned_10pct) * Fine-tuned SimCLRv2 models on 100% of labels: [gs://simclr-checkpoints-tf2/simclrv2/finetuned_100pct](https://console.cloud.google.com/storage/browser/simclr-checkpoints-tf2/simclrv2/finetuned_100pct) * Supervised models with the same architectures: [gs://simclr-checkpoints-tf2/simclrv2/supervised](https://console.cloud.google.com/storage/browser/simclr-checkpoints-tf2/simclrv2/supervised) * The distilled / self-trained models (after fine-tuning) are also provided: * [gs://simclr-checkpoints-tf2/simclrv2/distill_1pct](https://console.cloud.google.com/storage/browser/simclr-checkpoints-tf2/simclrv2/distill_1pct) * [gs://simclr-checkpoints-tf2/simclrv2/distill_10pct](https://console.cloud.google.com/storage/browser/simclr-checkpoints-tf2/simclrv2/distill_10pct) We also provide examples on how to use the SavedModels in `colabs/` folder. In addition to the TF1 colabs we provide a `imagenet_results.ipynb` colab to verify results from SimCLR v1 and v2 papers for ImageNet. ## Pre-trained models for SimCLRv1 The pre-trained models (base network with linear classifier layer) can be found below. Note that for these SimCLRv1 checkpoints, the projection head is not available. | SavedModel | ImageNet Top-1 | |--------------------------------------------------------------------------------------------------------------|------------------------| |[ResNet50 (1x)](https://console.cloud.google.com/storage/browser/simclr-checkpoints-tf2/simclrv1/pretrain/1x) | 69.1 | |[ResNet50 (2x)](https://console.cloud.google.com/storage/browser/simclr-checkpoints-tf2/simclrv1/pretrain/2x) | 74.2 | |[ResNet50 (4x)](https://console.cloud.google.com/storage/browser/simclr-checkpoints-tf2/simclrv1/pretrain/4x) | 76.6 | Additional SimCLRv1 checkpoints are available: [gs://simclr-checkpoints-tf2/simclrv1](https://console.cloud.google.com/storage/browser/simclr-checkpoints-tf2/simclrv1). A note on the signature of the TensorFlow SavedModel: `logits_sup` is the supervised classification logits for ImageNet 1000 categories. Others (e.g. `initial_max_pool`, `block_group1`) are middle layers of ResNet; refer to resnet.py for the specifics. ## Enviroment setup Our models are trained with TPUs. It is recommended to run distributed training with TPUs when using our code for pretraining. The code can be run on multiple GPUs by replacing `tf.distribute.TPUStrategy` with `tf.distribute.MirroredStrategy`. See the TensorFlow distributed training [guide](https://www.tensorflow.org/guide/distributed_training) for an overview of `tf.distribute`. The code is compatible with TensorFlow 2.x. See requirements.txt for all prerequisites, and you can also install them using the following command. ``` pip install -r requirements.txt ``` ## Pretraining To pretrain the model on CIFAR-10 with CPU / 1 or more GPUs, try the following command: ``` python run.py --train_mode=pretrain \ --train_batch_size=512 --train_epochs=1000 \ --learning_rate=1.0 --weight_decay=1e-4 --temperature=0.5 \ --dataset=cifar10 --image_size=32 --eval_split=test --resnet_depth=18 \ --use_blur=False --color_jitter_strength=0.5 \ --model_dir=/tmp/simclr_test --use_tpu=False ``` To pretrain the model on ImageNet with Cloud TPUs, first check out the [Google Cloud TPU tutorial](https://cloud.google.com/tpu/docs/tutorials/mnist) for basic information on how to use Google Cloud TPUs. Once you have created virtual machine with Cloud TPUs, and pre-downloaded the ImageNet data for [tensorflow_datasets](https://www.tensorflow.org/datasets/catalog/imagenet2012), please set the following enviroment variables: ``` TPU_NAME=<tpu-name> STORAGE_BUCKET=gs://<storage-bucket> DATA_DIR=$STORAGE_BUCKET/<path-to-tensorflow-dataset> MODEL_DIR=$STORAGE_BUCKET/<path-to-store-checkpoints> ``` The following command can be used to pretrain a ResNet-50 on ImageNet (which reflects the default hyperparameters in our paper): ``` python run.py --train_mode=pretrain \ --train_batch_size=4096 --train_epochs=100 --temperature=0.1 \ --learning_rate=0.075 --learning_rate_scaling=sqrt --weight_decay=1e-4 \ --dataset=imagenet2012 --image_size=224 --eval_split=validation \ --data_dir=$DATA_DIR --model_dir=$MODEL_DIR \ --use_tpu=True --tpu_name=$TPU_NAME --train_summary_steps=0 ``` A batch size of 4096 requires at least 32 TPUs. 100 epochs takes around 6 hours with 32 TPU v3s. Note that learning rate of 0.3 with `learning_rate_scaling=linear` is equivalent to that of 0.075 with `learning_rate_scaling=sqrt` when the batch size is 4096. However, using sqrt scaling allows it to train better when smaller batch size is used. ## Finetuning the linear head (linear eval) You could simply set `--lineareval_while_pretraining=True` during pretraining, which will train the linear classifier as you pretrain the model. The `stop_gradient` operator is uesd to prevent backpropagating the label information to representations. More conventionally, you can also finetune the linear head on top of a pretrained model after pretraining, as follows: ``` class Model(tf.keras.Model): def __init__(self, path): super(Model, self).__init__() # Load a pretrained SimCLR model. self.saved_model = tf.saved_model.load(path) # Linear head. self.dense_layer = tf.keras.layers.Dense(units=num_classes, name="head_supervised_new") self.optimizer = <your favorite optimizer> def call(self, x): with tf.GradientTape() as tape: # Use `trainable=False` since we do not wish to update batch norm # statistics of the loaded model. If finetuning everything, set this to # True. outputs = self.saved_model(x['image'], trainable=False) logits_t = self.dense_layer(outputs['final_avg_pool']) loss_t = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits( labels = tf.one_hot(x['label'], num_classes), logits=logits_t)) dense_layer_weights = self.dense_layer.trainable_weights print('Variables to train:', dense_layer_weights) # Note: We only compute gradients wrt the linear head. To finetune all # weights use self.trainable_weights instead. grads = tape.gradient(loss_t, dense_layer_weights) self.optimizer.apply_gradie

评论收藏

内容反馈

版权申诉