复数版卷积神经网络，复数版CAFFE_复数卷积神经网络,复数卷积资源-CSDN文库

共743个文件

cpp：203个

hpp：118个

md：98个

需积分: 42 19 浏览量 2018-09-01 20:24:47 上传评论 11 收藏 8.12MB RAR 举报

复数版卷积神经网络（Complex Convolutional Neural Networks，简称Complex CNN）是深度学习领域的一个创新性研究方向，它引入了复数运算的概念，旨在处理具有复数属性的数据，如信号处理中的频域信息或者某些特殊的物理现象。在传统的卷积神经网络中，我们通常只使用实数进行计算，但在某些情况下，复数能够提供更丰富的表示能力。复数版CAFFE，即基于Caffe框架的复数扩展，是由著名的深度学习库Caffe进行修改和增强的版本。Caffe以其高效、模块化和易于部署的特点广受研究人员和开发者欢迎。复数版Caffe不仅保留了这些优点，还引入了对复数数据的支持，允许用户在构建神经网络时使用复数卷积层、复数全连接层、复数池化层、复数sigmoid层、复数幅度层以及复数反卷积层。复数卷积层是复杂CNN的核心组成部分，它在处理数据时应用了复数乘法和加法。与实数卷积相比，复数卷积可以捕获输入数据的相位和幅度信息，这对于处理如电磁波、声波等具有相位特性的数据特别有用。复数全连接层则将前一层的复数激活值映射到下一层次的复数权重，同样增强了模型对复杂数据结构的理解。复数池化层是对传统池化操作的复数扩展，它可以对输入复数特征图进行下采样，保留关键信息，同时减少计算量。复数sigmoid层则是激活函数的复数版本，它将复数输入转换为介于-1和1之间的复数值，保持非线性特性。复数幅度层负责提取复数特征的幅度信息，这对于分析信号强度至关重要。复数反卷积层（或称上采样层）用于在解卷积过程中恢复图像的分辨率，同样采用复数运算。在实际应用中，复数版Caffe可以应用于诸多领域，如雷达信号处理、声学建模、无线通信等，这些场景中复数运算能更好地捕捉物理现象的本质。通过使用这个库，研究人员和工程师可以设计和训练针对复数数据的深度学习模型，以解决传统方法难以处理的问题。总结来说，复数版Caffe是深度学习工具箱的重要拓展，它使得处理复数数据成为可能，开启了新的研究领域和应用前景。通过这个库，我们可以构建更为复杂且适应性强的神经网络模型，以应对那些需要考虑相位信息和幅度信息的挑战。

资源推荐

资源详情

资源评论

收起资源包目录

复数版卷积神经网络，复数版CAFFE （743个子文件）

caffe 2KB

gtest_main.cc 2KB

caffe.cloc 1KB

Utils.cmake 13KB

Cuda.cmake 11KB

Summary.cmake 7KB

Targets.cmake 7KB

Dependencies.cmake 7KB

FindLAPACK.cmake 7KB

ProtoBuf.cmake 4KB

FindMKL.cmake 3KB

FindNumPy.cmake 2KB

ConfigGen.cmake 2KB

gflags.cmake 2KB

glog.cmake 2KB

Misc.cmake 2KB

FindMatlabMex.cmake 2KB

FindLevelDB.cmake 2KB

FindAtlas.cmake 2KB

FindOpenBLAS.cmake 2KB

FindGFlags.cmake 2KB

lint.cmake 1KB

FindGlog.cmake 1KB

FindvecLib.cmake 1KB

FindLMDB.cmake 1KB

FindSnappy.cmake 1KB

FindNCCL.cmake 654B

CNAME 25B

gtest-all.cpp 329KB

test_net.cpp 79KB

test_upgrade_proto.cpp 71KB

test_pooling_layer.cpp 50KB

test_gradient_based_solver.cpp 44KB

test_convolution_layer.cpp 43KB

upgrade_proto.cpp 42KB

net.cpp 38KB

test_neuron_layer.cpp 34KB

test_complex_inner_product_layer.cpp 30KB

math_functions.cpp 25KB

test_split_layer.cpp 25KB

caffe_.cpp 21KB

test_scale_layer.cpp 21KB

_caffe.cpp 21KB

test_bias_layer.cpp 19KB

base_complex_conv_layer.cpp 18KB

data_transformer.cpp 18KB

test_random_number_generator.cpp 17KB

window_data_layer.cpp 17KB

solver.cpp 17KB

test_lrn_layer.cpp 17KB

test_data_layer.cpp 16KB

base_conv_layer.cpp 16KB

test_inner_product_layer.cpp 15KB

caffe.cpp 15KB

blob.cpp 14KB

complex_pooling_layer.cpp 14KB

complex_batch_norm_layer.cpp 14KB

test_io.cpp 13KB

sgd_solver.cpp 12KB

test_deconvolution_layer.cpp 12KB

im2col.cpp 12KB

test_accuracy_layer.cpp 11KB

pooling_layer.cpp 11KB

recurrent_layer.cpp 11KB

test_memory_data_layer.cpp 11KB

test_data_transformer.cpp 11KB

parallel.cpp 11KB

lrn_layer.cpp 11KB

common.cpp 10KB

test_lstm_layer.cpp 10KB

test_crop_layer.cpp 10KB

test_argmax_layer.cpp 10KB

cudnn_conv_layer.cpp 10KB

batch_norm_layer.cpp 10KB

test_reshape_layer.cpp 10KB

test_blob.cpp 9KB

test_reduction_layer.cpp 9KB

scale_layer.cpp 9KB

layer_factory.cpp 9KB

lstm_layer.cpp 9KB

classification.cpp 8KB

rnn_layer.cpp 8KB

im2col_layer.cpp 8KB

test_image_data_layer.cpp 8KB

complex_inner_product_layer.cpp 8KB

test_slice_layer.cpp 8KB

test_eltwise_layer.cpp 8KB

spp_layer.cpp 8KB

infogain_loss_layer.cpp 8KB

test_rnn_layer.cpp 8KB

test_concat_layer.cpp 8KB

test_dummy_data_layer.cpp 7KB

io.cpp 7KB

test_filler.cpp 7KB

hdf5.cpp 7KB

image_data_layer.cpp 7KB

test_embed_layer.cpp 7KB

complex_layer.cpp 6KB

test_math_functions.cpp 6KB

extract_features.cpp 6KB

共 743 条

--- title: LeNet MNIST Tutorial description: Train and test "LeNet" on the MNIST handwritten digit data. category: example include_in_docs: true priority: 1 --- # Training LeNet on MNIST with Caffe We will assume that you have Caffe successfully compiled. If not, please refer to the [Installation page](/installation.html). In this tutorial, we will assume that your Caffe installation is located at `CAFFE_ROOT`. ## Prepare Datasets You will first need to download and convert the data format from the MNIST website. To do this, simply run the following commands: cd $CAFFE_ROOT ./data/mnist/get_mnist.sh ./examples/mnist/create_mnist.sh If it complains that `wget` or `gunzip` are not installed, you need to install them respectively. After running the script there should be two datasets, `mnist_train_lmdb`, and `mnist_test_lmdb`. ## LeNet: the MNIST Classification Model Before we actually run the training program, let's explain what will happen. We will use the [LeNet](http://yann.lecun.com/exdb/publis/pdf/lecun-01a.pdf) network, which is known to work well on digit classification tasks. We will use a slightly different version from the original LeNet implementation, replacing the sigmoid activations with Rectified Linear Unit (ReLU) activations for the neurons. The design of LeNet contains the essence of CNNs that are still used in larger models such as the ones in ImageNet. In general, it consists of a convolutional layer followed by a pooling layer, another convolution layer followed by a pooling layer, and then two fully connected layers similar to the conventional multilayer perceptrons. We have defined the layers in `$CAFFE_ROOT/examples/mnist/lenet_train_test.prototxt`. ## Define the MNIST Network This section explains the `lenet_train_test.prototxt` model definition that specifies the LeNet model for MNIST handwritten digit classification. We assume that you are familiar with [Google Protobuf](https://developers.google.com/protocol-buffers/docs/overview), and assume that you have read the protobuf definitions used by Caffe, which can be found at `$CAFFE_ROOT/src/caffe/proto/caffe.proto`. Specifically, we will write a `caffe::NetParameter` (or in python, `caffe.proto.caffe_pb2.NetParameter`) protobuf. We will start by giving the network a name: name: "LeNet" ### Writing the Data Layer Currently, we will read the MNIST data from the lmdb we created earlier in the demo. This is defined by a data layer: layer { name: "mnist" type: "Data" transform_param { scale: 0.00390625 } data_param { source: "mnist_train_lmdb" backend: LMDB batch_size: 64 } top: "data" top: "label" } Specifically, this layer has name `mnist`, type `data`, and it reads the data from the given lmdb source. We will use a batch size of 64, and scale the incoming pixels so that they are in the range \[0,1\). Why 0.00390625? It is 1 divided by 256. And finally, this layer produces two blobs, one is the `data` blob, and one is the `label` blob. ### Writing the Convolution Layer Let's define the first convolution layer: layer { name: "conv1" type: "Convolution" param { lr_mult: 1 } param { lr_mult: 2 } convolution_param { num_output: 20 kernel_size: 5 stride: 1 weight_filler { type: "xavier" } bias_filler { type: "constant" } } bottom: "data" top: "conv1" } This layer takes the `data` blob (it is provided by the data layer), and produces the `conv1` layer. It produces outputs of 20 channels, with the convolutional kernel size 5 and carried out with stride 1. The fillers allow us to randomly initialize the value of the weights and bias. For the weight filler, we will use the `xavier` algorithm that automatically determines the scale of initialization based on the number of input and output neurons. For the bias filler, we will simply initialize it as constant, with the default filling value 0. `lr_mult`s are the learning rate adjustments for the layer's learnable parameters. In this case, we will set the weight learning rate to be the same as the learning rate given by the solver during runtime, and the bias learning rate to be twice as large as that - this usually leads to better convergence rates. ### Writing the Pooling Layer Phew. Pooling layers are actually much easier to define: layer { name: "pool1" type: "Pooling" pooling_param { kernel_size: 2 stride: 2 pool: MAX } bottom: "conv1" top: "pool1" } This says we will perform max pooling with a pool kernel size 2 and a stride of 2 (so no overlapping between neighboring pooling regions). Similarly, you can write up the second convolution and pooling layers. Check `$CAFFE_ROOT/examples/mnist/lenet_train_test.prototxt` for details. ### Writing the Fully Connected Layer Writing a fully connected layer is also simple: layer { name: "ip1" type: "InnerProduct" param { lr_mult: 1 } param { lr_mult: 2 } inner_product_param { num_output: 500 weight_filler { type: "xavier" } bias_filler { type: "constant" } } bottom: "pool2" top: "ip1" } This defines a fully connected layer (known in Caffe as an `InnerProduct` layer) with 500 outputs. All other lines look familiar, right? ### Writing the ReLU Layer A ReLU Layer is also simple: layer { name: "relu1" type: "ReLU" bottom: "ip1" top: "ip1" } Since ReLU is an element-wise operation, we can do *in-place* operations to save some memory. This is achieved by simply giving the same name to the bottom and top blobs. Of course, do NOT use duplicated blob names for other layer types! After the ReLU layer, we will write another innerproduct layer: layer { name: "ip2" type: "InnerProduct" param { lr_mult: 1 } param { lr_mult: 2 } inner_product_param { num_output: 10 weight_filler { type: "xavier" } bias_filler { type: "constant" } } bottom: "ip1" top: "ip2" } ### Writing the Loss Layer Finally, we will write the loss! layer { name: "loss" type: "SoftmaxWithLoss" bottom: "ip2" bottom: "label" } The `softmax_loss` layer implements both the softmax and the multinomial logistic loss (that saves time and improves numerical stability). It takes two blobs, the first one being the prediction and the second one being the `label` provided by the data layer (remember it?). It does not produce any outputs - all it does is to compute the loss function value, report it when backpropagation starts, and initiates the gradient with respect to `ip2`. This is where all magic starts. ### Additional Notes: Writing Layer Rules Layer definitions can include rules for whether and when they are included in the network definition, like the one below: layer { // ...layer definition... include: { phase: TRAIN } } This is a rule, which controls layer inclusion in the network, based on current network's state. You can refer to `$CAFFE_ROOT/src/caffe/proto/caffe.proto` for more information about layer rules and model schema. In the above example, this layer will be included only in `TRAIN` phase. If we change `TRAIN` with `TEST`, then this layer will be used only in test phase. By default, that is without layer rules, a layer is always included in the network. Thus, `lenet_train_test.prototxt` has two `DATA` layers defined (with different `batch_size`), one for the training phase and one for the testing phase. Also, there is an `Accuracy` layer which is included only in `TEST` phase for reporting the model accuracy every 100 iteration, as defined in `lenet_solver.prototxt`. ## Define t

评论收藏

内容反馈