DeepLearningusingLinearSupportVectorMachines实现_deeplearning和machinelearning资源-CSDN文库

共19个文件

py：3个

meta：2个

checkpoint：2个

深度学习

机器学习

需积分: 48 163 浏览量 2018-05-24 11:30:09 上传评论 1 收藏 64.59MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

cnn-svm-master.zip （19个子文件）

cnn-svm-master

README.md 7KB

LICENSE 11KB

model

cnn_svm.py 9KB

cnn_softmax.py 9KB

setup.sh 504B

main.py 3KB

trained-cnn-softmax

CNN-Softmax-9900.data-00000-of-00001 37.48MB

CNN-Softmax-9900.index 914B

checkpoint 183B

CNN-Softmax-9900.meta 76KB

figures

accuracy-loss-mnist.png 15KB

accuracy-loss-fashion.png 19KB

requirements.txt 40B

trained-cnn-svm

CNN-SVM-9900.index 914B

CNN-SVM-9900.meta 85KB

checkpoint 81B

CNN-SVM-9900.data-00000-of-00001 37.48MB

logs

Sat Dec 9 13:38:20 2017-training

events.out.tfevents.1512797900.darth-Inspiron7559 150KB

Sat Dec 9 13:43:41 2017-training

events.out.tfevents.1512798221.darth-Inspiron7559 169KB

An Architecture Combining Convolutional Neural Network (CNN) and Linear Support Vector Machine (SVM) for Image Classification === ![](https://img.shields.io/badge/DOI-cs.CV%2F1712.03541-blue.svg) [![DOI](https://zenodo.org/badge/113296846.svg)](https://zenodo.org/badge/latestdoi/113296846) ![](https://img.shields.io/badge/license-Apache--2.0-blue.svg) [![PyPI](https://img.shields.io/pypi/pyversions/Django.svg)]() *This project was inspired by Y. Tang's [Deep Learning using Linear Support Vector Machines](https://arxiv.org/abs/1306.0239) (2013).* The full paper on this project may be read at [arXiv.org](https://arxiv.org/abs/1712.03541), [ResearchGate](https://www.researchgate.net/publication/321745073_An_Architecture_Combining_Convolutional_Neural_Network_CNN_and_Support_Vector_Machine_SVM_for_Image_Classification), and [Academia.edu](https://www.academia.edu/35401788/An_Architecture_Combining_Convolutional_Neural_Network_CNN_and_Support_Vector_Machine_SVM_for_Image_Classification). ## Abstract Convolutional neural networks (CNNs) are similar to "ordinary" neural networks in the sense that they are made up of hidden layers consisting of neurons with "learnable" parameters. These neurons receive inputs, performs a dot product, and then follows it with a non-linearity. The whole network expresses the mapping between raw image pixels and their class scores. Conventionally, the Softmax function is the classifier used at the last layer of this network. However, there have been studies ([Alalshekmubarak and Smith, 2013](http://ieeexplore.ieee.org/abstract/document/6544391/); [Agarap, 2017](http://arxiv.org/abs/1709.03082); [Tang, 2013](https://arxiv.org/abs/1306.0239)) conducted to challenge this norm. The cited studies introduce the usage of linear support vector machine (SVM) in an artificial neural network architecture. This project is yet another take on the subject, and is inspired by (Tang, 2013). Empirical data has shown that the CNN-SVM model was able to achieve a test accuracy of ~99.04% using the MNIST dataset ([LeCun, Cortes, and Burges, 2010](http://yann.lecun.com/exdb/mnist/)). On the other hand, the CNN-Softmax was able to achieve a test accuracy of ~99.23% using the same dataset. Both models were also tested on the recently-published Fashion-MNIST dataset ([Xiao, Rasul, and Vollgraf, 2017](https://arxiv.org/abs/1708.07747)), which is suppose to be a more difficult image classification dataset than MNIST ([Zalandoresearch, 2017](http://github.com/zalandoresearch/fashion-mnist)). This proved to be the case as CNN-SVM reached a test accuracy of ~90.72%, while the CNN-Softmax reached a test accuracy of ~91.86%. The said results may be improved if data preprocessing techniques were employed on the datasets, and if the base CNN model was a relatively more sophisticated than the one used in this study. ## Usage First, clone the project. ```bash git clone https://github.com/AFAgarap/cnn-svm.git/ ``` Run the `setup.sh` to ensure that the pre-requisite libraries are installed in the environment. ```bash sudo chmod +x setup.sh ./setup.sh ``` Program parameters. ```bash usage: main.py [-h] -m MODEL -d DATASET [-p PENALTY_PARAMETER] -c CHECKPOINT_PATH -l LOG_PATH CNN & CNN-SVM for Image Classification optional arguments: -h, --help show this help message and exit Arguments: -m MODEL, --model MODEL [1] CNN-Softmax, [2] CNN-SVM -d DATASET, --dataset DATASET path of the MNIST dataset -p PENALTY_PARAMETER, --penalty_parameter PENALTY_PARAMETER the SVM C penalty parameter -c CHECKPOINT_PATH, --checkpoint_path CHECKPOINT_PATH path where to save the trained model -l LOG_PATH, --log_path LOG_PATH path where to save the TensorBoard logs ``` Then, go to the repository's directory, and run the `main.py` module as per the desired parameters. ```bash cd cnn-svm python3 main.py --model 2 --dataset ./MNIST_data --penalty_parameter 1 --checkpoint_path ./checkpoint --log_path ./logs ``` ## Results The hyperparameters used in this project were manually assigned, and not through optimization. |Hyperparameters|CNN-Softmax|CNN-SVM| |---------------|-----------|-------| |Batch size|128|128| |Learning rate|1e-3|1e-3| |Steps|10000|10000| |SVM C|N/A|1| The experiments were conducted on a laptop computer with Intel Core(TM) i5-6300HQ CPU @ 2.30GHz x 4, 16GB of DDR3 RAM, and NVIDIA GeForce GTX 960M 4GB DDR5 GPU. ![](figures/accuracy-loss-mnist.png) **Figure 1. Training accuracy (left) and loss (right) of CNN-Softmax and CNN-SVM on image classification using [MNIST](http://yann.lecun.com/exdb/mnist/).** The orange plot refers to the training accuracy and loss of CNN-Softmax, with a test accuracy of 99.22999739646912%. On the other hand, the blue plot refers to the training accuracy and loss of CNN-SVM, with a test accuracy of 99.04000163078308%. The results do not corroborate the findings of [Tang (2017)](https://arxiv.org/abs/1306.0239) for [MNIST handwritten digits](http://yann.lecun.com/exdb/mnist/) classification. This may be attributed to the fact that no data preprocessing nor dimensionality reduction was done on the dataset for this project. ![](figures/accuracy-loss-fashion.png) **Figure 2. Training accuracy (left) and loss (right) of CNN-Softmax and CNN-SVM on image classification using [Fashion-MNIST](http://github.com/zalandoresearch/fashion-mnist).** The red plot refers to the training accuracy and loss of CNN-Softmax, with a test accuracy of 91.86000227928162%. On the other hand, the light blue plot refers to the training accuracy and loss of CNN-SVM, with a test accuracy of 90.71999788284302%. The result on CNN-Softmax corroborates the finding by [zalandoresearch](https://github.com/zalandoresearch) on [Fashion-MNIST](https://github.com/zalandoresearch/fashion-mnist#benchmark). ## Citation To cite the paper, kindly use the following BibTex entry: ``` @article{agarap2017architecture, title={An Architecture Combining Convolutional Neural Network (CNN) and Support Vector Machine (SVM) for Image Classification}, author={Agarap, Abien Fred}, journal={arXiv preprint arXiv:1712.03541}, year={2017} } ``` To cite the repository/software, kindly use the following BibTex entry: ``` @misc{abien_fred_agarap_2017_1098369, author = {Abien Fred Agarap}, title = {AFAgarap/cnn-svm v0.1.0-alpha}, month = dec, year = 2017, doi = {10.5281/zenodo.1098369}, url = {https://doi.org/10.5281/zenodo.1098369} } ``` ## License ``` Copyright 2017 Abien Fred Agarap Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0 Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License. ```

评论收藏

内容反馈

Bill_zhang5

粉丝: 23
资源: 7

Deep Learning using Linear Support Vector Machines实现

最新资源

Deep Learning using Linear Support Vector Machines实现

Support Vector Machines

机器学习 第十讲：Support Vector Machines 4

Opencv2.4.9源码分析——Support Vector Machines

Learning with Support Vector Machines

Feature Selection Using Linear Support Vector Machines

lecture 16：Learning：Support Vector Machines

Classifying-income-data-using-Support-Vector-Machines:我们将基于14个属性构建支持向量机分类器，以预测给定人员的收入等级。 我们的目标是查看年收入高于或低于$ 50,000的地方

SVM（Basic Support Vector Machines using quadprog）：这篇文章展示了使用 Matlab quadprog 函数训练 SVM 是多么简单。-matlab开发

Machine Learning for OpenCV

Machine Learning for OpenCV: Intelligent image processing with Python

Mastering Machine Learning with R - Second Edition

Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow 2ed 2019.epub

The Elements of Statistical Learning(2nd)(Trevor Hastie 2008)_12.Support Vector Machines and Flexible Discriminants.pdf

论文研究-Classification of EU project descriptions using Support Vector Machines.pdf

Face Recognition using Support Vector Machines with Local Correlation Kernels.pdf

An Introduction to Support Vector Machines and Other Kernel-based Learning Methods.chm

An Introduction to Support Vector Machines and Other Kernel-based Learning Methods

Hands-On Machine Learning with Scikit-Learn and TensorFlow: Concepts, Tools, and

Hands－On.Machine.Learning.with.Scikit－Learn.and.TensorFlow.2017

Hands-On Machine Learning with Scikit-Learn and TensorFlow [EPUB]

Hands-On Machine Learning with Scikit-Learn and TensorFlow [Kindle Edition]

Hands_On_Machine_Learning_with_Scikit_Learn_and_TensorFlow book and code

Image-Forgery-using-Deep-Learning:使用深度学习的图像伪造检测，在PyTorch中实现

An Introduction to Support Vector Machines and Other Kernel-based Learning Metho

Mastering.Machine.Learning.with.R.

Machine Learning Algorithms 2017.8

最新资源

机器学习第十讲：Support Vector Machines 4

Classifying-income-data-using-Support-Vector-Machines:我们将基于14个属性构建支持向量机分类器，以预测给定人员的收入等级。我们的目标是查看年收入高于或低于$ 50,000的地方