Bag-of-Visual-WordsModelsforAdultImageClassificationandFiltering资源-CSDN文库

4星 · 超过85%的资源需积分: 13 159 浏览量 2012-02-12 16:58:47 上传评论 1 收藏 802KB PDF 举报

Bag-of-Visual-Words Models for Adult Image Classification and Filtering，Thomas Deselaers1, Lexi Pimenidis, Hermann Ney1 ### 视觉词袋模型在成人图像分类与过滤中的应用 #### 概述本文介绍了一种基于视觉词袋（Bag-of-Visual-Words, BoVW）模型的方法，用于将图像分类为不同类型的色情内容，并创建一个系统来从网络流量中过滤色情图像。虽然过去已经提出了针对该应用的不同系统，但大多数系统都是基于简单的肤色特征，其性能相对较差。近年来，在图像识别领域特别是在对象分类方面取得的进步表明，视觉词袋方法是许多图像分类问题的良好解决方案。本文所提出的系统正是基于这种方法，使用了任务特定的视觉词汇，并在一个包含8500张不同类别图像的数据集上进行了训练和评估。结果显示，该系统在该数据集上的表现明显优于早期的系统，并且在两个新的网络流量集合上的进一步评估也显示了该系统的良好性能。 #### 方法论 ##### 视觉词袋模型 **视觉词袋模型**是一种广泛应用于计算机视觉领域的统计模型，类似于文本处理中的词袋模型。它将图像表示为一组局部视觉特征的统计分布，这些特征通常称为“视觉词”。具体步骤如下： 1. **特征提取**：需要从图像中提取一系列局部特征，如SIFT（尺度不变特征变换）、SURF（加速鲁棒特征）等。 2. **构建视觉词汇**：通过聚类算法（如K-means）对提取到的特征进行聚类，形成一个视觉词汇表。 3. **图像表示**：对于每张图像，统计其所有特征与视觉词汇表中每个视觉词的匹配次数，得到一个向量表示。 4. **分类器训练与测试**：利用训练集上的图像向量表示来训练分类器（如支持向量机、随机森林等），并用测试集评估分类器性能。 ##### 任务特定的视觉词汇为了提高分类性能，研究者开发了一个针对成人图像分类的任务特定视觉词汇。这一步骤包括选择最能区分不同类别图像的特征，并构建一个更加针对性的视觉词汇表。 #### 实验设计 ##### 数据集实验在包含8500张图像的数据集上进行，这些图像被分为不同的色情内容类别。数据集覆盖了多种类型，以确保系统的泛化能力。 ##### 训练与评估 - **训练过程**：采用BoVW模型对图像进行表示，并利用机器学习算法（如支持向量机）进行训练。 - **评估指标**：通过准确率、召回率、F1分数等指标评估系统的性能。 - **结果对比**：与先前基于简单肤色特征的方法进行比较，展示了BoVW方法的优势。 #### 结果与讨论研究结果显示，所提出的基于BoVW的成人图像分类与过滤系统在实验数据集上取得了显著的效果，明显优于基于简单肤色特征的传统方法。此外，该系统还在两个新的网络流量集合上进行了测试，进一步验证了其良好的泛化能力和实用性。 #### 结论视觉词袋模型为成人图像的分类与过滤提供了一种有效的方法。通过对图像进行局部特征提取和聚类，形成任务特定的视觉词汇表，可以极大地提高分类的准确性。未来的研究可以探索更高级的特征表示方法以及更高效的分类算法，以进一步提升系统的性能。

资源推荐

资源详情

资源评论

Bag-of-Visual-Words Models for Adult Image Classiﬁcation and Filtering

Thomas Deselaers

1∗

, Lexi Pimenidis

, Hermann Ney

1∗

Human Language Technology and Pattern Recognition –

Security and Privacy Research

Computer Science Department, RWTH Aachen University, Aachen, Germany

E-mail: deselaers@cs.rwth-aachen.de

Abstract

We present a method to classify images into different

categories of pornographic content to create a system

for ﬁltering pornographic images from network traf-

ﬁc. Although different systems for this application were

presented in the past, most of these systems are based

on simple skin colour features and have rather poor

performance. Recent advances in the image recogni-

tion ﬁeld in particular for the classiﬁcation of objects

have shown that bag-of-visual-words-approaches are a

good method for many image classiﬁcation problems.

The system we present here, is based on this approach,

uses a task-speciﬁc visual vocabulary and is trained and

evaluated on an image database of 8500 images from

different categories. It is shown that it clearly outper-

forms earlier systems on this dataset and further eval-

uation on two novel web-trafﬁc collections shows the

good performance of the proposed system.

1 Introduction

Rating images according to their content is an important

application area, with one main application in ﬁltering

network trafﬁc to prohibit e.g. viewing pornographic

material. One desired property of such a system is the

possibility to dynamically change the content-type that

is ﬁltered to avoid the necessity of several such systems.

Different clients might require differently strict content-

ﬁlters (e.g. elementary schools or religious institutions

might require different ﬁlters than universities or private

employers). At home, people might want to enable such

a ﬁlter over the day, when children are using the com-

puter but disable it in the late evening [14]. Ideally, an

pornographic image ﬁlter is created once and then the

ﬁlter administrator can easily select which types of im-

ages he wants the ﬁlter to remove and which types of

images are allowed.

In the literature, different porn image ﬁltering tech-

niques were presented: The detection of skin coloured

areas is investigated in [10, 9], skin colour features are

used in combination with other features such as tex-

ture features and colour histograms [7, 11, 15, 2, 9, 16].

∗

This work was partially funded by the Deutsche Forschungsge-

meinschaft (DFG) under grant NE-572/6.

Most of these systems build on neural networks or sup-

port vector machines as classiﬁers. In [14], some spe-

cialised features for porn image classiﬁcation are pre-

sented and used in a retrieval/nearest neighbour clas-

siﬁcation scheme. The POESIA ﬁlter

contains an

open source implementation of a skin-colour-based ﬁl-

ter. Other approaches try to fuse textual and visual in-

formation from webpages in order to achieve better per-

formance [8].

Recently the bag-of-visual-words (BOVW) mod-

els, which were initially proposed for texture classiﬁca-

tion [3, 13], have gained enormous popularity in object

classiﬁcation [4, 5] and natural scene analysis [6]. The

BOVW models are inspired by the bag-of-words models

in text classiﬁcation where a document is represented by

an unsorted set of the contained words. Analogously,

here an image is represented by an unsorted set of dis-

crete visual words, which are obtained by discretisation

of local descriptors. The here presented method learns

a task-speciﬁc visual vocabulary and employs a log-

linear model to discriminate between different classes

of content-type.

2 Porn Image Identiﬁcation

For porn image identiﬁcation, we follow the BOVW-

approach, where images are represented as a histogram

of visual words. The visual words denote local features

extracted from the images and the vocabulary is learnt

task-speciﬁcally from a training database.

2.1 Bag-of-Visual-Words Method

As local features, we extract image patches around

difference-of-Gaussian interest points [12] which are

scaled to a common size and then PCA transformed

leaving 30 coefﬁcients to reduce their dimensionality.

The advantage of patches over e.g. SIFT features [12]

is the straight-forward inclusion of colour information

which clearly is important for the addressed task.

To create a visual vocabulary, we use the training al-

gorithm for unsupervised training of Gaussian mixture

models. This algorithm creates a set of 2

#splits

densi-

ties by iteratively splitting each existing density in the

http://www.poesia-filter.org/

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余3页未读，立即下载

评论收藏

内容反馈

milffan

2012-08-10

是一篇论文，Bova是流行的检测不良图片的方法。
LogisticsProgramer

2017-11-24

不错的资料，学习一下

芒果太甜

粉丝: 37
资源: 11

Bag-of-Visual-Words Models for Adult Image Classification and Fi...

最新资源

Bag-of-Visual-Words Models for Adult Image Classification and Fi...

Bag of visual words 详解及例程

Image Classification with Bag of Visual Words:Bag of Visual Words for Image Classification using SURF features on Caltech101 and my own test data-matlab开发

PyTorch-Image-Models-Multi-Label-Classification-main.zip

Unsupervised Deep Transfer Feature Learning for Medical Image Classification

Power Mean SVM for Large Scale Visual Classification

bagofwords_classification

Textural features for image classification

3.Linear-Models-for-Classification.html

The Bag-of-Opinions Method for Car Review Sentiment Polarity Classification

Active Convolution Learning the Shape of Convolution for Image Classification

Image_Classification_using_SIFT,_Bag_of_words,

机器学习分类模型 Introduction-to-ML-Classification-Models-using-scikit-learn-master.zip

bag of words classification(BOW算法)

Grad-CAM-for-AlexNet-to-explain-the-reason-of-classification

Image Classification for Content-Based Indexing

A Novel Codebook Representation Method and Encoding Strategy for Bag-of-Words Based Acoustic Event Classification

Ask the dictionary: Soft-assignment location-orientation pooling for image classification

【船级社】 467-NR_2022-07 Rules for the Classification of Steel Ships

【船级社】 DNV Rules for classification Rules for classification_ Hig

Digital Image Processing and Analysis, 2nd Edition

SIRI-WHU Data Set 遥感影像数据集.7z

UC Merced Land-Use Data Set 土地利用图像遥感数据集.7z

A Guide to Convolutional Neural Networks for Computer Vision

Deep Metric Learning for Few-Shot Image Classification A Selec

Group Collaborative Representation for Image Set Classification组协同表示.zip

最新资源