人工智能-项目实践-检测-人脸检测加表情识别.zip_孪生网络人脸识别研究资源-CSDN文库

共73个文件

hdf5：35个

py：22个

jpg：4个

版权申诉

opencv

人工智能

人脸检测

表情识别

17 浏览量 2023-12-23 16:17:50 上传评论收藏 76.05MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

人工智能-项目实践-检测-人脸检测加表情识别.zip （73个子文件）

emotion_recognition-master

report.pdf 958KB

src

__init__.py 0B

video_emotion_gender_demo.py 4KB

detect_face.py 31KB

image_gradcam_demo.py 3KB

utils

__init__.py 0B

preprocessor.py 642B

data_augmentation.py 10KB

inference.py 2KB

grad_cam.py 7KB

visualizer.py 6KB

datasets.py 6KB

web

__init__.py 1B

emotion_gender_processor.py 4KB

faces.py 1003B

train_gender_classifier.py 3KB

video_emotion_color_demo.py 3KB

video_dectect_emotion.py 4KB

video_gradcam_demo.py 3KB

train_emotion_classifier.py 3KB

image_emotion_gender_demo.py 3KB

models

__init__.py 0B

cnn.py 13KB

det3.npy 1.49MB

det2.npy 392KB

det1.npy 27KB

datasets

.gitignore 68B

REQUIREMENTS.txt 105B

trained_models

gender_models

simple_CNN.81-0.96.hdf5 7.35MB

gender_mini_XCEPTION.21-0.95.hdf5 784KB

emotion_models

fer2013_mini_XCEPTION.102-0.66.hdf5 852KB

simple_CNN.530-0.65.hdf5 7.47MB

fer2013_mini_XCEPTION.97-0.65.hdf5 852KB

fer2013_mini_XCEPTION.51-0.63.hdf5 852KB

fer2013_mini_XCEPTION.02-0.52.hdf5 852KB

simple_CNN.985-0.66.hdf5 7.47MB

fer2013_mini_XCEPTION.100-0.65.hdf5 852KB

fer2013_mini_XCEPTION.99-0.65.hdf5 852KB

fer2013_mini_XCEPTION.14-0.59.hdf5 852KB

fer2013_mini_XCEPTION.110-0.65.hdf5 852KB

fer2013_mini_XCEPTION.04-0.55.hdf5 852KB

fer2013_mini_XCEPTION.29-0.62.hdf5 852KB

fer2013_mini_XCEPTION.70-0.63.hdf5 852KB

fer2013_mini_XCEPTION.107-0.66.hdf5 852KB

mini_XCEPTION_KDEF.hdf5 852KB

fer2013_mini_XCEPTION.43-0.64.hdf5 852KB

tiny_XCEPTION_KDEF.hdf5 386KB

fer2013_mini_XCEPTION.37-0.62.hdf5 852KB

fer2013_mini_XCEPTION.00-0.47.hdf5 852KB

fer2013_mini_XCEPTION.25-0.60.hdf5 852KB

fer2013_mini_XCEPTION.27-0.62.hdf5 852KB

fer2013_mini_XCEPTION.10-0.58.hdf5 852KB

fer2013_mini_XCEPTION.32-0.62.hdf5 852KB

fer2013_mini_XCEPTION.05-0.56.hdf5 852KB

fer2013_mini_XCEPTION.12-0.58.hdf5 852KB

fer2013_mini_XCEPTION.41-0.62.hdf5 852KB

fer2013_mini_XCEPTION.38-0.62.hdf5 852KB

fer2013_mini_XCEPTION.03-0.53.hdf5 852KB

fer2013_mini_XCEPTION.15-0.60.hdf5 852KB

fer2013_mini_XCEPTION.11-0.58.hdf5 852KB

fer2013_mini_XCEPTION.08-0.57.hdf5 852KB

detection_models

haarcascade_frontalface_default.xml 908KB

fer2013_mini_XCEPTION.119-0.65.hdf5 848KB

fer2013_big_XCEPTION.54-0.66.hdf5 2.48MB

Dockerfile 532B

images

demo_results.png 723KB

color_demo.gif 26.62MB

gradcam_results.png 498KB

12_angry_men.jpg 134KB

robocup_team.png 3.62MB

emotion_classification.jpg 194KB

solvay_conference.jpg 133KB

test_image.jpg 503KB

Real-time Convolutional Neural Networks for

Emotion and Gender Classiﬁcation

Octavio Arriaga

Hochschule Bonn-Rhein-Sieg

Sankt Augustin Germany

Email: octavio.arriaga@smail.inf.h-brs.de

Paul G. Pl

oger

Hochschule Bonn-Rhein-Sieg

Sankt Augustin Germany

Email: paul.ploeger@h-brs.de

Matias Valdenegro

Heriot-Watt University

Edinburgh, UK

Email: m.valdenegro@hw.ac.uk

Abstract—In this paper we propose an implement a general

convolutional neural network (CNN) building framework for

designing real-time CNNs. We validate our models by creat-

ing a real-time vision system which accomplishes the tasks of

face detection, gender classiﬁcation and emotion classiﬁcation

simultaneously in one blended step using our proposed CNN

architecture. After presenting the details of the training pro-

cedure setup we proceed to evaluate on standard benchmark

sets. We report accuracies of 96% in the IMDB gender dataset

and 66% in the FER-2013 emotion dataset. Along with this we

also introduced the very recent real-time enabled guided back-

propagation visualization technique. Guided back-propagation

uncovers the dynamics of the weight changes and evaluates

the learned features. We argue that the careful implementation

of modern CNN architectures, the use of the current regu-

larization methods and the visualization of previously hidden

features are necessary in order to reduce the gap between slow

performances and real-time architectures. Our system has been

validated by its deployment on a Care-O-bot 3 robot used during

RoboCup@Home competitions. All our code, demos and pre-

trained architectures have been released under an open-source

license in our public repository.

I. INTRODUCTION

The success of service robotics decisively depends on a

smooth robot to user interaction. Thus, a robot should be

able to extract information just from the face of its user,

e.g. identify the emotional state or deduce gender. Interpret-

ing correctly any of these elements using machine learning

(ML) techniques has proven to be complicated due the high

variability of the samples within each task [4]. This leads to

models with millions of parameters trained under thousands of

samples [3]. Furthermore, the human accuracy for classifying

an image of a face in one of 7 different emotions is 65% ±

5% [4]. One can observe the difﬁculty of this task by trying

to manually classify the FER-2013 dataset images in Figure

1 within the following classes {“angry”, “disgust”, “fear”,

“happy”, “sad”, “surprise”, “neutral”}.

In spite of these difﬁculties, robot platforms oriented to

attend and solve household tasks require facial expressions

systems that are robust and computationally efﬁcient. More-

over, the state-of-the-art methods in image-related tasks such

as image classiﬁcation [1] and object detection are all based on

Convolutional Neural Networks (CNNs). These tasks require

CNN architectures with millions of parameters; therefore,

their deployment in robot platforms and real-time systems

Fig. 1: Samples of the FER-2013 emotion dataset [4].

Fig. 2: Samples of the IMDB dataset [9].

becomes unfeasible. In this paper we propose an implement

a general CNN building framework for designing real-time

CNNs. The implementations have been validated in a real-time

facial expression system that provides face-detection, gender

classiﬁcation and that achieves human-level performance when

classifying emotions. This system has been deployed in a

care-O-bot 3 robot, and has been extended for general robot

platforms and the RoboCup@Home competition challenges.

Furthermore, CNNs are used as black-boxes and often their

learned features remain hidden, making it complicated to

establish a balance between their classiﬁcation accuracy and

unnecessary parameters. Therefore, we implemented a real-

time visualization of the guided-gradient back-propagation

proposed by Springenberg [11] in order to validate the features

learned by the CNN.

II. RELATED WORK

Commonly used CNNs for feature extraction include a

set of fully connected layers at the end. Fully connected

layers tend to contain most of the parameters in a CNN.

Speciﬁcally, VGG16 [10] contains approximately 90% of all

its parameters in their last fully connected layers. Recent

architectures such as Inception V3 [12], reduced the amount

of parameters in their last layers by including a Global

Average Pooling operation. Global Average Pooling reduces

each feature map into a scalar value by taking the average over

all elements in the feature map. The average operation forces

the network to extract global features from the input image.

Modern CNN architectures such as Xception [1] leverage from

the combination of two of the most successful experimental

assumptions in CNNs: the use of residual modules [6] and

depth-wise separable convolutions [2]. Depth-wise separable

convolutions reduce further the amount of parameters by

separating the processes of feature extraction and combination

within a convolutional layer.

Furthermore, the state-of-the-art model for the FER2-2013

dataset is based on CNN trained with square hinged loss

[13]. This model achieved an accuracy of 71% [4] using

approximately 5 million parameters. In this architecture 98%

of all parameters are located in the last fully connected layers.

The second-best methods presented in [4] achieved an

accuracy of 66% using an ensemble of CNNs.

III. MODEL

We propose two models which we evaluated in accordance

to their test accuracy and number of parameters. Both models

were designed with the idea of creating the best accuracy

over number of parameters ratio. Reducing the number of

parameters help us overcoming two important problems. First,

the use of small CNNs alleviate us from slow performances

in hardware-constrained systems such robot platforms. And

second, the reduction of parameters provides a better gener-

alization under an Occam’s razor framework. Our ﬁrst model

relies on the idea of eliminating completely the fully connected

layers. The second architecture combines the deletion of the

fully connected layer and the inclusion of the combined

depth-wise separable convolutions and residual modules. Both

architectures were trained with the ADAM optimizer [8].

Following the previous architecture schemas, our initial ar-

chitecture used Global Average Pooling to completely remove

any fully connected layers. This was achieved by having in the

last convolutional layer the same number of feature maps as

number of classes, and applying a softmax activation function

Fig. 3: Our proposed model for real-time classiﬁcation.

to each reduced feature map. Our initial proposed architecture

is a standard fully-convolutional neural network composed of

9 convolution layers, ReLUs [5], Batch Normalization [7]

and Global Average Pooling. This model contains approx-

imately 600,000 parameters. It was trained on the IMDB

gender dataset, which contains 460,723 RGB images where

each image belongs to the class “woman” or “man”, and it

achieved an accuracy of 96% in this dataset. We also validated

this model in the FER-2013 dataset. This dataset contains

35,887 grayscale images where each image belongs to one

of the following classes {“angry”, “disgust”, “fear”, “happy”,

“sad”, “surprise”, “neutral”}. Our initial model achieved an

accuracy of 66% in this dataset. We will refer to this model

as “sequential fully-CNN”.

Our second model is inspired by the Xception [1] archi-

tecture. This architecture combines the use of residual mod-

ules [6] and depth-wise separable convolutions [2]. Residual

modules modify the desired mapping between two subsequent

layers, so that the learned features become the difference of the

original feature map and the desired features. Consequently,

the desired features H(x) are modiﬁed in order to solve an

easier learning problem F (X) such that:

H(x) = F (x) + x (1)

评论收藏

内容反馈

版权申诉

博士僧小星

粉丝: 2381
资源: 5995

人工智能-项目实践-检测-人脸检测加表情识别.zip

opencv的一些人脸项目

基于keras的人脸表情识别

Python使用OpenCV人脸检测、识别、轮廓标识、性别识别等功能应用（源代码）

适用于国内应用的独立于机器人的ROS包_Python_C++_下载.zip

PyQt5+Caffe+Opencv搭建人脸识别登录界面

人工智能-项目实践-检测-facenet人脸检测与识别系统.zip

人工智能-项目实践-检测-一键人脸归一化处理工具，包括人脸检测，人脸关键点检测，基于关键点的人脸对齐.zip

微信小程序AI识别人脸WechatAI--AI-.zip

人工智能-项目实践-计算机视觉-音视频流录制+同时语音识别+同时人脸识别+同时语音合成.zip

PHP simple_html_dom.php+正则 采集文章代码

simple_CNN.530-0.65.hdf5

simple_pid.zip_matlab例程_matlab_

test_hdf5.zip

DeepLearnToolbox_CNN_lzbV3.0_cnnbp.m

毕业设计 - 基于树莓派、OpenCV及Python语言的人脸识别.zip

人工智能-项目实践-C#-基于C# 使用 百度API人脸检测V3版 实现人脸检测展示数据.zip

人工智能-项目实践-C#-C# 超简单的离线人脸识别库.zip

人脸表情识别系统源码.zip

深度学习基于卷积神经网络的人脸面部表情识别项目源码+论文+面部表情数据集+训练好的模型

CNN快速入门代码.rar

yolov5_models.zip

emotion_recognition:pythontf中的CNN从48x48面部图像中识别出6种情绪

cnn_head_detection-master.zip

hdf5_1.12.0安装包.zip

基于亚博K210的人脸识别项目python源码.zip

人脸表情 / 微表情识别，毕业设计.zip

人脸表情识别.zip

Java开发基于seetaface6的人脸识别（活体检测）的封装源码.zip

本科毕业设计-基于深度学习的口罩佩戴检测及人脸识别系统源码.zip

最新资源

PHP simple_html_dom.php+正则采集文章代码

人工智能-项目实践-C#-基于C# 使用百度API人脸检测V3版实现人脸检测展示数据.zip