基于系统调用序列特征加权的Android恶意软件检测方法资源-CSDN文库

52 浏览量 2021-03-06 11:47:10 上传评论收藏 954KB PDF 举报

从给定文件信息中，我们可以提取出以下知识点： 1. Android恶意软件检测技术是当前的一个研究热点。智能手机的普及使得恶意软件的攻击目标发生了变化，不再仅限于传统PC，而是开始针对智能手机。智能手机存储和处理的信息量大幅度增长，这为恶意软件提供了更多的攻击机会。 2. 特征选择在有监督和无监督学习中是一个重要环节。在恶意软件检测研究中，很多研究倾向于使用有限大小的样本进行特征提取，这容易导致样本不均衡问题，即积极类（恶意软件）的特征被选择得过少，而消极类（正常软件）的特征过多。这种不均衡会严重影响恶意软件检测的准确性。 3. 在本文中，作者提出了一种研究恶意软件和正常软件生成的系统调用序列样本的新方法，并使用TF-RF（词频-随机森林）相关性分类特征加权方法来提取特征。该方法能够有效地保留多数类别（正类）的特征，并对少数类别（稀有类）的特征分类能力也表现良好，从而提升了恶意软件检测的准确性。 4. 文章提到了智能手机的碎片化和兼容性问题，以及恶意软件的变换、加密壳和反分析技术给恶意软件检测带来了挑战。 5. 由于智能手机的这些特性，传统的PC恶意软件检测技术并不完全适用于智能手机环境，因此需要开发新的技术来应对智能手机恶意软件的检测挑战。 6. 文章还提到了研究的项目背景，这项研究是在国家自然科学基金（项目编号***）支持的“基于程序流程水印的活跃流关联技术研究”项目下进行的。此外，研究是在中国河南省的国家数学与先进计算重点实验室进行的。 7. 文章的关键词包括Android、DF（可能指数据流）、恶意软件、SVM（支持向量机）和TF-RF（词频-随机森林），这些关键词指出了文章研究的主要内容和方法。通过以上信息，我们可以了解到目前智能手机恶意软件的检测是信息安全领域的一个重要问题，同时介绍了作者提出的一种新的检测方法，利用系统调用序列的特征加权来提高检测的准确性和效率。此外，该方法考虑了样本不平衡的问题，并针对性地解决。研究项目背景信息和参与机构也强调了该研究领域的专业性和实用性。

资源推荐

资源详情

资源评论

Acta Technica 62, No. 3B/2017, 371–380

 2017 Institute of Thermomechanics CAS, v.v.i.

Android malware detection method

based on system call sequence feature

weighting

Xueli Hu

, Shuai Yi

, Zhenxing Wang

Liancheng Zhang

Abstract. The amount of information carried by smartphones grows tremendously, which

makes them one of the attack targets. The malware detection technique intended for smartphones

has become a research hotspot. Feature selection is an important process in both supervised and

unsupervised learning. However, many malware detection studies tend to perform their feature

extraction based on very limited size samples, which can easily lead to the selection of unbalanced

samples. Nonetheless, the unbalanced samples problem in malware detection techniques has not

raised enough awareness and attention yet. In this paper, we propose an approach that studies the

system call sequence samples generated by both malicious and normal and uses TF-RF relevance

category feature weighting approach to extract features. The proposed approach can eﬀectively

retain the majority classes features (positive classes) and also has a good features classiﬁcation

capability for minority classes (rare classes), which improves malware detection accuracy.

Key words. Android, DF, malware, SVM, TF-RF.

1. Introduction

With increasing popularity of smartphones, malware which have been targeting

PCs for several decades have quietly extended their scope to them. The malwares

have caused severe damages to smartphones which greatly aﬀected their users. Due

to the smartphones fragmentation and compatibility problems, and malware tech-

niques such as transformation, encryption shell and anti-analysis, malware detection

is a very challenging task.

Previous studies show that if an application with malicious behavior wants to per-

This research was performed under the project: Studies on Active Flow Association Technique

Based on Package Program Flow Watermarking, supported by the National Nature and Science

Fund (Fund No. 61402526).

State Key Laboratory of Mathematical Engineering and Advanced Computing, Henan, 45000,

China; e-mail: success_receive@hotmail.com

http://journal.it.cas.cz

372 XUELI HU, SHUAI YI, ZHENXING WANG, LIANCHENG ZHANG

form its tasks, it must request the relevant services after being executed. If system

calls are monitored and behavior trails of the entire lifecycle system calls are ac-

quired, then system calls can be analyzed and the behaviors can be examined. Thus

it can be concluded whether the codes are malicious or not. References [1–5] selected

system calls as features for malwares detection. And some other related research

results extract diﬀerent features as features for malwares detection [6–8].However,

in [3] only the frequencies of a single system call are considered, whereas the depen-

dencies between certain system calls and their orders are not considered. Reference

[4] only considered part of system call events during the features selection, so the

detection is not comprehensive.

Due to the rapid development of malware, malware sample set update is not

timely, leading to obsolete samples, easy to lead to training phase of unbalanced

data sample sets problem. Malware detection is a typical unbalanced sample set

classiﬁcation problem. Unbalanced data means that some classes have very few

instances in the dataset (i.e., minority classes), while other classes have many in-

stances (i.e., majority classes). According to Gary M Weiss [9], unbalanced data set

can cause a series of problems, such as data sparseness, data fragmentation, and in-

ductive deviation. These problems might reduce the performances of the traditional

classiﬁcation methods. To improve the unbalanced data sets and improve the system

eﬃciency, [10] used sparse matrices extracted from local singular value decomposi-

tion in order to reduce the system load. But singular value decomposition cannot

reduce the impact of the unbalanced feature selection on the detection results.

Aiming at the above research problems, this paper proved a method. The paper

contributions of research presented in this paper are as listed below:

1. Proved an approach based on two anti-debugging techniques, the App self-

attachment and the dynamic process additional state, is proposed. the method

tracks all the system call sequences of the App progress and obtains the system call

traces of the entire lifecycle.

2. Most of the existing Android malware detection techniques are based on oc-

curring frequency of each individual system call, whereas the dependencies between

multiple system calls are neglected. However, that shortcoming is compensated in

the proposed approach by sorting out of some implicit features from the N-Gram

data sets. Experimental results have demonstrated that these features are highly

eﬀective in malware detection.

3. The TF-RF relevance category feature weighting method is adopted. Through

feature computation and inter-class correlation based on features selection, the ap-

proach can eﬀectively select as many beneﬁcial minority class features as is possible,

while maintaining the majority class features.

2. System architecture

According to the fact that the malware uses some certain system call sequences

to execute its malicious behavior and that these system call sequences rarely appear

in the normal codes, they can be used to extract the malware features. In order

to achieve malware detection, TF-RF algorithm was used for features extraction

剩余9页未读，继续阅读

评论收藏

内容反馈

weixin_38618819

粉丝: 4
资源: 894

基于系统调用序列特征加权的Android恶意软件检测方法

基于特征融合的Android恶意软件检测系统

基于特征生成方法的Android恶意软件检测方法.pdf

基于非常规特征的Android恶意软件检测方法.pdf

一种基于元信息的Android恶意软件检测方法

系统调用序列在Markov链上的反向传播神经网络：一种通过系统调用序列检测Android恶意软件的新方法

基于特征码的恶意软件检测实验

一种Android恶意软件检测模型.pdf

用于Android恶意软件检测的模型检查

依特征频率的安卓恶意软件异常检测的研究.pdf

基于抽象API调用序列的Android恶意软件检测方法.pdf

基于组合式算法的Android恶意软件检测方法

基于多类特征的Android应用恶意行为检测系统

一种Android平台恶意软件静态检测方法.pdf

Android恶意软件检测方法研究综述.pdf

基于深度置信网络的Android恶意软件检测.pdf

最新资源