【免费】【综述参考】AutomatedTestingofAndroidApps-ASystematicLiteratur资源-CSDN文库

需积分: 0 180 浏览量 2022-08-03 13:48:20 上传评论收藏 1.73MB PDF 举报

【综述参考】《Android应用自动化测试：系统性文献回顾》随着Android智能设备的普及，近年来其受欢迎程度急剧上升。截至2017年7月，官方应用商店Google Play已分发超过300万个Android应用程序，涵盖娱乐、个性化、教育、金融等多个类别。这种在开发者社区中的广泛流行，部分归功于基于熟悉的Java编程语言的易用开发环境，以及提供各种功能的库支持。自动化测试在Android应用中扮演着至关重要的角色，对于用户、开发者和市场维护者来说都至关重要。鉴于Android的广泛应用及其特有的开发模式，学术界已经提出了各种测试方法，以确保不仅满足功能性需求，还能满足非功能性需求。这篇论文旨在清晰地概述Android应用测试领域的最新进展，揭示主要趋势，指明所采用的主要方法，并列举Android测试方法面临的挑战以及需要社区努力的方向。通过进行系统性文献回顾（SLR），研究者们最终筛选出103篇在2016年前发表于主要会议和期刊的相关研究论文。对这些文献的深入分析揭示了一些关键发现，并突出了未来Android测试研究人员需要解决的挑战。此外，论文还提出了几个具体的研究方向，包括测试方法如何解决应用更新时反复出现的问题、应用大小持续增加的问题，以及应对Android生态系统碎片化的策略。 1. 引言 Android设备的普及引发了对高效、可靠测试的需求。数百万的应用程序意味着大量潜在的错误和性能问题，自动化测试成为了确保质量的关键工具。开发者社区对Android的热爱源于其开放的开发环境和丰富的资源，但这也带来了测试复杂性的挑战，尤其是在应对不断变化的平台版本和设备多样性时。 2. 主要趋势与方法论在文献回顾中，研究者发现了多种测试方法，如模型驱动测试、动态分析和静态分析。模型驱动测试侧重于创建应用行为的模型，然后生成测试用例；动态分析则在运行时收集信息来检测潜在问题；而静态分析则在代码编写阶段就进行检查，无需实际运行应用。这些方法各有优缺点，适用于不同场景和需求。 3. 挑战与未来方向 Android测试面临的主要挑战包括兼容性问题、性能优化、安全性测试和用户界面测试。兼容性测试需确保应用在不同设备和Android版本上表现一致；性能测试关注应用的响应速度和资源消耗；安全测试则关乎用户数据的保护；而用户界面测试则关注用户体验。为了克服这些挑战，研究者提出需要更智能的测试生成技术、更有效的测试覆盖率评估方法，以及适应Android生态快速变化的测试框架。 4. 应用更新与规模增长随着应用规模的扩大和频繁更新，测试成本和复杂性也随之增加。研究者建议开发新的测试策略，如增量测试，仅针对更新部分进行测试，以及轻量级的持续集成方案，以提高测试效率。 5. 生态系统碎片化 Android系统的碎片化使得测试更加困难，因为每个版本可能有不同的API和特性。因此，研究者呼吁开发跨版本兼容的测试方法，以及能够模拟多种设备环境的测试工具。总结，Android应用的自动化测试是保障其质量和用户体验的重要环节。这篇系统性文献回顾为该领域的研究提供了全面的视角，指出了当前的方法、挑战和未来的研究重点，为Android应用的持续改进提供了宝贵的指导。

资源详情

资源评论

资源推荐

Automated Testing of Android Apps:

A Systematic Literature Review

Pingfan Kong, Li Li

, Jun Gao, Kui Liu, Tegawend

e F. Bissyand

e, Jacques Klein

Abstract—Automated testing of Android apps is essential for app users,

app developers and market maintainer communities alike. Given the

widespread adoption of Android and the speciﬁcities of its develop-

ment model, the literature has proposed various testing approaches for

ensuring that not only functional requirements but also non-functional

requirements are satisﬁed. In this paper, we aim at providing a clear

overview of the state-of-the-art works around the topic of Android app

testing, in an attempt to highlight the main trends, pinpoint the main

methodologies applied and enumerate the challenges faced by the An-

droid testing approaches as well as the directions where the community

effort is still needed. To this end, we conduct a Systematic Literature

Review (SLR) during which we eventually identiﬁed 103 relevant re-

search papers published in leading conferences and journals until 2016.

Our thorough examination of the relevant literature has led to several

ﬁndings and highlighted the challenges that Android testing researchers

should strive to address in the future. After that, we further propose a

few concrete research directions where testing approaches are needed

to solve recurrent issues in app updates, continuous increases of app

sizes, as well as the Android ecosystem fragmentation.

1 INTRODUCTION

Android smart devices have become pervasive after gaining

tremendous popularity in recent years. As of July 2017,

Google Play, the ofﬁcial app store, is distributing over

3 million Android applications (i.e., apps), covering over

30 categories ranging from entertainment and personali-

sation apps to education and ﬁnancial apps. Such popu-

larity among developer communities can be attributed to

the accessible development environment based on familiar

Java programming language as well as the availability of

libraries implementing diverse functionalities [1]. The app

distribution ecosystem around the ofﬁcial store and other

alternative stores such as Anzhi and AppChina is further

attractive for users to ﬁnd apps and organisations to market

their apps [2].

Unfortunately, the distribution ecosystem of Android

is porous to poorly-tested apps [3]–[5]. Yet, as reported

•

The corresponding author.

• P. Kong, J. Gao, K. Liu, T. Bissyand´e, and J. Klein are with the In-

terdisciplinary Centre for Security, Reliability and Trust, University of

Luxembourg, Luxembourg.

• L. Li is with the Faculty of Information Technology, Monash University,

Australia.

E-mail: li.li@monash.edu

Manuscript received XXX; revised XXX.This work was supported by the

Fonds National de la Recherche (FNR), Luxembourg, under projects CHAR-

ACTERIZE C17/IS/11693861 and Recommend C15/IS/10449467.

Android

Device

Testing

Approaches

(1) Install App

(3) Send Test Cases

(4) Observe Execution

Behaviour

(5) Clean Environment

(2) Static

Analysis

Testing

Environment

Testing

Approaches

Fig. 1: Process of testing Android apps.

by Kochhar [3], error-prone apps can signiﬁcantly impact

user experience and lead to a downgrade of their ratings,

eventually harming the reputation of app developers and

their organizations [5]. It is thus becoming more and more

important to ensure that Android apps are sufﬁciently tested

before they are released on the market. However, instead of

manual testing, which is often laborious, time-consuming

and error-prone, the ever-growing complexity and the enor-

mous number of Android apps call for scalable, robust and

trustworthy automated testing solutions.

Android app testing aims at testing the functionality,

usability and compatibility of apps running on Android

devices [6], [7]. Fig. 1 illustrates a typical working process.

At Step (1), target app is installed on an Android device.

Then in Step (2), the app is analysed to generate test cases.

We remind the readers that this step (in dashed line) is

optional and some testing techniques such as automated

random testing do not need to obtain pre-knowledge for

generating test cases. Subsequently, in Step (3), these test

cases are sent to the Android device to exercise the app.

In Step (4), execution behaviour is observed and collected

from all sorts of perspectives. Finally, in Step (5), the app

is uninstalled and relevant data is wiped. We would like

to remind the readers that installation of the target app is

sometimes not a necessity, e.g., frameworks like Robolectric

allow tests directly run in JVM. In fact, Fig. 1 can be

borrowed to describe the workﬂow of testing almost any

software besides Android apps. Android app testing, on the

contrary, falls in a unique context and often fails to use

general testing techniques [8]–[13]. There are several dif-

ferences with traditional (e.g., Java) application testing that

motivate research on Android app testing. We enumerate

and consider for our review a few common challenges:

First, although apps are developed in Java, traditional

Java-based testing tools are not immediately usable on An-

droid apps since most control-ﬂow interactions in Android

are governed by speciﬁc event-based mechanisms such as

the Inter-Component Communication (ICC [14]). To address

this ﬁrst challenge, several new testing tools have been

speciﬁcally designed for taking Android speciﬁcities into

account. For example, RERAN [15] was proposed for testing

Android apps through a timing- and touch-sensitive record-

and-replay mechanism, in an attempt to capture, represent

and replay complicated non-discrete gestures such as circu-

lar bird swipe with increasing slingshot tension in Angry Birds.

Second, Android fragmentation, in terms of the diversity

of available OS versions and target devices (e.g., screen size

varieties), is becoming acuter as now testing strategies have

to take into account different execution contexts [16], [17].

Third, the Android ecosystem attracts a massive number

of apps requiring scalable approaches to testing. Further-

more, these apps do not generally come with open source

code, which may constrain the testing scenarios.

Finally, it is challenging to generate a perfect coverage of

test cases, in order to ﬁnd faults in Android apps. Traditional

test case generation approaches based on symbolic execution

and tools such as Symbolic Pathﬁnder (SPF) are challenged by

the fact that Android apps are available in Dalvik bytecode

that differs from Java bytecode. In other words, traditional

Java-based symbolic execution approaches cannot be di-

rectly applied to tackle Android apps. Furthermore, the

event-driven feature, as well as framework libraries, pose

further obstacles for systematic generation of test cases [18].

Given the variety of challenges in testing Android apps,

it is important for this ﬁeld, which has already produced

a signiﬁcant amount of approaches, to reﬂect on what has

already been solved, and on what remains to tackle. To

the best of our knowledge, there is no related literature

review or survey summarizing the topic of Android testing.

Thus, we attempt to meet this need through a comprehen-

sive study. Concretely, we undertake a systematic literature

review (SLR), carefully following the guidelines proposed

by Kitchenham et al. [19] and the lessons learned from

applying SLR within the software engineering domain by

Brereton et al. [20]. To achieve our goal, we have searched

and identiﬁed a set of relevant publications from four well-

known repositories including the ACM Digital Library and

from major testing-related venues such as ISSTA, ICSE.

Then, we have performed a detailed overview on the current

state of research in testing Android apps, focusing on the

types and phases of the testing approaches applied as well

as on a trend analysis in research directions. Eventually, we

summarize the limitations of the state-of-the-art apps and

highlight potential new research directions.

The main contributions of this paper are:

• We build a comprehensive repository tracking the re-

search community effort to address the challenges in

testing Android apps. In order to enable an easy naviga-

tion of the state-of-the-art, thus enabling and encourag-

ing researchers to push the current frontiers in Android

app testing, we make all collected and built information

publicly available at

http://lilicoding.github.io/TA2Repo/

• We analyse in detail the key aspects in testing Android

apps and provide a taxonomy for clearly summarising

Research Question

Identiﬁcation

Keywords

Identiﬁcation

Repository

CCF-Ranked

Venues Search

Results Merging

Exclusion Criteria

Application

Data Extraction

Cross Checking

Harvested

Publications

Primary

Publications

SLR Report

Fig. 2: Process of the SLR.

and categorising all related research works.

• Finally, we investigate the current state of the art,

enumerate the salient limitations and pinpoint a few

directions for furthering the research in Android test-

ing.

The rest of the paper is organized as follows: Section 2

depicts the methodology of this systematic literature review,

including a general overview and detailed reviewing pro-

cesses of our approach. In Section 3, we present the results of

our selected primary publications, along with a preliminary

trend and statistic analysis on those collected publications.

Later, we introduce our data extraction strategy and their

corresponding ﬁndings in the following two sections: Sec-

tion 4 and 5. After that, we discuss the trends we observed

and challenges the community should attempt to address in

Section 6 and enumerate the threats to validity of this SLR in

Section 7. A comparison of this work with literature studies

is given in Section 8 and ﬁnally we conclude this SLR in

Section 9.

2 METHODOLOGY OF THIS SLR

We now introduce the methodology applied in this SLR. We

remind the readers that an SLR follows a well-deﬁned strat-

egy to systematically identify, examine, synthesize, evaluate

and compare all available literature works in a speciﬁc topic,

resulting in a reliable and replicable report [19], [21], [22].

Fig. 2 illustrates the process of our SLR. At the beginning,

we deﬁne relevant research questions (cf. Section 2.1) to

frame our investigations. The following steps are unfolded

to search and consolidate the relevant literature, before

extracting data for answering the research questions, and

ﬁnalizing the report.

Concretely, to harvest all relevant publications, we iden-

tify a set of search keywords and apply them in two separate

processes: 1) online repository search and 2) major

venues

search. All results are eventually merged for further review-

ing (cf. Section 2.2). Next, we apply some exclusion criteria

on the merged list of publications, to exclude irrelevant

papers (e.g., papers not written in English) or less relevant

papers (e.g., short papers), in order to focus on a small, but

1. We rely on the China Computer Federation (CCF) ranking of

computer science venues.

highly relevant, set of primary publications (cf. Section 2.3).

Finally, we have developed various metrics and reviewed

the selected primary publications against these metrics

through full paper examination. After the examination, we

cross-check the extracted results to ensure their correctness

and eventually we report on the ﬁndings to the research

community (cf. Section 2.4).

2.1 Initial research questions

Given the common challenges enumerated in the Introduc-

tion section, which have motivated several research lines in

Android apps, we investigate several research questions to

highlight how and which challenges have been focused on

in the literature. In particular, with regards to the fact that

Android has programming speciﬁcities (e.g., event-based

mechanisms, GUI), we categorize test concerns targeted by

the research community. With regards to the challenge of

ensuring scalability, we study the tests levels which are

addressed in research works. With regards to the challenge

of generating test cases, we investigate in details the funda-

mental testing techniques leveraged. Finally, with regards

to the fragmentation of the Android ecosystem, we explore

the extent of validation schemes for research approaches.

Overall, we note that testing Android apps is a broad

activity that can target a variety of functional and non-

functional requirements and veriﬁcation issues, leverage

different techniques and focus on different granularity levels

and phases. Our investigation thus starts with the following

related research questions:

• RQ1: What are the test concerns? With this research

question, we survey the various objectives sought by An-

droid app testing researchers. In general, we investigate

the testing objectives at a high level to determine what

requirements (e.g., security, performance, defects, energy)

the literature addresses. We look more in-depth into the

speciﬁcities of Android programming, to enumerate the

priorities that are tackled by the community, including

which concerns (e.g., GUI and ICC mechanism) are fac-

tored in the design of testing strategies.

• RQ2: Which test levels are addressed? With the second

research question, we investigate the levels (i.e., when

the tests are relevant in the app development process)

that research works target. The community could indeed

beneﬁt from knowing to what extent regression testing is

(or is not) developed for apps which are now commonly

known to evolve rapidly.

• RQ3: How are the testing approaches built? In the third

research question, we process detailed information on the

design and implementation of test approaches. In par-

ticular, we investigate the fundamental techniques (e.g.,

concolic testing or mutation testing) leveraged, as well as

the amount of input information (i.e., to what extent the

tester should know about the app prior to testing) that

approaches require to perform.

• RQ4: To what extent are the testing approaches vali-

dated? Finally, the fourth research question investigates

the metrics, datasets and procedures in the literature for

measuring the effectiveness of state-of-the-art approaches.

Answers to this question may shed light on the gaps in the

research agenda of Android testing.

2.2 Search Strategy

We now detail the search strategy that we applied to harvest

literature works related to Android app testing.

Identiﬁcation of search keywords. Our review focuses

on two key aspects: Testing and Android. Since a diversity

of terms may be used by authors to refer, broadly or pre-

cisely, to any of these aspects, we rely on the extended set

of keywords identiﬁed in Table 1. Our ﬁnal search string

is then constructed as a conjunction of these two categories

of keywords (search string = cat1 & cat2), where each

category is represented as a disjunction of its keywords

(cat = kw1 | kw2 | kw3).

TABLE 1: Search Keywords

Category Keywords

Android

android, mobile, portable device,

smartphone, smart phone, smart device

Test

test, testing, measure, measurement, measuring,

check, checking, detect, detecting, detection

Online repository search. We use the search string on

online literature databases to ﬁnd and collect relevant pa-

pers. We have considered four widely used repository for

our work: ACM Digital Library

, IEEE Xplore Digital Li-

brary

, SpringerLink

, and ScienceDirect

. The “advanced”

search functionality of the four selected online repositories

are known to be inaccurate, which usually result in a huge

set of irrelevant publications, noising the ﬁnal paper set [22].

Indeed, those irrelevant publications do not really match our

keywords criteria. For example, they may not contain any of

the keywords shown in the Test category. Thus, we develop

scripts (combined with Python and Shell) to perform off-line

matching veriﬁcation on the papers yielded by those search

engines, where the scripts follow exactly the same criteria

that we have used for online repository search. For example,

regarding the keywords enumerated in the Test category,

if none of them is presented in a publication, the scripts

will mark that publication as irrelevant and subsequently

exclude it from the candidate list.

Major venues search. Since we only consider a few

repositories for search, the coverage can be limited given

that a few conferences such as NDSS

and SEKE

do not

host their proceedings in the aforementioned repositories.

Thus, to mitigate the threat to validity of not including all

relevant papers, we further explicitly search in proceedings

of all major venues in computer science. We have chosen the

comprehensive CCF-ranking of venues

and leveraged the

DBLP

repository to collect the Document Object Identiﬁers

(DOI) of the publications in order to crawl abstracts and

all publication metadata. Since this search process considers

major journal and conference venues, the resulting set of

2. http://dl.acm.org/

3. http://ieeexplore.ieee.org/Xlpore/home.jsp

4. http://link.springer.com

5. http://www.sciencedirect.com

6. The Network and Distributed System Security Symposium

7. International Conference on Software Engineering & Knowledge

Engineering

8. http://www.ccf.org.cn/sites/ccf/paiming.jsp, we only take into

account software engineering and security categories, as from what have

observed, the majority of papers related to testing Android apps.

9. http://dblp.uni-trier.de

literature papers should be a representative collection of the

state-of-the-art.

2.3 Exclusion Criteria

After execution of our search based on the provided key-

words, a preliminary manual scanning showed that the re-

sults are rather coarse-grained since it included a number of

irrelevant or less relevant publications which, nonetheless,

matched

the keywords. It is thus necessary to perform a

ﬁne-grained inclusion/exclusion in order to focus on a con-

sistent and reliable set of primary publications and reduce

the eventual effort in further in-depth examination. For this

SLR, we have applied the following exclusion criteria:

1) Papers that are not written in English are ﬁltered out

since English is the common language spoken in the

worldwide scientiﬁc peer-reviewing community.

2) Short papers are excluded, mainly because such papers

are often work-in-progress or idea papers: on the one

hand, short papers are generally not mature, and, on the

other hand, many of them will eventually appear later in

a full paper format. In the latter case, mature works are

likely to already be included in our ﬁnal set. In this work,

we take a given publication as a short paper when it has

fewer than 4 pages (included) in IEEE/ACM-like double-

column format

or fewer than 8 pages (included) in

LNCS-like single column format as short papers are

likely to be 4 pages in double column format and 8 pages

in single column format.

3) Papers that are irrelevant to testing Android apps are

excluded. Our search keywords indeed included broad

terms such as mobile and smartphone as we aimed at

ﬁnding all papers related to Android even when the term

“Android” was not speciﬁcally included in the title and

abstract. By doing so, we have excluded papers that only

deal with mobile apps for other platforms such as iOS

and Windows.

4) Duplicated papers are removed. It is quite common for

authors to publish an extended version of their confer-

ence paper to a journal venue. However, these papers

share most of the ideas and approach steps. To consider

both of them would result in a biased weighting of the

metrics in the review. To mitigate this, we identify dupli-

cate papers by ﬁrst comparing paper titles, abstracts and

authors and then further manually check when a given

pair of records share a major part of their contents. We

ﬁlter out the least recent publication when duplication is

conﬁrmed.

5) Papers that conduct comparative evaluations, including

surveys on different approaches of testing Android apps,

are excluded. Such papers indeed do not introduce new

technical contributions for testing Android apps.

6) Papers in which the testing approach targets the operat-

ing system, networks, or hardware, rather than mobile

apps are excluded.

10. The keywords were found for example to be mentioned in the

related sections of the identiﬁed papers.

11. Note that we have actually kept a short paper entitled “GuiDiff:

a regression testing tool for graphical user interface” because it is very

relevant to our study and it does not have an extended version released

in the following years.

7) Papers that assess

existing testing methods are also

ﬁltered out. The publications that they discuss are sup-

posed to be already included in our search results.

8) Papers demonstrating how to set up environments and

platforms to retrieve runtime data from Android apps are

excluded. These papers are also important for Android

Apps testing, but they are not focusing on new testing

methodology.

9) Finally, some of our keywords (e.g., “detection” of issues,

“testing” of apps) have led to the retrieval of irrelevant

literature works that must be excluded. We have mainly

identiﬁed two types of such papers: the ﬁrst includes

papers that perform detection of malicious apps using

machine learning (and not testing); the second includes

papers that describe the building of complex platforms,

adopting existing mature testing methodologies.

We refer to all collected papers that remain after the

application of exclusion criteria as primary publications.

These publications are the basis for extracting review data.

2.4 Review Protocol

Concretely, the review is conducted in two phases: 1) First,

we perform an abstract review and quick full paper scan to

ﬁlter out irrelevant papers based on the exclusion criteria

deﬁned above. At the end of this phase, the set of primary

publications is known. 2) Subsequently, we perform a full

review of each primary publication and extract relevant in-

formation that is necessary for answering all of our research

questions.

In practice, we have split our primary publications to

all the co-authors to conduct the data extraction step. We

have further cross-checked all the extracted results: when

some results are in disagreement, informal discussions are

conducted until a consensus is reached.

3 PRIMARY PUBLICATIONS SELECTION

TABLE 2: Summary of the selection of primary publications.

Step Count

Repository and Major Venues Search 9259

After reviewing titles/abstracts (scripts) 472

After reviewing titles/abstracts 255

After skimming/scanning full paper 171

After ﬁnal discussion 103

Table 2 summarizes statistics of collected papers during

the search phase. Overall, our repository search and major

venue search have yielded in total 9,259 papers.

Following the exclusion criteria in Section 2, the papers

satisfying the matching requirements immediately drop

from 9259 to 472. We then manually go through the title and

abstract of each paper to further dismiss those that match

the exclusion criteria. After this step, the set of papers is

reduced to 255 publications. Subsequently, we go through

the full content of papers in the set, leading to the exclusion

of 84 more papers. Finally, after discussion among the

authors for the rest of the set, we reach a consensus on con-

sidering 103 publications as relevant primary publications.

12. For example, [23] and [24] propose tools and algorithms for

measuring the code coverage of testing methods.

剩余22页未读，继续阅读

评论收藏

内容反馈

张景淇

粉丝: 41
资源: 275

【综述参考】Automated Testing of Android Apps - A Systematic Literatur

评论0

最新资源

【综述参考】Automated Testing of Android Apps - A Systematic Literatur

评论0

Neural Networks - A Systematic Introduction

神经网络-系统介绍Neural Networks - A Systematic Introduction

论文研究-A Systematic Approach to the Query Optimization of Datalog Recursive Programs.pdf

Unsupervised Person Re-Identification A Systematic Survey of

Unsupervised Person Re-Identification A Systematic Survey of Ch

Systematic Testing of Systematic Trading Strategies

软件测试：wiley,.the.art.of.software.testing.(2004)

Systematic Software Testing 2002

Systematic Software Testing

react-systematic-master.zip

a systematic study of the class imbalance problem.pdf

A Comparison of Clinical Follow-Up of Different Total

What is the State of the Art of Computer Vision-Assisted Cytolo

The Reactivity of Potassium Polyfluoroaryl-, Polyfluoroalkenyl-, and Perfluoroalkyltrifluoroborates and their Hydrocarbon Analogues towards Acids of Different Strength: A Systematic Study of the Hydrodeboration

Ring of Quotients - Introduction to Methods of Ring Theory - Bo Stenstrom

BurpLoaderKeygen.jar.zip

最新版ISO/IEC 27001:2022、ISO 27002:2022中英文合集

Chrome Header Editor 插件

Goby红队版-win-x64-2.4.7版本

软件工程导论(第六版)课后习题答案1

OpenVAS GVM 中文翻译补丁

第四届网鼎杯赛前训练(20241019)

安全认证cisp教材全套

STM32F103C8T6核心板-电路原理图1.PDF

最新资源