通过知识图上的随机游走的面向Mashup的API推荐资源-CSDN文库

22 浏览量 2021-03-14 07:50:08 上传评论收藏 1.87MB PDF 举报

资源推荐

资源详情

资源评论

Received November 22, 2018, accepted December 19, 2018, date of publication December 28, 2018,

date of current version January 23, 2019.

Digital Object Identifier 10.1109/ACCESS.2018.2890156

Mashup-Oriented API Recommendation via

Random Walk on Knowledge Graph

XIN WANG

, HAO WU

, AND CHING-HSIEN HSU

, (Senior Member, IEEE)

School of Information Science and Engineering, Yunnan University, Kunming 650500, China

Department of Computer Science and Information Engineering, National Chung Cheng University, Chiayi 62102, Taiwan

Corresponding author: Hao Wu ([email protected])

This work was supported in part by the National Natural Science Foundation of China under Grant 61562090, Grant 61472345,

and Grant 61562092, and in part by the China Postdoctoral Science Foundation under Grant 2016M592721.

ABSTRACT With the growing prosperity of the Web API economy, mashup-oriented API recommendation

has become an important requirement. Various methods based on different principles of technology have

been used to deal with this issue. In recent years, the Web API ecosystem has accumulated a wealth of

knowledge that can be used to enhance the recommendation models, and however, current concerns in this

regard still remain. To cope with this issue, we present a graph-based algorithmic framework for the task

of mashup-oriented API recommendation. Especially, we design a concise schema of the knowledge graph

to encode the mashup-speciﬁc contexts and model the mashup requirement with graphic entities. We then

exploit random walks with restart to assess the potential relevance between the mashup requirement and

the Web APIs according to the knowledge graph. In addition, we propose the query-speciﬁc weighting

strategies to enhance the knowledge graph construction. The experimental results demonstrate that our

proposed method is much superior to some state-of-the-art methods, also achieves robust effects on reducing

computational overhead, and suppresses the negative Matthew effect in APIs’ recommendation.

INDEX TERMS Mashup development, API recommendation, random walks with restart, knowledge graph.

I. INTRODUCTION

Web APIs are application programming interfaces through

which web applications can realize storage services, mes-

sage services, computing services and other capabilities. The

number of accessible Web APIs has grown consistently over

the past years. ProgrammableWeb, the largest online API

registry, has tracked more than 20,000 Web APIs recently.

As Web APIs become the backbone of the Web, cloud,

mobile and machine learning applications, API ecosystem

has gradually formed and an API economy is emerging [1].

However, in the face of rapid development of the information

society and the emergence of a large number of additional

requirements, the functions of existing APIs have become

increasingly unable to respond to complex business needs.

Regarding this issue, mashup has emerged as a technology

for today’s challenges by integrating multiple services to

match requirements of users even with users who have little

programming skills [2].

Unfortunately, with the rapid growth of the number of

Web APIs, quickly selecting the right Web APIs from a large

number of candidates covering a wide range of functionali-

ties has become increasingly challenging for inexperienced

developers. Therefore, it is necessary to develop recommen-

dation techniques and help developers to better identify rele-

vant Web APIs satisfying the need of mashup developments

in a shorter amount of time [3], [4]. In recent years, numer-

ous efforts have been made to address this issue. Existing

works can be coarsely classiﬁed into two categories, one

focuses on the principle of collaborative ﬁltering [5]–[8],

and the other focuses on estimating the relevance between

the mashup requirements and the candidate APIs [9]–[13].

Various technologies, e.g., matrix factorization [7], [8], topic

modeling [9], [10], link analysis [11] and various features,

e.g., texts, tags, topics and popularity are exploited to enhance

the accuracy of recommendations [13]–[17].

Nevertheless, most of the existing works which use sin-

gle source information are vulnerable to data sparsity. For

example, methods based on text similarity or topic mining

will get worse effect once the text description is poor or

insufﬁcient [7]. In contrast, knowledge graph usually con-

tains much more fruitful facts and connections of APIs,

mashups and other items. A knowledge graph is a type of

directed heterogeneous graph in which nodes correspond to

entities and edges correspond to relations. These semantics

VOLUME 7, 2019

2169-3536  2018 IEEE. Translations and content mining are permitted for academic research only.

Personal use is also permitted, but republication/redistribution requires IEEE permission.

See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

7651

X. Wang et al.: Mashup-Oriented API Recommendation via Random Walk on Knowledge Graph

can help us to understand mashup patterns accurately. The

Web API ecosystem has accumulated a wealth of knowledge

that can be used to enhance recommendation models [18],

however, the current concerns in this regard are still limited.

Particularly, there is the absence of a uniﬁed and easy-to-use

way. As to take advantage of these knowledge, they usually

combine different technologies, resulting in the relatively

complex algorithms which are hard to understand and prac-

tice. Moreover, how to exploit key knowledge instead of all

the knowledge to beneﬁt recommendation of APIs and how

to effectively express users’ requirements need to be further

explored.

Inspired by these, we propose a simple yet effective rec-

ommendation framework based on random walks on the

knowledge graph for mashup developments. In this approach,

we use knowledge graph to capture the most relevant infor-

mation which is related to Web APIs and mashups. Then,

we use Random Walks with Restart(RWR) to estimate the

relatedness between the mashup requirements and the can-

didate APIs. Also, we propose three query-speciﬁc weight-

ing strategies to improve the relatedness estimation. In this

way, we can effectively address the problem of data sparsity,

providing an elegant way to utilize the abundant knowledge

in the API ecosystem and achieving better recommendation

performance.

The main contributions of this paper can be summarized as

follows:

• We have proposed an effective schema of knowl-

edge graph to capture the most useful knowledge for

mashup-oriented API recommendation and proposed

three query weighting strategies to enhance the recom-

mendation effectiveness of RWR.

• Through comprehensive experimental analyses, our

proposed method can provide highly accurate recom-

mendation results in comparison with the state-of-

the-art methods. Particularly, RWR using our graph

schema promotes that case of using a full graph

schema by at least 5.9-9.5% on Recall@{5,10} and at

least 4.5-5.7% on NDCG@{5,10}. With query-speciﬁc

weighting strategies, we further improve the recom-

mendation effects of RWR by at least 11.9-17.7%

on Recall@{5,10} and at least 12.9-13.3% on

NDCG@{5,10}.

The rest of the paper is structured as follows. Section II

provides some backgrounds to API-speciﬁc knowledge graph

and the random walk method. Section III presents our

enhanced model by introducing reﬁned knowledge graph and

query weighting strategies. We evaluate our methods via a set

of experiments in Section IV. We review some works which

are most relevant to ours in Section V. Finally, we conclude

our works and point to future works.

II. PRELIMINARY MODEL

A. KNOWLEDGE GRAPH FOR APIS RECOMMENDATION

In the past decades, the use of Web API has increased

signiﬁcantly. However, the lack of semantic descriptions of

FIGURE 1. A full schema of knowledge graph for mashup-oriented API

recommendation.

Web APIs limits their discovery, sharing, integration and

consumption [18]. To cope with this problem, Dojchinovski

and Vitvar [18] have presented the Linked Web APIs dataset

with semantic descriptions of Web APIs to capture the prove-

nance, temporal, technical, functional, and non-functional

aspects. Specially, the Linked Web APIs ontology is designed

to capture the most relevant information which is related to

Web APIs and mashups. This provides us with a knowledge

base (graph) to guide APIs recommendation.

A full schema of knowledge graph is presented as Fig.1,

which includes key entities: Tag, Category, Mashup, WebAPI,

Agent, Protocol, DataFormat as well as relationships among

them and attributes associated with, such as isPrimary-

Topicof, created, creator, rating, title, name, type, label,

publisher, homepage, wasAttributedTo, supportedProtocol,

supportedDataFormat, sslSupport, etc. There are totally

3286 tag entities and 66 category entities. Here, we specially

distinguish entities of Tag from entities of Category consider-

ing that tags are usually user-generated in the free way while

categories are assigned adopting to a standard vocabulary.

B. RANDOM WALKS WITH RESTART

PageRank is an ordering node technique based on Markovian

walks in a directed graph G = (V , E), where V (|V | = n) is

the node set and E is the edge set. The surfer jumps from one

node to another with a consistent probability of α (damping

factor) or gets bored, and then jumps to a random node with

a probability of 1 − α. Assuming P is the ranking vector

for all nodes in G, the PageRank value of P

is the surfer’s

probability at a given node of i. The method of fast computing

PageRank is to use the power iteration method,

(t)

= αM

(t−1)

+ (1 − α)e (1)

where M is a row-stochastic matrix (n × n). Beginning with

an arbitrary vector P

(0)

, the solving of Eq.1 is to apply the

operator

= αM

+(1 −α)e in succession, until |P

(t+1)

−

(t)

| < . When setting the personalized vector e to prefer

a subset of V [19], the PageRank model is usually called as

Random Walks with Restart(RWR) [20], [31].

7652 VOLUME 7, 2019

X. Wang et al.: Mashup-Oriented API Recommendation via Random Walk on Knowledge Graph

RWR has been used as a measure of relatedness in var-

ious recommendation scenes and been proved to achieve

superior performance with the ability to alleviate data

sparsity [20], [21]. It can be adapted to recommend Web

APIs for the mashup development as follows: a) Given a

knowledge graph, we treat the edges with different types as

a bidirectional link with a uniﬁed weight of 1; b) set e to

prefer the node representing a mashup; c) ﬁnd the vector

(t)

(where t is the state after convergence) using Eq.1;

d) sort APIs by their rankings in P

(t)

and generate the top-N

recommendations [22]. Specially, given the source node i,

the target node j and the number of links from i to j can be

expressed as L(i, j). L(i, k) is the same, in which, k represents

the node connected to node i. The matrix M is initialized as

follows:







L(i, j)

L(i, k)

if L(i, j) > 0,

0 otherwise

(2)

RWR is simple and effective, however, it has a cou-

ple of drawbacks which lead to unsatisfying results in

mashup-oriented API recommendation. One problem con-

cerns the computational efﬁciency. As the computational

complexity of RWR is O(n

t), the computational cost will

be high when the number of entities in the knowledge graph

is larger. Also, not all the information in the knowledge graph

contributes to the accuracy of recommendations. Thus we

are required to reﬁne the knowledge graph to reduce the

computational cost. Another problem is about the negative

Matthew effect which means that the APIs of high popularity

will always achieve higher ranking values [23]. This negative

effect will reduce the diversity of recommendation lists and

lower the accuracy of recommendations [22]. For instance,

Google Maps API has been used more than 2000 times in

mashup developments and frequently ranks higher in the rec-

ommendation lists, even if it does not match any requirement.

III. ENHANCED MODEL

A. REFINING KNOWLEDGE GRAPH

To improve RWR-based recommendation model, one feasi-

ble approach is to initialize a walking boundary to reduce

unnecessary random surﬁng. We customize the query bound-

aries based on the mashup requirements: (I) only nodes of

typed Tag or Category with rich semantic information and

a strong transitivity ability are used to represent the mashup

requirements. Other nodes are either unique or lack sufﬁcient

feature information; (II) only the nodes and edges within the

boundary will be used to create the knowledge graph, and

the other nodes and edges are omitted. Figure 2 speciﬁes the

walking boundary. Formally, we use the following notations

to model requirements of mashups:

Deﬁntion 1: A mashup requirementis speciﬁed by Q =

cat

∪ Q

tag

, where Q

cat

is the set of entities typed Category,

and Q

tag

is the set of entities typed Tag.

Deﬁntion 2: A query node q represents a mashup to be

established (i.e., test node in Figure 2).

FIGURE 2. A refined schema of knowledge graph where the boundary of

the dotted box is used to create a data graph. The node named test

represents the node of target mashup.

The APIs recommendation corresponds to repeat a spread

process from the query node q to the nodes in Q, and in

turn to other nodes, until a stable global state is achieved.

By employing this strategy, the number of visited nodes by

RWR is greatly reduced, and thus the computational cost will

be less. By the way, the negative Matthew effect will also be

suppressed to some extent. According to our experiments,

the number of visited nodes can be reduced by 98%, while

the recommendation accuracy can be increased by around

10%. To distinguish from the original RF model using the

full knowledge graph, we call this approach using the reﬁned

knowledge graph as RR.

It is worth noting that we use tags and categories to describe

user needs for the following reasons: (1) Tags/categories

can be regarded as precise summaries of API functions,

so they carry more rich information. (2) Tags/categories have

been recognized as successful in organizing and sharing

resources in information systems, especially in the era of

social Web, such as Flickr, YouTube, Delicious and other

websites. This is also true for service/API repositories, e.g.,

ProgrammableWeb and Seekda.

B. QUERY-SPECIFIC WEIGHTI NG

Up to now, we use uniform weight for different links when

constructing knowledge graph, and do not consider the

impact of different weights of links on the recommendation

effects. It is almost impossible to ﬁnd the global optimal

conﬁguration of all link weights. For this reason, we intro-

duce some simple but easy-to-operate heuristic strategies to

adjust weights of speciﬁc links, to reﬂect the inﬂuence of user

preferences on mashup developments.

Q1: In a common sense, the APIs contain more information

related to Q, the more important they should be. Therefore,

we strengthen the weights of links between Q and those APIs

to reﬂect this principle.

Q2: In many cases, sets Q

cat

and Q

tag

contain the same

keywords. For example, a mashup requirement may include

both social_tag and social_category. In our dataset, 95.5%

keywords in the Q

cat

appears in the Q

tag

, so we should give

more weights to the APIs they point together.

VOLUME 7, 2019 7653

剩余11页未读，继续阅读

评论收藏

内容反馈

weixin_38518074

粉丝: 6
资源: 927

通过知识图上的随机游走的面向Mashup的API推荐

通过知识图上的随机游走的面向Mashup的API建议

基于Mashup的推荐书目服务的实现

python__mashup_API

web 2.0 mashup开发实践

论文研究-植入引导式层次聚类Mashup服务推荐算法.pdf

面向金融的python大作业.zip

Google map+yahoo!weather-天气查询-mashup

Foundations of Rapid Mashup Development

QlikView 包含（使用API 及 全套文档）开发文档.rar

MASHUP_Info.xlsx

mashup资料

arcgis api for wpf sdk 2.4 示例

IBM WebSphere mashup

mashup的一个项目 omelette

论文研究-Mashup技术在电子商务平台的应用 .pdf

Mashup案例。基于Amazon.com。需要用VS2008打开。

孙朝晖-基于开放平台与Mashup 技术环境的Web App设计方法

用Javascript的开发Web应用及Mashup的方法.pdf

基于主题模型的Mashup标签推荐方法

Qt 5实现串口调试助手 （源工程文件、0积分下载）

【SystemVerilog】路科验证V2学习笔记（全600页）.pdf

AutoSAR标准协议4.2.2

光伏-储能并网系统仿真.rar

GD32替换STM32注意事项.pdf

XCP协议的规范文档

NPPJSONViewer.zip

CANoe通过CAPL脚本实现自动测试

VS2015安装证书，JavaScript_ProjectSystem.msi，JavaScript_LanguageService.msi

蓝牙BLE协议中文版.pdf

最新资源

QlikView 包含（使用API 及全套文档）开发文档.rar

Qt 5实现串口调试助手（源工程文件、0积分下载）