LearningtoBuildUser-tagProfileinRecommendationSystem.pdf资源-CSDN文库

需积分: 48 56 浏览量 2021-06-09 17:31:12 上传评论收藏 1.96MB PDF 举报

用户标签画像构建是推荐系统中的一项重要技术，其目的在于通过用户的人口统计信息（如性别、年龄和地理位置）以及用户行为信息（例如浏览和搜索历史）对用户进行刻画。在不同的用户画像维度中，标签化是一种具有解释性且广泛使用的用户兴趣表示方法。本文提出了一种用户标签画像模型（User Tag Profiling Model，简称UTPM），将用户标签画像的学习视为一个多标签分类任务，并采用深度神经网络来完成。与传统模型不同的是，UTPM模型采用了一个带有共享查询向量的多头注意力机制，用以学习不同领域的稀疏特征。此外，引入了一种改进的基于FM（Factorization Machine）的交叉特征层，该方法在多数先进的交叉特征方法中表现突出，并进一步增强了模型的性能。同时，本文设计了一种新的联合方法，用于从推荐系统中用户点击的一篇新闻文章学习不同标签的偏好。文章的内容表明，用户画像模型在个性化推荐系统中的应用是至关重要的，因为这样的系统需要理解和预测用户的行为和偏好。在推荐系统中，用户标签画像可以用来描述用户可能感兴趣的项目或内容，从而实现更精准的推荐。深度神经网络在处理复杂的非线性关系和大量数据方面表现出色，因此它们成为了构建用户标签画像的有力工具。多标签分类任务则是指一个用户可以被分配多个标签，这反映了用户兴趣的多样性和复杂性。多头注意力机制来自于自然语言处理（NLP）领域，最初用于处理文本数据中的序列依赖关系。这种机制通过并行处理多个“注意力头”来学习数据的不同方面，进而捕捉到更加丰富和多维的信息表示。在推荐系统中，这种机制可以帮助模型更好地理解用户的多种兴趣。交叉特征是推荐系统中的一个关键概念，它指的是模型能够同时考虑用户和物品的属性来生成有效的推荐。通过交叉特征，推荐系统能够发现用户和物品属性之间的复杂相互作用，并利用这些信息来提升推荐质量。改进的基于FM的交叉特征层能够有效处理这种特征间的交叉关系，从而在推荐过程中获得更好的性能。在用户画像的构建过程中，如何有效地利用用户行为数据是关键。在本文中，作者提出了一种新的方法，这种方法能从用户点击的单篇文章中学习不同标签的偏好。这种做法既节省了数据收集的复杂性，又能够实时地捕捉到用户的即时兴趣变化，这对于设计一个高效能的推荐系统来说至关重要。文章中提到的UTPM模型已经被部署在微信的“Top Stories”推荐系统中，并通过在线和离线实验验证了该模型相比基线模型具有优越性。在线实验可以即时监测模型在实际环境中的表现，而离线实验则提供了更系统的方式来评估模型的性能。文章指出，本文的研究成果得到了ACM国际会议的认可，并在信息和知识管理领域发表。这表明了该研究不仅在技术上有所创新，而且在业界也有广泛的应用价值和影响力。通过提供更准确的用户画像，推荐系统能够更有效地为用户推荐他们可能感兴趣的内容，从而增强用户体验并提高用户粘性。这对于社交媒体平台、新闻门户以及在线广告平台等都具有重要的商业意义。

资源推荐

资源详情

资源评论

Learning to Build User-tag Profile in Recommendation System

Su Yan

WeiXin Group, Tencent Inc.

Beijing, China

suyan@tencent.com

Xin Chen

WeiXin Group, Tencent Inc.

Beijing, China

andrewxchen@tencent.com

Ran Huo

WeiXin Group, Tencent Inc.

Beijing, China

lavinhuo@tencent.com

Xu Zhang

WeiXin Group, Tencent Inc.

Beijing, China

xuonezhang@tencent.com

Leyu Lin

WeiXin Group, Tencent Inc.

Beijing, China

goshawklin@tencent.com

ABSTRACT

User proling is one of the most important components in rec-

ommendation systems, where a user is proled using demographic

(e.g. gender, age, and location) and user behavior information (e.g.

browsing and search history). Among dierent dimensions of user

proling, tagging is an explainable and widely-used representation

of user interest. In this paper, we propose a user tag proling model

(UTPM) to study user-tag proling as a multi-label classication

task using deep neural networks. Dierent from the conventional

model, our UTPM model is a multi-head attention mechanism with

shared query vectors to learn sparse features across dierent elds.

Besides, we introduce the improved FM-based cross feature layer,

which outperforms many state-of-the-art cross feature methods

and further enhances model performance. Meanwhile, we design a

novel joint method to learn the preference of dierent tags from

a single clicked news article in recommendation systems. Further-

more, our UTPM model is deployed in the WeChat "Top Stories"

recommender system, where both online and oine experiments

demonstrate the superiority of the proposed model over baseline

models.

CCS CONCEPTS

• Information systems → Personalization.

KEYWORDS

personalization, user proling, neural networks, recommendation

systems

ACM Reference Format:

Su Yan, Xin Chen, Ran Huo, Xu Zhang, and Leyu Lin. 2020. Learning to

Build User-tag Prole in Recommendation System. In Proceedings of the 29th

ACM International Conference on Information and Knowledge Management

(CIKM ’20), October 19–23, 2020, Virtual Event, Ireland. ACM, New York, NY,

USA, 8 pages. https://doi.org/10.1145/3340531.3412719

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for prot or commercial advantage and that copies bear this notice and the full citation

on the rst page. Copyrights for components of this work owned by others than the

author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or

republish, to post on servers or to redistribute to lists, requires prior specic permission

and/or a fee. Request permissions from permissions@acm.org.

CIKM ’20, October 19–23, 2020, Virtual Event, Ireland

ACM ISBN 978-1-4503-6859-9/20/10.. .$15.00

https://doi.org/10.1145/3340531.3412719

1 INTRODUCTION

In recent years, personalization techniques have been extensively

applied in large-scale recommendation platforms, such as YouTube

and Amazon. Widely used personalization techniques, such as CTR

(click-through-rate) prediction, evaluate a user’s preference for an

item based on user and article features. Therefore, an accurate se-

lection of user representation from over-loaded user information

is the backbone for satisfactory recommendation results. Conse-

quently, user proling is one way to enhance the performance of

recommendation systems by building accurate user representation

that improves personalization [12].

Figure 1 briey shows how user proling works in the WeChat

"Top Stories" news recommendation system. The architecture of

the recommendation system consists of four parts: news prole

layer, user prole layer, recall layer, and rank layer. In the news pro-

le layer, basic properties of the news article, such as its WeChat

subscription account, are recorded, while semantic information,

such as tag and news category, is extracted through natural lan-

guage processing techniques. As for the user prole layer, user’s

demographic information, such as gender and age, and behavioral

information, such as reading history, are collected. User proling

utilizes both behavior information and news prole, transmitting

news properties to users to reect users’ interest in certain kinds

of articles. Among these properties, tagging is an interpretable and

widely used representation of users’ interests in recommendation

systems [

]. In this study, we introduce how to build a user-tag

prole using the WeChat "Top Stories" recommender system as

an example. In this paper, "clicked tags" refer to tags within news

articles that the user clicked, and "un-clicked tags" refer to tags

within news articles that the user browsed but has not clicked. Note

that an article is associated with various tags related to its content.

In the recall layer, news articles related to the user’s interest are

recalled through several recall strategies, most of which take the

user-tag prole as an important feature input. Major recall strate-

gies are tag-based recall, where news articles containing user’s

interest tags are recalled, cf-based recall, which includes content-

based cf and user-based cf, and other model-based recalls. After

the aforementioned steps, the rank layer ranks the recalled news

articles based on user and news proles. Finally, top-ranked news

articles are shown to users. Due to the steep decline in the number

of candidate news articles from the recall layer to the rank layer,

the rank layer can use a more complex model with a higher number

of feature input than the recall layer without compromising on

Applied Research Track

CIKM '20, October 19–23, 2020, Virtual Event, Ireland

2877

CIKM ’20, October 19–23, 2020, Virtual Event, Ireland Su Yan, Xin Chen, Ran Huo, Xu Zhang, and Leyu Lin

eciency. It is noteworthy that the user prole is highly connected

to strategies and models used in the recall and rank layers; it is,

therefore, necessary to build an accurate user prole for online

news recommendation systems.

The process to build a user-tag prole is demonstrated in the blue

dashed rectangle in Figure 1. Firstly, we collect data and process it

in the feature generation and label organization steps (the process

is detailed in the Experiment Setup section). Then, for each user, all

"clicked tags" (tags within news articles that the user clicked) in the

user’s reading history are collected in the candidate selection step,

and the user’s preference on these tags is calculated in the model

training step. One user can have thousands of clicked tags. Besides,

when predicting a user’s preferences on unseen tags, candidates

could consist of a large number of tags.

While building the user-tag prole, it is essential to select fea-

tures and learn feature interaction eciently since the model input

consists of numerous sparse features from multiple elds, as shown

on the left part of Figure 1. In news recommendation systems, arti-

cles clicked are taken as positive samples; articles browsed but not

clicked are taken as negative samples. However, directly applying

this method to tagging leads to treating tags within clicked news

articles as positive samples, which can be problematic, since the

user who clicks an article may be interested in only one of its tags.

Based on the aforementioned consideration, we aim to answer the

following two questions:

• RQ1

: How to automatically select useful features and learn

the interaction between features within and among dierent

elds ?

• RQ2

: How to learn user’s preference over dierent tags from

each clicked news article ?

In recent years, deep neural networks have been widely used

in recommendation systems, where user and news article features

are utilized and their interactions are learned. One widely used

deep recommendation model is the Youtube model proposed by

Google [

]. However, we have identied two weaknesses of this

model that we aim to improve. Firstly, the YouTube model uses

the average pooling layer to merge multiple input feature embed-

dings, failing to consider that useful and useless features should be

assigned dierent weights. Furthermore, the YouTube model uses

concatenation to merge features across dierent elds and feed the

merged output into the upper layer through MLP(multilayer per-

ceptron). In experiments, we observed that weights of some elds

are underestimated, especially when these elds are not highly

related to labels, which hinders feature fusion across elds.

To avoid the pitfall of the Youtube model and inspired by the

attention mechanism [

], which captures useful word and sentence

embedding for doc classication, we design an attention fusion

layer within and across each feature eld. In the original attention

mechanism, there is only one query vector to determine feature

usefulness. We believe that multi-head attention helps reserve more

useful features from multi-aspects, and therefore our model uses

two query vectors and shares the query vector with each head of

attention units.

Furthermore, we propose a cross feature layer to enhance model

performance. FM-based feature interaction methods like AFM [

]

and NFM [

] are widely used, where the Hadamard product of pair-

wise hidden vectors is summed into a vector with the same size of

the hidden vector. In the user proling task, the aforementioned

methods might lead to the loss of user information. It is therefore

advisable to output all inner product values of each pair-wise hidden

vectors. It helps to learn a user’s multiple interests and leads to

higher performance for multi-labels classication.

The contributions of this paper are as follows: 1) We propose a

user-tag proling model (UTPM), which could make use of multiple

elds of user information and is suitable for other user proling

tasks. 2) In this model, we introduce a multi-head attention mecha-

nism with shared query vectors to capture the important attributes

within each eld and merge multiple elds by assigning each eld

a reasonable weight. 3) We propose a specially designed FM-based

cross feature layer to promote user proling where all crossed val-

ues are fed as a dense vector to the next layer along with linear

values to generate the nal user embedding. 4) Particularly for the

user-tag proling task where each news article contains several

tags, we design joint loss to learn each user-tag preference, which is

proved to achieve better performance compared with the separate

training.

2 RELATED WORK

Tag recommendation system.

In recent years, tag recommen-

dation techniques have received more and more attention. An adap-

tation of user-based collaborative ltering and graph-based rec-

ommender is proposed for tag recommendation system [

]. Mean-

while, Vig introduces a tag-based explainable recommendation

system [

] where he studied two key components: tag-relevance

and user-tag preference during recommendation, both of which

improve eectiveness. Researchers at Sina Weibo designed an inte-

grated recommendation algorithm to collectively explore the social

relationship among users, the co-occurrence relationship and se-

mantic relationship among tags [

]. So far, the methods applied to

the tag recommendation problem have been mainly collaborative

algorithms that are based simply on the co-occurrence among users

and tags. But in reality, these methods cannot fully utilize multi-

eld user information which could help discover users’ interests. It

is necessary to apply state-of-the-art deep models to enhance the

performance of user proling tasks.

Attention Mechanism.

Attention mechanism originates from

Neural Machine Translation(NMT) [

], where words are assigned

with dierent weights in dierent contexts. Attention has been

used successfully not only in a variety of NLP tasks including

reading comprehension, abstractive summarization, and textual

entailment [

], but also in recommendation systems [

]. One

type of self-attention [

] learns the words and sentences normalized

weight for a certain classication task where only one query vector

is learned. Liu [

] utilizes the self-attention mechanism to fuse

features among elds and achieves better performance than con-

catenation and MLP for online news recommendation task. Another

kind of self-attention [

] studies the inner correlation between

words, where each word is assigned with a query vector. Song [

]

utilizes this self-attention mechanism to design a stack of cross

networks called AutoInt, which can learn high-order feature inter-

action among dierent elds.

Applied Research Track

CIKM '20, October 19–23, 2020, Virtual Event, Ireland

2878

剩余7页未读，继续阅读

评论收藏

内容反馈

Aboyomy

粉丝: 2
资源: 4

Learning to Build User-tag Profile in Recommendation System.pdf

最新资源

Learning to Build User-tag Profile in Recommendation System.pdf

Deep+Learning+in+Recommendation.pdf

User-Profile-System:简单的迷你社交网络

splashysoul-movie-recommendation-system

dsc-recommendation-system-introduction-nyc-ds-021720

Generative-Adversarial-User-Model-for-Reinforcement-Learning-Based-Recommendation-System-Pytorch

ITU-T Recommendation H.222.0.pdf

Recommendation ITU-R-BT.2020.pdf

T-REC-G.107.1-201906-I!!PDF-E.pdf_itu_rec_

T-REC-G.987-201206-I!!PDF-E.pdf

2014-History-Guided conversational recommendation-WWW.pdf

内容分发与精准推荐 UberEats Discovery- Food Recommendation 共37页.pdf

PyPI 官网下载 | mypy-boto3-compute-optimizer-1.12.14.0.tar.gz

CIKM2019-graph-for-recommendation.pdf

Recommendation-ITU-T-G.987.2

Movie-recommendation-system-master.zip

ITU-T G.8261/Y.1361

Building.a.Recommendation.System.with.R.1783554495

DKN_ Deep knowledge-aware network for news recommendation.pdf

T-REC-G.709.1-202005-I!Cor1!PDF-E.pdf

T-REC-G.987.2-201010-I!!PDF-E.pdf

ITU-T G.984.X

ITU-T , Recommendation, G.108.1.rar

推荐系统的循序进阶读物

ITU-T Recommendation G.107.rar

Graph Neural Networks_ A Review of Methods and Applications----清华大学周杰.pdf

ITU T Recommendation V.250

ITU-T-REC-P.800--传输质量的主观评价方法

最新资源