没有合适的资源?快使用搜索试试~ 我知道了~
温馨提示
内容概要:本文探讨了电子商务产品评价系统的重要性和应用场景,提出了一种基于评分和评论数据的产品评估模型(RRBS Model)和声誉模型(Reputation Model)。通过量化客户对产品的反应并随时间变化进行趋势分析,模型能有效预测产品的成功与否。此外,还通过实验验证了高评分序列能激励更多积极评论,但低评分对负面评论的影响不明显。 适合人群:从事数据分析、市场调研、电子商务产品管理和市场营销的专业人士。 使用场景及目标:帮助企业理解和优化其在线产品的市场表现,提高竞争力和销售业绩。 其他说明:本文提供了详细的模型构建方法和实际案例,有助于读者深入了解和应用相关技术。
资源推荐
资源详情
资源评论
关注数学模型
获取更多资讯
2
MATHmodels
Problem Chosen
C
2020
MCM/ICM
Summary Sheet
Team Control Number
2004647
Good Morning Sunshine:
A Rating and Review Based Analysis
Summary
The Review and Rating System has been playing a huge role during the evolu-
tion of E-commerce. Customers use it to help make the best purchase decisions,
while vendors can utilize it to collect opinions on how to make improvements.
Therefore, in a vendor’s perspective, it is of essential to learn to most efficiently
extract the most valuable information out of mountains of customers’ voices.
Firstly, based on the attributes we find useful in the provided data sets, we pro-
pose the Rating and Review Based Score (RRBS) Model, which defines a product’s
score—a data measure based on information refined from ratings and reviews—
that finely quantifies and describes customers’ responses to a product. Based on
the RRBS Model, we further introduce the Reputation Model, which gives a time-
based quantification of a product’s reputation over time periods. Our model eval-
uation results show that a quantified reputation has a strong correlation with the
corresponding product’s sales in terms of trend. This fact intuitively proves our
models’ correctness.
Then, in order to predict a product’s potential success or failure in the future,
we propose the the Successfulness Prediction Model, utilizing the knowledge ba-
sis established from our previously proposed models. Based on the premise that a
product’s success is closely connected to its sales, we are able to address the suc-
cessfulness prediction task by predicting a product’s future reputation with calcu-
lated reputation series, followed by comparison between the predicted values and
a pre-set threshold. As a result, the successfulness of products can be effectively
predicted.
Next, we evaluate star ratings’ incitement to custmoers’ specific reviews by record-
ing customers’ reviews right after a series of same ratings (either high or low), and
calculate the chance of them being either positive, negative or neutral. It turns out
a series of high ratings do incite more positive reviews; at the other end, we cannot
confidently determine a series of low ratings’ incitement to negative reviews. Sub-
sequently, we discover the association between rating levels and specific quality
descriptors from review text by matching review contents with graded-words. The
results reveal strong connection between postive words and high ratings, and simi-
larly, negative words and low ratings.
Finally, to assist with Sunshine Company’s upcoming business, we propose our
devised marketing strategies and some important design features for each product.
Keywords: the RRBS Model; the Reputation Model; Correlation Analysis;
关注数学模型
获取更多资讯
2
MATHmodels
Team # 2004647 Page 1 of 24
Contents
1 Introduction 2
1.1 Problem Restatement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Literature Review . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.3 Data Cleaning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.4 Modeling Framework . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2 Assumptions & Nomenclature 3
3 the RRBS Model 4
3.1 Model Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.2 the Rating Vector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.3 the Review Vector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.3.1 The Review-based Measure Equation . . . . . . . . . . . . . . . . . . . . . 8
3.3.2 the Sentiment Polarity Matrix . . . . . . . . . . . . . . . . . . . . . . . . . . 9
4 the Reputation Model 10
4.1 Time Weight Sequence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
4.2 Quantification of Reputation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
4.3 Model Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
4.3.1 Trend Similarity between Quantified Reputation and Sales . . . . . . . . . 11
4.3.2 Kendall’s Tau Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
5 the Successfulness Prediction Model 13
5.1 Gaussian Process Regression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
5.2 Reputation Mapping . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
5.3 Successfulness Threshold . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
6 Results 16
6.1 Evaluation of Star Rating’s Incitement . . . . . . . . . . . . . . . . . . . . . . . . . 16
6.2 Association between Review Wording and Rating Levels . . . . . . . . . . . . . . 18
6.3 Our Strategies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
6.3.1 Marketing Strategies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
6.3.2 Important Design Features . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
7 Sensitivity Analysis 20
8 Strengths and Weaknesses 20
8.1 Strengths . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
8.2 Weaknesses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
9 Conclusion 21
10 Our Letter 22
Appendices 25
关注数学模型
获取更多资讯
2
MATHmodels
Team # 2004647 Page 2 of 24
1 Introduction
"Amazing things will happen when you listen to the consumer."
—Jonathan Midenhall, CMO of Airbnb
E-commerce has sprouted for the last few decades all over the world, and now it has be-
come one of the most prosperous and promising modern industries. During this process, the
Review and Rating System has been playing a huge role all along. Not only does it help cus-
tomers find out what is best for them when making purchase decisions, but also it helps sellers
realize what problems their products have and thus things can be improved. Therefore, it is
significant for companies to know how to make the most use of it and apply the knowledge to
improving product designs and marketing strategies.
1.1 Problem Restatement
Sunshine Company is about to launch three new products in the marketplace, including a mi-
crowave, a baby pacifier and a hair dryer. Data of customer-supplied reviews and ratings for
microwave ovens, baby pacifiers and hair dryers sold on Amazon.com are supplied, each of
which has a time label on. Analysis of customers’ ratings and reviews of other companies’s
competing products is required in order to figure out the optimal online marketing strategies
and possible design features they can employ to make their products more desirable to cus-
tomers.
To achieve our goals, specifically, we need to:
• Identify the most informative data measures based on reviews and ratings for Sunshine
Company to track, once they launch their three new products in the online marketplace.
• Discover time-based measures and patterns that can suggest a product’s reputation’s
trend of increase or decrease.
• Combine text-based measures and rating-based measures to discover measures that best
indicate a product’s potential success or faliure.
• Find out whether specific star ratings incite more reviews.
• Find out whether rating levels and specific quality descriptors of text-based reviews are
strongly associated.
1.2 Literature Review
Early work have proposed several methods for utilizing reviews and ratings of online prod-
ucts. In 2009, Koren et al. [4] proposed Latent Factor Models (LFM) to address the rating
prediction task. In 2013, McAuley et al. [5] took advantage of the abundunt information in
reviews using Latent Dirichlet Allocation (LDA) to achieve topic distributions. Later in 2016,
Tan et al. [8] proposed the Rating-Boosted Latent Topics (RBLT) Framework, combining tex-
tual reviews with users’ sentiment orientations, and thus recommendations are made more
accurate.
However, some of the valuable information in reviews is not fully utilized in the introduced
works, such as varying text lengths of different reviews and the sentiment intensity of specific
关注数学模型
获取更多资讯
2
MATHmodels
Team # 2004647 Page 3 of 24
words (e.g. "fine" is less positively intense compared with "good"). Moreover, other impor-
tant attributes in modern E-commerce recommendation systems, such as helpfulness ratings,
reviewers’ credibility and whether or not reviewers are verified buyers, are rarely taken into
account. The quantification of reviews and ratings’ combinations are not yet proposed, and
data measures based on reivews and ratings are not provided either.
1.3 Data Cleaning
Upon observation on the provided data, we notice there are some misplaced data in each data
set, where some review items belong to products of wrong categories. For example, in the
file hair_dryer.tsv the review item with the product_parent value #466871293 (which uniquely
identifies a product) belongs to a hair spray product, instead of a hair dryer. Similarly, in
the file microwave.tsv the review item with the product_parent value #311592014 belongs to a
fridge, instead of a microwave. We find these mistakes almost exclusively happen to those
products with single digit review records. Along with the perception that sparse data do not
contribute to our modeling well, we omit all product items with single digit review records.
1.4 Modeling Framework
Our modeling framework can be illustrated as shown in Figure 1.
Figure 1: Modeling Framework
2 Assumptions & Nomenclature
To simplify our modeling, we make the following assumptions:
关注数学模型
获取更多资讯
2
MATHmodels
Team # 2004647 Page 4 of 24
Asm. 1 Provided data do not include spam reviews, either from competitors or mischievous
customers.
Asm. 2 Contents of every review are legitimate and unbiased.
Asm. 3 The number of review records of a product equals to the sales of the same product
aggregated since it was launched, i.e., there are no missed or excluded records in any of
the provided data sets.
Asm. 4 Customers’ reviews and ratings were posted the same day they purchased the prod-
ucts, i.e., the time consumption of procedures such as packaging, delivery and users’
confirmation of quality is neglected.
Asm. 5 Each customer has at most one review and rating record on one type of product (either
a microwave, a pacifier or a hair dryer).
We have a list of the main notations we define during our modeling, as shown in Table 1.
3 the RRBS Model
In this section, we propose the RRBS Model (viz., the Rating and Review Based Score Model) to
serve as a data measure, combining star ratings and textual information from reviews. When
determining the quantification process, we take into account nearly all influencing attributes
from the provided data. As a result, a highly informative data measure is presented for Sun-
shine Company to track, once their three new products are launched.
Table 1: Notations we use in future discussion.
Symbol Description
i Product type (either Microwaves, Pacifiers or Hair dryers), i = 1, 2, 3
k Unique model identifier of a type of products
t Timestamp identifier (unit: Month)
n Number of rating and review records of a model within a specific month
score
(t)
i,k
Measure for customer responses a product model receives
α Rating vector—an n-dimensional row vector
β Review vector—an n-dimensional column vector
λ Rating weight parameter
A Sentiment polarity matrix—an n-by-n diagonal matrix
Φ Review-quantifying vector—an n-dimensional column vector
θ
j
Vine factor
s
j
Sentiment intensity factor
v
j
Validity factor
h
j
Helpfulness-rating factor
L
j
Review-length factor
ϵ
j
Correction term
score Time series of a model’s scores
Rep
(t)
i,k
Quantified reputation
剩余28页未读,继续阅读
资源评论
pk_xz123456
- 粉丝: 2257
- 资源: 2143
下载权益
C知道特权
VIP文章
课程特权
开通VIP
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功