【免费】利用R语言编写的数据挖掘大作业资源-CSDN文库

共5个文件

md：1个

xlsx：1个

pdf：1个

需积分: 0 34 浏览量 2024-01-12 21:30:30 上传评论收藏 1.82MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

利用R语言编写的数据挖掘大作业。着重分析朴素贝叶斯判别分析算法、 AdaBoost 算法以及随机森林算法在口红销量预测中的效果，并在随机森林算法中进行模型优化。Using R language data mining big homework. The effects o….zip （5个子文件）

48941918

《数据挖掘实验》课程设计--周伟.pdf 1023KB

main.R 20KB

口红-data.xlsx 272KB

《数据挖掘实验》课程设计PPT--周伟.pptx 1020KB

README.md 1KB

# data-mining-R 从网站爬取口红销售数据，分析影响销售数据的重要因素以及根据销售因素建模预测其销售量。本文先将数据进行预处理得到实验数据，然后着重分析朴素贝叶斯判别分析算法、 AdaBoost 算法以及随机森林算法在口红销量预测中的效果，并在随机森林算法中进行模型优化。通过实验结果表明总评价数、价格和描述分这三个因素对销售量的影响较大，对三个算法对比分析得出随机森林算法预测错误率最低，有较好的预测效果。 Crawling lipstick sales data from the website, analyzing the important factors affecting sales data and predicting sales volume according to sales factors modeling. In this paper, we first preprocess the data to get the experimental data, and then focus on the analysis of Naive Bayesian Discriminant Analysis (Naive Bayesian Discriminant Analysis), AdaBoost algorithm and random forest algorithm in lipstick sales forecasting effect, and in the random forest algorithm to optimize the model. The experimental results show that the total evaluation number, price and description of these three factors have a greater impact on sales. The comparison of the three algorithms shows that the random forest algorithm has the lowest prediction error rate, and has a better prediction effect. main.R文件是代码的源文件。

评论收藏

内容反馈