Summary
=======
This dataset (ml-25m) describes 5-star rating and free-text tagging activity from [MovieLens](http://movielens.org), a movie recommendation service. It contains 25000095 ratings and 1093360 tag applications across 62423 movies. These data were created by 162541 users between January 09, 1995 and November 21, 2019. This dataset was generated on November 21, 2019.
Users were selected at random for inclusion. All selected users had rated at least 20 movies. No demographic information is included. Each user is represented by an id, and no other information is provided.
The data are contained in the files `genome-scores.csv`, `genome-tags.csv`, `links.csv`, `movies.csv`, `ratings.csv` and `tags.csv`. More details about the contents and use of all these files follows.
This and other GroupLens data sets are publicly available for download at <http://grouplens.org/datasets/>.
Usage License
=============
Neither the University of Minnesota nor any of the researchers involved can guarantee the correctness of the data, its suitability for any particular purpose, or the validity of results based on the use of the data set. The data set may be used for any research purposes under the following conditions:
* The user may not state or imply any endorsement from the University of Minnesota or the GroupLens Research Group.
* The user must acknowledge the use of the data set in publications resulting from the use of the data set (see below for citation information).
* The user may not redistribute the data without separate permission.
* The user may not use this information for any commercial or revenue-bearing purposes without first obtaining permission from a faculty member of the GroupLens Research Project at the University of Minnesota.
* The executable software scripts are provided "as is" without warranty of any kind, either expressed or implied, including, but not limited to, the implied warranties of merchantability and fitness for a particular purpose. The entire risk as to the quality and performance of them is with you. Should the program prove defective, you assume the cost of all necessary servicing, repair or correction.
In no event shall the University of Minnesota, its affiliates or employees be liable to you for any damages arising out of the use or inability to use these programs (including but not limited to loss of data or data being rendered inaccurate).
If you have any further questions or comments, please email <grouplens-info@umn.edu>
Citation
========
To acknowledge use of the dataset in publications, please cite the following paper:
> F. Maxwell Harper and Joseph A. Konstan. 2015. The MovieLens Datasets: History and Context. ACM Transactions on Interactive Intelligent Systems (TiiS) 5, 4: 19:1–19:19. <https://doi.org/10.1145/2827872>
Further Information About GroupLens
===================================
GroupLens is a research group in the Department of Computer Science and Engineering at the University of Minnesota. Since its inception in 1992, GroupLens's research projects have explored a variety of fields including:
* recommender systems
* online communities
* mobile and ubiquitious technologies
* digital libraries
* local geographic information systems
GroupLens Research operates a movie recommender based on collaborative filtering, MovieLens, which is the source of these data. We encourage you to visit <http://movielens.org> to try it out! If you have exciting ideas for experimental work to conduct on MovieLens, send us an email at <grouplens-info@cs.umn.edu> - we are always interested in working with external collaborators.
Content and Use of Files
========================
Verifying the Dataset Contents
------------------------------
We encourage you to verify that the dataset you have on your computer is identical to the ones hosted at [grouplens.org](http://grouplens.org). This is an important step if you downloaded the dataset from a location other than [grouplens.org](http://grouplens.org), or if you wish to publish research results based on analysis of the MovieLens dataset.
We provide a [MD5 checksum](http://en.wikipedia.org/wiki/Md5sum) with the same name as the downloadable `.zip` file, but with a `.md5` file extension. To verify the dataset:
# on linux
md5sum ml-25m.zip; cat ml-25m.zip.md5
# on OSX
md5 ml-25m.zip; cat ml-25m.zip.md5
# windows users can download a tool from Microsoft (or elsewhere) that verifies MD5 checksums
Check that the two lines of output contain the same hash value.
Formatting and Encoding
-----------------------
The dataset files are written as [comma-separated values](http://en.wikipedia.org/wiki/Comma-separated_values) files with a single header row. Columns that contain commas (`,`) are escaped using double-quotes (`"`). These files are encoded as UTF-8. If accented characters in movie titles or tag values (e.g. Misérables, Les (1995)) display incorrectly, make sure that any program reading the data, such as a text editor, terminal, or script, is configured for UTF-8.
User Ids
--------
MovieLens users were selected at random for inclusion. Their ids have been anonymized. User ids are consistent between `ratings.csv` and `tags.csv` (i.e., the same id refers to the same user across the two files).
Movie Ids
---------
Only movies with at least one rating or tag are included in the dataset. These movie ids are consistent with those used on the MovieLens web site (e.g., id `1` corresponds to the URL <https://movielens.org/movies/1>). Movie ids are consistent between `ratings.csv`, `tags.csv`, `movies.csv`, and `links.csv` (i.e., the same id refers to the same movie across these four data files).
Ratings Data File Structure (ratings.csv)
-----------------------------------------
All ratings are contained in the file `ratings.csv`. Each line of this file after the header row represents one rating of one movie by one user, and has the following format:
userId,movieId,rating,timestamp
The lines within this file are ordered first by userId, then, within user, by movieId.
Ratings are made on a 5-star scale, with half-star increments (0.5 stars - 5.0 stars).
Timestamps represent seconds since midnight Coordinated Universal Time (UTC) of January 1, 1970.
Tags Data File Structure (tags.csv)
-----------------------------------
All tags are contained in the file `tags.csv`. Each line of this file after the header row represents one tag applied to one movie by one user, and has the following format:
userId,movieId,tag,timestamp
The lines within this file are ordered first by userId, then, within user, by movieId.
Tags are user-generated metadata about movies. Each tag is typically a single word or short phrase. The meaning, value, and purpose of a particular tag is determined by each user.
Timestamps represent seconds since midnight Coordinated Universal Time (UTC) of January 1, 1970.
Movies Data File Structure (movies.csv)
---------------------------------------
Movie information is contained in the file `movies.csv`. Each line of this file after the header row represents one movie, and has the following format:
movieId,title,genres
Movie titles are entered manually or imported from <https://www.themoviedb.org/>, and include the year of release in parentheses. Errors and inconsistencies may exist in these titles.
Genres are a pipe-separated list, and are selected from the following:
* Action
* Adventure
* Animation
* Children's
* Comedy
* Crime
* Documentary
* Drama
* Fantasy
* Film-Noir
* Horror
* Musical
* Mystery
* Romance
* Sci-Fi
* Thriller
* War
* Western
* (no genres listed)
Links Data File Structure (links.csv)
---------------------------------------
Identifiers that can be used to link to other sources of movie data are contained in the file `links.csv`. Each line of this file after the header row represents one movie, and has the following format:
movieId,imdbId,tmdbId
movieId is an identi
没有合适的资源?快使用搜索试试~ 我知道了~
毕业设计-基于矩阵分解的推荐算法研究源码.zip
共75个文件
ipynb:34个
csv:17个
pkl:12个
1.该资源内容由用户上传,如若侵权请联系客服进行举报
2.虚拟产品一经售出概不退款(资源遇到问题,请及时私信上传者)
2.虚拟产品一经售出概不退款(资源遇到问题,请及时私信上传者)
版权申诉
0 下载量 51 浏览量
2022-11-18
17:01:46
上传
评论 2
收藏 200.28MB ZIP 举报
温馨提示
毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码.zip毕业设计-基于矩阵分解的推荐算法研究源码
资源推荐
资源详情
资源评论
收起资源包目录
毕业设计——基于矩阵分解的推荐算法研究.zip (75个子文件)
UndergraduateGraduationProject-master
BX-book
2-Data_Processing.ipynb 6KB
3-LFM_Model.ipynb 20KB
1-Expore_Data.ipynb 10KB
4-BiasSVD_Model.ipynb 15KB
.ipynb_checkpoints
3-LFM_Model-checkpoint.ipynb 20KB
2-Data_Processing-checkpoint.ipynb 6KB
4-BiasSVD_Model-checkpoint.ipynb 15KB
1-Expore_Data-checkpoint.ipynb 10KB
MovieLens
4-Bias_SVD_Model.ipynb 25KB
2-Data_Processing.ipynb 6KB
1-Explore_Datal.ipynb 13KB
3-LFM_Model.ipynb 23KB
.ipynb_checkpoints
3-LFM_Model-checkpoint.ipynb 23KB
2-Data_Processing-checkpoint.ipynb 6KB
4-Bias_SVD_Model-checkpoint.ipynb 31KB
1-Explore_Datal-checkpoint.ipynb 16KB
AmazonProduct
4-Bias_SVD_Model.ipynb 17KB
2-Data_Processing.ipynb 6KB
3-LFM_Model.ipynb 12KB
.ipynb_checkpoints
3-LFM_Model-checkpoint.ipynb 12KB
2-Data_Processing-checkpoint.ipynb 6KB
4-Bias_SVD_Model-checkpoint.ipynb 17KB
1-Explore_Data-checkpoint.ipynb 15KB
1-Explore_Data.ipynb 15KB
dataset
ml-25m
item_users.pkl 36KB
README.txt 10KB
ratings.csv 55.22MB
user_index.pkl 64KB
BiasSVD_modelParameter.json 888KB
tags.csv 35.97MB
movies.csv 2.84MB
user_item_score.mtx 60KB
genome-tags.csv 17KB
links.csv 1.25MB
user_items.pkl 50KB
pred_R.mtx 57.49MB
.ipynb_checkpoints
README-checkpoint.txt 10KB
train-checkpoint.csv 146KB
test-checkpoint.csv 37KB
item_index.pkl 44KB
BX-CSV-Dump
item_users.pkl 49KB
BX-Book-Ratings.csv 28.16MB
user_index.pkl 49KB
BiasSVD_modelParameter.json 921KB
train.csv 25.19MB
user_item_score.mtx 22KB
BX-Books.csv 73.93MB
BX-Users.csv 11.45MB
user_items.pkl 40KB
pred_R.mtx 63.6MB
.ipynb_checkpoints
train-checkpoint.csv 25.19MB
user_item_score-checkpoint.mtx 10.99MB
item_index.pkl 42KB
test.csv 6.3MB
amazon-ratings
item_users.pkl 16KB
user_index.pkl 40KB
BiasSVD_modelParameter.json 735KB
train.csv 57KB
user_item_score.mtx 47KB
user_items.pkl 39KB
ratings_Beauty.csv 78.61MB
pred_R.mtx 18.4MB
.ipynb_checkpoints
ratings_Beauty-checkpoint.csv 78.61MB
item_index.pkl 11KB
test.csv 14KB
Model_Fusion.ipynb 817B
.ipynb_checkpoints
3-LFM_Model-checkpoint.ipynb 21KB
2-Data_Processing-checkpoint.ipynb 6KB
Explore_Pyplot-checkpoint.ipynb 2KB
Explore_Data_Modeling-checkpoint.ipynb 59KB
4-Bias_SVD_Model-checkpoint.ipynb 19KB
Model_Fusion-checkpoint.ipynb 817B
1-Explore_Data_Model-checkpoint.ipynb 14KB
Explore_Data_Modeling.ipynb 53KB
Explore_Pyplot.ipynb 3KB
共 75 条
- 1
资源评论
程序员张小妍
- 粉丝: 1w+
- 资源: 2599
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- tensorflow-gpu-2.7.4-cp37-cp37m-manylinux2010-x86-64.whl
- 多段线、 圆、弧转多段线(仅我可见)
- tensorflow-2.7.2-cp38-cp38-manylinux2010-x86-64.whl
- yeyue-p8Yi4-ve4a83792.apk
- tensorflow-gpu-2.7.3-cp38-cp38-manylinux2010-x86-64.whl
- 五相感应电机矢量控制模型MATLAB
- RGLED (1) (1).circ
- IMG_20240427_215747.jpg
- python下前端WEB学习笔记
- 田间种植行排号自动生成工具
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功