Summary
=======
This dataset (ml-latest-small) describes 5-star rating and free-text tagging activity from [MovieLens](http://movielens.org), a movie recommendation service. It contains 100836 ratings and 3683 tag applications across 9742 movies. These data were created by 610 users between March 29, 1996 and September 24, 2018. This dataset was generated on September 26, 2018.
Users were selected at random for inclusion. All selected users had rated at least 20 movies. No demographic information is included. Each user is represented by an id, and no other information is provided.
The data are contained in the files `links.csv`, `movies.csv`, `ratings.csv` and `tags.csv`. More details about the contents and use of all these files follows.
This is a *development* dataset. As such, it may change over time and is not an appropriate dataset for shared research results. See available *benchmark* datasets if that is your intent.
This and other GroupLens data sets are publicly available for download at <http://grouplens.org/datasets/>.
Usage License
=============
Neither the University of Minnesota nor any of the researchers involved can guarantee the correctness of the data, its suitability for any particular purpose, or the validity of results based on the use of the data set. The data set may be used for any research purposes under the following conditions:
* The user may not state or imply any endorsement from the University of Minnesota or the GroupLens Research Group.
* The user must acknowledge the use of the data set in publications resulting from the use of the data set (see below for citation information).
* The user may redistribute the data set, including transformations, so long as it is distributed under these same license conditions.
* The user may not use this information for any commercial or revenue-bearing purposes without first obtaining permission from a faculty member of the GroupLens Research Project at the University of Minnesota.
* The executable software scripts are provided "as is" without warranty of any kind, either expressed or implied, including, but not limited to, the implied warranties of merchantability and fitness for a particular purpose. The entire risk as to the quality and performance of them is with you. Should the program prove defective, you assume the cost of all necessary servicing, repair or correction.
In no event shall the University of Minnesota, its affiliates or employees be liable to you for any damages arising out of the use or inability to use these programs (including but not limited to loss of data or data being rendered inaccurate).
If you have any further questions or comments, please email <grouplens-info@umn.edu>
Citation
========
To acknowledge use of the dataset in publications, please cite the following paper:
> F. Maxwell Harper and Joseph A. Konstan. 2015. The MovieLens Datasets: History and Context. ACM Transactions on Interactive Intelligent Systems (TiiS) 5, 4: 19:1–19:19. <https://doi.org/10.1145/2827872>
Further Information About GroupLens
===================================
GroupLens is a research group in the Department of Computer Science and Engineering at the University of Minnesota. Since its inception in 1992, GroupLens's research projects have explored a variety of fields including:
* recommender systems
* online communities
* mobile and ubiquitious technologies
* digital libraries
* local geographic information systems
GroupLens Research operates a movie recommender based on collaborative filtering, MovieLens, which is the source of these data. We encourage you to visit <http://movielens.org> to try it out! If you have exciting ideas for experimental work to conduct on MovieLens, send us an email at <grouplens-info@cs.umn.edu> - we are always interested in working with external collaborators.
Content and Use of Files
========================
Formatting and Encoding
-----------------------
The dataset files are written as [comma-separated values](http://en.wikipedia.org/wiki/Comma-separated_values) files with a single header row. Columns that contain commas (`,`) are escaped using double-quotes (`"`). These files are encoded as UTF-8. If accented characters in movie titles or tag values (e.g. Misérables, Les (1995)) display incorrectly, make sure that any program reading the data, such as a text editor, terminal, or script, is configured for UTF-8.
User Ids
--------
MovieLens users were selected at random for inclusion. Their ids have been anonymized. User ids are consistent between `ratings.csv` and `tags.csv` (i.e., the same id refers to the same user across the two files).
Movie Ids
---------
Only movies with at least one rating or tag are included in the dataset. These movie ids are consistent with those used on the MovieLens web site (e.g., id `1` corresponds to the URL <https://movielens.org/movies/1>). Movie ids are consistent between `ratings.csv`, `tags.csv`, `movies.csv`, and `links.csv` (i.e., the same id refers to the same movie across these four data files).
Ratings Data File Structure (ratings.csv)
-----------------------------------------
All ratings are contained in the file `ratings.csv`. Each line of this file after the header row represents one rating of one movie by one user, and has the following format:
userId,movieId,rating,timestamp
The lines within this file are ordered first by userId, then, within user, by movieId.
Ratings are made on a 5-star scale, with half-star increments (0.5 stars - 5.0 stars).
Timestamps represent seconds since midnight Coordinated Universal Time (UTC) of January 1, 1970.
Tags Data File Structure (tags.csv)
-----------------------------------
All tags are contained in the file `tags.csv`. Each line of this file after the header row represents one tag applied to one movie by one user, and has the following format:
userId,movieId,tag,timestamp
The lines within this file are ordered first by userId, then, within user, by movieId.
Tags are user-generated metadata about movies. Each tag is typically a single word or short phrase. The meaning, value, and purpose of a particular tag is determined by each user.
Timestamps represent seconds since midnight Coordinated Universal Time (UTC) of January 1, 1970.
Movies Data File Structure (movies.csv)
---------------------------------------
Movie information is contained in the file `movies.csv`. Each line of this file after the header row represents one movie, and has the following format:
movieId,title,genres
Movie titles are entered manually or imported from <https://www.themoviedb.org/>, and include the year of release in parentheses. Errors and inconsistencies may exist in these titles.
Genres are a pipe-separated list, and are selected from the following:
* Action
* Adventure
* Animation
* Children's
* Comedy
* Crime
* Documentary
* Drama
* Fantasy
* Film-Noir
* Horror
* Musical
* Mystery
* Romance
* Sci-Fi
* Thriller
* War
* Western
* (no genres listed)
Links Data File Structure (links.csv)
---------------------------------------
Identifiers that can be used to link to other sources of movie data are contained in the file `links.csv`. Each line of this file after the header row represents one movie, and has the following format:
movieId,imdbId,tmdbId
movieId is an identifier for movies used by <https://movielens.org>. E.g., the movie Toy Story has the link <https://movielens.org/movies/1>.
imdbId is an identifier for movies used by <http://www.imdb.com>. E.g., the movie Toy Story has the link <http://www.imdb.com/title/tt0114709/>.
tmdbId is an identifier for movies used by <https://www.themoviedb.org>. E.g., the movie Toy Story has the link <https://www.themoviedb.org/movie/862>.
Use of the resources listed above is subject to the terms of each provider.
Cross-Validation
----------------
Prior versions of the MovieLens dataset included either pre-computed cross-folds or scripts to perform this computation.
没有合适的资源?快使用搜索试试~ 我知道了~
温馨提示
【资源说明】 1.项目代码均经过功能验证ok,确保稳定可靠运行。欢迎下载食用体验! 2.主要针对各个计算机相关专业,包括计算机科学、信息安全、数据科学与大数据技术、人工智能、通信、物联网等领域的在校学生、专业教师、企业员工。 3.项目具有丰富的拓展空间,不仅可作为入门进阶,也可直接作为毕设、课程设计、大作业、初期项目立项演示等用途。 4.当然也鼓励大家基于此进行二次开发。在使用过程中,如有问题或建议,请及时沟通。 5.期待你能在项目中找到乐趣和灵感,也欢迎你的分享和反馈! 【项目介绍】 (Hadoop和spark项目)基于Java+python开发的电影推荐系统源码+说明+数据.zip 1.ALS(推荐算法) item_base基于物品的推荐算法 RecommendRDD/DF基于ALS推荐模型的推荐算法 运行环境(hadoop+spark) 2.movie 电影推荐系统网站 3.tutorial(爬虫) 4.movie.sql 数据库脚本 5.movie.war 6.movielen 数据集 基于Hadoop和spark由java和python语言开发的电影推荐系统. ### 开发语言 1.python _2.java_ ### 环境 1.Hadoop2.7.0 2.spark2.x 3.jdk1.8 4.python3.7 5.tomcat8.0 6.mysql5.6 含sql数据库
资源推荐
资源详情
资源评论
收起资源包目录
(Hadoop和spark项目)基于Java+python开发的电影推荐系统源码+说明+数据库.zip (244个子文件)
scrapy.cfg 259B
MovieDao.class 7KB
Querymovieaction.class 3KB
Movie.class 3KB
Commentsscoreaction.class 3KB
Categoryaction.class 2KB
Loginaction.class 2KB
Moviedetailaction.class 2KB
Browsiedaction.class 2KB
Registeraction.class 2KB
Recommendaction.class 2KB
JdbcUtil.class 2KB
Moviesimlaraction.class 2KB
MovieService.class 2KB
UserDao.class 2KB
Movieaction.class 1KB
Loginoutaction.class 1KB
User.class 1KB
Similartab.class 947B
LoginService.class 800B
RegistServer.class 773B
Category.class 762B
Rectab.class 744B
.classpath 2KB
org.eclipse.wst.common.component 444B
org.eclipse.wst.jsdt.ui.superType.container 49B
style3.css 65KB
loaders.css 52KB
layui.css 36KB
global.css 18KB
layer.css 17KB
page.css 17KB
layer.css 16KB
component2.css 12KB
layui.mobile.css 9KB
laydate.css 9KB
star-main.css 8KB
selectSeat.css 7KB
buyTickets.css 6KB
layer.css 6KB
default2.css 6KB
styles.css 5KB
movieDetail.css 5KB
header.css 3KB
main.css 3KB
demo2.css 2KB
movieList.css 2KB
style2.css 2KB
normalize.css 2KB
normalize2.css 2KB
demo.css 1KB
default.css 1KB
code.css 1KB
footer.css 55B
ratings.csv 2.37MB
ratings.csv 2.37MB
movies.csv 483KB
links.csv 193KB
tags.csv 116KB
icomoon.eot 2KB
loading-0.gif 6KB
loading-0.gif 6KB
loading-2.gif 2KB
loading-2.gif 2KB
loading-1.gif 701B
loading-1.gif 701B
title.ico 17KB
ALS.iml 398B
druid-1.0.15.jar 1.86MB
lombok.jar 1.62MB
jackson-databind-2.9.7.jar 1.29MB
mysql-connector-java-5.1.40-bin.jar 968KB
jstl-1.2.jar 405KB
jackson-core-2.9.7.jar 316KB
commons-beanutils-1.8.3.jar 227KB
jackson-annotations-2.9.7.jar 65KB
commons-logging-1.1.1.jar 59KB
commons-dbutils-1.4.jar 48KB
MovieDao.java 6KB
Movie.java 2KB
Querymovieaction.java 2KB
Commentsscoreaction.java 2KB
MovieService.java 2KB
JdbcUtil.java 1KB
Browsiedaction.java 1KB
Loginaction.java 1KB
Registeraction.java 1KB
Categoryaction.java 1KB
Moviedetailaction.java 1KB
Recommendaction.java 1KB
Moviesimlaraction.java 1KB
UserDao.java 948B
Movieaction.java 946B
Loginoutaction.java 829B
User.java 702B
Similartab.java 526B
LoginService.java 417B
RegistServer.java 402B
Category.java 396B
Rectab.java 366B
共 244 条
- 1
- 2
- 3
资源评论
- 小鱼有亿点甜2024-04-25资源内容详实,描述详尽,解决了我的问题,受益匪浅,学到了。
.whl
- 粉丝: 3907
- 资源: 4858
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功