# Amazon product data description
## dataset
http://snap.stanford.edu/data/amazon/productGraph/
Image-based recommendations on styles and substitutes
J. McAuley, C. Targ
### sample revieiws
```
{
"reviewerID": "A2SUAM1J3GNN3B",
"asin": "0000013714",
"reviewerName": "J. McDonald",
"helpful": [2, 3],
"reviewText": "I bought this for my husband who plays the piano. He is having a wonderful time playing these old hymns. The music is at times hard to read because we think the book was published for singing from more than playing from. Great purchase though!",
"overall": 5.0,
"summary": "Heavenly Highway Hymns",
"unixReviewTime": 1252800000,
"reviewTime": "09 13, 2009"
}
```
where
- reviewerID - ID of the reviewer, e.g. A1RSDE90N6RSZF
- asin - ID of the product, e.g. 0000013714
- reviewerName - name of the reviewer
- helpful - helpfulness rating of the review, e.g. 2/3
- reviewText - text of the review
- overall - rating of the product
- summary - summary of the review
- unixReviewTime - time of the review (unix time)
- reviewTime - time of the review (raw)
### sample metadata
```angular2
{
"asin": "0000031852",
"title": "Girls Ballet Tutu Zebra Hot Pink",
"price": 3.17,
"imUrl": "http://ecx.images-amazon.com/images/I/51fAmVkTbyL._SY300_.jpg",
"related":
{
"also_bought": ["B00JHONN1S", "B002BZX8Z6", "B00D2K1M3O", "0000031909", "B00613WDTQ", "B00D0WDS9A", "B00D0GCI8S", "0000031895", "B003AVKOP2", "B003AVEU6G", "B003IEDM9Q", "B002R0FA24", "B00D23MC6W", "B00D2K0PA0", "B00538F5OK", "B00CEV86I6", "B002R0FABA", "B00D10CLVW", "B003AVNY6I", "B002GZGI4E", "B001T9NUFS", "B002R0F7FE", "B00E1YRI4C", "B008UBQZKU", "B00D103F8U", "B007R2RM8W"],
"also_viewed": ["B002BZX8Z6", "B00JHONN1S", "B008F0SU0Y", "B00D23MC6W", "B00AFDOPDA", "B00E1YRI4C", "B002GZGI4E", "B003AVKOP2", "B00D9C1WBM", "B00CEV8366", "B00CEUX0D8", "B0079ME3KU", "B00CEUWY8K", "B004FOEEHC", "0000031895", "B00BC4GY9Y", "B003XRKA7A", "B00K18LKX2", "B00EM7KAG6", "B00AMQ17JA", "B00D9C32NI", "B002C3Y6WG", "B00JLL4L5Y", "B003AVNY6I", "B008UBQZKU", "B00D0WDS9A", "B00613WDTQ", "B00538F5OK", "B005C4Y4F6", "B004LHZ1NY", "B00CPHX76U", "B00CEUWUZC", "B00IJVASUE", "B00GOR07RE", "B00J2GTM0W", "B00JHNSNSM", "B003IEDM9Q", "B00CYBU84G", "B008VV8NSQ", "B00CYBULSO", "B00I2UHSZA", "B005F50FXC", "B007LCQI3S", "B00DP68AVW", "B009RXWNSI", "B003AVEU6G", "B00HSOJB9M", "B00EHAGZNA", "B0046W9T8C", "B00E79VW6Q", "B00D10CLVW", "B00B0AVO54", "B00E95LC8Q", "B00GOR92SO", "B007ZN5Y56", "B00AL2569W", "B00B608000", "B008F0SMUC", "B00BFXLZ8M"],
"bought_together": ["B002BZX8Z6"]
},
"salesRank": {"Toys & Games": 211836},
"brand": "Coxlures",
"categories": [["Sports & Outdoors", "Other Sports", "Dance"]]
}
```
where
- asin - ID of the product, e.g. 0000031852
- title - name of the product
- price - price in US dollars (at time of crawl)
- imUrl - url of the product image
- related - related products (also bought, also viewed, bought together, buy after viewing)
- salesRank - sales rank information
- brand - brand name
- categories - list of categories the product belongs to
## data process
数据处理代码来自 https://github.com/zhougr1993/DeepInterestNetwork
进行了部分调整
- 转换成tfrecord方便和estimator进行对接
- 直接读/写不定长数组,不在原始数据做padding降低内存占用
- 训练集/测试集在tfrecord进行区分
```
bash 0_download_raw.sh
python 1_convert_pd.py
python 2_remap_id.py
python 3_build_dataset.py
python 4_dump_tfrecord.py
```
没有合适的资源?快使用搜索试试~ 我知道了~
资源推荐
资源详情
资源评论
收起资源包目录
基于python的CTR模型代码和学习笔记总结.rar (91个子文件)
基于python的CTR模型代码和学习笔记总结
main.py 3KB
paper
[NFM]Neural Factorization Machines for Sparse Predictive Analytics.pdf 3.49MB
[xDeepFM]xDeepFM- Combining Explicit and Implicit Feature Interactions for Recommender Systems.pdf 1.44MB
[Deep&Cross]Deep & Cross Network for Ad Click Predictions.pdf 232KB
[PNN] Product-based Neural Networks for User Response Prediction (SJTU 2016).pdf 470KB
[NCF] Neural Collaborative Filtering (NUS 2017).pdf 1.42MB
[FM Method]Factorization Machines.pdf 187KB
[AFM] Attentional Factorization Machines - Learning the Weight of Feature Interactions via Attention Networks (ZJU 2017).pdf 3.39MB
[AutoInt]- Automatic Feature Interaction Learning via Self-Attentive Neural Networks.pdf 1.52MB
[FiBiNET]- Combining Feature Importance and Bilinear feature Interaction for Click-Through Rate Prediction.pdf 939KB
[FFM]Field-aware Factorization Machines for CTR Prediction.pdf 361KB
[AFM]Attentional Factorization Machines- Learning the Weight of Feature Interactions via Attention Networks∗.pdf 987KB
[DeepFM]DeepFM- A Factorization-Machine based Neural Network for CTR Prediction.pdf 1.14MB
[FNN] Deep Learning over Multi-field Categorical Data (UCL 2016).pdf 566KB
[Wide&Deep] Wide & Deep Learning for Recommender Systems (Google 2016).pdf 480KB
[FM Model] Fast Context-aware Recommendations with Factorization Machines (UKON 2011).pdf 291KB
[Deep Crossing] Deep Crossing - Web-Scale Modeling without Manually Crafted Combinatorial Features (Microsoft 2016).pdf 434KB
[DIN]Deep Interest Network for Click-Through Rate Prediction.pdf 8.13MB
[GBDT+LR]Practical Lessons from Predicting Clicks on Ads at Facebook.pdf 774KB
[DIEN]Deep Interest Evolution Network for Click-Through Rate Prediction.pdf 2.07MB
utils.py 6KB
data
ali_ccp
readme.md 407B
frappe
frappe.test.libfm 1.98MB
readme.md 2KB
frappe.valid.libfm 3.97MB
frappe.train.libfm 13.89MB
census
readme.md 2KB
data_loader.py 922B
valid.csv 1.68MB
train.csv 3.36MB
amazon
readme.md 3KB
0_download_raw.sh 310B
3_build_dataset.py 1KB
4_dump_tfrecord.py 2KB
1_convert_pd.py 665B
2_remap_id.py 2KB
movie_len
ratings.csv 2.37MB
const
census_const.py 2KB
amazon_const.py 731B
__init__.py 731B
frappe_const.py 93B
model
xDeepFM
xDeepFM.py 5KB
__init__.py 0B
preprocess.py 786B
AFM
__init__.py 0B
preprocess.py 786B
AFM.py 5KB
FM
FM.py 2KB
FM_keras.py 4KB
__init__.py 0B
preprocess.py 580B
NFM
NFM.py 4KB
__init__.py 0B
preprocess.py 785B
FNN
FNN.py 2KB
__init__.py 0B
preprocess.py 580B
wide_and_deep
wide_and_deep.py 1KB
__init__.py 0B
preprocess.py 2KB
DIN
DIN.py 5KB
__init__.py 0B
preprocess.py 498B
DeepCrossing
DeepCrossing.py 2KB
__init__.py 0B
preprocess.py 659B
FiBiNET
__init__.py 0B
preprocess.py 786B
FiBiNET.py 8KB
PNN
PNN.py 3KB
__init__.py 0B
preprocess.py 660B
DCN
DCN.py 5KB
__init__.py 0B
preprocess.py 659B
DeepFM
DeepFM.py 6KB
__init__.py 0B
preprocess.py 786B
EMMLP
EMMLP.py 3KB
__init__.py 0B
preprocess.py 959B
DIEN
preprocess.py 0B
FFM
FFM.py 2KB
__init__.py 0B
preprocess.py 638B
__init__.py 0B
.gitignore 183B
layers.py 2KB
playground
feature_columnn
feature_column_play.ipynb 15KB
Embedding
Embedding.py 3KB
config.py 2KB
共 91 条
- 1
资源评论
爱吃苹果的Jemmy
- 粉丝: 75
- 资源: 1148
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- 4399GameSem_116_13955_207551_6.apk
- python 3.9.19源码编译包
- php-8.2.18-Win32-vs16-x64.rar
- 字节跳动青训营-抖音项目
- SQL资料手册,语句教程,高级查询语句语法
- 上位机和串口建立 Modbus 协议进行数据传输,并使用 Mysql 数据库存储,能够实现实时温湿度显示和动态变化曲线,历史数据
- Attachment 1_chazhi.xlsx
- 安卓项目,实现虚拟摇杆通过wifi串口发送nema-0183协议实现小吊舱方向控制
- 基于modbus协议的大屏数据监控,使用modbus slave模拟数据,串口服务器获取温湿度
- 下载资源.zip
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功