##Elo Merchant Category Recommendation
该比赛是巴西的一个信用卡公司,
##源数据
###train (201918条)
- card_id 信用卡唯一标识符
- first_active_month 购买的记录时间,精确到月,格式为:'YYYY-MM'
- feature_1 脱敏特征1
- feature_2 脱敏特征2
- feature_3 脱敏特征3
- target 在2个月之后的数据评分。Loyalty numerical score calculated 2 months after historical and evaluation period
...
+------------------+---------------+---------+---------+---------+-----------+
|first_active_month|card_id |feature_1|feature_2|feature_3|target |
+------------------+---------------+---------+---------+---------+-----------+
|2017-06 |C_ID_92a2005557|5 |2 |1 |-0.8202826 |
|2017-01 |C_ID_3d0044924f|4 |1 |0 |0.39291325 |
|2016-08 |C_ID_d639edf6cd|2 |2 |0 |0.68805599 |
+------------------+---------------+---------+---------+---------+-----------+
###test (123624条)
与train结构一致,没有target项
+------------------+---------------+---------+---------+---------+
|first_active_month|card_id |feature_1|feature_2|feature_3|
+------------------+---------------+---------+---------+---------+
|2017-04 |C_ID_0ab67a22ab|3 |3 |1 |
|2017-01 |C_ID_130fd0cbdd|2 |3 |0 |
|2017-08 |C_ID_b709037bc5|5 |1 |1 |
+------------------+---------------+---------+---------+---------+
###historical_transaction(29112361,2千900万)
信用卡过去三个月的消费记录
- card_id
- month_lag 距离参考日期的月份间隔
- purchase_date 购买时间
- authorized_flag 置信标签,Y是经过确认的,N是没有的,new中都是Y
- installments 分期付款期数
- merchant_category_id 客户商品id (anonymized )
- subsector_id 商家商品种类id (anonymized )
- merchant_id 商家id(脱敏)
- purchase_amount 正则化后的购买数量
- city_id 城市脱敏id
- state_id 州脱敏id
- category_1 脱敏商品1
- category_2 脱敏商品2
- category_3 脱敏商品3
...
+---------------+---------------+-------+----------+------------+----------+--------------------+---------------+---------+---------------+-------------------+----------+--------+------------+
|authorized_flag|card_id |city_id|category_1|installments|category_3|merchant_category_id|merchant_id |month_lag|purchase_amount|purchase_date |category_2|state_id|subsector_id|
+---------------+---------------+-------+----------+------------+----------+--------------------+---------------+---------+---------------+-------------------+----------+--------+------------+
|Y |C_ID_4e6213e9bc|88 |N |0 |A |80 |M_ID_e020e9b302|-8 |-0.70333091 |2017-06-25 15:33:07|1.0 |16 |37 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |367 |M_ID_86ec983688|-7 |-0.73312848 |2017-07-15 12:10:45|1.0 |16 |16 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |80 |M_ID_979ed661fc|-6 |-0.720386 |2017-08-09 22:04:29|1.0 |16 |37 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |560 |M_ID_e6d5ae8ea6|-5 |-0.73535241 |2017-09-02 10:06:26|1.0 |16 |34 |
+---------------+---------------+-------+----------+------------+----------+--------------------+---------------+---------+---------------+-------------------+----------+--------+------------+
###new_merchant_period(196303,196万行)
信用卡当前月的消费记录
结构与historical一致
+---------------+---------------+-------+----------+------------+----------+--------------------+---------------+---------+---------------+-------------------+----------+--------+------------+
|authorized_flag|card_id |city_id|category_1|installments|category_3|merchant_category_id|merchant_id |month_lag|purchase_amount|purchase_date |category_2|state_id|subsector_id|
+---------------+---------------+-------+----------+------------+----------+--------------------+---------------+---------+---------------+-------------------+----------+--------+------------+
|Y |C_ID_4e6213e9bc|88 |N |0 |A |80 |M_ID_e020e9b302|-8 |-0.70333091 |2017-06-25 15:33:07|1.0 |16 |37 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |367 |M_ID_86ec983688|-7 |-0.73312848 |2017-07-15 12:10:45|1.0 |16 |16 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |80 |M_ID_979ed661fc|-6 |-0.720386 |2017-08-09 22:04:29|1.0 |16 |37 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |560 |M_ID_e6d5ae8ea6|-5 |-0.73535241 |2017-09-02 10:06:26|1.0 |16 |34 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |80 |M_ID_e020e9b302|-11 |-0.72286538 |2017-03-10 01:14:19|1.0 |16 |37 |
|Y |C_ID_4e6213e9bc|333 |N |0 |A |80 |M_ID_50af771f8d|0 |-0.73488659 |2018-02-24 08:45:05|1.0 |9 |37 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |278 |M_ID_5e8220e564|-11 |-0.71685477 |2017-03-21 00:10:51|1.0 |16 |37 |
|Y |C_ID_4e6213e9bc|3 |N |0 |A |80 |M_ID_9d41786a50|-3 |-0.65704925 |2017-11-18 20:05:55|1.0 |16 |37 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |80 |M_ID_979ed661fc|-8 |-0.73796702 |2017-06-01 22:02:56|1.0 |16 |37 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |80 |M_ID_74ba14b5fc|-11 |-0.71535212 |2017-03-16 15:41:22|1.0 |16 |37 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |683 |M_ID_1449f22bfb|-9 |-0.73413526 |2017-05-09 12:42:07|1.0 |16 |34 |
|Y |C_ID_4e6213e9bc|-1 |N |0 |A |560 |M_ID_7c5e93af2f|0 |-0.72792931 |2018-02-08 20:05:45|null |-1 |34 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |80 |M_ID_e020e9b302|-8 |-0.74164852 |2017-06-08 18:02:29|1.0 |16 |37 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |130 |M_ID_e8fb39882d|-7 |-0.72737333 |2017-07-14 12:59:38|1.0 |16 |41 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |80 |M_ID_97e86eae5f|-4 |-0.73188128 |2017-10-24 13:29:29|1.0 |16 |37 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |560 |M_ID_e6d5ae8ea6|-8 |-0.73819242 |2017-06-14 07:40:48|1.0 |16 |34 |
|Y |C_ID_4e6213e9bc|69 |N |0 |A |879 |M_ID_00a6ca8a8a|0 |-0.72587068 |2018-02-07 12:19:33|1.0 |9 |29 |
|Y |C_ID_4e6213e9bc|88 |N |0 |A |278 |M_ID_21e1552dab|-7 |-0.73488659 |201
没有合适的资源?快使用搜索试试~ 我知道了~
基于SparkML2.0进行的Kaggle、JData等比赛.zip
共125个文件
scala:79个
py:17个
java:11个
1.该资源内容由用户上传,如若侵权请联系客服进行举报
2.虚拟产品一经售出概不退款(资源遇到问题,请及时私信上传者)
2.虚拟产品一经售出概不退款(资源遇到问题,请及时私信上传者)
版权申诉
0 下载量 61 浏览量
2023-09-30
15:16:53
上传
评论
收藏 320KB ZIP 举报
温馨提示
比赛项目源码
资源推荐
资源详情
资源评论
收起资源包目录
基于SparkML2.0进行的Kaggle、JData等比赛.zip (125个子文件)
TitanicPassengersTrainData.csv 59KB
TitanicPassengersTestData.csv 59KB
housing.data 48KB
winutils.exe 40KB
.gitignore 19B
SparkML.iml 31KB
测试泄露文件之间的关联性.ipynb 15KB
SmartFile.java 13KB
SmartTimer.java 6KB
SmartFileExtend.java 2KB
SmartProperties.java 1KB
Loader.java 1KB
Loader.java 883B
Consts.java 559B
Tools.java 422B
Start.java 385B
GCTest.java 299B
SmartException.java 151B
Readme.md 18KB
README.md 881B
log4j.properties 620B
hadoop.properties 29B
jdbc.properties 15B
SantanderGP.py 33KB
fea_exact.py 16KB
LagSelectFakeRows.py 10KB
explore_source.py 7KB
util.py 7KB
LightGBMWithBayesanOptimization.py 6KB
Feature_scoring_vs_zeros.py 6KB
Features_24.py 5KB
lgb.py 3KB
model.py 3KB
np.py 1KB
data_anlyse.py 679B
PrimeCity.py 514B
test1.py 466B
stantderTest.py 337B
offline_spark.py 256B
test2.py 138B
main.scala 60KB
FeatureExact.scala 20KB
TimeFuture.scala 16KB
TrainModel.scala 14KB
RecordFeatures.scala 13KB
DataCollect.scala 11KB
OrderAndActionCluster.scala 10KB
CustomerFeatures.scala 8KB
Explore.scala 7KB
OpTitanicSimple.scala 7KB
Record.scala 6KB
OpTitanic.scala 5KB
Models.scala 5KB
Constant.scala 5KB
OpBoston.scala 4KB
Run.scala 4KB
GASimpleWithTransmogriAIMain.scala 4KB
ARIMATrain.scala 4KB
Customer.scala 4KB
OpElo.scala 4KB
FeatureUtils.scala 4KB
OpBoston2.scala 4KB
GASimpleWithTransmogriAIloadModel.scala 4KB
DataExplore.scala 3KB
ArrayTest.scala 3KB
TimeSeriesTest.scala 3KB
UserDefineEvaluator.scala 3KB
JointTest.scala 3KB
Run.scala 3KB
SparkUtil.scala 2KB
SimpleReaderTest.scala 2KB
ScoreEvaluator.scala 2KB
OpTitanicMini.scala 2KB
csvTest.scala 2KB
ModelUtils.scala 2KB
Explore2.scala 2KB
XGBoostModel.scala 2KB
OnehotTest.scala 2KB
TransmogrifAITest.scala 2KB
SetSuite.scala 2KB
Run.scala 1KB
BostonFeatures.scala 1KB
TitanicFeatures.scala 1KB
test2.scala 1KB
ShowFeatures.scala 1KB
yyy.scala 1KB
GetMeansOfResult.scala 1KB
WordCount.scala 1KB
dateTimeTest.scala 1KB
DataUtils.scala 1KB
EloConstants.scala 996B
CaseGroupKeyTest.scala 970B
PviotTest.scala 939B
ARIMATest.scala 939B
TrainTest.scala 925B
WorkCount.scala 903B
OptionTest.scala 857B
createTrainMini.scala 809B
JDBCSink.scala 775B
Constants.scala 739B
共 125 条
- 1
- 2
资源评论
学术菜鸟小晨
- 粉丝: 1w+
- 资源: 4931
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- 基于图像的三维模型重建C++源代码+文档说明(高分课程设计)
- 基于聚焦法的工件立体测量方案,根据数据进行三维重建 使用HALCON处理图像,MATLAB拟合数据+源代码+数据集+效果图
- 锄战三国村 修改:货币使用不减 v1.10(2) 原创 (中文).apk
- 基于python实现的单目双目视觉三维重建+源代码+图像图片(高分课程设计)
- 基于C+++OPENCV的全景图像拼接源码(课程设计)
- 基于Python+OpenCV对多张图片进行全景图像拼接,消除鬼影,消除裂缝+源代码+文档说明+界面截图(高分课程设计)
- 基于C++实现的全景图像拼接源码(课程设计)
- 基于SIFT特征点提取和RASIC算法实现全景图像拼接python源码+文档说明+界面截图+详细注释(95分以上课程大作业)
- 基于matlab实现眼部判别的疲劳检测系统+源代码+全部数据+文档说明+详细注释+使用说明+截图(高分课程设计)
- 基于Matlab的异常姿势识别系统+源代码+全部数据+文档说明+详细注释+使用说明+截图(高分课程设计)
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功