没有合适的资源?快使用搜索试试~ 我知道了~
温馨提示
Spark培训资料库 该存储库包含由dimajix进行的Spark和Hadoop培训的许多不同示例,练习和教程。 您随时可以在GitHub上找到最新版本 https://github.com/dimajix/spark-training 内容 存储库包含不同类型的文档 Spark / Scala的源代码 适用于PySpark的Jupyter笔记本 Zeppelin笔记本用于Spark / Scala Hive SQL脚本 猪脚本 ...以及更多 外部依赖 一些笔记本需要在S3上s3:// dimajix-training / data /上由dimajix提供的一些测试数据。 建筑可执行文件
资源推荐
资源详情
资源评论
收起资源包目录
spark-training:用于Spark培训的存储库 (312个子文件)
common.conf 1KB
wordcount.conf 96B
global-sales.csv 585B
persons_header.csv 137B
persons_header.csv 135B
persons_headerless.csv 117B
persons_headerless.csv 117B
Dockerfile 173B
Dockerfile 148B
.gitignore 13B
.gitignore 10B
.gitignore 10B
.gitignore 10B
.gitignore 10B
.gitignore 10B
.gitignore 10B
.gitignore 10B
.gitignore 10B
NYC Taxi Trips - Part 3 - Analyze - Full.ipynb 1.28MB
NYC Taxi Trips - Part 4 - Integrate - Full.ipynb 1.16MB
Weather Analysis - Full.ipynb 730KB
PySpark Bike Sharing Pipeline Solution.ipynb 629KB
PySpark Bike Sharing Regression Full.ipynb 484KB
NYC Taxi Trips - Part 2 - Refine - Full.ipynb 317KB
05 - UDFs Full.ipynb 254KB
COVID-19 - Full.ipynb 238KB
Grouped Regression - Full.ipynb 225KB
NYC Taxi Trips - Part 5 - ML - Full.ipynb 201KB
PySpark DataFrame Full.ipynb 156KB
NYC Taxi Trips - Part 1 - Preparation - Full.ipynb 98KB
10 - Working with JSONs - Full.ipynb 96KB
House Prices Solution.ipynb 90KB
Sales Analysis - Full.ipynb 90KB
Weather Analysis Solution.ipynb 85KB
NYC Taxi Trips - Part 5 - ML - Skeleton.ipynb 82KB
01 - Execution Plan - Full.ipynb 79KB
Execution Plan - Full.ipynb 79KB
Twitter Latest Tweet - Full.ipynb 75KB
08 - Repartitioning - Full.ipynb 69KB
Pivoting - Full.ipynb 66KB
House Price Pipeline Solution.ipynb 63KB
Amazon Baby Products Classification Solution.ipynb 54KB
From Pandas to Spark - Full.ipynb 53KB
02 - Caching Data - Full.ipynb 52KB
Caching Data - Full.ipynb 52KB
PySpark DataFrame Skeleton.ipynb 51KB
Weather Analysis Solution.ipynb 50KB
Amazon Baby Products Pipeline Solution.ipynb 50KB
Iterative Algorithms - Full.ipynb 45KB
Pandas UDFs - Full.ipynb 44KB
09 - Bucketing - Full.ipynb 44KB
06 - Broadcast Variables - Full.ipynb 44KB
03 - Checkpointing - Full.ipynb 43KB
Checkpointing - Full.ipynb 43KB
Python-Introduction.ipynb 37KB
COVID-19 - Skeleton.ipynb 34KB
NYC Taxi Trips - Part 3 - Analyze - Skeleton.ipynb 31KB
PySpark Introduction.ipynb 27KB
04 - Broadcast Joins - Full.ipynb 27KB
NYC Taxi Trips - Part 1 - Preparation - Skeleton.ipynb 24KB
08 - Repartitioning - Skeleton.ipynb 24KB
Audioscrobbler - Full.ipynb 23KB
Sales Analysis - Skeleton.ipynb 22KB
Weather Analysis Solution.ipynb 21KB
From Pandas to Spark - Skeleton.ipynb 20KB
05 - UDFs Skeleton.ipynb 18KB
07 - Accumulators - Full.ipynb 18KB
01 - Execution Plan - Skeleton.ipynb 17KB
Execution Plan - Skeleton.ipynb 17KB
Weather Analysis - Skeleton.ipynb 16KB
House Price Pipeline Exercise.ipynb 16KB
NYC Taxi Trips - Part 4 - Integrate - Skeleton.ipynb 15KB
Amazon Baby Products Classification Exercise.ipynb 15KB
Amazon Baby Products Pipeline Exercise.ipynb 15KB
PySpark Bike Sharing Pipeline Exercise.ipynb 14KB
09 - Bucketing - Skeleton.ipynb 13KB
10 - Working with JSONs - Skeleton.ipynb 12KB
Parallel Query Execution.ipynb 12KB
NYC Taxi Trips - Part 2 - Refine - Skeleton.ipynb 12KB
PySpark Bike Sharing Regression Skeleton.ipynb 12KB
PySpark Weather Analysis with Broadcast Variables Solution.ipynb 12KB
06 - Broadcast Variables - Skeleton.ipynb 12KB
House Prices Exercise.ipynb 12KB
Pandas UDFs - Skeleton.ipynb 11KB
Weather Analysis Exercise.ipynb 11KB
02 - Caching Data - Skeleton.ipynb 11KB
Caching Data - Skeleton.ipynb 11KB
Simple UDFs - Full.ipynb 11KB
Grouped Regression - Skeleton.ipynb 11KB
03 - Checkpointing - Skeleton.ipynb 10KB
Checkpointing - Skeleton.ipynb 10KB
07 - Accumulators - Skeleton.ipynb 10KB
04 - Broadcast Joins - Skeleton.ipynb 10KB
PySpark Weather Analysis with Broadcast Variables Exercise.ipynb 10KB
Pivoting - Skeleton.ipynb 9KB
Audioscrobbler - Skeleton.ipynb 9KB
Iterative Algorithms - Skeleton.ipynb 8KB
Python Essentials.ipynb 7KB
Twitter Latest Tweet - Skeleton.ipynb 7KB
Weather Data Hive Preparation.ipynb 7KB
共 312 条
- 1
- 2
- 3
- 4
资源评论
giao金
- 粉丝: 34
- 资源: 4604
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功