2.7 Windows 单机模式的 Spark 安装和 Python 开发环境配置.....................................................118
2.7.1 准备工作 ..........................................................................................................................118
2.7.2 下载 Anaconda .................................................................................................................118
2.7.3 安装 Anaconda .................................................................................................................119
2.7.4 配置 Anaconda 环境变量 ................................................................................................119
2.7.5 测试 Anaconda .................................................................................................................119
2.7.6 下载 JDK 1.8 .....................................................................................................................120
2.7.7 配置 JDK 环境变量...........................................................................................................120
2.7.8 测试 JDK 1.8 .....................................................................................................................121
2.7.9 安装 Scala.........................................................................................................................122
2.7.10 下载 Spark 3.1.0 ...............................................................................................................128
2.7.11 安装 Spark 3.1.0 ...............................................................................................................128
2.7.12 配置 Spark 环境变量 .......................................................................................................128
2.7.13 配置日志显示级别 ..........................................................................................................129
2.7.14 下载 Hadoop 支持模块....................................................................................................129
2.7.15 安装 Hadoop 支持模块....................................................................................................130
2.7.16 配置 Hadoop 支持模块的环境变量................................................................................130
2.7.17 测试 Spark ........................................................................................................................130
2.7.18 测试 pyspark.....................................................................................................................131
2.7.19 运行示例代码 ..................................................................................................................131
2.8 Spark 的开发环境搭建(intelliJ IDEA) ....................................................................................132
2.8.1 建立新项目 ..........................................................................................................................133
2.8.2 编写代码 ..............................................................................................................................135
2.8.3 生成程序包 ..........................................................................................................................138
2.8.4 Spark 单机版的决策树测试 ................................................................................................139
2.9 Spark 集群的安装与设置 ..........................................................................................................141
2.9.1 Ubuntu 12.04 下 Hadoop 2.2.0 集群搭建 ...........................................................................146
2.9.2 Ubuntu 14.04 下安装 Hadoop2.4.0(单机模式) .............................................................154
2.10 启动 Spark 集群 .......................................................................................................................166
2.11 Spark 运行 WordCount.............................................................................................................176
评论0
最新资源