没有合适的资源？快使用搜索试试~ 我知道了~

文库首页操作系统OS藏经阁-Dockerizing Spark Workloads.pdf

藏经阁-Dockerizing Spark Workloads.pdf

阿里云

需积分: 5 0 下载量 71 浏览量 2023-08-26 11:35:12 上传评论收藏 4.36MB PDF 举报

温馨提示

试读

34页

藏经阁-Dockerizing Spark Workloads.pdf

资源推荐

资源详情

资源评论

Lessons Learned From

Dockerizing Spark Workloads

Thomas Phelan Nanda Vijaydev

Chief Architect, BlueData Data Scientist, BlueData

@tapbluedata @nandavijaydev

February 8, 2017

Outline

• Docker Containers and Big Data

• Spark on Docker: Challenges

• How We Did It: Lessons Learned

• Key Takeaways

• Q & A

Distributed Spark Environments

• Data scientists want flexibility:

– New tools, latest versions of Spark, Kafka, H2O, et.al.

– Multiple options – e.g. Zeppelin, RStudio, JupyterHub

– Fast, iterative prototyping

• IT wants control:

– Multi-tenancy

– Data security

– Network isolation

Why “Dockerize”?

Infrastructure

•

Agility and elasticity

•

Standardized environments

(dev, test, prod)

•

Portability (on-premises and

public cloud)

•

Efficient (higher resource

utilization)

Applications

•

Fool-proof packaging (configs,

libraries, driver versions, etc.)

•

Repeatable builds and

orchestration

•

Faster app dev cycles

•

Lightweight (virtually no

performance or startup penalty)

The Journey to Spark on Docker

Start with a clear

goal in sight

Begin with your Docker toolbox

of a single container and basic

networking and storage

So you want to run Spark on Docker in a

multi-tenant enterprise deployment?

Warning: there are some pitfalls & challenges

剩余33页未读，继续阅读

评论收藏

内容反馈

资源评论

资源反馈

评论星级较低，若资源使用遇到问题可联系上传者，3个工作日内问题未解决可申请退款~

weixin_40191861_zj

粉丝: 62
资源: 1万+

上传资源快速赚钱

我的内容管理展开

我的资源快来上传第一个资源

我的收益

登录查看自己的收益

我的积分登录查看自己的积分

我的C币登录后查看C币余额

我的收藏

我的下载

下载帮助

前往需求广场，查看用户热搜

藏经阁-Dockerizing Spark Workloads.pdf

藏经阁-Lessons Learned From Dockerizing Spark Workloads.pdf

藏经阁-Accelerating SparkML Workloads.pdf

藏经阁-Tuning Apache Spark for Large Scale Workloads.pdf

藏经阁-Accelerating SparkML Workloads on the Intel Xeon FPGA Platfo

JESD219A-01 2022 SOLID-STATE DRIVE (SSD) ENDURANCE WORKLOADS.pdf

Evaluation of Partitioning of Real-Time Workloads on Linux.pdf

开源项目-akrylysov-pogreb.zip

vmotion-perf-vsphere5.pdf )

WiscKey - Separating Keys from Values.pdf

Real-time.Big.Data.Analytics.17843

Kubernetes Cookbook - Sebastien Goasguen -2018.02.14

Learning Docker_Faster App Development and Deployment, 2nd-Packt(2017).pdf

Research Advances in Cloud Computing-Springer(2017).pdf

Concurrency in Main-Memory Database Systems

DB - F1 Lightning- HTAP as a Service.pdf

LessonsLearnedFromDockerizingSparkWorkloads.pdf

docker-hyperdex-ycsb:来自带有 java-bindings 和 ycsb 的 docker hyperdex 1.6

python-assured-workloads

操作系统学习与考试系统(XOSCATS)

SquareLine-Studio 1.3.0安装包

王道操作系统课件 2024

C语言规范标准-C99(中文版)

ELF解析工具 v1.7（elf格式解析工具)

计算机组成原理：最详细笔记 word格式下载

KeepOutlookRunning.7z

dell r730xd 调速工具

Sim-EKB-Install-2022-11-27.zip

贵州电信天邑TY1613-s905l3-b-rtl8822cs当贝固件刷机教程

22-003-T-九联UNT403A-UNT413A-M401A-M411A-S905L3A处理器线刷固件-当贝桌面纯净版

最新资源