没有合适的资源?快使用搜索试试~ 我知道了~
藏经阁-Apache Kudu & Apache Spark SQL.pdf
需积分: 5 0 下载量 184 浏览量
2023-09-01
07:09:01
上传
评论
收藏 1.66MB PDF 举报
温馨提示
试读
47页
藏经阁-Apache Kudu & Apache Spark SQL.pdf
资源推荐
资源详情
资源评论
1
© Cloudera, Inc. All rights reserved.
Apache Kudu & Apache Spark SQL
for Fast Analytics on Fast Data
Mike Percy
Software Engineer at Cloudera
Apache Kudu PMC member
2
© Cloudera, Inc. All rights reserved.
Kudu Overview
3
© Cloudera, Inc. All rights reserved.
HDFS
Fast Scans, Analytics
and Processing of
Static Data
Fast On-Line
Updates &
Data Serving
Arbitrary Storage
(Active Archive)
Fast Analytics
(on fast-changing or
frequently-updated data)
Traditional Hadoop Storage Leaves a Gap
Use cases that fall between HDFS and HBase were difficult to manage
Unchanging
Fast Changing
Frequent Updates
HBase
Append-Only
Real-Time
Complex Hybrid
Architectures
Analytic
Gap
Pace of Analysis
Pace of Data
4
© Cloudera, Inc. All rights reserved.
HDFS
Fast Scans, Analytics
and Processing of
Static Data
Fast On-Line
Updates &
Data Serving
Arbitrary Storage
(Active Archive)
Fast Analytics
(on fast-changing or
frequently-updated data)
Kudu: Fast Analytics on Fast-Changing Data
New storage engine enables new Hadoop use cases
Unchanging
Fast Changing
Frequent Updates
HBase
Append-Only
Real-Time
Kudu
Kudu fills the Gap
Modern analytic
applications often
require complex data
flow & difficult
integration work to
move data between
HBase & HDFS
Analytic
Gap
Pace of Analysis
Pace of Data
5
© Cloudera, Inc. All rights reserved.
Apache Kudu: Scalable and fast tabular storage
Tabular
• Represents data in structured tables like a relational database
• Strict schema, finite column count, no BLOBs
• Individual record-level access to 100+ billion row tables
Scalable
• Tested up to 275 nodes (~3PB cluster)
• Designed to scale to 1000s of nodes and tens of PBs
Fast
• Millions of read/write operations per second across cluster
• Multiple GB/second read throughput per node
剩余46页未读,继续阅读
资源评论
weixin_40191861_zj
- 粉丝: 64
- 资源: 1万+
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功