没有合适的资源?快使用搜索试试~ 我知道了~
藏经阁-Scaling 30 TB s of Data lake with Apache HBase and Scala DSL
需积分: 5 0 下载量 80 浏览量
2023-08-26
16:29:09
上传
评论
收藏 2.18MB PDF 举报
温馨提示
试读
34页
藏经阁-Scaling 30 TB s of Data lake with Apache HBase and Scala DSL
资源推荐
资源详情
资源评论
hosted by
Scaling 30 TB’s of Data Lake
with Apache HBase and Scala
DSL at Production
Chetan Khatri
hosted by
Who Am I
Lead - Data Science, Technology Evangelist @ Accion labs India Pvt. Ltd.
Contributor @ Apache Spark, Apache HBase, Elixir Lang, Spark HBase
Connectors.
Co-Authored University Curriculum @ University of Kachchh, India.
Data Engineering @: Nazara Games, Eccella Corporation.
Advisor - Data Science Lab, University of Kachchh, India.
M.Sc. - Computer Science from University of Kachchh, India.
hosted by
Agenda
01
02
04
03
What is Apache HBase
Why Apache HBase
Apache Spark and Scala
Apache Spark HBase Connector
05
Case Study: Retail Analytics
Architecturing Fast Data Processing Platform to
Scale 30 TB Data in Production
hosted by
Add the title
• Modules do not limit the size,
number, interval, can be adjusted
according to need
• Modules do not limit the size,
number, interval, can be adjusted
according to need
• Modules do not limit the size,
number, interval, can be adjusted
according to need
Source: https://hbase.apache.org/
● Column-oriented NoSQL
● Non-relational
● Distributed database build on top of HDFS.
● Modeled after Google’s BigTable.
● Built for fault-tolerant application with billions/
trillions of rows and millions of columns.
● Very low latency and near real-time random
reads and random writes.
● Replication, end-to-end checksums,
automatic rebalancing with HDFS.
● Compression
● Bloom filters
● MapReduce over HBase data.
● Best at fetching rows by key, scanning
ranges of rows with ordered partitioning.
What is Apache HBase
hosted by
What is Apache Spark ?
Source: https://spark.apache.org/
Structured Data / SQL -
Spark SQL
Graph Processing -
GraphX
Machine Learning -
MLlib
Streaming - Spark Streaming,
Structured Streaming
剩余33页未读,继续阅读
资源评论
weixin_40191861_zj
- 粉丝: 62
- 资源: 1万+
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功