没有合适的资源?快使用搜索试试~ 我知道了~
Delta Lake数据湖 English Version
需积分: 2 2 下载量 148 浏览量
2023-01-03
21:47:42
上传
评论
收藏 11.34MB PDF 举报
温馨提示
试读
104页
Delta Lake - The Definitive Guide 基于Delta Lake的湖仓一体的现代数据架构,主要包括常规的基本代码用例,历史审计和时间旅行管理,流批一体等原理讲解和使用代码样例讲解和分析
资源推荐
资源详情
资源评论
With Early Release ebooks, you get books in their earliest
form—the authors’ raw and unedited content as they write—
so you can take advantage of these technologies long before
the official release of these titles.
Denny Lee, Tathagata Das, and Vini Jaiswal
Delta Lake: The Denitive Guide
Modern Data Lakehouse Architectures
with Delta Lake
Boston Farnham Sebastopol
Tokyo
Beijing Boston Farnham Sebastopol
Tokyo
Beijing
978-1-098-10452-8
Delta Lake: The Denitive Guide
by Denny Lee, Tathagata Das, and Vini Jaiswal
Copyright © 2022 O’Reilly Media. All rights reserved.
Printed in the United States of America.
Published by O’Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 95472.
O’Reilly books may be purchased for educational, business, or sales promotional use. Online editions are
also available for most titles (http://oreilly.com). For more information, contact our corporate/institutional
sales department: 800-998-9938 or corporate@oreilly.com.
Acquisitions Editor: Jessica Haberman
Development Editor: Gary O’Brien
Production Editor: Christopher Faucher
Interior Designer: David Futato
Cover Designer: Karen Montgomery
April 2022:
First Edition
Revision History for the Early Release
2021-04-20: First Release
2021-05-07: Second Release
See http://oreilly.com/catalog/errata.csp?isbn=9781098104597 for release details.
The O’Reilly logo is a registered trademark of O’Reilly Media, Inc. Delta Lake: e Denitive Guide, the
cover image, and related trade dress are trademarks of O’Reilly Media, Inc.
The views expressed in this work are those of the authors, and do not represent the publisher’s views.
While the publisher and the authors have used good faith efforts to ensure that the information and
instructions contained in this work are accurate, the publisher and the authors disclaim all responsibility
for errors or omissions, including without limitation responsibility for damages resulting from the use of
or reliance on this work. Use of the information and instructions contained in this work is at your own
risk. If any code samples or other technology this work contains or describes is subject to open source
licenses or the intellectual property rights of others, it is your responsibility to ensure that your use
thereof complies with such licenses and/or rights.
This work is part of a collaboration between O’Reilly and Databricks. See our statement of editorial inde‐
pendence.
Table of Contents
1.
Basic Operations on Delta Lakes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
What is Delta Lake? 8
How to start using Delta Lake 9
Using Delta Lake via local Spark shells 9
Leveraging GitHub or Maven 10
Using Databricks Community Edition 10
Basic operations 11
Creating your first Delta table 11
Unpacking the Transaction Log 14
What Is the Delta Lake Transaction Log? 16
How Does the Transaction Log Work? 18
Dealing With Multiple Concurrent Reads and Writes 30
Other Use Cases 35
Diving further into the transaction log 35
Table Utilities 36
Review table history 36
Vacuum History 37
Retrieve Delta table details 39
Generate a manifest file 41
Convert a Parquet table to a Delta table 42
Convert a Delta table to a Parquet table 43
Restore a table version 43
Summary 48
2.
Time Travel with Delta Lake. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
Introduction 49
Under the hood of a Delta Table 50
The Delta Directory 50
v
剩余103页未读,继续阅读
资源评论
ZL小屁孩
- 粉丝: 18
- 资源: 25
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功