ReadingsinDatabaseSystems-5th-edition资源-CSDN文库

需积分: 50 52 浏览量 2017-02-06 10:39:37 上传评论 1 收藏 370KB PDF 举报

资源推荐

资源详情

资源评论

Readings

Database

Systems

Fifth Edition

edited by

Peter Bailis

Joseph M. Hellerstein

Michael Stonebraker

Readings in Database Systems, 5th Edition (2015)

Preface

In the ten years since the previous edition of Read-

ings in Database Systems, the ﬁeld of data management

has exploded. Database and data-intensive systems to-

day operate over unprecedented volumes of data, fueled

in large part by the rise of “Big Data” and massive de-

creases in the cost of storage and computation. Cloud

computing and microarchitectural trends have made dis-

tribution and parallelism nearly ubiquitous concerns.

Data is collected from an increasing variety of hetero-

geneous formats and sources in increasing volume, and

utilized for an ever increasing range of tasks. As a re-

sult, commodity database systems have evolved consid-

erably along several dimensions, from the use of new

storage media and processor designs, up through query

processing architectures, programming interfaces, and

emerging application requirements in both transaction

processing and analytics. It is an exciting time, with

considerable churn in the marketplace and many new

ideas from research.

In this time of rapid change, our update to the tradi-

tional “Red Book” is intended to provide both a ground-

ing in the core concepts of the ﬁeld as well as a commen-

tary on selected trends. Some new technologies bear

striking resemblance to predecessors of decades past,

and we think it’s useful for our readers to be familiar

with the primary sources. At the same time, technology

trends are necessitating a re-evaluation of almost all di-

mensions of database systems, and many classic designs

are in need of revision. Our goal in this collection is

to surface important long-term lessons and foundational

designs, and highlight the new ideas we believe are most

novel and relevant.

Accordingly, we have chosen a mix of classic, tradi-

tional papers from the early database literature as well as

papers that have been most inﬂuential in recent develop-

ments, including transaction processing, query process-

ing, advanced analytics, Web data, and language design.

Along with each chapter, we have included a short com-

mentary introducing the papers and describing why we

selected each. Each commentary is authored by one of

the editors, but all editors provided input; we hope the

commentaries do not lack for opinion.

When selecting readings, we sought topics and pa-

pers that met a core set of criteria. First, each selec-

tion represents a major trend in data management, as

evidenced by both research interest and market demand.

Second, each selection is canonical or near-canonical;

we sought the most representative paper for each topic.

Third, each selection is a primary source. There are

good surveys on many of the topics in this collection,

which we reference in commentaries. However, read-

ing primary sources provides historical context, gives

the reader exposure to the thinking that shaped inﬂuen-

tial solutions, and helps ensure that our readers are well-

grounded in the ﬁeld. Finally, this collection represents

our current tastes about what is “most important”; we

expect our readers to view this collection with a critical

eye.

One major departure from previous editions of the

Red Book is the way we have treated the ﬁnal two sec-

tions on Analytics and Data Integration. It’s clear in

both research and the marketplace that these are two of

the biggest problems in data management today. They

are also quickly-evolving topics in both research and in

practice. Given this state of ﬂux, we found that we had

a hard time agreeing on “canonical” readings for these

topics. Under the circumstances, we decided to omit of-

ﬁcial readings but instead offer commentary. This obvi-

ously results in a highly biased view of what’s happen-

ing in the ﬁeld. So we do not recommend these sections

as the kind of “required reading” that the Red Book has

traditionally tried to offer. Instead, we are treating these

as optional end-matter: “Biased Views on Moving Tar-

gets”. Readers are cautioned to take these two sections

with a grain of salt (even larger that the one used for the

rest of the book.)

We are releasing this edition of the Red Book free

of charge, with a permissive license on our text that al-

lows unlimited non-commercial re-distribution, in mul-

tiple formats. Rather than secure rights to the rec-

ommended papers, we have simply provided links to

Google Scholar searches that should help the reader lo-

cate the relevant papers. We expect this electronic for-

mat to allow more frequent editions of the “book.” We

plan to evolve the collection as appropriate.

A ﬁnal note: this collection has been alive since

1988, and we expect it to have a long future life. Ac-

cordingly, we have added a modicum of “young blood”

to the gray beard editors. As appropriate, the editors of

this collection may further evolve over time.

Peter Bailis

Joseph M. Hellerstein

Michael Stonebraker

Readings in Database Systems, 5th Edition (2015)

Chapter 1: Background

Introduced by Michael Stonebraker

Selected Readings:

Joseph M. Hellerstein and Michael Stonebraker. What Goes Around Comes Around. Readings in Database

Systems, 4th Edition (2005).

Joseph M. Hellerstein, Michael Stonebraker, James Hamilton. Architecture of a Database System. Foundations

and Trends in Databases, 1, 2 (2007).

I am amazed that these two papers were written a

mere decade ago! My amazement about the anatomy

paper is that the details have changed a lot just a few

years later. My amazement about the data model paper

is that nobody ever seems to learn anything from history.

Lets talk about the data model paper ﬁrst.

A decade ago, the buzz was all XML. Vendors were

intent on adding XML to their relational engines. In-

dustry analysts (and more than a few researchers) were

touting XML as “the next big thing”. A decade later it

is a niche product, and the ﬁeld has moved on. In my

opinion, (as predicted in the paper) it succumbed to a

combination of:

• excessive complexity (which nobody could un-

derstand)

• complex extensions of relational engines, which

did not seem to perform all that well and

• no compelling use case where it was wildly ac-

cepted

It is a bit ironic that a prediction was made in the

paper that X would win the Turing Award by success-

fully simplifying XML. That prediction turned out to be

totally wrong! The net-net was that relational won and

XML lost.

Of course, that has not stopped “newbies” from rein-

venting the wheel. Now it is JSON, which can be viewed

in one of three ways:

• A general purpose hierarchical data format. Any-

body who thinks this is a good idea should read

the section of the data model paper on IMS.

• A representation for sparse data. Consider at-

tributes about an employee, and suppose we wish

to record hobbies data. For each hobby, the data

we record will be different and hobbies are funda-

mentally sparse. This is straightforward to model

in a relational DBMS but it leads to very wide,

very sparse tables. This is disasterous for disk-

based row stores but works ﬁne in column stores.

In the former case, JSON is a reasonable encod-

ing format for the “hobbies” column, and several

RDBMSs have recently added support for a JSON

data type.

• As a mechanism for “schema on read”. In effect,

the schema is very wide and very sparse, and es-

sentially all users will want some projection of

this schema. When reading from a wide, sparse

schema, a user can say what he wants to see at

run time. Conceptually, this is nothing but a pro-

jection operation. Hence, ’schema on read” is just

a relational operation on JSON-encoded data.

In summary, JSON is a reasonable choice for sparse

data. In this context, I expect it to have a fair amount of

“legs”. On the other hand, it is a disaster in the mak-

ing as a general hierarchical data format. I fully ex-

pect RDBMSs to subsume JSON as merely a data type

(among many) in their systems. In other words, it is a

reasonable way to encode spare relational data.

No doubt the next version of the Red Book will

trash some new hierarchical format invented by people

who stand on the toes of their predecessors, not on their

shoulders.

The other data model generating a lot of buzz in the

last decade is Map-Reduce, which was purpose-built by

Google to support their web crawl data base. A few

years later, Google stopped using Map-Reduce for that

application, moving instead to Big Table. Now, the rest

of the world is seeing what Google ﬁgured out earlier;

Map-Reduce is not an architecture with any broad scale

applicability. Instead the Map-Reduce market has mor-

剩余53页未读，继续阅读

评论收藏

内容反馈

etah000

粉丝: 2
资源: 9

Readings in Database Systems-5th-edition

最新资源

Readings in Database Systems-5th-edition

第5版数据库系统中的读物Readings in Database Systems, 5th Edition

Readings in Database System

Readings In Database Systems 5th Edition（含论文）

Readings in Database Systems 5th_ Peter Bailis.pdf

Readings in Database Systems, 4th Edition part 1

数据库系统基础教程 中文版

Readings in Database Systems s4th edition, part 2

Computer Networks A Systems Approach 5th edition

Database Systems Using Oracle A Simplified Guide to SQL and PLSQL, 2nd edition

Database Systems Design, Implementation and Management 9th edition

Database Management Systems_2nd edition.pdf

Database Systems 2nd edition

Redbooks in database systems

A Little Book of Python for Multivariate Analysis 等 28 本

Real-Time Systems Design and Analysis, 4th Edition

college writing skills with readings 7th ed

Fundamentals of Database Systems 6th edition

Fundamentals of Database Systems, 6th edition

EE7722-Readings-Perf-GPU+RDNA.pdf

数据库领域图灵奖获得者

Steps to Writing Well with Additional Readings

英文原版-Patterns for College Writing Brief Edition A Rhetorical Reader and Guide 13th Edition

Information.Science.Reference.Selected.Readings.on.Database.Technologies.and.Applications.Aug.2008.eBook-DDU

CET-Readings.pdf

设计&原理&管理 - 安全技术资料汇总（共1份）.zip

unicode-mandarin-readings:

Parallel I/O for Cluster Computing

免费下载Navicat15安装包+工具+教程.zip

cmu 15445 2023spring project0

最新资源

数据库系统基础教程中文版