没有合适的资源?快使用搜索试试~ 我知道了~
Data Structures and Algorithms for Big Databases
需积分: 10 8 下载量 30 浏览量
2012-12-09
14:46:30
上传
评论
收藏 14.27MB PDF 举报
温馨提示


试读
208页
I/O model and cache-oblivious analysis Write-optimized data structures How write-optimized data structures can help file systems Block-replacement algorithms Indexing strategies Log-structured merge trees Bloom filters
资源推荐
资源详情
资源评论



















Data Structures and Algorithms for
Big Databases
Michael A. Bender
Stony Brook & Tokutek
Bradley C. Kuszmaul
MIT & Tokutek

Big data problem
oy vey
???
???
???
data indexing
query processor
queries + answers
???
365
42
data ingestion
Important and universal problem.
Hot topic.
2

Big data problem
oy vey
???
???
???
data indexing
query processor
queries + answers
???
365
42
data ingestion
For on-disk data, one sees funny tradeoffs in the speeds
of data ingestion, query speed, and freshness of data.
Important and universal problem.
Hot topic.
2

Don’t Thrash: How to Cache Your Hash in Flash
data indexing
query processor
queries +
answers
???
42
data
ingestion
Funny tradeoff in ingestion, querying, freshness
• Typical record of all kinds of metadata is < 150 bytes.
• Different parts of metadata are accessed separately.
• “I'm trying to create indexes on a table with 308 million rows. It took ~20
minutes to load the table but 10 days to build indexes on it.”
‣
MySQL bug #9544
• “Select queries were slow until I added an index onto the timestamp field...
Adding the index really helped our reporting, BUT now the inserts are taking
forever.”
‣
Comment on mysqlperformanceblog.com
• “They indexed their tables, and indexed them well,
And lo, did the queries run quick!
But that wasn’t the last of their troubles, to tell–
Their insertions, like molasses, ran thick.”
‣
Not from Alice in Wonderland by Lewis Carroll
3

Don’t Thrash: How to Cache Your Hash in Flash
data indexing
query processor
queries +
answers
???
42
data
ingestion
Funny tradeoff in ingestion, querying, freshness
• Typical record of all kinds of metadata is < 150 bytes.
• Different parts of metadata are accessed separately.
• “I'm trying to create indexes on a table with 308 million rows. It took ~20
minutes to load the table but 10 days to build indexes on it.”
‣
MySQL bug #9544
• “Select queries were slow until I added an index onto the timestamp field...
Adding the index really helped our reporting, BUT now the inserts are taking
forever.”
‣
Comment on mysqlperformanceblog.com
• “They indexed their tables, and indexed them well,
And lo, did the queries run quick!
But that wasn’t the last of their troubles, to tell–
Their insertions, like molasses, ran thick.”
‣
Not from Alice in Wonderland by Lewis Carroll
4
剩余207页未读,继续阅读
资源评论


sysubo
- 粉丝: 4
- 资源: 3
上传资源 快速赚钱
我的内容管理 展开
我的资源 快来上传第一个资源
我的收益
登录查看自己的收益我的积分 登录查看自己的积分
我的C币 登录后查看C币余额
我的收藏
我的下载
下载帮助


安全验证
文档复制为VIP权益,开通VIP直接复制
