优化基于闪存的键值缓存系统资源-CSDN文库

131 浏览量 2021-02-26 13:51:26 上传评论收藏 141KB PDF 举报

闪存基础的键值缓存系统是一种针对高速键值缓存需求提供的成本效益解决方案。随着互联网服务对低延迟的需求日益增长，传统的基于内存的键值缓存系统（如Memcached和Redis）已经成为了第一道防线。这些系统主要依赖于大量昂贵且耗电的动态随机存取存储器（DRAM），为了降低总体拥有成本（TCO），基于闪存的键值缓存系统近来在工业界引起了很高的兴趣。例如，Facebook部署了一个基于闪存的兼容Memcached的键值缓存系统，被称为McDipper。据报道，McDipper让Facebook能够在降低部署服务器数量最多达90%的同时，依然提供超过90%的“获取响应”（get responses），响应时间小于一毫秒。Twitter也有类似的基于闪存的键值缓存解决方案，名为Fatcache。 McDipper和Fatcache这类缓存系统通常直接使用商业级的固态硬盘（SSD）并且采纳Memcached类的架构来在闪存中存储和管理键值对数据。例如，它们会将键值对组织成不同尺寸类的slabs，并使用内存哈希表来维护键到值的映射等。然而，尽管这种做法简单，它实际上是不高效的。文章提出应重新考虑硬件/软件架构的设计，通过将设备级别的细节直接开放给键值缓存系统来实现。这种重新设计的方法可以有效地消除语义上的鸿沟，并使两层之间紧密相连。利用键值缓存领域的知识和固有设备级别的特性，可以在闪存设备上最大化键值缓存系统的效率，同时最小化其弱点。研究者们正在基于开放通道SSD硬件平台实现原型系统，初步实验结果显示出非常有希望的结果。文章的重点在于介绍闪存基础键值缓存系统的优化方法，它不仅关注于提高缓存系统在存储层面的性能，还关注于如何在硬件和软件架构设计上进行创新，以解决传统缓存系统中存在的成本和效率问题。为了优化基于闪存的键值缓存系统，首先需要对现有架构和工作机制有深刻的理解。Memcached是一个广泛使用的高性能分布式内存对象缓存系统，最初是为了加速动态Web应用程序而设计的。它采用简单的键值存储方式，通过简单的内存哈希表来快速定位数据。然而，当系统中的数据量超过物理内存容量时，性能会急剧下降。将闪存SSD用于键值缓存，就可以利用其非易失性存储、高读写速度、以及相对较低的价格等优势，从而大幅降低缓存系统的TCO。此外，闪存的块状结构和擦除-写入特性也对键值缓存系统的优化提出了新的挑战。例如，如何合理组织数据块的大小和管理数据的写入顺序来优化擦写次数，以及如何有效地管理闪存中数据的持久化和一致性。这些挑战的解决依赖于对闪存设备层面特性的深入理解和对键值缓存系统工作原理的深刻洞察。文章提出了一个通过硬件和软件架构的优化来提升性能的思路，即重新设计系统架构来更紧密地结合硬件和软件。这包括但不限于： 1. 直接利用固态硬盘的设备级别特性，例如使用闪存转换层（FTL）来优化数据块的管理。 2. 调整内存哈希表的策略，适应闪存设备的读写特性。 3. 设计新的数据组织和存储方案，提高缓存命中率和减少数据写入时的磨损。 4. 设计合理的缓存替换策略，以适应闪存有限的写入次数。 5. 实现高效的数据一致性维护策略，以保证缓存数据的准确性和可靠性。基于闪存的键值缓存系统的研究正在不断进步，针对上述提到的优化方案，还有许多相关的技术挑战需要解决，包括如何在保持高性能的同时降低系统的延迟，如何在保证数据一致性的前提下减少不必要的数据写入，以及如何在复杂的网络环境下保证系统稳定运行等。随着技术的不断革新和优化策略的持续发展，未来的键值缓存系统将会更加高效、可靠和经济。

资源推荐

资源详情

资源评论

Optimizing Flash-based Key-value Cache Systems

Zhaoyan Shen

†

Feng Chen

‡

Yichen Jia

‡

Zili Shao

†

Department of Computing

‡

Computer Science & Engineering

Hong Kong Polytechnic University Louisiana State University

Abstract

Flash-based key-value cache systems, such as Face-

book’s McDipper [1] and Twitter’s Fatcache [2], pro-

vide a cost-efﬁcient solution for high-speed key-value

caching. These cache solutions typically take commer-

cial SSDs and adopt a Memcached-like scheme to store

and manage key-value pairs in ﬂash. Such a practice,

though simple, is inefﬁcient. We advocate to recon-

sider the hardware/software architecture design by di-

rectly opening device-level details to key-value cache

systems. This co-design approach can effectively bridge

the semantic gap and closely connect the two layers to-

gether. Leveraging the domain knowledge of key-value

caches and the unique device-level properties, we can

maximize the efﬁciency of a key-value cache system on

ﬂash devices while minimizing its weakness. We are im-

plementing a prototype based on the Open-channel SSD

hardware platform. Our preliminary experiments show

very promising results.

1 Introduction

High-speed key-value caches, such as Memcached and

Redis, are the “ﬁrst line of defense” in today’s low-

latency Internet services. Traditionally, these in-memory

key-value caches heavily rely on large amount of expen-

sive and power-hungry DRAM. In order to lower the To-

tal Cost of Ownership (TCO), a more cost-efﬁcient alter-

native, ﬂash-based key-value cache, has recently raised a

high interest in the industry [1, 2]. Facebook, for exam-

ple, deploys a ﬂash-based Memcached-compatible key-

value cache system, called McDipper [1]. It is reported

that McDipper allows Facebook to reduce the number of

deployed servers by as much as 90% while still deliver-

ing more than 90% “get responses” with sub-millisecond

latencies [3]. Twitter also has a similar ﬂash-based key-

value cache solution, called Fatcache [2].

Typically, these ﬂash-based key-value cache sys-

tems directly use commercial ﬂash SSDs and adopt a

Memcached-likescheme to manage key-value cache data

in ﬂash, such as organizing key-values into slabs of dif-

ferent size classes, and using in-memory hash table to

maintain the key-to-value mapping, etc. Such a design,

though simple, disregards an important fact – The key-

value cache system and the underlying ﬂash storage both

have very unique properties. Simply treating the ﬂash

SSD as a faster storage and the key-value cache as a

regular application not only fails to exploit various op-

timization opportunities but also raises several critical

problems, namely redundant mapping, double garbage

collection, and over-overprovisioning. In this study, we

advocate to reconsider the current software/hardware ar-

chitecture for designing an efﬁcient key-value cache sys-

tem, highly optimized for ﬂash.

2 Background and Motivation

2.1 Flash-based key-value caches

The existing ﬂash-based key-value cache system design

is fairly similar to its in-memory counterpart – both use

a slab-based space management. Here we use Twitter’s

Fatcache [2] as an example for explanation:

The ﬂash SSD space is ﬁrst segmented into slabs.

Each slab is often of several Megabytes and further di-

vided into an array of slots (a.k.a. chunks) of equal size.

Each slot stores a “value” item. Slabs are logically orga-

nized into different slab classes based on the slot sizes.

An incoming value item is stored in a slab whose slot size

is the best ﬁt of its size. For quick accesses, a hash map-

ping table is maintained in memory to map the keys to

the slabs that contain the corresponding values. Query-

ing a key-value pair (get) is accomplished by searching

the in-memory hash table and loading the correspond-

ing slab block from ﬂash into memory. Updating a key-

value pair (set) is realized by writing the updated value

to a new location and updating the mapping table entry.

Deleting a key-value pair (delete) simply removes the

mapping from the hash table. The deleted or obsolete

value items are left for garbage collection (GC) later. The

current design has three critical problems, which have

motivated us to perform this study.

2.2 Critical Issues

• Problem 1: Redundant mapping. Modern ﬂash

SSDs implement a complex Flash Translation Layer

(FTL) in the ﬁrmware. A key function of FTL is to trans-

late Logical Block Addresses (LBA) to Physical Flash

Memory Pages. Although a variety of mapping schemes

exist [8], for performance reasons, high-end SSDs often

adopt page-level mapping for a ﬁne-grained logical-to-

physical address translation. As a result, for a 1TB SSD

with a 4KB page size, a page-level mapping table could

be as large as 1GB. Integrating such a large DRAM on

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余4页未读，立即下载

评论收藏

内容反馈

weixin_38570406

粉丝: 9
资源: 951

优化基于闪存的键值缓存系统

DIDACache：设备和应用程序的深度集成，用于基于闪存的键值缓存

行业分类-设备装置-基于动态非覆盖RAID技术的固态闪存写缓存系统及方法.zip

cache 闪存 快速缓存

深度无盘缓存系统

FancyCache 将系统内存或闪存虚拟成硬盘缓存的软件

基于数据访问计数的NAND闪存缓存管理算法.docx

2013年预测：存储闪存用于服务器缓存.doc

kreon：Kreon是针对基于闪存的存储进行了优化的键值存储库

闪存在嵌入式Linux系统中的应用

Linux下大容量Nand闪存文件系统研究

基于SSD的数据库系统绿色查询优化器的软件工程研究.docx

emmc 文件系统优化

基于Oracle Exadata的数据库整合及性能优化.pdf

基于嵌入式Linux系统的Qt Quick应用启动优化.pdf

存储系统-缓存-磁盘学习

基于闪存物理镜像的ECC算法逆向识别方法.pdf

基于闪存工艺的SoC FPGA器件实现安全启动设计

QLogic缓存SAN适配器实现Oracle RAC闪存加速.pdf

行业分类-设备装置-基于闪存的数据写入方法和装置.zip

嵌入式系统/ARM技术中的闪存在嵌入式Linux系统中的应用

行业分类-设备装置-一种基于缓存操作的flash快速读写方法及系统.zip

基于CORDIC的反正弦和反余弦计算的FPGA实现

最新资源

cache 闪存快速缓存