Overhead-freein-placerecoveryandrepairschemesofXOR-basedregeneratingcodes资源-CSDN文库

112 浏览量 2021-02-07 08:22:22 上传评论收藏 221KB PDF 举报

### 关于XOR-Based Regenerating Codes的Overhead-Free In-Place恢复与修复方案的研究 #### 摘要本文提出了一种针对XOR-Based MBR（Minimum Bandwidth Regenerating）再生存储码的优化恢复和修复策略。该策略由侯等人提出，并在此基础上进行了改进。这些策略的特点是，在数据恢复或节点修复的过程中不存在任何传输开销，即所传输的数据量正好等于修复或恢复的数据量。此外，该方法主要依赖XOR操作，因此相较于以往的方法具有更低的计算复杂度。同时，这些方案所需的辅助空间较小，使其能够实现原位操作。 #### 引言再生码（Regenerating Codes）最初被设计用于分布式存储系统中，以解决数据冗余和节点故障时的数据恢复问题。在一个典型的设置中，一个大小为M比特的文件经过编码后被存储在n个存储节点上。该文件可以通过访问任意k个存储节点来恢复。当某个存储节点发生故障时，可以利用其他d个存活节点的信息来重建新的节点，从而保持系统的恢复和再生特性。这种参数为[n,k,d]的再生码能够确保文件的可靠性和系统的鲁棒性。假设每个存储节点存储α比特的数据，而再生带宽为γ比特，即从d个节点中进行再生过程中通信的总比特数。 Dimakis等人在文献[3]中对节点存储量与再生带宽之间的基本权衡关系进行了理论分析。在最优存储-带宽权衡曲线中有两个特别重要的点：最小存储再生(MSR)点和最小带宽再生(MBR)点。再生码达到MSR点时，每个节点存储的数据量为α = M/k比特。由于任意k个节点都可以用来恢复原始文件，这类再生码的恢复过程没有额外的开销，即总恢复带宽等于文件大小。而达到MBR点的再生码则具有最小的修复带宽γ = M/(k * (2d - k + 1))，并且每个节点存储的数据量为α = γ比特。只需从任意k个节点中获取α * k = M / (2d - k + 1)比特的数据就可以恢复整个文件。 #### 重要知识点详解 1. **再生码的基础概念**： - **定义**：再生码是一种能够有效处理分布式存储系统中的数据恢复和节点修复的技术。 - **关键参数**：n表示总的存储节点数量，k表示恢复文件所需节点的数量，d表示修复单个节点时需要访问的其他节点的数量。 - **存储量与带宽权衡**：随着节点存储量的增加，再生带宽会相应减少，反之亦然。 2. **最小带宽再生(MBR)点**： - **定义**：MBR点是指再生码达到最小修复带宽的点。 - **特点**： - γ = M / (k * (2d - k + 1))：最小修复带宽公式。 - α = γ：每个节点存储的数据量等于最小修复带宽。 - 从任意k个节点中获取α * k = M / (2d - k + 1)比特的数据即可恢复整个文件。 3. **Overhead-Free In-Place恢复与修复方案**： - **零传输开销**：提出的策略在数据恢复和节点修复过程中不存在任何额外的传输开销。 - **低复杂度**：该方案主要依赖XOR操作，这使得其计算复杂度低于之前的方案。 - **小辅助空间需求**：该方案仅需要少量的辅助空间，这使得它能够在原位执行。 4. **XOR-Based MBR Regenerating Storage Code**： - **XOR操作的应用**：XOR操作在再生码中扮演着关键角色，尤其是在修复和恢复过程中。 - **性能优化**：通过优化XOR操作和其他算法，该方案能够提高数据恢复和节点修复的效率。 #### 结论本文介绍了一种用于XOR-Based MBR再生存储码的Overhead-Free In-Place恢复与修复方案。该方案通过消除传输开销、降低计算复杂度以及减少辅助空间需求，显著提高了分布式存储系统的效率和可靠性。通过对存储量与再生带宽之间权衡关系的深入理解，研究人员能够开发出更加高效且实用的再生码技术，这对于未来分布式存储系统的优化具有重要意义。

资源推荐

资源详情

资源评论

Overhead-free In-place Recovery and Repair

Schemes of XOR-based Regenerating Codes

Ximing Fu

∗

, Zhiqing Xiao

†

, and Shenghao Yang

‡

∗

Department of Computer Science and Technology, Tsinghua University

†

Department of Electronic Engineering, Tsinghua University

‡

Institute of Network Coding, The Chinese University of Hong Kong

Abstract—In this paper, reﬁned recovery and repair schemes

are proposed for a storage system using the XOR-based MBR

regenerating storage code proposed by Hou et al. Our schemes

have zero transmission overhead for both recovery and repair,

i.e., the total number of transmitted bits for repair/recovery is

exactly equal to the total number of bits repaired/recovered.

Further, our schemes use mainly XOR operations and have lower

complexity than that of the previous schemes. Moreover, our

schemes require only a small amount of auxiliary space, which

qualiﬁes our schemes as in-place.

I. INTRODUCTION

Regenerating codes were proposed for distributed storage

systems in [1]–[3]. In a typical setting, a ﬁle of M bits

is encoded and stored at n storage nodes. The ﬁle can be

recovered by accessing any k storage nodes. When a storage

node fails, a new node can be regenerated by accessing any

d surviving nodes such that the new set of n storage nodes

preserves the above recovery and regenerating properties. A

regenerating code with the above parameters is also called an

[n, k, d] code. Suppose that each storage node stores α bits and

the regenerating bandwith is γ bits, i.e., the total number of bits

communicated from the d nodes during regenerating. Dimakis

et al. [3] characterized a fundamental tradeoff between the

storage per node and the regenerating bandwidth.

Two extremal points in the optimal storage-bandwith trade-

off curve are of particular interest, i.e., the minimum-storage

regenerating (MSR) point and minimum-bandwidth regenerat-

ing (MBR) point. The regenerating codes attaining the MSR

point store α =

bits in each node. Since each k nodes

can be used to recover the original ﬁle, such regenerating

codes have zero recovery overhead, i.e., the total recovery

bandwidth is equal to the ﬁle size. The regenerating codes

attaining the MBR point have the minimum repair bandwidth

γ =

2d−k+1

and α = γ. Accessing kα = M

2d−k+1

bits

from any k nodes is sufﬁcient to recover the original ﬁle. But

kα is strictly larger than M when k > 1 so that an MBR code

may not have zero recovery overhead. We focus on MBR codes

in this paper.

Rashmi, Shah and Kumar [4] provided product-matrix

constructions of MBR codes for all valid values of [n, k, d]

This work was supported in part by the National Basic Research Program

of China (973 Program) under Grant 2013CB834205 and the National Natural

Science Foundation of China (NSFC) under Grant 61133013 and 61471215.

This work was partially funded by a grant from the University Grants

Committee of the Hong Kong Special Administrative Region (Project No.

AoE/E-02/08).

and MSR codes for d ≥ 2k − 2. The product-matrix MBR

codes, however, require matrix operations over ﬁnite ﬁelds

for encoding and recovery, which leads to high complexity

for practical systems. To resolve this complexity issue, Hou

et al. [5]–[7] proposed BASIC codes using an exclusive-or

(XOR) version of the product-matrix constructions. For BASIC

codes, encoding and recovery mainly use binary XOR and shift

operations, so the computational complexities are signiﬁcantly

reduced.

In a BASIC MBR code, a ﬁle of M bits are divided into

B = kd−





sequences, each of which consists of L = M/B

bits. After encoding, each storage node stores d encoded

packets. In the recovery algorithm introduced in [5], kd packets

are retrieved from k storage nodes and the B sequences

are recovered by solving d linear systems of dimension k.

Due to the shift operations, the packets encoded may be

of different length but all have at least L bits. This packet

overhead (the number of bits in a packet minus L) would

affect the recovery bandwidth. The recovery bandwidth, the

extra decoding storage and the computational complexity in

the recovery algorithm of [5] are given in Table I. In the worst

case, about 2M bits are transmitted to recover the M bits.

In this paper, we propose a more efﬁcient recovery scheme

for BASIC codes, which works for all valid values of [n, k, d].

Our recovery scheme has two stages: the retrieving stage

and the decoding stage. In a BASIC MBR code, each node

stores more than M/k bits. In the retrieving stage of our

scheme, exactly M bits are retrieved from any k storage

nodes. Therefore, our scheme achieves the optimal recovery

bandwidth exactly. In other words, our scheme implies that it

is possible to achieve zero recovery overhead for MBR codes.

In the decoding stage of our scheme, the M bits retrieved in the

ﬁrst stage are used to recover the original ﬁle. Our algorithm

is similar to the ZigZag decoding of Sung and Gong [8]

designed for a storage code based on XOR and shift operations.

We optimize their algorithm for BASIC MBR codes to gain

lower computational and storage complexities. Speciﬁcally, the

number of XOR operations used in our recovery scheme is





. After retrieving the data from the storage nodes, our

decoding algorithm overwrites the data during execution, and

only consumes O (k log L) extra storage space for auxiliary

variables. After the algorithm executing, the M bits in the

memory storing the retrieved data become exactly the desired

ﬁle. Therefore, our decoding algorithm is in-place.

The packet overhead also affects the repair bandwidth. For

the repair scheme of BASIC codes in [4], [5], the total number

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余4页未读，立即下载

评论收藏

内容反馈

weixin_38743076

粉丝: 7
资源: 925

Overhead-free in-place recovery and repair schemes of XOR-based ...

最新资源

Overhead-free in-place recovery and repair schemes of XOR-based ...

Algorithm-overhead-camera-people-counter.zip

DB - Just Say NO to Paxos Overhead- Replacing Consensus xxx.pdf

Big Data and Computational Intelligence in Networking-CRC(2018).pdf

Virtualization Overhead of Multithreading in X86 State of the Ar

System Level Design of Reconfigurable Systems-on-Chip

所有的camera镜头全解-V1.4

Low-overhead and high-precision prediction model forcontent-based sensor search in the Internet of Things(审稿中）

Bochs - The cross platform IA-32 (x86) emulator

Real-Time Video Compression: Techniques and Algorithms

[PACT07]CacheScouts Fine-Grain Monitoring of Shared Caches in CMP Platforms

Overhead Analysis and Evaluation of Approaches to Host-Based Bot Detection

Low-overhead authentication method for reprogramming protocol based on rateless codes in wireless sensor networks

DSR的RFC4728

Generic Programming for Scientific Computing in C++, Java, and C#

Oracle Core - Essential Internals for DBAs and Developers.(Jonathan Lewis)

Improving Nested Loop Pipelining on Coarse-Grained Reconfigurable Architectures

meteor-overhead-benchmark

A Mobility Prediction-based Adaptive Data Gathering

Robust Topology Control for Indoor Wireless Sensor Networks

基于CORDIC的反正弦和反余弦计算的FPGA实现

BA无标度网络中的SIR模型

使用3DCNN和卷积LSTM进行手势识别学习时空特征

基于三次贝塞尔曲线的类汽车曲率连续路径平滑

基于机器学习的设备剩余寿命预测方法综述

基于维纳过程的退化模型，具有递归过滤算法，可用于估计剩余使用寿命

基于FPGA的奇异值和特征值分解的快速实现。

磁悬浮系统自适应模糊PID控制器的设计

最新资源