PrefixHashTree.pdf资源-CSDN文库

需积分: 9 144 浏览量 2021-02-27 12:08:33 上传评论收藏 140KB PDF 举报

根据提供的文件信息，以下是对《PrefixHashTree.pdf》文档中内容的知识点的详细说明：分布式哈希表（Distributed Hash Tables，简称DHT）是一种分布式系统，它具有可扩展性、鲁棒性和自组织特性。这种系统能够支持精确匹配查找。DHT利用结构化的覆盖网络，将给定的键映射到持有与之相关联的对象的网络节点上。这种查找操作（lookup(key)）可以用来支持标准的哈希表操作，例如放置（put(key, value)）和获取（get(key)）。DHT的广泛应用使得可以在其上构建各种系统，包括文件系统、中间服务、事件通知、内容分发网络等多种应用。文章中提到的Prefix Hash Tree是一种基于DHT的数据结构，它对DHT的索引接口进行利用，构建了一个基于前缀树（trie-based）的结构。这种结构在更新时具有较高的效率（更新操作的时间复杂度与索引域的大小是对数的对数相关），并且能够抵抗节点故障（ Prefix Hash Tree中任何一个给定节点的失败不会影响其他节点存储的数据的可用性）。前缀哈希树为DHT支持更为复杂的查询操作提供了便利。 Prefix Hash Tree可以高效地执行范围查询（range queries），这对于处理大规模数据集是非常有用的。此外，它还能够支持像前缀匹配这样的查询，而这些查询在传统的DHT中是不支持的。文档提到了在文档末尾的作者信息，包括Sriram Ramabhadran来自加州大学圣地亚哥分校，Sylvia Ratnasamy来自英特尔研究实验室伯克利分部，Joseph M. Hellerstein和Scott Shenker则同时属于加州大学伯克利分校以及英特尔研究实验室和国际计算机科学研究所。从文档提供的片段来看，作者们致力于解决DHT在实现更复杂查询时存在的限制，并开发出一种全新的数据结构来提升DHT的查询能力。Prefix Hash Tree的提出，不仅丰富了DHT的功能，也加强了其作为分布式系统基础架构的稳固性。文档还提供了一些其他信息，例如提到DHT的“互联网风格”的设计原则，即其可扩展性和易于部署的特性优于严格的语义性。这表明设计者在开发DHT时，注重了系统的可扩展性和部署的便捷性，这为DHT在各种互联网应用中的广泛应用奠定了基础。《PrefixHashTree.pdf》介绍了如何通过对DHT的创新和扩展，开发出Prefix Hash Tree这样的数据结构，以支持更为复杂的查询操作，并介绍了相关的原理和应用场景。这种数据结构的设计和实现，使得DHT作为分布式系统底层技术的能力得到了显著增强。

资源推荐

资源详情

资源评论

Preﬁx Hash Tree

An Indexing Data Structure over Distributed Hash Tables

Sriram Ramabhadran

∗

University of California, San Diego

Sylvia Ratnasamy

Intel Research, Berkeley

Joseph M. Hellerstein

University of California, Berkeley

and

Intel Research, Berkeley

Scott Shenker

International Comp. Science Institute, Berkeley

and

University of California, Berkeley

ABSTRACT

Distributed Hash Tables are scalable, robust, and

self-organizing peer-to-peer systems that support

exact match lookups. This paper describes the de-

sign and implementation of a Preﬁx Hash Tree -

a distributed data structure that enables more so-

phisticated queries over a DHT. The Preﬁx Hash

Tree uses the lookup interface of a DHT to con-

struct a trie-based structure that is both eﬃcient

(updates are doubly logarithmic in the size of the

domain being indexed), and resilient (the failure

of any given node in the Preﬁx Hash Tree does

not aﬀect the availability of data stored at other

nodes).

Categories and Subject Descriptors

C.2.4 [Comp. Communication Networks]: Dis-

tributed Systems—distributed applications; E.1 [Data

Structures]: distributed data structures; H.3.1 [Info-

rmation Storage and Retrieval]: Content Anal-

ysis and Indexing—indexing methods

General Terms

Algorithms, Design, Performance

Keywords

distributed hash tables, data structures, range queries

1. INTRODUCTION

The explosive growth but primitive design of peer-

to-peer ﬁle-sharing applications such as Gnutella

[7] and KaZaa [29] inspired the research community

to invent Distributed Hash Tables (DHTs) [31, 24,

14, 26, 22, 23]. Using a structured overlay network,

DHTs map a given key to the node in the network

holding the object associated with that key; this

lookup operation lookup(key) can be used to sup-

port the canonical put(key,value) and get(key)

hash table operations. The broad applicability of

∗

email sriram@cs.ucsd.edu

this lookup interface has allowed a wide variety of

system to be built on top DHTs, including ﬁle sys-

tems [9, 27], indirection services [30], event notiﬁ-

cation [6], content distribution networks [10] and

many others.

DHTs were designed in the Internet style: scala-

bility and ease of deployment triumph over strict

semantics. In particular, DHTs are self-organizing,

requiring no centralized authority or manual con-

ﬁguration. They are robust against node failures

and easily accommodate new nodes. Most impor-

tantly, they are scalable in the sense that both la-

tency (in terms of the number of hops per lookup)

and the local state required typically grow loga-

rithmically in the number of nodes; this is crucial

since many of the envisioned scenarios for DHTs

involve extremely large systems (such as P2P mu-

sic ﬁle sharing). However, DHTs, like the Internet,

deliver ”best-eﬀort” semantics; put’s and get’s are

likely to succeed, but the system provides no guar-

antees. As observed by others [36, 5], this conﬂict

between scalability and strict semantics appears

to be inevitable and, for many large-scale Inter-

net systems, the former is deemed more important

than the latter.

While DHTs have enjoyed some success as a build-

ing block for Internet-scale applications, they are

seriously deﬁcient in one regard: they only directly

support exact match queries. Keyword queries can

be derived from these exact match queries in a

straightforward but ineﬃcient manner; see [25, 20]

for applications of this to DHTs. Equality joins

can also be supported within a DHT framework;

see [15]. However, range queries, asking for all ob-

jects with values in a certain range, are particularly

diﬃcult to implement in DHTs. This is because

DHTs use hashing to distribute keys uniformly and

so can’t rely on any structural properties of the key

space, such as an ordering among keys.

Range queries arise quite naturally in a number of

0 1

0 01 1

10 10

0 1

000001

000100

001001

001010

001011

010000

010101

100010

101011

101111

110000

110010

110011

110110

111000

111010

000*

00100*

001010*

001011*

0011*

01*

10*

110*

111*

KeysLeaf nodes

Figure 1: Preﬁx Hash Tree

potential application domains:

Databases Peer-to-peer databases [15] need to sup-

port SQL-type relational queries in a distributed

fashion. Range predicates are a key component in

SQL.

Distributed computing Resource discovery requires

locating resources within certain size ranges in a

decentralized manner.

Location-aware computing Many applications want

to locate nearby resources (computing, human or

commercial) based on a user’s current location,

which is essentially a 2-dimensional range query

based on geographic coordinates.

Scientiﬁc computing Parallel N-body computations

[34] require 3-dimensional range queries for accu-

rate approximations.

In this paper, we address the problem of eﬃciently

supporting 1-dimensional range queries over a DHT.

Our main contribution is a novel trie-based dis-

tributed data structure called Preﬁx Hash Tree (hence-

forth abbreviated as PHT) that supports such queries.

As a corollary, the PHT can also support heap

queries (“what is the maximum/minimum ?”), prox-

imity queries (“what is the nearest element to X

?”), and, in a limited way, multi-dimensional ana-

logues of the above, thereby greatly expanding the

querying facilities of DHTs. PHT is eﬃcient, in

that updates are doubly logarithmic in the size

of the domain being indexed. Moreover, PHT is

self-organizing and load-balanced. PHT also toler-

ates failures well; while it cannot by itself protect

against data loss when nodes go down

, the failure

of any given node in the Preﬁx Hash Tree does not

aﬀect the availability of data stored at other nodes.

But perhaps the most crucial property of PHT is

that it is built entirely on top of the lookup inter-

face, and thus can run over any DHT. That is, PHT

uses only the lookup(key) operation common to

all DHTs and does not, as in SkipGraph [1] and

other such approaches, assume knowledge of nor

require changes to the DHT topology or routing

behavior. While designs that rely on such lower-

layer knowledge and modiﬁcations are appropriate

for contexts where the DHT is expressly deployed

for the purpose of supporting range queries, we ad-

dress the case where one must use a pre-existing

DHT. This is particularly important if one wants

to make use of publicly available DHT services,

such as OpenHash [18].

The remainder of the paper is organized as fol-

lows. Section 2 describes the design of the PHT

data structure. Section 3 presents the results of an

experimental evaluation. Section 4 surveys related

work and section 5 concludes.

2. DATA STRUCTURE

This section describes the PHT data structure, along

with related algorithms.

2.1 PHT Description

For the sake of simplicity, it is assumed that the do-

main being indexed is {0, 1}

, i.e., binary strings

But PHT can take advantage of any replication or other

data-preserving technique employed by a DHT.

剩余9页未读，继续阅读

评论收藏

内容反馈

weixin_45786425

粉丝: 0
资源: 7

PrefixHashTree.pdf

Prefix

Java 面经手册·小傅哥.pdf

解压后拖入浏览器扩展程序使用.zip

103套PPT模板.zip

Beyond Compare 免安装直接使用

notepad++.exe官网下载

Mars4_5.zip

keygen_2032.rar

QT自制精美Ui模板系列（一）桃子风格模板 - 二次开发专用

python爬虫数据可视化分析大作业.zip

WinRAR 6.01 简体中文版_x64(无广告).exe

Postman9.12.2安装包

智能门锁 指纹锁 密码锁 蓝牙锁 门禁锁.rar

HslCommunication 7.0.1 免费版本 免费使用

java实现rtsp/rtmp转m3u8/flv/hls/mp4等实现web直播和回放

软件工程实验图书管理系统

微信小程序商城源码.zip

八股文免费资源下载.zip

基于OpenCV的车牌号码识别的Python代码（可直接运行）

2021最新最全AD封装库3D封装库元件库.zip

DirectX修复工具(DirectX Repair)修复工具V4.0增强版

时序卷积网络（tcn）库文件

systemverilog绿皮书.pdf

科研伦理与学术规范 期末考试1（50题）

小傅哥的字节码编程(公众号：bugstack虫洞栈).pdf

pdfium.dll

狂神说全部笔记内容.zip

QT软件界面框架及精美样式（PC端）.zip

王万良-人工智能导论（第五版）课件

华为一镜到底主题.rar

最新资源

智能门锁指纹锁密码锁蓝牙锁门禁锁.rar

HslCommunication 7.0.1 免费版本免费使用

科研伦理与学术规范期末考试1（50题）