opes.rar_OPES_同态加密资源-CSDN文库

共1个文件

pdf：1个

版权申诉

109 浏览量 2022-09-22 21:39:19 上传评论收藏 131KB RAR 举报

同态加密是密码学中的一个先进概念，它允许在加密数据上直接进行计算，而无需先解密。这种技术在云计算、数据隐私保护和分布式计算等领域具有巨大的潜力。"opes.rar_OPES_同态加密"这个压缩包文件，很可能是包含了一篇关于OPES（Order Preserving Encryption Scheme，有序保持加密方案）的详细研究报告或学术论文，OPES是同态加密的一种变体，主要设计用于在保持数据排序性的前提下进行加密。同态加密的核心思想是，对明文执行的任何操作，都可以转化为对密文的操作，然后将结果解密后得到与原操作相同的结果。这样，即使数据在传输或存储时被加密，服务提供商也能在不解密的情况下处理这些数据，从而保证了数据的隐私性。 OPES作为一种特殊类型的同态加密，它的重点在于保持数据的排序性。这意味着即使数据被加密，我们仍然可以根据加密后的值进行比较和排序，这对于数据库查询优化尤其有用。例如，在云存储中，用户可以加密他们的数据并上传到云端，然后在不暴露原始数据的情况下，让云服务器执行搜索、排序等操作。同态加密的实现通常分为全同态加密（Fully Homomorphic Encryption, FHE）和部分同态加密（Partial Homomorphic Encryption, PHE）。OPES属于PHE，因为它只支持有限类型的操作，比如比较和排序。FHE则更加强大，它可以支持任意复杂的计算，但实现起来也更为复杂，目前仍处于研究阶段。 OPES的实现通常基于一些特定的加密算法，如Paillier公钥加密系统，这种系统能够对加密数值进行加法和乘法操作，同时保持排序性。然而，同态加密往往面临着效率低下的问题，因为加密和解密过程通常比传统加密算法更耗时，且计算复杂度更高。在实际应用中，OPES可能用于诸如隐私保护的搜索引擎、匿名数据共享和分布式计算等场景。例如，医疗记录可以在保持隐私的前提下进行排序和检索，而金融交易数据可以加密存储，但仍能进行合法性和合规性的检查。通过阅读"opes.pdf"这篇文档，我们可以深入了解OPES的工作原理、安全性分析、性能评估以及可能的应用场景。它可能会涵盖OPES与其他同态加密方案的对比，以及如何在实际系统中实现和优化OPES。对于理解和应用同态加密技术，这份资料无疑是非常有价值的。

资源推荐

资源详情

资源评论

收起资源包目录

opes.rar （1个子文件）

opes.pdf 160KB

Order Preserving Encryption for Numeric Data

Rakesh Agrawal Jerry Kiernan Ramakrishnan Srikant Yirong Xu

IBM Almaden Research Center

650 Harry Road, San Jose, CA 95120

ABSTRACT

Encryption is a well established technology for protecting sensi-

tive data. However, once encrypted, data can no longer be easily

queried aside from exact matches. We present an order-preserving

encryptionschemefor numeric data that allows any comparison op-

eration to be directly applied on encrypted data. Query results pro-

duced are sound (no false hits) and complete (no false drops). Our

scheme handles updates gracefully and new values can be added

without requiring changes in the encryption of other values. It al-

lows standard database indexes to be built over encrypted tables

and can easily be integrated with existing database systems. The

proposed scheme has been designed to be deployed in application

environments in which the intruder can get access to the encrypted

database, but does not have prior domain information such as the

distribution of values and cannot encrypt or decrypt arbitrary val-

ues of his choice. The encryption is robust againstestimation of the

true value in such environments.

1. INTRODUCTION

Database systems typically offer access control as the means to

restrict access to sensitive data. This mechanism protects the pri-

vacyof sensitive information provided data is accessedusingthe in-

tended database system interfaces. However, access control, while

important and necessary, is often insufﬁcient. Attacks upon com-

puter systems have shown that information can be compromised

if an unauthorized user simply gains access to the raw database

ﬁles, bypassing the database access control mechanism altogether.

For instance, a recent article published in the Toronto Star [14] de-

scribes an incident where a disk containing the records of several

hundred bank customers was being auctioned on eBay. The bank

had inadvertently sold the disk to the eBay re-seller as used equip-

ment without deleting its contents. Drawing upon privacy legisla-

tions and guidelinesworldwide, Hippocraticdatabasesalsoidentify

the protection of personal data from unauthorized acquisition as a

vital requirement [1].

Permission to make digital or hard copies of all or part of this work for

personal or classroom use is granted without fee provided that copies are

not made or distributed for proﬁt or commercial advantage, and that copies

bear this notice and the full citation on the ﬁrst page. To copy otherwise, to

republish, to post on servers or to redistribute to lists, requires prior speciﬁc

permission and/or a fee.

SIGMOD 2004 June 13-18, 2004, Paris, France.

:::

$5.00.

Encryption is a well established technology for protecting sensi-

tive data [7] [22] [24]. Unfortunately, the integration of existing

encryption techniques with database systems causes undesirable

performance degradation. For example, if a column of a table con-

taining sensitive information is encrypted, and is used in a query

predicate with a comparison operator, an entire table scan would

be needed to evaluate the query. The reason is that the current en-

cryption techniques do not preserve order and therefore database

indices such as B-tree can no longer be used. Thus the query exe-

cution over encrypted databases can become unacceptably slow.

We presentan encryption techniquecalledOPES(Order Preserv-

ing Encryption Scheme) that allows comparison operations to be

directly applied on encrypteddata,without decrypting theoperands.

Thus, equality and range queries as well as the MAX, MIN, and

COUNT queries can be directly processed over encrypted data.

Similarly, GROUP BY and ORDER BY operations can also be ap-

plied. Only when applying SUM or AVG to a group do the values

need to be decrypted. OPES is also endowed with the following

properties:



The results of query processing over data encrypted using

OPES are exact. They neither contain any false positives nor

miss any answer tuple. This feature of OPES sharply differ-

entiates it from schemes such as [13] that produce a superset

of answer, necessitating ﬁltering of extraneous tuples in a

rather expensive and complex post-processing step.



OPES handles updates gracefully. A value in a column can

be modiﬁed or a new value can be inserted in a column with-

out requiring changes in the encryption of other values.



OPES can easily be integrated with existing database sys-

tems as it has been designed to work with the existing index-

ing structures such as B-trees. The fact that the database is

encrypted can be made transparent to the applications.

Measurements from an implementation of OPES in DB2 show that

the time and space overhead of OPES are reasonable for it to be

deployed in real systems.

1.1 Estimation Exposure

The security of an encryption scheme is conventionally assessed

by analyzing whether an adversary can ﬁnd the key used for en-

cryption. See [22] [24] for a categorization of different levels of

attacks against a cryptosystem.

When dealing with sensitive numeric data, an adversary does

not have to determine the exact data value

corresponding to an

Figure 1: Transparentencryption in the “trusted databasesoft-

ware with vulnerable storage” setting.

encrypted value

; a breach may occur if the adversary succeeds

in obtaining a tight estimate of

. For a numeric domain

,if

an adversary can estimate with

conﬁdence that a data value

lies within the interval

[

] then the interval width

(

)

domain-width

(

)

deﬁnes the amount of estimation exposure

conﬁdence level.

Clearly, any order-preserving encryption scheme is vulnerable

to tight estimation exposure if the adversary can choose any num-

ber of unencrypted (encrypted) values of his liking and encrypt

(decrypt) them into their corresponding encrypted (unencrypted)

values. Similarly, any order-preserving encryption is not secure

against tight estimation exposure if the adversary canguess the do-

main and knows the distribution of values in that domain.

We consider an application environmentwhere the goal is safety

from an adversary who has access to all (but only) encryptedvalues

(the so called ciphertext only attack [22] [24]), and does not have

any special information about the domain. We will particularly fo-

cus on robustness against estimation exposure.

1.2 Threat Model

We assume (see Figure 1):



The storage system usedby the database softwareis vulnera-

ble to compromise. While current database systems typically

perform their own storage management, the storage system

remains part of the operating system. Attacks against storage

could be performed by accessing database ﬁles following a

path other than through the database software, or in the ex-

treme, by physical removal of the storage media.



The database software is trusted. We trust the database

software to transform query constants into their encrypted

values and decrypt the query results. Similarly, we assume

that an adversary does not have access to the values in the

memory of the database software.



All disk-resident data is encrypted. In addition to the data

values, the database software also encrypts schema infor-

mation such as table and column names, metadata such as

column statistics, as well as values written to recovery logs.

Otherwise, an adversary may be able to use this information

to guess data distributions.

1.3 Pedagogical Assumptions and Notations

The focus of this paper is on developing order-preserving en-

cryption techniques for numeric values and assumes conventional

encryption [22] [24] for other data types as well as for encrypting

information such as schema names and metadata. We will some-

time refer to unencrypted data values as plaintext. Similarly, en-

crypted values will also be referred to as ciphertext.

We will assume that the databaseconsists of a single table, which

in turn consists of a single column. The domain of the column will

be initially assumed to be a subset of integer values,

[

min

max

)

The extension for real values is given later in the paper.

Assume the database

consists of a total of

plaintext values.

Out of these,

values are unique, which will be represented as

;::: ;p

The corresponding encrypted

values will be represented as

;::: ;c

Duplicates can sometimes be used to guess the distribution of a

domain, particularly if the distribution is highly skewed. A closely

related problem is that if the number of distinct values is small (e.g.,

day of the month), it is easy to guess the domain. We will ini-

tially assume that the domain to be encrypted either does not con-

tain many duplicates or contains a distribution that can withstand a

duplicate attack, and discuss the handling of duplicates later in the

paper.

1.4 Paper Layout

The rest of the paper is organized as follows. We ﬁrst discuss re-

lated work in Section 2. We give an overview of OPES in Section 3.

The next three sections give details of the three main phases of

OPES. We describe extensions to handle real values and duplicates

in Section 7. In Section 8, we study the quality of the encryption

produced by OPES and present performance measurements from a

DB2 implementation. We conclude with a summary and directions

for future work in Section 9.

2. RELATED WORK

Summation of Random Numbers A simple scheme has been

proposed in [3] that computes the encrypted value

of integer

,where

is the

th value generated by a se-

cure pseudo-random number generator

. Unfortunately, the cost

of making

calls to

for encrypting or decrypting

can be pro-

hibitive for large values of

A more serious problem is the vulnerability to estimation ex-

posure. Since the expected gap between two encrypted values is

proportional to the gap between the corresponding plaintext val-

ues, the nature of the plaintext distribution can be inferred from the

encrypted values. Figure 2 showsthe distributions of encryptedval-

ues obtained using this scheme for data values sampled from two

different distributions: Uniform and Gaussian. In each case, once

both the input and encrypted distributions are scaled to be between

0 and 1, the number of points in each bucket is almost identical for

the plaintext and encrypted distributions. Thus the percentile of a

point in the encrypted distribution is also identical to its percentile

in the plaintext distribution.

(a) Input: Uniform

100

150

200

250

300

0 1

Number of points

Scaled Domain

Original

Encrypted

(b) Input: Gaussian

100

200

300

400

500

600

700

0 1

Number of points

Scaled Domain

Original

Encrypted

Figure 2: Summation of random numbers: Distribution of en-

crypted values tracks the input distribution.

Polynomial Functions In [12], a sequence of strictly increasing

polynomial functions is used for encrypting integer values while

preserving their order. These polynomial functions can simply be

of the ﬁrst or second order, with coefﬁcients generated from the en-

cryption key. An integer value is encrypted by applying the func-

tions in such a way that the output of a function becomes the input

of the next function. Correspondingly, an encrypted value is de-

crypted by solving these functions in reverse order. However, this

encryption methoddoes not take the input distribution into account.

Therefore the shape of the distribution of encrypted values depends

on the shape of the input distribution, as shown in Figure 3 for the

encryption function given in Example 10 in [12]. This illustration

suggests that this scheme may reveal information about the input

distribution, which can be exploited.

Bucketing In [13], tuples are encrypted using conventional en-

cryption, but an additional bucket id is created for each attribute

value. This bucket id, which represents the partition to which the

unencrypted value belongs, can be indexed. The constants ap-

pearing in a query are replaced by their corresponding bucket ids.

Clearly, the result of a query will contain false hits that must be

removed in a post-processing step after decrypting the tuples re-

turned by the query. This ﬁltering can be quite complex since the

bucket ids may have been used in joins, subqueries, etc. The num-

ber of false hits dependson the width of the partitions involved. It is

shown in [13] that the post-processingoverhead can become exces-

sive if a coarse partitioning is used for bucketization. On the other

hand, a ﬁne partitioning makes the scheme vulnerable to estimation

exposure, particularly if an equi-width partitioning is used.

It has been pointed out in [6] that the indexes proposed in [13]

can open the door to interference and linking attacks. Instead, they

(a) Input: Uniform

500

1000

1500

2000

2500

0 1

Number of points

Scaled Domain

Original

Encrypted

(b) Input: Gaussian

500

1000

1500

2000

2500

0 1

Number of points

Scaled Domain

Original

Encrypted

Figure 3: Polynomial functions: Encryption of different input

distributions look different.

build a B-tree over plaintext values, but then encrypt every tuple

and the B-tree at the node level using conventionalencryption. The

advantageof this approachis that the contentof B-tree is not visible

to an untrusted database server. The disadvantageis that the B-tree

traversal can now be performed by the front-end only by execut-

ing a sequence of queries that retrieve tree nodes at progressively

deeper level.

Other RelevantWork Rivest et al. [21] suggest that the limit on

manipulating encrypted data arises from the choice of encryption

functions used, and there exist encryption functions that permit en-

crypted data to be operated on directly for many sets of interesting

operations. They call these functions “privacy homomorphisms”.

The focus of [21] and the subsequent follow-up work [2] [8] [9]

has been on designing privacy homomorphisms to enable arith-

metic on encrypted data, but the comparison operations were not

investigated in this line of research.

In [10], a simple but effective scheme has been proposed to en-

crypt a look-up directory consisting of (key, value) pairs. The goal

is to allow the corresponding value to be retrieved if and only if

a valid key is provided. The essential idea is to encrypt complete

tuples, but associate with every tuple the one-way hash value of its

key. Thus, no tuple will be retrieved if an invalid key is presented.

Answering range queries was not a goal of this system.

In [23], interesting schemes are proposed to support keyword

searches over an encrypted text repository. The driving application

for this work is the efﬁcient retrieval of encrypted email messages.

Naturally, they do not discuss relational queries and it is not clear

how their techniques can be adapted for relational databases.

In [4], a smart card with encryption and query processing ca-

pabilities is used to ensure the authorized and secure retrieval of

encrypted data stored on untrusted servers. Encryption keys are

(a) Input: Uniform, Target: Zipf

500

1000

1500

2000

2500

0 1

Number of points

Scaled Domain

Original

Target

Encrypted

(b) Input: Gaussian, Target: Zipf

500

1000

1500

2000

2500

0 1

Number of points

Scaled Domain

Original

Target

Encrypted

Figure 4: Illustrating OPES.

maintained on the smart card. The smart card can translate exact

match queries into equivalent queries over encrypted data. How-

ever, range queries require creating a disjunction for every pos-

sible value in the range, which is infeasible for real data values.

The smart card implementation could beneﬁt from our encryption

scheme in that range queries could be translated into equivalent

queries over encrypted data.

In [25], the security and tamper resistance of a database stored

on a smart card is explored. They consider snooping attacks for

secrecy, and spooﬁng,splicing, and replay attacks for tamper resis-

tance. Retrieval performance is not the focus of their work and it

is not clear how much of their techniques apply to general purpose

databases not stored in specialized devices.

Amongstcommercial database products,Oracle 8i allows values

in any of the columns of a table to be encrypted [18]. However,

the encrypted column can no longer participate in indexing as the

encryption is not order-preserving.

Related work also includes researchon order-preserving hashing

[5] [11]. However, protecting the hash values from cryptanalysis is

not the concern of this body of work. Similarly, the construction of

original values from the hash values is not required.

3. PROPOSED ORDER-PRESERVING EN-

CRYPTION SCHEME

The basic idea of OPES is to take as input a user-provided tar-

get distribution and transform the plaintext values in such a way

that the transformation preserves the order while the transformed

values follow the target distribution. Figure 4 shows the result of

running OPES with different input distributions and the same target

distribution. Notice that the distribution of encrypted values looks

identical in both 4(a) and 4(b), even though the input distributions

were very different.

3.1 Intuition

To understand the intuition behind OPES algorithm, consider the

following encryption scheme:

Generate

unique values from a user-speciﬁed target distri-

bution and sort them into a table

. The encrypted value

is then given by

[

]

.Thatis,the

th plaintext value in the

sorted list of

plaintext values is encrypted into the

th value in

the sorted list of

values obtained from the target distribution.

The decryption of

requires a lookup into a reverse map. Here

is the encryption key that must be kept secret.

Clearly, this scheme does not reveal any information about the

original values apart from the order, since the encrypted values

were generated solely from the user-speciﬁed target distribution,

without using any information from the original distribution. Even

if an adversary has all of the encrypted values, he cannot infer

from those values. By appropriately choosing target distribution,

the adversary can be forced to make large estimation errors.

This simple scheme, while instructive, has the following short-

comings for it to be used for encrypting large databases:



The size of encryption key is twice as large as the number of

unique values in the database.



Updates are problematic. When adding a new value

,where

<p<p

, we will need to re-encrypt all

;j > i

OPES has been designedsuch that the result of encryption is sta-

tistically indistinguishable from the one obtained using the above

scheme,thereby providing the same level of security, while remov-

ing its shortcomings.

3.2 Overview of OPES

When encrypting a given database

, OPES makes use of all

the plaintext values currently present

andalsousesadatabase

of sampled values from the target distribution. Only the encrypted

database

is stored on disk. At the same time, OPES also creates

some auxiliary information

, which the database system uses to

decrypt encoded values or encrypt new values. Thus

serves the

function of the encryption key. This auxiliary information is kept

encrypted using conventionalencryption techniques.

OPES works in three stages:

1. Model: The input and target distributions are modeled as

piece-wise linear splines.

2. Flatten: The plaintext database

is transformed into a “ﬂat”

database

suchthatthe valuesin

are uniformly distributed.

It is possible to avoid immediate re-encryption by choosing an

encrypted value for

in the interval (

), but

would still

needupdating. Moreover, there might be cases where

and therefore inserting a new value will require re-encryption of

existing values.

Note that the encryption scheme

[

]

circumvents the up-

date problem. But now the size of the key becomes the size of the

domain. It is also vulnerable to percentile exposure, as discussed

earlier in Section 2.

If an installation is creating a new database, the database admin-

istrator can provide a sample of expected values.

评论收藏

内容反馈

版权申诉

alvarocfc

粉丝: 128
资源: 1万+

opes.rar_OPES_同态加密

基于同态加密的智能电网安全数据融合技术.rar

T112019-数据智能技术峰会-利用同态加密实现安全的数据交付-2019.12-22页.rar

Assignment 2.rar

复杂电子信息系统关键数据加密防护仿真.pdf

一种保序加密域数据库认证水印算法.pdf

数据库中的数据加密技术 (2006年)

MECC煤泥微乳捕收剂的制备与性能研究

yii2-magic-scopes:查询魔法范围的 Yii2 行为

辛基酚聚氧乙烯醚磺酸盐的合成与界面张力的测定 (2010年)

地铁通信技术方案比选分析.pdf

icap协议rfd3507

超滤膜分离纯化山麦冬多糖的研究 (2006年)

冰河的渗透实战笔记-冰河.pdf

stm32f103 adc采样+dma传输+fft处理 频率计_fft处理_stm32_ADCFFT_频率计_ADC采样_

ISO21434.pdf

Web安全漏洞扫描工具-AWVS14

J-LINK V10 V11固件.rar

CTF 竞赛入门指南（ctf-all-in-one）.pdf

Web中间件常见漏洞总结.pdf

jts-1.14.zip

DEAP2.1.zip_DEA2.1软件下载_dea 2.1软件下载_deap2.1_deap2.1基础模型_dea模型

数据结构与算法分析--C语言描述_数据结构与算法_

CobaltStrike4.4.zip

RK3568硬件设计资料.zip_C#

cisp-pte渗透测试资源下载 （考试环境+题库）

QT帮助文档_中文版_QT中文版帮助文档_

goby2021红队专版，1.8.255

pconline1478255959502.rar

最新资源

stm32f103 adc采样+dma传输+fft处理频率计_fft处理_stm32_ADCFFT_频率计_ADC采样_

cisp-pte渗透测试资源下载（考试环境+题库）