【免费】矩阵编码-MatrixEmbeddingforLargePayloads,1资源-CSDN文库

需积分: 0 112 浏览量 2022-08-04 12:33:23 上传评论收藏 225KB PDF 举报

资源详情

资源评论

资源推荐

Matrix Embedding for Large Payloads

Jessica Fridrich

and David Soukal

Department of Electrical and Computer Engineering;

Department of Computer Science;

SUNY Binghamton, Binghamton, NY 13902-6000, USA

ABSTRACT

Matrix embedding is a general coding method that can be applied to most steganographic schemes to improve

their embedding eﬃciency—the number of message bits embedded per one embedding change. Because smaller

number of embedding changes is less likely to disrupt statistic properties of the cover object, schemes that employ

matrix embedding generally have better steganographic security. This gain is more important for long messages

than for shorter ones because longer messages are easier to detect. Previously introduced approaches to matrix

embedding based on Hamming codes are, however, not eﬃcient for long messages. In this paper, we present

novel matrix embedding schemes that are eﬃcient for embedding messages close to the embedding capacity. One

is based on a family of codes constructed from simplex codes and the second one on random linear codes of

small dimension. The embedding eﬃciency of the proposed methods is evaluated with respect to theoretically

achievable bounds.

Keywords: steganography, covering codes, matrix embedding, simplex codes

1. INTRODUCTION

Statistical undetectability is the main requirement for a steganographic scheme. By undetectability, we under-

stand the inability of an attacker to distinguish between stego and cover objects with success rate better than

random guessing, given the knowledge of the embedding algorithm and the source of cover media. There are

four main factors that inﬂuence the steganographic security

1. Type of cover media

2. Method for selection of places within the cover that might be modiﬁed

3. The embedding operation

4. The number of embedding changes

If two diﬀerent embedding schemes share 1)–3), the one that introduces fewer embedding changes will be less

detectable because it is less likely to disturb the statistics of the cover to trigger detection.

Matrix embedding improves embedding eﬃciency—the expected number of random message bits embedded

with one embedding change. Matrix embedding was discovered by Crandall

in 1998 and analyzed by Bierbrauer.

It was also independently re-discovered by Willems et al.

and Galand et al.

Westfeld

was the ﬁrst one to

incorporate matrix embedding in his F5 algorithm. Intuitively, it is clear that the gain in embedding eﬃciency

is larger for short messages than for longer ones. However, improving the embedding eﬃciency for increasingly

shorter messages becomes progressively less important for the overall security because short messages are more

diﬃcult to detect than longer ones. Matrix embedding based on binary Hamming codes

is, however, far from

theoretically achievable bounds for payloads larger than 67% of embedding capacity.

In this paper, we attempt to remedy this situation and propose two coding methods that enable eﬃcient

matrix embedding for long messages. The ﬁrst method uses simplex codes and codes derived from them, while

Further author information:

J.F.: E-mail: fridrich@binghamton.edu, Telephone: +1 607 777 6177

the second method uses codes of small dimension with random generator matrix. Section 2 introduces the

necessary basic concepts from coding theory. The embedding mechanism of matrix embedding based on binary

Hamming codes is reviewed in Section 3. Relating covering codes with steganography enables us to derive upper

bounds on achievable embedding eﬃciency in Section 4. In Section 5, matrix embedding based on simplex codes

and random linear codes with small dimension is explained. The code designs and coding algorithms are supplied

with pseudo-codes to ease the code implementation for practitioners. The paper is concluded in Section 6.

2. BASIC CONCEPTS

In this section, we introduce a few elementary concepts from coding theory and some simple facts that will be

needed in the rest of the paper. A good introductory text to this subject is, for example, the book by Sloane et

al.

Throughout the text, boldface symbols denote vectors or matrices while the caligraphiccalligraphic font is

reserved for sets.

The space of all n-bit column vectors x = (x

, . . . , x

)

, where ()

denotes the matrix transpose, will be

denoted F

. A binary code C is any subset of F

. The vectors in C are called codewords. The set F

is a

linear vector space if we deﬁne the sum of two vectors x, y ∈ F

and a multiplication of a vector by scalar

a ∈ {0, 1} using the usual arithmetics in the ﬁnite ﬁeld GF(2) = {0, 1}. Note that in binary arithmetics, sum

is the same as diﬀerence. The Hamming weight w(x) of a vector x is deﬁned as the number of ones in x, i.e.,

w(x) = x

+ · · · + x

. The distance between two vectors x and y is deﬁned as d(x, y) = w(x − y). We denote

as B(x, r) the ball with center x ∈ F

and radius r,

B(x, r) = {y ∈ F

|d(x, y) ≤ r}.

The distance between x and subset C ⊂ F

is deﬁned as d(x, C) = min

c∈C

d(x, c) = d(c, c

) for some c

∈ C. The

covering radius R of C is deﬁned as

R = max

x∈F

d(x, C).

The covering radius is determined by the vector most distant from C. In this text, we will need one more concept

called the “average distance to code,”

= 2

−n

x∈F

d(x, C),

which is the average distance between a randomly selected vector from F

and C. It follows directly from the

deﬁnitions that R

≤ R.

For any subset C and vector x, x + C = {y ∈ F

|y = x + c, c ∈ C}. The redundancy r of a code C is deﬁned

as r = log

|C|

, where |C| is the cardinality of C.

Codes that form a linear vector subspace of F

are called linear codes. If the vector subspace C has dimension

k, we say that C is a linear code of length n and dimension k (and codimension n − k). We can also say that

C is an [n, k] code. Since there are 2

codewords in an [n, k] code, the redundancy of a linear code is equal to

its codimension r = n − k. Each [n, k] code has a basis consisting of k vectors. By writing the basis vectors as

rows of an k × n matrix G, we obtain a generator matrix of C. Each codeword can be written as a unique linear

combination of rows from G.

Given two vectors x, y ∈ F

, their dot product is deﬁned as x · y = x

+ · · · + x

, all operations

in GF(2). The vectors x and y are orthogonal if x · y = 0. The orthogonal complement of C is deﬁned as

⊥

= {x ∈ F

|x ·c = 0 for all c ∈ C}, which is an [n, n − k] code. It is called the dual code to C and its generator

matrix H has n − k rows and n columns. From orthogonality, Hx = 0 for each x ∈ C. The matrix H is called

the parity check matrix of C.

For any x ∈ F

, the vector s = Hx ∈ F

is called the syndrome of x. For each syndrome s ∈ F

n−k

, the set

C(s) = {x ∈ F

|Hx = s} is called a coset. Note that C(0) = C. It should be clear that cosets associated with

diﬀerent syndromes are disjoint. From elementary linear algebra, every coset can be written as C(s) = x + C,

where x ∈ C(s) is arbitrary. Therefore, there are total of 2

n−k

disjoint cosets, each consisting of 2

vectors. Any

member of the coset C(s) with the smallest Hamming weight is called a coset leader and will be denoted as e

(s).

The following two simple lemmas will be needed in the text.

Lemma 2.1. Given a coset C(s), for any x ∈ C(s), d(x, C) = w(e

(s)). Moreover, if d(x, C) = d(x, c

) for some

∈ C, the vector x − c

is a coset leader.

Proof. d(x, C) = min

c∈C

w(x − c) = min

y∈C(s)

w(y) = w(e

(s)). The second equality follows from the fact

that if c runs through C, x − c goes through all members of the coset C(s).

Lemma 2.2. If C is [n, k] with an (n − k) × n parity check matrix H and covering radius R, then any syndrome

s ∈ F

n−k

can be written as a sum of at most R columns of H and R is the smallest such number. Thus, the

covering radius can also be deﬁned as the maximal weight of all coset leaders while the average distance to code

is equal to the average weight of coset leaders.

Proof. Any x ∈ F

belongs to exactly one coset C(s). We know from Lemma 2.1 that d(x, C) = w(e

(s)).

But the weight w(e

(s)) is the smallest number of columns in H that must be added to obtain s.

Lemma 2.3. (Sphere-covering bound) For any code C ⊂ F

with covering radius R

|C| ≥

V (n, R)

, (1)

where V (n, R) is the volume of a ball of radius R in F

, V (n, R) =

i=0





. Moreover, for R < n/2,

log

V (n, R) ≤ nH(R/n), (2)

where H(x) = −x log

x − (1 − x) log

(1 − x) is the binary entropy function.

Proof. Each ball with radius R covers V (n, R) vectors. The balls with centers at codewords cover the whole

space but they may have non-empty intersection. Thus, we must have |C|V (n, R) ≥ 2

. The upper bound (2)

is a frequently used inequality in coding and its proof is not essential for understanding the rest of this paper.

The reader is referred to Lemma 2.4.4 in Ref. 7.

3. MATRIX EMBEDDING USING BINARY HAMMING CODES

In this section, we describe binary Hamming codes and explain how they can be used for matrix embedding.

Binary Hamming codes are [2

− 1, 2

− 1 − p] linear codes with parity check matrix H of dimensions p × (2

− 1)

whose columns are binary expansions of numbers 1, . . . , 2

−1. For example, the parity check matrix H for p = 3

H =







0 0 0 1 1 1 1

0 1 1 0 0 1 1

1 0 1 0 1 0 1







For any syndrome s ∈ F

, let dec(s) be the integer whose binary expansion is s. It is easy to see that for any

non-zero syndrome s, the vector e

(s) = (0, . . . , 0, 1, 0, . . . , 0)

with 1 at the dec(s)-th place is the leader of the

coset C(s) because He

(s) = s.

Let us assume that the cover object is an image consisting of N pixels. Most steganographic schemes assign

a bit to each possible pixel value, for example, as the LSB of the grayscale value. The embedding then usually

proceeds by changing the pixel values to match their assigned bits to the desired message bits. To do so, one

might for example ﬂip the LSB of the pixel grayscale value. Assuming the embedded message is a random

bit-stream, the probability that each pixel will have to be changed is 1/2. Thus, on average we embed 2 bits per

embedding change. We can also say that the scheme has embedding eﬃciency of 2.

To improve the embedding eﬃciency using matrix embedding, we divide the cover image into N/n subsets,

each consisting of n pixels, where n is the length of an appropriately chosen code. For matrix embedding using

the binary Hamming code, n = 2

−1. We now show that we can embed p message bits in each subset by making

at most one embedding change.

剩余11页未读，继续阅读

评论收藏

内容反馈

鲸阮

粉丝: 19
资源: 303

矩阵编码-Matrix Embedding for Large Payloads,1

评论0

最新资源

矩阵编码-Matrix Embedding for Large Payloads,1

评论0

Modular-Matrix-Inverse-Java:这将对用 Java 编码的矩阵进行模块化逆运算，这在大多数情况下有助于密码学

论文研究-Matrix Embedding Based on Trellis Structure of Linear Block Codes.pdf

Adversarial Attribute-Text Embedding for Person Search with Natural Language Que

Deep Learning via Semi-Supervised Embedding.pdf

经过处理的腾讯中文词汇/短语向量 tencent-ailab-embedding-zh-d200-v0.2.0-s

Survey of Visual-Semantic Embedding Methods for Zero-Shot Imag

Survey of Visual-Semantic Embedding Methods for Zero-Shot Im

Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba.pdf

Low-Rank Embedding for Robust Image Feature Extraction

In-Situ_De-embedding.pdf

用于多维缩放的高性能样本外嵌入技术_High Performance Out-of-sample Embedding Techn

Neural Word Embedding as Implicit Matrix Factorization (5477-neural-word-embedding-as-implicit-matrix-factorization)-计算机科学

孔繁爽__LENA Locality-Expanded Neural Embedding for Knowledge Base

孔繁爽__LENA_Locality-Expanded Neural Embedding for Knowledge Base

spark-face-embedding-源码.rar

Chinese-Text-Classification-Pytorch-mas

Schroff 等。 - 2015 - FaceNet A Unified Embedding for Face Recognition .pdf

2019-UNSUPERVISED INDUCTIVE WHOLE-GRAPH EMBEDDING BY PRESERVING

A Structured Self-attentive Sentence Embedding

ACL2020---基于Knowledge-Embedding的多跳知识图谱问答.rar

BurpLoaderKeygen.jar.zip

最新版ISO/IEC 27001:2022、ISO 27002:2022中英文合集

Goby红队版-win-x64-2.4.7版本

Chrome Header Editor 插件

ISO SAE 21434-2021 中文版.pdf

OpenVAS GVM 中文翻译补丁

安全认证cisp教材全套

STM32F103C8T6核心板-电路原理图1.PDF

最新资源