使用基于熵的自适应测量分配进行深度图像编码资源-CSDN文库

55 浏览量 2021-03-03 00:39:05 上传评论收藏 1.4MB PDF 举报

深度图像编码是三维(3D)视频系统中的关键技术，它涉及到如何高效地存储和传输深度信息，以提供高质量和沉浸式的多媒体体验。传统的二维纹理图像与深度图像在数据特性上存在显著差异。深度图像反映了三维场景中的距离信息，具有在特定变换基础下显著的稀疏特性。这种稀疏性使得压缩感知技术可以有效地表达深度信息。在本文中，提出了一种新颖的基于块压缩感知方法的深度图像编码方案。该方案在编码器端利用每个块中像素的熵来表示深度信号的稀疏性，并据此在像素域中不同的稀疏性基础上，自适应地分配测量值给每个块，以提高压缩效率。在解码器端，可以结合稀疏变换以实现压缩感知重建。实验结果显示，在相同的采样率下，所提出的编码方案与使用均匀采样率的方法相比，能够获得更高的峰值信噪比(PSNR)值以及更好的主观质量渲染的虚拟视图。关键词包括深度图像编码、熵、三维视频系统和压缩感知。压缩感知是一种基于信号稀疏性的新型信号采样和重建理论，它允许以远低于奈奎斯特采样定理所需的速率捕获信号，同时仍然能够精确重建原始信号。在深度图像编码中应用压缩感知能够减少数据量，加快处理速度，提升编码效率。深度图像在三维视频系统中的应用越来越广泛，例如在虚拟现实和增强现实等领域。深度图像编码技术的好坏直接影响到三维视频质量和用户观看体验。深度图像编码技术主要面临两大挑战，即如何在不损失图像质量的前提下，压缩数据量以及如何有效恢复图像。提出的方法通过自适应分配测量值，使得压缩过程更加智能化，能够针对不同稀疏性的图像块采取不同的压缩策略，从而在保证图像质量的同时，尽可能地降低数据量。熵的概念源于热力学，后被引入信息论中，用来衡量信息的不确定性或信息内容的多少。在深度图像编码中，熵的概念被用来衡量深度信号的稀疏程度，即图像中像素值的不确定性。通过计算每个图像块的熵，可以确定其稀疏性，进而指导自适应地分配压缩资源。这种方法充分考虑了图像内容的特性，实现了编码的优化。文章提到的块压缩感知方法是对传统压缩感知技术的一种改进，它将整个图像划分为多个块，并在每个块上独立地执行压缩感知过程。这种方法的好处在于，可以根据每个块的具体情况来优化压缩参数，从而在局部范围内达到最优的压缩效果，这对于深度图像这种具有高度局部变化性的数据来说尤为有效。解码器端的稀疏变换结合是压缩感知重建的关键。在块压缩感知中，每个块可能会使用不同的测量值，因此需要在解码端进行相应的稀疏变换来准确重建信号。这一步骤是压缩感知成功与否的关键，需要精确地从采样的测量值中恢复出原始信号。整体而言，深度图像编码作为三维视频系统的核心技术之一，需要不断地进行研究和创新，以应对日益增长的计算需求和用户体验要求。本文提出的基于熵的自适应测量分配方法，为深度图像编码提供了新的思路和技术手段，对于提升三维视频系统的性能具有积极的意义。

资源推荐

资源详情

资源评论

Entropy 2014, 16, 6590-6601; doi:10.3390/e16126590

entropy

ISSN 1099-4300

www.mdpi.com/journal/entropy

Article

Depth Image Coding Using Entropy-Based Adaptive

Measurement Allocation

Huihui Bai

*, Mengmeng Zhang

, Meiqin Liu

, Anhong Wang

and Yao Zhao

Institute of Information Science, Beijing Jiaotong University, Beijing 100044, China;

E-Mails: mqliu@bjtu.edu.cn (M.L); yzhao@bjtu.edu.cn (Y.Z.)

College of Information Engineering, North China University of Technology, Beijing 100144,

China; E-Mail: zmm@ncut.edu.cn

Electronic Information and Engineering College, Taiyuan University of Science and Technology,

Taiyuan 030024, China; E-Mail: wah_ty@163.com

* Author to whom correspondence should be addressed; E-Mail: luckybhh@gmail.com;

Tel.: +86-10-5168-4108; Fax: +86-10-5168-8626.

External Editor: Kevin H. Knuth

Received: 20 October 2014; in revised form: 7 December 2014 / Accepted: 12 December 2014 /

Published: 17 December 2014

Abstract: Differently from traditional two-dimensional texture images, the depth images of

three-dimensional (3D) video systems have significant sparse characteristics under the

certain transform basis, which make it possible for compressive sensing to represent depth

information efficiently. Therefore, in this paper, a novel depth image coding scheme is

proposed based on a block compressive sensing method. At the encoder, in view of the

characteristics of depth images, the entropy of pixels in each block is employed to represent

the sparsity of depth signals. Then according to the different sparsity in the pixel domain, the

measurements can be adaptively allocated to each block for higher compression efficiency. At

the decoder, the sparse transform can be combined to achieve the compressive sensing

reconstruction. Experimental results have shown that at the same sampling rate, the proposed

scheme can obtain higher PSNR values and better subjective quality of the rendered virtual

views, compared with the method using a uniform sampling rate.

Keywords: depth image coding; entropy; 3D video system; compressive sensing

OPEN ACCESS

Entropy 2014, 16 6591

1. Introduction

Three-dimensional (3D) video can provide the viewers a high-quality and immersive multimedia

experience, which has drawn increasing attention among industry and academic researchers [1]. Two

typical 3D applications have appeared in the form of three-dimensional television (3DTV) [2] and

free-viewpoint television (FTV) [3]. In 3DTV applications, multiple views from different viewing angles

can be rendered for depth perception of the scene while in FTV applications, arbitrary viewpoints within

a certain range can be selected interactively by viewers.

The basic format of 3D video is a multiview representation which is usually captured simultaneously

by multiple cameras with slightly displaced positions [4]. However, with an increasing number of the

views, the huge amount of data from multiview video poses great challenge for 3D applications, such as

data compression and transmission. In order to solve this problem, the multiview video plus depth (MVD)

format has emerged as an efficient data representation for 3D systems. Compared to the pure multiview

video format without depth information, the main advantage of the MVD format is that desired virtual

views at arbitrary viewpoint positions can be conveniently synthesized via the depth-image-based

rendering (DIBR) technique [5].

Depth images represent the distance information between the camera and the objects in the

scene. The depth images are often treated as grey scale image sequences, which are similar to the

luminance component of texture video. However, differently from the texture video, the depth image

has its own special characteristics. Firstly, the depth image signal is much sparser than the texture video

under certain transform basis, such as Discrete Cosine Transform (DCT) or Discrete Wavelet Transform

(DWT), etc. It contains no texture but sharp object boundaries, since the gray levels are nearly the same

in most regions within an object but change abruptly across the boundaries. Furthermore, the depth image

is not directly used for display, but it plays an important role in the virtual view synthesis. The distortion

of depth data, especially around the object boundaries, will seriously degrade the quality of the rendered

virtual views [6]. Therefore, how to employ the depth image characteristics for efficient compression is

an essential part in 3D systems.

In view of the sparsity characteristics of depth images, we attempt to apply compressive sensing

(CS) [7] to represent depth information efficiently. CS is a new method to capture and represent

compressible signals at a rate significantly below the conventional Shannon/Nyquist rate. In the

conventional Shannon/Nyquist sampling theorem, when capturing a signal, one must sample at least two

times faster than the signal bandwidth in order to avoid losing information. Due to the low sampling

rate, CS can avoid the big burden of data storage and processing at the conventional encoder.

In recent years, CS is applied in image compression and the basic framework is shown in Figure 1.

At the encoder, the input image can be processed block by block. For each block in the image, sparse

transform, such as DCT or DWT, is used to produce the coefficients with sparse characteristics. Then

compressive sensing is employed to encode the transform coefficients and generate the same amount of

measurements for each block. At the decoder, a convex optimization method, such as the log-barrier or

multiplier [8], can be adopted for the CS recovery. In the end, the corresponding inverse transform can

be used for the image reconstruction. Block compressed sensing for natural images is proposed using

the same measurement matrix, which is claimed that it can sufficiently capture the complicated

geometric structures of natural images [9]. A new image/video coding approach is proposed, which can

剩余11页未读，继续阅读

评论收藏

内容反馈

weixin_38686399

粉丝: 9
资源: 934

使用基于熵的自适应测量分配进行深度图像编码

基于压缩感知的自适应深度图像编码

一种基于图像信息熵的自适应滤波算法 图像滤波算法.pdf

基于自适应扫描次序的二叉树图像编码方法matlab代码

图像处理基于自适应扫描次序的二叉树图像编码附matlab代码

论文研究-基于小波熵自适应阈值的语音信号去噪新方法.pdf

多通道自适应码流MCAoB彩色图像编码

基于预测误差自适应编码的图像加密可逆数据隐藏.docx

基于matlab使用自适应中值滤波器对椒盐图像去噪处理设计与实现

基于自适应叠合分割与深度神经网络的人数统计方法.pdf

《基于自适应注意力的深度残差图像超分辨率重建》

基于自适应探索改进的深度增强学习算法.pdf

基于APIDCT和自适应霍夫曼编码的静态图像压缩算法.doc

基于深度强化学习的异构云无线接入网自适应无线资源分配算法.pdf

【图像去噪】基于量子自适应变换QAB算法实现图像去噪附matlab代码 上传.rar

论文研究-基于局部自适应阈值的细胞图像分割方法.pdf

自适应的区块截断编码（图像压缩）

基于深度特征学习的图像自适应目标识别算法.pdf

【图像去噪】基于自适应滤波器消除椒盐噪声图像附matlab代码.zip

基于遗传算法的OFDM自适应资源分配算法MATLAB源码

计算机视觉与深度学习实战-以MATLAB和Python为工具_基于形态学的权重自适应图像去噪_项目开发案例教程.pdf

论文研究-基于时域自适应滤波及非局部平均的夜视图像去噪算法.pdf

基于LMS自适应滤波器均衡设计

基于量子自适应变换QAB算法的图像去噪matlab仿真+仿真录像

一种基于深度学习的自适应医学超声图像去斑方法.pdf

基于深度学习的权重自适应的图像去噪算法matlab实现版本

基于自适应显着性的图像分割源码.zip

【图像去噪】基于matlab自适应双边滤波SAR灰色图像去噪（含PNSR）【含Matlab源码 4232期】.md

最新资源

一种基于图像信息熵的自适应滤波算法图像滤波算法.pdf

【图像去噪】基于量子自适应变换QAB算法实现图像去噪附matlab代码上传.rar