【免费】Maggioni_Efficient_Multi-Stage_Video_CVPR_2021

需积分: 0 80 浏览量 2024-05-14 14:53:09 上传评论收藏 10.23MB PDF 举报

资源推荐

资源详情

资源评论

Efﬁcient Multi-Stage Video Denoising with Recurrent Spatio-Temporal Fusion

Supplementary Materials

Matteo Maggioni

, Yibin Huang

, Cheng Li

, Shuai Xiao, Zhongqian Fu, Fenglong Song

Huawei Noah’s Ark Lab

{matteo.maggioni, huangyibin1, licheng89, xiaoshuai7, fuzhongqian, songfenglong}@huawei.com

1. Implementation Aspects

1.1. Learnable Invertible Transforms

Color Transform. The C × C color transform matrix

is analogous to a YUV transformation for RGB domain. A

YUV transform matrix has size C = 3, however the pro-

posed model is designed for raw data, thus in our case the

matrix will have size C = 4, in order to transform each

color in the CFA Bayer pattern (e.g., RG

B). Practi-

cally the matrix is deﬁned as [2]

M =







0.5 0.5 0.5 0.5

−0.5 0.5 0.5 −0.5

0.65 0.2784 −0.2784 −0.65

−0.2784 0.65 −0.65 0.2784



















(1)

where each row has unit norm and corresponds to a different

color transform basis. The luminance component Y can be

easily recognized in the ﬁrst row of (1), and unsurprisingly

it corresponds to an (energy-preserving) average of the four

input color channels. In our context, the matrix M will be

used to initialize the 1 × 1 × C × C kernel of a (point-wise)

convolutional layer.

Frequency Transform. As initialization value for our

learnable frequency transform we use ﬁlters obtained by

standard wavelet families. In fact, each wavelet type has

a pair of decomposition ﬁlters, a low-pass ψ

and a high-

pass ψ

, as well as a complementary pair of reconstruction

ﬁlters, again, a low-pass φ

and a high-pass φ

. These are

all real 1-D ﬁlters of size 1 × n, being n ∈ N

an even inte-

ger value. We use these ﬁlters to generate the corresponding

n × n convolutional kernels. For example, the 2-D LL de-

composition kernel is obtained as ψ

⊗ψ

being ⊗ the outer

product. We show all components involved in the learning

and application of the frequency transform in Fig. 1.

1.2. Models

VBM4D. VBM4D [5] is a traditional algorithm origi-

nally designed to remove independent and identically dis-

tributed zero-mean Gaussian noise in grayscale or RGB

video. However, in our experiments, we apply VBM4D on

Strided

Conv

Model

Transp.

Conv

Frequency Domain

Forward

Wavelet Filters

Inverse

Wavelet Filters

Identity

Forward Kernels

Inverse Kernels

∗

Input

Output

Figure 1: Frequency transform: convolutional kernels cor-

responds to the outer product ⊗ of the learned ﬁlters.

sRGB videos generated by an ISP [8] applied to the noisy

raw data. Thus the noise will be not independent, not iden-

tically distributed, and not white. These are not ideal condi-

tions for VBM4D, but we optimize its σ parameter, which

can be used to control the amount of denoising, to maximize

the PSNR of the validation data. We simply perform a grid

search to ﬁnd the best σ for each ISO and each dataset.

FastDVDnet. We use the original FastDVDnet imple-

mentation provided by the authors [6]. FastDVDnet is de-

signed for Gaussian noise removal and uses a uniform noise

map corresponding to the variance of the distribution as ad-

ditional input of the network. Since we deal with signal-

dependent noise, we replace the uniform map with the vari-

ance map computed according to the raw noise model de-

ﬁned in (2) of the main paper. In order to decrease model

complexity, we reduce the number of channels. Speciﬁcally,

in the 82.61 GFLOPs version, we use 8 channels in the in-

put layers, 16 channels in the highest-resolution scale, and

24 everywhere else. In the 22.16 GFLOPs version we use 8

channels everywhere.

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余8页未读，立即下载

评论收藏

内容反馈

花生の恶魔

粉丝: 41
资源: 1

Maggioni_Efficient_Multi-Stage_Video_CVPR_2021_supplemental.pdf

最新资源

Maggioni_Efficient_Multi-Stage_Video_CVPR_2021_supplemental.pdf

ing-sw-2018-Maggioni-Martina-Nichelini

hsimatlab代码-Spatially-Regularized-Ultrametrics:空间正则化超声

Dissecting the NVIDIA Volta GPU Architecture via Microbenchmarking - 2018 - Slides (1804.06826)-计算机科学

用卷积滤波器matlab代码-reproducible-video-denoising-state-of-the-art:流行和可复制的视频去

em_cpp_mapa:我使用 Eigen 和 Sam 的代码作为基础的 MAPA C++ 转换版本

1_sixyin-music-source-v1.0.7.js

植物大战僵尸杂交版v2.0安装程序.exe

植物大战僵尸杂交版v2.0.zip

misaka-v3.3.8.zip

py作业.zip

大麦抢票_BP全自动抢购教程+注意事项.rar

红果脚本.apk

TiggerRamDiskV4.2Beta1-Win.zip

C语言程序设计第四版何钦铭课后习题及答案.pdf

Flyme10图标包_1.0.0_1.apk

自动抢福袋.apk

EhViewer-1.9.5.0.apk

00孙亮v2白体版本.zip

大麦抢票_7.6最新详细教程（IOS+安卓）.rar

Tailscale_v1.18.0.apk

小财神计算器(1).exe

ExuiKrnln.dll

826103 计算机网络-自顶向下方法第七版.pdf

管理运筹学配套软件3-5版.zip

大麦内部版抢购脚本8.5.0.docx

2024金地杯本科组赛题.zip

罗技GHUB 主播定制版全套数据III.lua

1_大麦内部版抢购脚本15.6.0.docx

最新资源