NeuralResponseGenerationviaGANwithanApproximateEmbeddingLayer资源-CSDN文库

113 浏览量 2021-02-08 14:13:05 上传评论收藏 397KB PDF 举报

这篇文章介绍了一种利用生成对抗网络（GAN）来模拟单轮短文本对话的技术，同时提出了近似嵌入层来解决Seq2Seq生成模型中基于采样的输出解码过程导致的非微分问题。该技术在处理神经网络回应生成任务时，与传统的方法相比，能够有效避免产生非信息性的回答（又称“安全回答”），并且在多样性和相关性指标上取得了显著的改进。下面将详细介绍这篇文章中提到的关键概念和技术。 GAN是生成对抗网络（Generative Adversarial Networks）的缩写，它是一种由生成器（Generator）和判别器（Discriminator）构成的对抗式神经网络。生成器负责生成接近真实的数据，而判别器的任务是区分生成的数据和真实数据。在这个过程中，生成器和判别器互相竞争，从而提高生成数据的质量。 Seq2Seq模型，即序列到序列（Sequence-to-Sequence）模型，是一种常用的神经网络结构，用于解决诸如机器翻译、文本摘要、对话生成等序列生成任务。Seq2Seq模型通常包含两个部分：编码器（Encoder）和解码器（Decoder）。编码器用于理解输入序列，解码器则用于生成输出序列。在对话生成任务中，Seq2Seq模型面临的一个挑战是输出解码过程基于采样，这导致无法对生成的回应进行直接的微分，从而难以在生成器中进行有效的梯度下降训练。为了解决这个问题，文章提出了引入一个近似嵌入层。近似嵌入层能够在训练过程中处理解码器输出的非微分问题，通过近似处理，使得整个网络能够进行有效的梯度传播和参数更新。文章还提到，传统的神经网络回应生成模型，尤其是在单轮短文本对话的场合，常常会生成“安全回应”。这类回应虽然在形式上看起来像是合理的对话，但实际上缺乏信息量和深度，无法提供有实质性内容的回答。这种现象在基于Seq2Seq模型的对话生成任务中尤为常见。作者通过实验证明，他们的方法在多样性和相关性指标上表现显著优于现有的神经回应生成模型。在使用中文和英文语料库的评估中，该方法在不显著牺牲相关性分数的前提下，大幅提高了回应的多样性。这一点非常重要，因为在实际的对话系统中，既需要回应相关、准确，也需要回应具有多样性，以适应不同场景和对话需求。文章的研究成果显示了GAN在提高对话系统质量方面的潜力，为后续的研究提供了重要的参考和启发。特别是在当前人工智能领域，随着技术的进步和应用场景的增多，对话生成技术将变得越来越重要，而GAN技术的应用也将会更加广泛。总结而言，这篇文章的主要知识点包括： 1. 生成对抗网络（GAN）的基本概念及其在对话生成中的应用。 2. 序列到序列（Seq2Seq）模型的结构及其在对话生成任务中的优势。 3. 解决Seq2Seq模型中基于采样的输出解码过程导致的非微分问题的方法，特别是近似嵌入层的提出。 4. 如何通过GAN提高对话生成的多样性，并保持回应的相关性。 5. 实验验证了新方法在提高对话生成质量方面的有效性。这些知识点不仅对于理解当前对话系统技术的进步有帮助，同时也为将来在对话生成领域的进一步研究指明了方向。

资源推荐

资源详情

资源评论

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pages 617–626

Copenhagen, Denmark, September 7–11, 2017.

2017 Association for Computational Linguistics

Neural Response Generation via GAN with an Approximate

Embedding Layer

∗

Zhen Xu

, Bingquan Liu

, Baoxun Wang

, Chengjie Sun

, Xiaolong Wang

Zhuoran Wang

and Chao Qi

School of Computer Science and Technology,

Harbin Institute of Technology, Harbin, China

Tricorn (Beijing) Technology Co., Ltd, Beijing, China

{zxu,bqliu,cjsun,wangxl}@insun.hit.edu.cn

{wangbaoxun, wangzhuoran, qichao}@trio.ai

Abstract

This paper presents a Generative Adver-

sarial Network (GAN) to model single-

turn short-text conversations, which trains

a sequence-to-sequence (Seq2Seq) net-

work for response generation simulta-

neously with a discriminative classiﬁer

that measures the differences between

human-produced responses and machine-

generated ones. In addition, the proposed

method introduces an approximate embed-

ding layer to solve the non-differentiable

problem caused by the sampling-based

output decoding procedure in the Seq2Seq

generative model. The GAN setup pro-

vides an effective way to avoid non-

informative responses (a.k.a “safe re-

sponses”), which are frequently observed

in traditional neural response generators.

The experimental results show that the

proposed approach signiﬁcantly outper-

forms existing neural response generation

models in diversity metrics, with slight

increases in relevance scores as well, when

evaluated on both a Mandarin corpus and

an English corpus.

1 Introduction

After achieving remarkable successes in Machine

Translation (Sutskever et al., 2014; Cho et al.,

2014), neural networks with the encoder-decoder

architectures (a.k.a sequence-to-sequence models,

Seq2Seq) have been proven to be a functioning

method to model short-text conversations (Vinyals

and Le, 2015; Shang et al., 2015), where the

corresponding task is often called Neural Re-

sponse Generation. The advantage of applying

∗

The work was done when the ﬁrst author was an intern

at Tricorn (Beijing) Technology Co., Ltd.

Seq2Seq models to conversation generation is

that the training procedure can be performed

end-to-end in an unsupervised manner, based on

human-generated conversational utterances (typ-

ically query-response pairs mined from social

networks). One of the potential applications of

such neural response generators is to improve

the capability of existing conversational interfaces

(informally also known as chatbots) by enabling

them to go beyond predeﬁned tasks and chat with

human users in an open domain.

However, previous research has indicated that

ıve implementations of Seq2Seq based conver-

sation models tend to suffer from the so-called

“safe response” problem (Li et al., 2016a), i.e.

such models tend to generate non-informative

responses that can be associated to most queries,

e.g. “I don’t know”, “I think so”, etc. This is due

to the fundamental nature of statistical models,

which ﬁt sufﬁciently observed examples better

than insufﬁciently observed ones. Concretely, the

space of open-domain conversations is so large

that in any sub-sample of it (i.e. a training set),

the distribution of most pieces of information

are relatively much sparser when compared to

safe response patterns. Furthermore, since a

safe response can be of relevance to a large

amount of diverse queries, a statistical learner

will tend to minimize its empirical risk in the

response generation process by capturing those

safe responses if na

ıve relevance-oriented loss

metrics are employed.

Frequent occurrences of safe responses can

dramatically reduce the attractiveness of a chat

agent, which therefore should be avoided to the

best extent possible when designing the learning

algorithms. The pathway to achieve this purpose

is to seek a more expressive model with better

capacity that can take relevance and diversity

(or informativeness) into account simultaneously

617

when modelling the underlying distribution of

human conversations.

Generative Adversarial Nets (GANs) (Good-

fellow et al., 2014; Chen et al., 2016) offers

an effective architecture of jointly training a

generative model and a discriminative classiﬁer

to generate sharp and realistic images. This

architecture could also potentially be applied to

conversational response generation to relieve the

safe response problem, where the generative part

can be an Seq2Seq-based model that generates

response utterances for given queries, and the

discriminative part can evaluate the quality of

the generated utterances from diverse dimen-

sions according to human-produced responses.

However, unlike the image generation problems,

training such a GAN for text generation here is

not straightforward. The decoding phase of the

Seq2Seq model usually involves sampling discrete

words from the predicted distributions, which will

be fed into the training of the discriminator. The

sampling procedure is non-differentiable, and will

therefore break the back-propagation.

To the best of our knowledge, Reinforcement

Learning (RL) is ﬁrst introduced to address the

above problem (Li et al., 2017; Yu et al., 2017),

where the score predicted by a discriminator was

used as the reinforcement to train the generator,

yielding a hybrid model of GAN and RL. But to

train the RL phrase, Li et al. (2017) introduced

two approximations for reward computing at each

action (word) selection step, including a Markov

Chain Monte Carlo (MCMC) sampling method

and a partial utterance scoring approach. It has

been stated in their work that the former approach

is time-consuming and the latter one will result in

lower performance due to the overﬁtting problem

caused by adding a large amount of partial utter-

ances into the training set. Nevertheless, we also

want to argue that, besides the time complexity

issue of MCMC, RL itself is not an optimal choice

either. As shown in our experimental results in

Section 5.1, a more elegant design of an end-to-

end differentiable GAN can signiﬁcantly increase

the model’s performance in this text generation

task.

In this paper, we propose a novel variant

of GAN for conversational response generation,

which introduces an approximate embedding layer

to replace the sampling-based decoding phase,

such that the entire model is continuous and dif-

ferentiable. Empirical experiments are conducted

based on two datasets, of which the results show

that the proposed method signiﬁcantly outper-

forms three representative existing approaches in

both relevance and diversity oriented automatic

metrics. In addition, human evaluations are

carried out as well, demonstrating the potential of

the proposed model.

2 Related Work

Inspired by recent advances in Neural Machine

Translation (NMT), Ritter et al. (2011) and

Vinyals and Le (2015) have shown that single-

turn short-text conversations can be modelled as

a generative process trained using query-response

pairs accumulated on social networks. Earlier

works focused on paired word sequences only,

while Zhou et al. (2016) and Iulian et al. (2017)

have demonstrated that the comprehensibility of

the generated responses can beneﬁt from multi-

view training with respect to words, coarse tokens

and utterances. Moreover, Sordoni et al. (2015)

proposed a context-aware response generation

model that goes beyond single-turn conversations.

In addition, attention mechanisms were intro-

duced to Seq2Seq-based models to capture topic

and dialog focus information by Shang et al.

(2015) and Chen et al. (2017), which had been

proven to be helpful for improving query-response

relevance (Wu et al., 2016). Additional features

such as persona information (Li et al., 2016b) and

latent semantics (Zhou et al., 2017; Serban et al.,

2017) have also been proven beneﬁcial within this

context.

When compared to previous work, this paper is

focused on single-turn conversation modeling, and

employs a GAN to yield informative responses.

3 Building a Conversational Response

Generator via GAN

3.1 Notations

Let D = {(q

, r

)}

i=1

be a set of N single-

turn human-human conversations, where q

, . . . , w

) is a query, r

, . . . , w

) stands for the re-

sponse to q

, and w

and w

denote the t-

th words in q

and r

, respectively. This paper

aims to learn a generative model G(r|q) based

on a discriminator D that can predict informative

responses with good diversity for arbitrary input

queries.

618

剩余9页未读，继续阅读

评论收藏

内容反馈

weixin_38729438

粉丝: 3
资源: 915

Neural Response Generation via GAN with an Approximate Embedding...

最新资源

Neural Response Generation via GAN with an Approximate Embedding...

聊天机器人对话生成.pdf

Neural Personalized Response Generation as Domain Adaptation

思维导图_综述-Recent Advances in Neural Question Generation_.pdf

ImageNet Classification with Deep Convolutional Neural Networks

Deep Residual Output Layers for Neural Language Generation 模型结构改

Neural.Networks.with.R.epub

Science2006 - Reducing the Dimensionality of Data with Neural Networks

Deep Learning: Practical Neural Networks with Java 完整高清英文azw3版

MATLAB Deep Learning: With Machine Learning, Neural Networks and A I

ImageNet Classification with Deep Convolutional Neural Networks.pdf

Pattern Recognition with Neural Networks in C++

imagenet-classification-with-deep-convolutional-neural-networks原版和翻译..rar

Neural Data Science A Primer with MATLAB and Python pdf

Supervised Sequence Labelling with Recurrent Neural Networks

An introduction to neural networks

Hierarchy Response Learning for Neural Conversation Generation

Programming Artificial Neural Networks Step by Step with Python

Deep Learning Practical Neural Networks with Java 无水印pdf

Neural Networks for Applied Sciences and Engineering

Deep Learning in Neural Networks: An Overview

Artificial Neural Networks with Java

(keras复现)Learning to Compare Image Patches via Convolutional Neural Networks.rar

基于CORDIC的反正弦和反余弦计算的FPGA实现

最新资源