【免费】AnalogiesExplained_TowardsUnderstandingWordEmbeddings词嵌入1资源-CSDN文库

自然语言处理

需积分: 0 156 浏览量 2022-08-03 15:14:18 上传评论收藏 619KB PDF 举报

资源详情

资源评论

资源推荐

Analogies Explained: Towards Understanding Word Embeddings

Carl Allen

Timothy Hospedales

Abstract

Word embeddings generated by neural network

methods such as word2vec (W2V) are well known

to exhibit seemingly linear behaviour, e.g. the

embeddings of analogy “woman is to queen as

man is to king” approximately describe a paral-

lelogram. This property is particularly intriguing

since the embeddings are not trained to achieve

it. Several explanations have been proposed, but

each introduces assumptions that do not hold in

practice. We derive a probabilistically grounded

deﬁnition of paraphrasing that we re-interpret

as word transformation, a mathematical descrip-

tion of “

is to

”. From these concepts we

prove existence of linear relationships between

W2V-type embeddings that underlie the analogi-

cal phenomenon, identifying explicit error terms.

1. Introduction

The vector representation, or embedding, of words under-

pins much of modern machine learning for natural language

processing (e.g. Turney & Pantel (2010)). Where, previ-

ously, embeddings were generated explicitly from word

statistics, neural network methods are now commonly used

to generate neural embeddings that are of low dimension

relative to the number of words represented, yet achieve

impressive performance on downstream tasks (e.g. Turian

et al. (2010); Socher et al. (2013)). Of these, word2vec

(W2V) (Mikolov et al., 2013a) and Glove (Pennington et al.,

2014) are amongst the best known and on which we focus.

Interestingly, such embeddings exhibit seemingly linear be-

haviour (Mikolov et al., 2013b; Levy & Goldberg, 2014a),

e.g. the respective embeddings of analogies, or word rela-

tionships of the form “

is to

∗

is to

∗

”, often

satisfy

∗

−w

+ w

≈ w

∗

, where

is the embedding

School of Informatics, University of Edinburgh. Correspondence

to: Carl Allen <[email protected]>.

Proceedings of the

International Conference on Machine

Learning, Long Beach, California, PMLR 97, 2019. Copyright

2019 by the author(s).

Throughout, we refer to the more commonly used Skipgram im-

plementation of W2V with negative sampling (SGNS).

of word

. This enables analogical questions such as “man

is to king as woman is to ..?” to be solved by vector addi-

tion and subtraction. Such high order structure is surprising

since word embeddings are trained using only pairwise word

co-occurrence data extracted from a text corpus.

We ﬁrst show that where embeddings factorise pointwise mu-

tual information (PMI), it is paraphrasing that determines

when a linear combination of embeddings equates to that of

another word. We say

king

paraphrases

man

and

royal

, for

example, if there is a semantic equivalence between

king

and

{man, royal}

combined. We can measure such equiva-

lence with respect to probability distributions over nearby

words, in line with Firth’s maxim “You shall know a word

by the company it keeps” (Firth, 1957). We then show that

paraphrasing can be reinterpreted as word transformation

with additive parameters (e.g. from

man

king

by adding

royal

) and generalise to also allow subtraction. Finally, we

prove that by interpreting an analogy “

is to

∗

is to

∗

” as word transformations

∗

and

∗

sharing the same parameters, the linear relationship

observed between word embeddings of analogies follows

(see overview in Fig 4). Our key contributions are:

•

to derive a probabilistic deﬁnition of paraphrasing and

show that it governs the relationship between one (PMI-

derived) word embedding and any sum of others;

•

to show how paraphrasing can be generalised and inter-

preted as the transformation from one word to another,

giving a mathematical formulation for “w

is to w

∗

”;

•

to provide the ﬁrst rigorous proof of the linear relation-

ship between word embeddings of analogies, including

explicit, interpretable error terms; and

•

to show how these relationships materialise between

vectors of PMI values, and so too in word embeddings

that factorise the PMI matrix, or approximate such a

factorisation e.g. W2V and Glove.

2. Previous Work

Intuition for the presence of linear analogical relationships,

or linguistic regularity, amongst word embeddings was ﬁrst

suggested by Mikolov et al. (2013a;b) and Pennington et al.

(2014), and has been widely discussed since (e.g. Levy &

Goldberg (2014a); Linzen (2016)). More recently, several

theoretical explanations have been proposed:

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余8页未读，立即下载

评论收藏

内容反馈

李诗旸

粉丝: 26
资源: 329

Analogies Explained_Towards Understanding Word Embeddings 词嵌入1

评论0

最新资源

Analogies Explained_Towards Understanding Word Embeddings 词嵌入1

评论0

image Analogies

Fluid Concepts & Creative Analogies

Admission Miller Analogies Test(MAT)认证考试题库.docx

word2viz:词嵌入中语义相似性的可视化

PyPI 官网下载 | neural-image-analogies-0.0.2.tar.gz

Verbal analogies in the ITPA

AS3.0高级设计模式

GTM 002 Measure and Category_ A Survey of the Analogies between Topological and Measure Spaces, John C. Oxtoby （测度和范畴：一份关于拓扑空间和测度空间类似的概要）

image-analogies-python:Hertzmann等人的图像类比的复制，发表于SIGGRAPH，2001年

Blockchain.Basics.A.Non-Technical.Introduction.in.25.Steps.pdf

Blockchain Basics: A Non-Technical Introduction in 25 Steps

GTM002.Measure.and.Category（测度和范畴：一份关于拓扑空间和测度空间类似的概要）

The concurrent validity of the matrix analogies test-short form with the stanford-binet: Fourth edition and KTEA-BF (academic achievement)

Assessment of mentally retarded children with the Matrix Analogies Test-Short Form, Draw A Person: A quantitative scoring system, and the Kaufman Test of Educational Achievement

How to Think About Algorithms

How to think about algorithms(Edmonds)

Cambridge.How.to.Think.About.Algorithms.2008

Parallel-Image-Analogies:CS205 最终项目代码

BurpLoaderKeygen.jar.zip

最新版ISO/IEC 27001:2022、ISO 27002:2022中英文合集

Goby红队版-win-x64-2.4.7版本

Chrome Header Editor 插件

ISO SAE 21434-2021 中文版.pdf

国赛ciscn2024-WP-re2-androidso-re(unidbg模拟执行Native层方法)

国赛ciscn2024-WP-re6-gdb-debug(伪随机数保护)

OpenVAS GVM 中文翻译补丁

安全认证cisp教材全套

STM32F103C8T6核心板-电路原理图1.PDF

最新资源