【免费】HumorinWordEmbeddings-CockamamieGobbledegookforNincompoops资源-CSDN文库

自然语言处理

需积分: 0 99 浏览量 2022-08-03 15:14:09 上传评论收藏 550KB PDF 举报

资源详情

资源评论

资源推荐

Humor in Word Embeddings:

Cockamamie Gobbledegook for Nincompoops

WARNING: This paper contains words that people rated humorous including many that are offensive in nature.

Limor Gultchin

Genevieve Patterson

Nancy Baym

Nathaniel Swinger

Adam Tauman Kalai

Abstract

While humor is often thought to be beyond the

reach of Natural Language Processing, we show

that several aspects of single-word humor corre-

late with simple linear directions in Word Embed-

dings. In particular: (a) the word vectors capture

multiple aspects discussed in humor theories from

various disciplines; (b) each individual’s sense of

humor can be represented by a vector, which can

predict differences in people’s senses of humor

on new, unrated, words; and (c) upon clustering

humor ratings of multiple demographic groups,

different humor preferences emerge across the dif-

ferent groups. Humor ratings are taken from the

work of Engelthaler and Hills (2017) as well as

from an original crowdsourcing study of 120,000

words. Our dataset further includes annotations

for the theoretically-motivated humor features we

identify.

1. Introduction

Detecting and generating humor is a notoriously difﬁcult

task for AI systems. While Natural Language Processing

(NLP) is making impressive advances in many frontiers

such as machine translation and question answering, NLP

progress on humor has been slow. This reﬂects the fact that

humans rarely agree upon what is humorous. Multiple types

of humor exist, and numerous theories were developed to

explain what makes something funny. Recent research sup-

porting the existence of single-word humor (Engelthaler &

Hills, 2017; Westbury et al., 2016) deﬁnes a more manage-

able scope to study with existing machine learning tools.

Word Embeddings (WEs) have been shown to capture nu-

merous properties of words (e.g., Mikolov et al., 2013a;b);

University of Oxford

TRASH

Microsoft Research

Lexington High School. Correspondence to: Limor Gultchin

<limor.gultchin@jesus.ox.ac.uk>.

Proceedings of the

International Conference on Machine

Learning, Long Beach, California, PMLR 97, 2019. Copyright

2019 by the author(s).

coupled with single-word humor as a possible research di-

rection, it is natural to study if and how WEs can capture

this type of humor. To assess the ability of WEs to explain

individual word humor, we draw on a long history of humor

theories and put them to the test.

To many readers, it may not be apparent that individual

words can be amusing in and of themselves, devoid of con-

text. However, Engelthaler & Hills (2017), henceforth re-

ferred to as EH, found some words consistently rated as

more humorous than others, through a crowdsourced study

of about ﬁve thousand nouns. We ﬁrst use their publicly

available 5k mean word-humor ratings to identify a “humor

vector,” i.e., a linear direction, in several WEs that correlate

(over

0.7

) with these 5k mean humor ratings. While these

correlations establish statistical signiﬁcance, little insight is

obtained into how the embeddings capture different aspects

of humor and differences between people’s senses of humor.

To complete this picture, we performed crowdsourcing stud-

ies to create additional datasets which we make publicly

available: (a) beginning with a set of 120k common words

and phrases chosen from a word embedding, a crowdsourc-

ing ﬁltering process yielded a set of 8,120 and a further set of

216 words

(in Appendix B) rated most humorous, (b) over

1,500 crowd workers rated these latter 216 words through

six-way comparisons each yielding a personal ﬁrst choice

out of dozens of other personally highly-ranked words, and

these sets) were each annotated by multiple workers accord-

ing to six humor features drawn from the aforementioned

theories of humor.

Our analysis suggests that individual-word humor indeed

possesses many aspects of humor that have been discussed

in general theories of humor, and that many of these as-

pects of humor are captured by WEs. For example, ‘incon-

gruity theory,’ which we discuss shortly, can be found in

words which juxtapose surprising combinations of words

These words, including gobbledegook and nincompoops, were

rated as more humorous than the words in the EH study, which

used common words from psychology experiments. The top-rated

EH words were booty, tit, booby, hooter, and nitwit.

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余9页未读，立即下载

评论收藏

内容反馈

臭人鹏

粉丝: 24
资源: 328

Humor in Word Embeddings-Cockamamie Gobbledegook for Nincompoops

评论0

最新资源

Humor in Word Embeddings-Cockamamie Gobbledegook for Nincompoops

评论0

Mastering VBA for Microsoft Office 2007

Subtitling Humor Translation in Sitcoms Based on Skopostheorie T

Visual Knowledge Discovery and Machine Learning-Springer(2018).pdf

The Way of the Web Tester(Pragmatic,2016)

AAA and Network Security for Mobile Access.pdf

Pro LINQ: Language Integrated Query in C# 2010 （含源码）

Pro LINQ: Language Integrated Query in C# 2010

Apress.Pro.LINQ.Language.Integrated.Query.in.C.Sharp.2008.Nov.2007.eBook-BBL

matlab人脸匹配代码-humor_project:funny_project

The.Way.of.the.Web.Tester.A.Beginners.Guide.to.Automating.Tests

有关humor的ppt

Perl Best Practices: Standards and Styles for Developing Maintainable Code

Manning EJB3.0 in action

The Art of Computer Programming, Volume 4 Fascicle 2

The Reasoned Schemer, 2nd Edition

Design Patterns in PHP and Laravel [2017]

Code Like a Pythonista: Idiomatic Python.txt

新闻幽默「News Humor」-crx插件

BurpLoaderKeygen.jar.zip

最新版ISO/IEC 27001:2022、ISO 27002:2022中英文合集

Goby红队版-win-x64-2.4.7版本

Chrome Header Editor 插件

ISO SAE 21434-2021 中文版.pdf

OpenVAS GVM 中文翻译补丁

安全认证cisp教材全套

iNodeClient-MacOS-7.30(E0630).tar.gz

STM32F103C8T6核心板-电路原理图1.PDF

软件工程导论(第六版)课后习题答案1

最新资源