【免费】读论文RethinkingtheRoleofDemonstrationsWhatMakesIn-Context资源-CSDN文库

毕业设计

需积分: 0 133 浏览量 2024-03-09 13:28:25 上传评论收藏 9.51MB PDF 举报

资源推荐

资源详情

资源评论

"In-context Learning"（上下文学习）

是一种机器学习技术，旨在利用上下文信息来提高模型的学习效果。在这种方

法中，模型通过与特定环境或场景进行交互，从中获取信息并改进自身的表

现。这种学习方式尤其适用于自然语言处理等任务，因为语言通常是在特定的

语境中产生和理解的。通过在具体的上下文中进行学习，模型可以更好地理解

语言的含义和语境，并做出更准确的预测或响应。

【腾讯文档】

1 Rethinking_the_Role_of_Demonstrations-_What_Makes_In-Context_Learning_Work_.pdf

https://docs.qq.com/pdf/DVmpzQnZlelFvYWhS

Rethinking the Role of Demonstrations:

What Makes In-Context Learning Work?

Sewon Min

1,2

Xinxi Lyu

Ari Holtzman

Mikel Artetxe

Mike Lewis

Hannaneh Hajishirzi

1,3

Luke Zettlemoyer

1,2

University of Washington

Meta AI

Allen Institute for AI

{sewon,alrope,ahai,hannaneh,lsz}@cs.washington.edu

{artetxe,mikelewis}@meta.com

Abstract

Large language models (LMs) are able to in-

context learn—perform a new task via infer-

ence alone by conditioning on a few input-

label pairs (demonstrations) and making pre-

dictions for new inputs. However, there has

been little understanding of how the model

learns and which aspects of the demonstra-

tions contribute to end task performance. In

this paper, we show that ground truth demon-

strations are in fact not required—randomly

replacing labels in the demonstrations barely

hurts performance on a range of classiﬁcation

and multi-choce tasks, consistently over 12 dif-

ferent models including GPT-3. Instead, we

ﬁnd that other aspects of the demonstrations

are the key drivers of end task performance, in-

cluding the fact that they provide a few exam-

ples of (1) the label space, (2) the distribution

of the input text, and (3) the overall format of

the sequence. Together, our analysis provides

a new way of understanding how and why

in-context learning works, while opening up

new questions about how much can be learned

from large language models through inference

alone.

1 Introduction

Large language models (LMs) have shown impres-

sive performance on downstream tasks by simply

conditioning on a few input-label pairs (demonstra-

tions); this type of inference has been referred to as

in-context learning (Brown et al., 2020). Despite in-

context learning consistently outperforming zero-

shot inference on a wide range of tasks (Zhao et al.,

2021; Liu et al., 2021), there is little understanding

of how it works and which aspects of the demon-

strations contribute to end task performance.

In this paper, we show that ground truth demon-

strations are in fact not required for effective in-

context learning (Section 4). Speciﬁcally, replac-

ing the labels in demonstrations with random labels

barely hurts performance in a range of classiﬁca-

tion and multi-choice tasks (Figure 1). The result

Figure 1: Results in classiﬁcation (top) and multi-

choice tasks (bottom), using three LMs with varying

size. Reported on six datasets on which GPT-3 is eval-

uated; the channel method is used. See Section 4 for

the full results. In-context learning performance drops

only marginally when labels in the demonstrations are

replaced by random labels.

is consistent over 12 different models including the

GPT-3 family (Radford et al., 2019; Min et al.,

2021b; Wang and Komatsuzaki, 2021; Artetxe

et al., 2021; Brown et al., 2020). This strongly

suggests, counter-intuitively, that the model does

not rely on the input-label mapping in the demon-

strations to perform the task.

Further analysis investigates which parts of

demonstrations actually do contribute to the perfor-

mance. We identify possible aspects of demonstra-

tions (e.g., the label space and the distribution of

the input text) and evaluate a series of variants of

the demonstrations to quantify the impact of each

(Section 5). We ﬁnd that: (1) the label space and

the distribution of the input text speciﬁed by the

demonstrations are both key to in-context learn-

ing (regardless of whether the labels are correct

for individual inputs); (2) specifying the overall

format is also crucial, e.g., when the label space

is unknown, using random English words as la-

bels is signiﬁcantly better than using no labels; and

arXiv:2202.12837v2 [cs.CL] 20 Oct 2022

摘要：

大型语言模型（LMs）能够通过上下文学习——仅

通过条件化在少量输入-标签对（示范）上，通过

推断来执行新任务。然而，对于模型如何学习以及

示范的哪些方面对最终任务性能有贡献，我们的理

解还很有限。在本文中，我们展示了实际上并不需

要真实的示范——在示范中随机替换标签几乎不会

影响在一系列分类和多选任务上的性能，这一结果

在包括GPT-3在内的12种不同模型上始终如一。相

反，我们发现示范的其他方面是影响最终任务性能

的关键驱动因素，包括它们提供了关于（1）标签

空间的少量示例，（2）输入文本的分布，以及（3）

序列的整体格式。我们的分析为理解上下文学习如

何以及为什么有效提供了一种新的方式，同时也提

出了关于通过单独推断大型语言模型可以学到多少

的新问题。

Rethinking the Role of Demonstrations:

What Makes In-Context Learning Work?

Sewon Min

1,2

Xinxi Lyu

Ari Holtzman

Mikel Artetxe

Mike Lewis

Hannaneh Hajishirzi

1,3

Luke Zettlemoyer

1,2

University of Washington

Meta AI

Allen Institute for AI

{sewon,alrope,ahai,hannaneh,lsz}@cs.washington.edu

{artetxe,mikelewis}@meta.com

Abstract

Large language models (LMs) are able to in-

context learn—perform a new task via infer-

ence alone by conditioning on a few input-

label pairs (demonstrations) and making pre-

dictions for new inputs. However, there has

been little understanding of how the model

learns and which aspects of the demonstra-

tions contribute to end task performance. In

this paper, we show that ground truth demon-

strations are in fact not required—randomly

replacing labels in the demonstrations barely

hurts performance on a range of classiﬁcation

and multi-choce tasks, consistently over 12 dif-

ferent models including GPT-3. Instead, we

ﬁnd that other aspects of the demonstrations

are the key drivers of end task performance, in-

cluding the fact that they provide a few exam-

ples of (1) the label space, (2) the distribution

of the input text, and (3) the overall format of

the sequence. Together, our analysis provides

a new way of understanding how and why

in-context learning works, while opening up

new questions about how much can be learned

from large language models through inference

alone.

1 Introduction

Large language models (LMs) have shown impres-

sive performance on downstream tasks by simply

conditioning on a few input-label pairs (demonstra-

tions); this type of inference has been referred to as

in-context learning (Brown et al., 2020). Despite in-

context learning consistently outperforming zero-

shot inference on a wide range of tasks (Zhao et al.,

2021; Liu et al., 2021), there is little understanding

of how it works and which aspects of the demon-

strations contribute to end task performance.

In this paper, we show that ground truth demon-

strations are in fact not required for effective in-

context learning (Section 4). Speciﬁcally, replac-

ing the labels in demonstrations with random labels

barely hurts performance in a range of classiﬁca-

tion and multi-choice tasks (Figure 1). The result

Figure 1: Results in classiﬁcation (top) and multi-

choice tasks (bottom), using three LMs with varying

size. Reported on six datasets on which GPT-3 is eval-

uated; the channel method is used. See Section 4 for

the full results. In-context learning performance drops

only marginally when labels in the demonstrations are

replaced by random labels.

is consistent over 12 different models including the

GPT-3 family (Radford et al., 2019; Min et al.,

2021b; Wang and Komatsuzaki, 2021; Artetxe

et al., 2021; Brown et al., 2020). This strongly

suggests, counter-intuitively, that the model does

not rely on the input-label mapping in the demon-

strations to perform the task.

Further analysis investigates which parts of

demonstrations actually do contribute to the perfor-

mance. We identify possible aspects of demonstra-

tions (e.g., the label space and the distribution of

the input text) and evaluate a series of variants of

the demonstrations to quantify the impact of each

(Section 5). We ﬁnd that: (1) the label space and

the distribution of the input text speciﬁed by the

demonstrations are both key to in-context learn-

ing (regardless of whether the labels are correct

for individual inputs); (2) specifying the overall

format is also crucial, e.g., when the label space

is unknown, using random English words as la-

bels is signiﬁcantly better than using no labels; and

arXiv:2202.12837v2 [cs.CL] 20 Oct 2022

剩余29页未读，继续阅读

评论收藏

内容反馈

计算机视觉-杨帆

粉丝: 1512
资源: 43

读论文Rethinking the Role of Demonstrations What Makes In-Context

最新资源

读论文Rethinking the Role of Demonstrations What Makes In-Context

Designing E-learning Interactions in the 21st Century: revisiting and rethinking the role of theory

Rethinking the design of the Internet:The end to end arguments vs. the brave new world

Rethinking the Heatmap Regression for Bottom-Up Human Pose

RETHINKING THE VALUE OF NETWORK PRUNING.pdf

Rethinking the Internet of Things

2020-Rethinking Pre-training and Self-training.pdf

行人属性识别Rethinking-of-PAR源码+数据集PA100K+pt模型.zip

英文原版-The Intelligent Enterprise in the Era of Big Data 1st Edition

AFDetV2重新思考点云目标检测第二阶段的必要性_AFDetV2 Rethinking the Necessity of th

Rethinking SIMD Vectorization for In-Memory Databases (2015)-计算机科学

RethinkingtheInternetofThingsNetworkingBook.pdf 英文原版

（过拟合）Rethinking the Inception Architecture for Computer Vision.pdf

Rethinking the Inception Architecture for Computer Vision

Rethinking-Atrous-Convolution-for-Semantic-Image-Segmentation-1.zip

Rethinking Productivity in Software Engineering.pdf

Rethinking the IOT

重新思考集成传感和通信识别精度与通信速率之间的权衡_Rethinking the Tradeoff in Integrated

34个经典javaweb项目实例.zip

项目源码：基于Hadoop+Spark招聘推荐可视化系统 大数据项目 计算机毕业设计

毕业设计 springBoot人力资源管理系统+毕业论文+前后端源代码

毕业设计：舆情监测系统（SpringBoot+NLP）

基于spring boot的小区物业管理系统源码+论文+答辩ppt

计算机毕业设计：Flask股票数据采集分析可视化系统 python+爬虫+金融数据

毕业设计-基于JAVA的springboot超市进销存系统(源代码+论文）

人脸识别系统OpenCV+dlib+python（含数据库）Pyqt5界面设计 项目源码 毕业设计

基于51单片机的智能电子秤系统设计(含代码仿真及论文)无需积分！

OpenCV和YOLOv8 实时车速检测+车辆检测跟踪系统 深度学习 测速 计算机视觉 计算机毕业设计

Python爬取智联招聘网站数据，2023.10.31测试，可跑

最新资源

项目源码：基于Hadoop+Spark招聘推荐可视化系统大数据项目计算机毕业设计

人脸识别系统OpenCV+dlib+python（含数据库）Pyqt5界面设计项目源码毕业设计

OpenCV和YOLOv8 实时车速检测+车辆检测跟踪系统深度学习测速计算机视觉计算机毕业设计