FakeNewsDetectionviaNLPisVulnerabletoAdversarialAttacks.pdf

需积分: 32 139 浏览量 2019-08-09 15:07:25 上传评论收藏 685KB PDF 举报

资源推荐

资源详情

资源评论

Fake News Detection via NLP is Vulnerable to Adversarial Attacks

Zhixuan Zhou

1,2

, Huankang Guan

, Meghana Moorthy Bhat

and Justin Hsu

Hongyi Honor College, Wuhan University, Wuhan, China

Department of Computer Science, University of Wisconsin-Madison, Madison, USA

{kyriezoe, hkguan}@whu.edu.cn, {mbhat2, justhsu}@cs.wisc.edu

Keywords:

Fake News Detection, NLP, Attack, Fact Checking, Outsourced Knowledge Graph

Abstract:

News plays a signiﬁcant role in shaping people’s beliefs and opinions. Fake news has always been a problem,

which wasn’t exposed to the mass public until the past election cycle for the 45th President of the United States.

While quite a few detection methods have been proposed to combat fake news since 2015, they focus mainly

on linguistic aspects of an article without any fact checking. In this paper, we argue that these models have the

potential to misclassify fact-tampering fake news as well as under-written real news. Through experiments on

Fakebox, a state-of-the-art fake news detector, we show that fact tampering attacks can be effective. To address

these weaknesses, we argue that fact checking should be adopted in conjunction with linguistic characteristics

analysis, so as to truly separate fake news from real news. A crowdsourced knowledge graph is proposed as a

straw man solution to collecting timely facts about news events.

1 INTRODUCTION

Fake news is an increasingly common feature of to-

day’s political landscape. To help address this issue,

researchers and media experts have proposed fake

news detectors adopting natural language processing

(NLP) to analyze word patterns and statistical corre-

lations of news articles. While these detectors achieve

impressive accuracy on existing examples of manip-

ulated news, the analysis is typically quite shallow—

roughly, models check whether news articles conform

to standard norms and styles used by professional

journalists. This leads to two drawbacks.

First, these models can detect fake news only

when they are under-written, for instance when the

content is totally unrelated to the headline (so-called

“clickbait”) or when the article includes words con-

sidered to be biased or inﬂammatory. While this

criteria sufﬁces to detect many existing examples of

fake news, more sophisticated rumor disseminators

can craft more subtle attacks, for instance taking a

well-written real news article and tampering the ar-

ticle in a targeted way. By preserving the original

subject matter and relating the content tightly to the

headline without using biased phrases, an adversar-

ial article can easily evade detection. To demon-

strate this kind of attack, we evaluate a state-of-the-art

model called Fakebox. We introduce three classes of

attacks: fact distortion, subject-object exchange and

cause confounding. We generate adversarial versions

of real news from a dataset by McIntire (2018), and

show that Fakebox achieves low accuracy when clas-

sifying these examples.

At the same time, requirements posed by current

detectors are often too strict. Real news which is

under-written or talks about certain political and re-

ligious topics is likely to be mistakenly rejected, re-

gardless of its accuracy. This is a particularly seri-

ous problem for open platforms, such as Twitter in

the United States and TouTiao in China, where much

of the news is contributed by users with diverse back-

grounds. To prevent frustrating false positives, plat-

forms are still heavily relying on manual work for

separating fake news from real news. We provide ex-

perimental evidence for Fakebox’s potential of mis-

classifying real news.

Taken together, our experiments highlight vulner-

able aspects of fake news detection methods based

purely on NLP. Without deeper semantic knowledge,

such detectors are easily fooled by fact-tampering at-

tacks and can suffer from a high rate of false pos-

itives, mistakenly classifying under-written yet real

news which may not be written in a journalistic style.

To address these problems, we argue that some form

of fact-based knowledge must be adopted alongside

NLP-based models. What this knowledge is remains

to be seen, but we consider a straw man solution: a

crowdsourced knowledge graph that aggregates infor-

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余7页未读，立即下载

评论收藏

内容反馈

Jayxp

粉丝: 6
资源: 139

Fake News Detection via NLP is Vulnerable to Adversarial Attacks...

最新资源

Fake News Detection via NLP is Vulnerable to Adversarial Attacks...

自动假新闻检测方法在对抗性攻击中有多脆弱_How Vulnerable Are Automatic Fake News Dete

Multimodal Fake News Detection_多模态假新闻检测.pdf

Fake-News-Classifier:这是一个NLP域项目，将新闻分类为垃圾邮件或正确

FakeNewsCorpus：从精选的数据源列表中抓取的数百万条新闻报道的数据集

印度尼西亚利用变压器网络检测假新闻_Indonesia's Fake News Detection using Transfor

Li_A_Continual_Deepfake_Detection_Benchmark_Dataset_Methods_and_Essentials_WACV_2023_paper.pdf

南非新闻网站上的假新闻假信息检测_Is it Fake News Disinformation Detection on Sou

translate_fakenews.pdf

Deep Fake Detection Survey of Facial Manipulation Detection

fake_news_stock_prediction_blockchain_analysic论文阅读笔记.pdf

FakeNews_Detection_MiniProject

science2019_Fake news on Twitter during the 2016 US presidential election.pdf

FAKE_NEWS_DETECTION_ML-NLP_USING-FLASK

fake news dataset

Deep Fake Detection Survey of Facial Manipulation Detectio

论文阅读笔记Fake News Detection on Social Media A Data Mining Perspect

Investigating Robustness and LinkPredictionAdversarialModifications.pdf

[Photoshop.CS4.炫招].(Photoshop.CS4.Down.&.Dirty.Tricks).[美]Scott.Kelby.完美版.pdf

fake_news_detection:使用Kaggle数据集检测假新闻的简单模型

YOLOv8-deepsort 实现智能车辆目标检测+车辆跟踪+车辆计数

YOLOv8网络结构图，自制visio文件，yolov8.vsds，需要的自取，在原有的基础上直接改就行了

yolov8(2023年8月版本),已经下好yolov8s.pt和yolov8n.pt

Transformer模型实现长期预测并可视化结果（附代码+数据集+原理介绍）

社交平台上经济类话题的文章热度信息，数据是真实的，但不是真实日期

行人跌倒数据集（VOC格式）

YOLOV5 + 双目相机实现三维测距（新版本）

Unet眼底血管图像分割数据集+代码+模型+系统界面+教学视频.zip

最新资源