1、情感因子/舆情因子(NLP)
*代码参考(研究对象为中国,仅做代码参考):
https://github.com/Coldog2333/Financial-NLP
文章 1
Tobias Daudert, Paul Buitelaar, Sapna Negi, Leveraging News Sentiment to Improve
Microblog Sentiment Classification in the Financial Domain
https://www.aclweb.org/anthology/W18-3107/
简介:通过机器学习模型对 microblog 中显示有关金融情况的情绪信息进行
处理。
数据:
(1)the microblog dataset (M) from the Semeval 2017 Task 5 - subtask 1 (Cortis
et al., 2017). This dataset contains 2,488 microblogs retrieved from Twitter3 collected
between March 11th and 18th 2016 as well as StockTwits.
https://www.acl-
bg.org/proceedings/2017/RANLPStud%202017/pdf/RANLPStud002.pdf
(2)the newly created microblogs-related news dataset (MRN) consists of 106
news, specifically, it contains the news’ titles, urls, time and date, a sentiment score,
and, if available, a description for each news. The news data was gathered from multiple
sources such as wsj.com or bloomberg.com.
方法:
The ML algorithm chosen for this task is a Support Vector Machine (SVM).
评论0