当前位置: X-MOL 学术Lang. Resour. Eval. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Spanish corpora for sentiment analysis: a survey
Language Resources and Evaluation ( IF 2.7 ) Pub Date : 2019-05-31 , DOI: 10.1007/s10579-019-09470-8
María Navas-Loro , Víctor Rodríguez-Doncel

Corpora play an important role when training machine learning systems for sentiment analysis. However, Spanish is underrepresented in these corpora, as most primarily include English texts. This paper describes 20 Spanish-language text corpora—collected to support different tasks related to sentiment analysis, ranging from polarity to emotion categorization. We present a brand-new framework for the characterization of corpora. This includes a number of features to help analyze resources at both corpus level and document level. This survey—besides depicting the overall landscape of corpora in Spanish—supports sentiment analysis practitioners with the task of selecting the most suitable resources.

中文翻译:

西班牙语情绪分析语料库:一项调查

在训练机器学习系统进行情感分析时,语料库起着重要作用。但是,西班牙语在这些语料库中的代表性不足,因为大多数主要包括英语文本。本文介绍了20种西班牙语文本语料库,这些语料库用于支持与情感分析相关的不同任务,范围从极性到情感分类。我们提出了一种全新的语料库表征框架。这包括许多功能,可帮助在语料库级别和文档级别分析资源。这项调查(除了描绘了西班牙语的语料库的总体情况)还为情感分析从业人员提供了选择最合适资源的任务。
更新日期:2019-05-31
down
wechat
bug