A Data-driven Neural Network Architecture for Sentiment Analysis,arXiv - CS - Computation and Language

当前位置： X-MOL 学术 › arXiv.cs.CL › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

A Data-driven Neural Network Architecture for Sentiment Analysis
arXiv - CS - Computation and Language Pub Date : 2020-06-30 , DOI: arxiv-2006.16642
Erion \c{C}ano and Maurizio Morisio

The fabulous results of convolution neural networks in image-related tasks, attracted attention of text mining, sentiment analysis and other text analysis researchers. It is however difficult to find enough data for feeding such networks, optimize their parameters, and make the right design choices when constructing network architectures. In this paper we present the creation steps of two big datasets of song emotions. We also explore usage of convolution and max-pooling neural layers on song lyrics, product and movie review text datasets. Three variants of a simple and flexible neural network architecture are also compared. Our intention was to spot any important patterns that can serve as guidelines for parameter optimization of similar models. We also wanted to identify architecture design choices which lead to high performing sentiment analysis models. To this end, we conducted a series of experiments with neural architectures of various configurations. Our results indicate that parallel convolutions of filter lengths up to three are usually enough for capturing relevant text features. Also, max-pooling region size should be adapted to the length of text documents for producing the best feature maps. Top results we got are obtained with feature maps of lengths 6 to 18. An improvement on future neural network models for sentiment analysis, could be generating sentiment polarity prediction of documents using aggregation of predictions on smaller excerpt of the entire text.

中文翻译：

用于情感分析的数据驱动神经网络架构

卷积神经网络在图像相关任务中的出色表现，引起了文本挖掘、情感分析和其他文本分析研究人员的关注。然而，在构建网络架构时，很难找到足够的数据来馈送此类网络、优化其参数并做出正确的设计选择。在本文中，我们介绍了两个大型歌曲情感数据集的创建步骤。我们还探索了卷积和最大池化神经层在歌词、产品和电影评论文本数据集上的使用。还比较了简单灵活的神经网络架构的三种变体。我们的目的是发现任何可以作为类似模型参数优化指南的重要模式。我们还想确定导致高性能情感分析模型的架构设计选择。为此，我们对各种配置的神经架构进行了一系列实验。我们的结果表明，滤波器长度最多为三个的并行卷积通常足以捕获相关的文本特征。此外，最大池化区域大小应适应文本文档的长度，以生成最佳特征图。我们得到的最佳结果是使用长度为 6 到 18 的特征图获得的。未来用于情感分析的神经网络模型的改进可以使用对整个文本的较小摘录的预测聚合来生成文档的情感极性预测。我们的结果表明，滤波器长度最多为三个的并行卷积通常足以捕获相关的文本特征。此外，最大池化区域大小应适应文本文档的长度，以生成最佳特征图。我们得到的最佳结果是使用长度为 6 到 18 的特征图获得的。未来用于情感分析的神经网络模型的改进可以使用对整个文本的较小摘录的预测聚合来生成文档的情感极性预测。我们的结果表明，滤波器长度最多为三个的并行卷积通常足以捕获相关的文本特征。此外，最大池化区域大小应适应文本文档的长度，以生成最佳特征图。我们得到的最佳结果是使用长度为 6 到 18 的特征图获得的。未来用于情感分析的神经网络模型的改进可以使用对整个文本的较小摘录的预测聚合来生成文档的情感极性预测。

更新日期：2020-07-01

点击分享查看原文

点击收藏

阅读更多本刊最新论文