Using Machine Learning to Predict the Sentiment of Online Reviews: A New Framework for Comparative Analysis,Archives of Computational Methods in Engineering

当前位置： X-MOL 学术 › Arch. Computat. Methods Eng. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Using Machine Learning to Predict the Sentiment of Online Reviews: A New Framework for Comparative Analysis
Archives of Computational Methods in Engineering ( IF 9.7 ) Pub Date : 2021-01-08 , DOI: 10.1007/s11831-020-09464-8
Gregorius Satia Budhi , Raymond Chiong , Ilung Pranata , Zhongyi Hu

Online reviews are becoming increasingly important for decision-making. Consumers often refer to online reviews for opinions before making a purchase. Marketers also acknowledge the importance of online reviews and use them to improve product success. However, the massive amount of online review data, as well as its unstructured nature, is a challenge for anyone wanting to derive a conclusion quickly. In this paper, we propose a novel framework for gauging the ratings of online reviews using machine learning techniques. This framework uses a combination of text pre-processing and feature extraction methods. Here, we investigate four different aspects of the new framework. First, we assess the performance of single and ensemble classifiers in predicting sentiment—positive or negative—initially on a specific dataset (Yelp), but subsequently also on two other datasets (Amazon's product reviews and a movie review dataset). Second, using the best identified classifiers, we improve the accuracy with which neutral polarity can be predicted, an ability largely overlooked in the literature. Third, we further improve the performance of these classifiers by testing different pre-processing and feature extraction methods. Finally, we measure how well our deep learning approach performs on the same task compared to the best previously identified classifiers. Our extensive testing shows that the linear-kernel support vector machine, logistic regression and multilayer perceptron are the three best single classifiers in terms of accuracy, precision, recall, and F-measure. Their performance could be further improved if they were used as base classifiers for ensemble models. We also observe that several text pre-processing techniques—negation word identification, word elongation correction, and part of speech lemmatisation (combined with Terms Frequency and N-gram words)—can increase accuracy. In addition, we demonstrate that the general sentiment of lexicons such as SentiWordNet 3.0 and SenticNet 4 can be used to generate features with good results, although deep learning models can perform equally well. Experiments with different datasets confirm that our framework provides consistent outcomes. In particular, we have focused on improving the accuracy of neutral sentiment, and we conclude by showing how this can be achieved without sacrificing the accuracy of positive or negative ratings.

中文翻译：

利用机器学习预测在线评论的情绪：一种新的比较分析框架

在线评论对于决策变得越来越重要。消费者在购买商品之前通常会参考在线评论以获取意见。营销人员还承认在线评论的重要性，并使用它们来提高产品的成功率。但是，海量的在线评论数据以及其非结构化的性质，对于想要快速得出结论的任何人都是一个挑战。在本文中，我们提出了一种使用机器学习技术来衡量在线评论评分的新颖框架。该框架结合了文本预处理和特征提取方法。在这里，我们研究了新框架的四个不同方面。首先，我们评估单个分类器和整体分类器最初在特定数据集（Yelp）上预测情绪（正面或负面）时的表现，但随后还涉及其他两个数据集（亚马逊的产品评论和电影评论数据集）。第二，使用最佳识别的分类器，我们提高了预测中性极性的准确性，这一能力在文献中被大大忽略了。第三，我们通过测试不同的预处理和特征提取方法来进一步提高这些分类器的性能。最后，我们测量与以前确定的最佳分类器相比，我们的深度学习方法在同一任务上的执行情况。我们的广泛测试表明，就准确性，精度，召回率和准确性而言，线性核支持向量机，逻辑回归和多层感知器是三个最佳的单一分类器。使用最佳识别的分类器，我们提高了预测中性极性的准确性，而该能力在文献中被大大忽略了。第三，我们通过测试不同的预处理和特征提取方法来进一步提高这些分类器的性能。最后，我们测量与以前确定的最佳分类器相比，我们的深度学习方法在同一任务上的执行情况。我们的广泛测试表明，就准确性，精度，召回率和准确性而言，线性核支持向量机，逻辑回归和多层感知器是三个最佳的单一分类器。使用最佳识别的分类器，我们提高了预测中性极性的准确性，而该能力在文献中被大大忽略了。第三，我们通过测试不同的预处理和特征提取方法来进一步提高这些分类器的性能。最后，我们测量与以前确定的最佳分类器相比，我们的深度学习方法在同一任务上的执行情况。我们的广泛测试表明，就准确性，精度，召回率和准确性而言，线性核支持向量机，逻辑回归和多层感知器是三个最佳的单一分类器。我们通过测试不同的预处理和特征提取方法来进一步提高这些分类器的性能。最后，我们测量与以前确定的最佳分类器相比，我们的深度学习方法在同一任务上的执行情况。我们的广泛测试表明，就准确性，精度，召回率和准确性而言，线性核支持向量机，逻辑回归和多层感知器是三个最佳的单一分类器。我们通过测试不同的预处理和特征提取方法来进一步提高这些分类器的性能。最后，我们测量与以前确定的最佳分类器相比，我们的深度学习方法在同一任务上的执行情况。我们的广泛测试表明，就准确性，精度，召回率和准确性而言，线性核支持向量机，逻辑回归和多层感知器是三个最佳的单一分类器。F-措施。如果将它们用作集成模型的基础分类器，则可以进一步提高它们的性能。我们还观察到了几种文本预处理技术-否定词识别，词伸长校正和部分语言词性化（与术语频率和N结合）-gram字）—可以提高准确性。此外，我们证明，尽管深度学习模型可以同样好地发挥作用，但可以使用诸如SentiWordNet 3.0和SenticNet 4之类的一般词汇来生成具有良好结果的功能。使用不同数据集进行的实验证实了我们的框架提供了一致的结果。尤其是，我们专注于提高中立情绪的准确性，并通过展示如何在不牺牲正或负评级的准确性的情况下实现这一点作为结论。

更新日期：2021-01-08

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>