当前位置: X-MOL 学术Egypt. Inform. J. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
A survey on sentiment analysis in Urdu: A resource-poor language
Egyptian Informatics Journal ( IF 5.0 ) Pub Date : 2020-05-15 , DOI: 10.1016/j.eij.2020.04.003
Asad Khattak , Muhammad Zubair Asghar , Anam Saeed , Ibrahim A. Hameed , Syed Asif Hassan , Shakeel Ahmad

Background/introduction

The dawn of the internet opened the doors to the easy and widespread sharing of information on subject matters such as products, services, events and political opinions. While the volume of studies conducted on sentiment analysis is rapidly expanding, these studies mostly address English language concerns. The primary goal of this study is to present state-of-art survey for identifying the progress and shortcomings saddling Urdu sentiment analysis and propose rectifications.

Methods

We described the advancements made thus far in this area by categorising the studies along three dimensions, namely: text pre-processing lexical resources and sentiment classification. These pre-processing operations include word segmentation, text cleaning, spell checking and part-of-speech tagging. An evaluation of sophisticated lexical resources including corpuses and lexicons was carried out, and investigations were conducted on sentiment analysis constructs such as opinion words, modifiers, negations.

Results and conclusions

Performance is reported for each of the reviewed study. Based on experimental results and proposals forwarded through this paper provides the groundwork for further studies on Urdu sentiment analysis.



中文翻译:

乌尔都语情绪分析调查:一种资源匮乏的语言

背景/简介

互联网的曙光为轻松,广泛地共享有关产品,服务,事件和政治见解等主题的信息打开了大门。尽管进行情绪分析的研究量正在迅速扩大,但这些研究主要解决了英语方面的问题。这项研究的主要目的是提出最新的调查,以查明影响乌尔都语情绪分析的进展和缺点,并提出纠正措施。

方法

我们通过沿三个维度对研究进行分类来描述迄今为止在该领域取得的进展:文本预处理词汇资源和情感分类。这些预处理操作包括分词,文本清理,拼写检查和词性标记。对包括语料库和词典在内的高级词汇资源进行了评估,并对情感分析结构(例如意见词,修饰语,否定词)进行了调查。

结果与结论

报告每个审查研究的表现。根据实验结果和本文提出的建议,为进一步研究乌尔都语情感分析提供了基础。

更新日期:2020-05-15
down
wechat
bug