Improving prediction with enhanced Distributed Memory-based Resilient Dataset Filter,Journal of Big Data

当前位置： X-MOL 学术 › J. Big Data › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Improving prediction with enhanced Distributed Memory-based Resilient Dataset Filter
Journal of Big Data ( IF 8.1 ) Pub Date : 2020-02-28 , DOI: 10.1186/s40537-020-00292-y
Sandhya Narayanan , Philip Samuel , Mariamma Chacko

Launching new products in the consumer electronics market is challenging. Developing and marketing the same in limited time affect the sustainability of such companies. This research work introduces a model that can predict the success of a product. A Feature Information Gain (FIG) measure is used for significant feature identification and Distributed Memory-based Resilient Dataset Filter (DMRDF) is used to eliminate duplicate reviews, which in turn improves the reliability of the product reviews. The pre-processed dataset is used for prediction of product pre-launch in the market using classifiers such as Logistic regression and Support vector machine. DMRDF method is fault-tolerant because of its resilience property and also reduces the dataset redundancy; hence, it increases the prediction accuracy of the model. The proposed model works in a distributed environment to handle a massive volume of the dataset and therefore, it is scalable. The output of this feature modelling and prediction allows the manufacturer to optimize the design of his new product.

中文翻译：

使用增强的基于分布式内存的弹性数据集过滤器改善预测

在消费电子市场推出新产品具有挑战性。在有限的时间内开发和销售此类产品会影响此类公司的可持续性。这项研究工作引入了可以预测产品成功的模型。特征信息增益（FIG）度量用于显着特征识别，而基于分布式内存的弹性数据集筛选器（DMRDF）用于消除重复的评论，从而提高了产品评论的可靠性。使用分类器（例如Logistic回归和Support Vector Machine）将预处理后的数据集用于预测市场上的产品预发布。DMRDF方法具有弹性，因此具有容错能力，并且还减少了数据集的冗余性。因此，它提高了模型的预测准确性。所提出的模型在分布式环境中工作以处理海量数据集，因此具有可伸缩性。此功能建模和预测的输出使制造商可以优化其新产品的设计。

更新日期：2020-04-21

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文

全部期刊列表>>