当前位置: X-MOL 学术arXiv.cs.DC › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Performance Evaluation of Linear Regression Algorithm in Cluster Environment
arXiv - CS - Distributed, Parallel, and Cluster Computing Pub Date : 2020-09-14 , DOI: arxiv-2009.06497
Cinantya Paramita, Fauzi Adi Rafrastara, Usman Sudibyo, R.I.W. Agung Wibowo

Cluster computing was introduced to replace the superiority of super computers. Cluster computing is able to overcome the problems that cannot be effectively dealt with supercomputers. In this paper, we are going to evaluate the performance of cluster computing by executing one of data mining techniques in the cluster environment. The experiment will attempt to predict the flight delay by using linear regression algorithm with apache spark as a framework for cluster computing. The result shows that, by involving 5 PCs in cluster environment with equal specifications can increase the performance of computation up to 39.76% compared to the standalone one. Attaching more nodes to the cluster can make the process become faster significantly.

中文翻译:

集群环境下线性回归算法的性能评估

集群计算被引入以取代超级计算机的优势。集群计算能够克服超级计算机无法有效处理的问题。在本文中,我们将通过在集群环境中执行一种数据挖掘技术来评估集群计算的性能。实验将尝试使用线性回归算法和apache spark作为集群计算框架来预测航班延误。结果表明,在集群环境中加入 5 台相同规格的 PC 可以将计算性能比独立的提高 39.76%。将更多节点附加到集群可以显着加快进程。
更新日期:2020-09-15
down
wechat
bug