当前位置:
X-MOL 学术
›
arXiv.cs.DC
›
论文详情
Our official English website, www.x-mol.net, welcomes your
feedback! (Note: you will need to create a separate account there.)
Performance Evaluation of Linear Regression Algorithm in Cluster Environment
arXiv - CS - Distributed, Parallel, and Cluster Computing Pub Date : 2020-09-14 , DOI: arxiv-2009.06497 Cinantya Paramita, Fauzi Adi Rafrastara, Usman Sudibyo, R.I.W. Agung Wibowo
arXiv - CS - Distributed, Parallel, and Cluster Computing Pub Date : 2020-09-14 , DOI: arxiv-2009.06497 Cinantya Paramita, Fauzi Adi Rafrastara, Usman Sudibyo, R.I.W. Agung Wibowo
Cluster computing was introduced to replace the superiority of super
computers. Cluster computing is able to overcome the problems that cannot be
effectively dealt with supercomputers. In this paper, we are going to evaluate
the performance of cluster computing by executing one of data mining techniques
in the cluster environment. The experiment will attempt to predict the flight
delay by using linear regression algorithm with apache spark as a framework for
cluster computing. The result shows that, by involving 5 PCs in cluster
environment with equal specifications can increase the performance of
computation up to 39.76% compared to the standalone one. Attaching more nodes
to the cluster can make the process become faster significantly.
中文翻译:
集群环境下线性回归算法的性能评估
集群计算被引入以取代超级计算机的优势。集群计算能够克服超级计算机无法有效处理的问题。在本文中,我们将通过在集群环境中执行一种数据挖掘技术来评估集群计算的性能。实验将尝试使用线性回归算法和apache spark作为集群计算框架来预测航班延误。结果表明,在集群环境中加入 5 台相同规格的 PC 可以将计算性能比独立的提高 39.76%。将更多节点附加到集群可以显着加快进程。
更新日期:2020-09-15
中文翻译:
集群环境下线性回归算法的性能评估
集群计算被引入以取代超级计算机的优势。集群计算能够克服超级计算机无法有效处理的问题。在本文中,我们将通过在集群环境中执行一种数据挖掘技术来评估集群计算的性能。实验将尝试使用线性回归算法和apache spark作为集群计算框架来预测航班延误。结果表明,在集群环境中加入 5 台相同规格的 PC 可以将计算性能比独立的提高 39.76%。将更多节点附加到集群可以显着加快进程。