当前位置: X-MOL 学术arXiv.cs.DB › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Evaluation of Distributed Databases in Hybrid Clouds and Edge Computing: Energy, Bandwidth, and Storage Consumption
arXiv - CS - Databases Pub Date : 2021-09-15 , DOI: arxiv-2109.07260
Yaser Mansouri, Victor Prokhorenko, Faheem Ullah, M. Ali Babar

A benchmark study of modern distributed databases is an important source of information to select the right technology for managing data in the cloud-edge paradigms. To make the right decision, it is required to conduct an extensive experimental study on a variety of hardware infrastructures. While most of the state-of-the-art studies have investigated only response time and scalability of distributed databases, focusing on other various metrics (e.g., energy, bandwidth, and storage consumption) is essential to fully understand the resources consumption of the distributed databases. Also, existing studies have explored the response time and scalability of these databases either in private or public cloud. Hence, there is a paucity of investigation into the evaluation of these databases deployed in a hybrid cloud, which is the seamless integration of public and private cloud. To address these research gaps, in this paper, we investigate energy, bandwidth and storage consumption of the most used and common distributed databases. For this purpose, we have evaluated four open-source databases (Cassandra, Mongo, Redis and MySQL) on the hybrid cloud spanning over local OpenStack and Microsoft Azure, and a variety of edge computing nodes including Raspberry Pi, a cluster of Raspberry Pi, and low and high power servers. Our extensive experimental results reveal several helpful insights for the deployment selection of modern distributed databases in edge-cloud environments.

中文翻译:

混合云和边缘计算中的分布式数据库评估:能源、带宽和存储消耗

现代分布式数据库的基准研究是选择正确的技术来管理云边缘范式中数据的重要信息来源。为了做出正确的决定,需要对各种硬件基础设施进行广泛的实验研究。虽然大多数最先进的研究只调查了分布式数据库的响应时间和可扩展性,但关注其他各种指标(例如,能源、带宽和存储消耗)对于充分了解分布式数据库的资源消耗至关重要。数据库。此外,现有研究已经探索了这些数据库在私有云或公共云中的响应时间和可扩展性。因此,对部署在混合云中的这些数据库的评估缺乏调查,这就是公有云和私有云的无缝集成。为了解决这些研究空白,在本文中,我们调查了最常用和最常见的分布式数据库的能源、带宽和存储消耗。为此,我们在混合云上评估了四个开源数据库(Cassandra、Mongo、Redis 和 MySQL),跨越本地 OpenStack 和 Microsoft Azure,以及各种边缘计算节点,包括 Raspberry Pi、Raspberry Pi 集群、以及低功率和高功率服务器。我们广泛的实验结果揭示了一些对边缘云环境中现代分布式数据库部署选择的有用见解。为此,我们在混合云上评估了四个开源数据库(Cassandra、Mongo、Redis 和 MySQL),跨越本地 OpenStack 和 Microsoft Azure,以及各种边缘计算节点,包括 Raspberry Pi、Raspberry Pi 集群、以及低功率和高功率服务器。我们广泛的实验结果揭示了一些对边缘云环境中现代分布式数据库部署选择的有用见解。为此,我们在混合云上评估了四个开源数据库(Cassandra、Mongo、Redis 和 MySQL),跨越本地 OpenStack 和 Microsoft Azure,以及各种边缘计算节点,包括 Raspberry Pi、Raspberry Pi 集群、以及低功率和高功率服务器。我们广泛的实验结果揭示了一些对边缘云环境中现代分布式数据库部署选择的有用见解。
更新日期:2021-09-16
down
wechat
bug