当前位置: X-MOL 学术Big Data › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
A Survey of Biological Data in a Big Data Perspective
Big Data ( IF 4.6 ) Pub Date : 2022-08-12 , DOI: 10.1089/big.2020.0383
Gabriel Dall'Alba 1, 2 , Pedro Lenz Casa 1 , Fernanda Pessi de Abreu 1 , Daniel Luis Notari 1 , Scheila de Avila E Silva 1
Affiliation  

The amount of available data is continuously growing. This phenomenon promotes a new concept, named big data. The highlight technologies related to big data are cloud computing (infrastructure) and Not Only SQL (NoSQL; data storage). In addition, for data analysis, machine learning algorithms such as decision trees, support vector machines, artificial neural networks, and clustering techniques present promising results. In a biological context, big data has many applications due to the large number of biological databases available. Some limitations of biological big data are related to the inherent features of these data, such as high degrees of complexity and heterogeneity, since biological systems provide information from an atomic level to interactions between organisms or their environment. Such characteristics make most bioinformatic-based applications difficult to build, configure, and maintain. Although the rise of big data is relatively recent, it has contributed to a better understanding of the underlying mechanisms of life. The main goal of this article is to provide a concise and reliable survey of the application of big data-related technologies in biology. As such, some fundamental concepts of information technology, including storage resources, analysis, and data sharing, are described along with their relation to biological data.

中文翻译:

大数据视角下的生物数据调查

可用数据量不断增长。这种现象促进了一个新概念,即大数据。与大数据相关的突出技术是云计算(基础设施)和 Not Only SQL(NoSQL;数据存储)。此外,对于数据分析,决策树、支持向量机、人工神经网络和聚类技术等机器学习算法也呈现出可喜的成果。在生物学背景下,由于可用的大量生物学数据库,大数据具有许多应用。生物大数据的一些局限性与这些数据的固有特征有关,例如高度复杂性和异质性,因为生物系统提供了从原子水平到生物体或其环境之间相互作用的信息。这些特征使得大多数基于生物信息学的应用程序难以构建、配置和维护。尽管大数据的兴起相对较晚,但它有助于更​​好地了解生命的潜在机制。本文的主要目的是对大数据相关技术在生物学中的应用进行简明可靠的调查。因此,描述了信息技术的一些基本概念,包括存储资源、分析和数据共享,以及它们与生物数据的关系。本文的主要目的是对大数据相关技术在生物学中的应用进行简明可靠的调查。因此,描述了信息技术的一些基本概念,包括存储资源、分析和数据共享,以及它们与生物数据的关系。本文的主要目的是对大数据相关技术在生物学中的应用进行简明可靠的调查。因此,描述了信息技术的一些基本概念,包括存储资源、分析和数据共享,以及它们与生物数据的关系。
更新日期:2022-08-16
down
wechat
bug