Deep Learning in Mining Biological Data,Cognitive Computation

当前位置： X-MOL 学术 › Cognit. Comput. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Deep Learning in Mining Biological Data
Cognitive Computation ( IF 5.4 ) Pub Date : 2021-01-05 , DOI: 10.1007/s12559-020-09773-x
Mufti Mahmud _{1,

2} , M Shamim Kaiser ₃ , T Martin McGinnity _{1,

4} , Amir Hussain ₅

Affiliation

Recent technological advancements in data acquisition tools allowed life scientists to acquire multimodal data from different biological application domains. Categorized in three broad types (i.e. images, signals, and sequences), these data are huge in amount and complex in nature. Mining such enormous amount of data for pattern recognition is a big challenge and requires sophisticated data-intensive machine learning techniques. Artificial neural network-based learning systems are well known for their pattern recognition capabilities, and lately their deep architectures—known as deep learning (DL)—have been successfully applied to solve many complex pattern recognition problems. To investigate how DL—especially its different architectures—has contributed and been utilized in the mining of biological data pertaining to those three types, a meta-analysis has been performed and the resulting resources have been critically analysed. Focusing on the use of DL to analyse patterns in data from diverse biological domains, this work investigates different DL architectures’ applications to these data. This is followed by an exploration of available open access data sources pertaining to the three data types along with popular open-source DL tools applicable to these data. Also, comparative investigations of these tools from qualitative, quantitative, and benchmarking perspectives are provided. Finally, some open research challenges in using DL to mine biological data are outlined and a number of possible future perspectives are put forward.

中文翻译：

挖掘生物数据中的深度学习

数据采集工具的最新技术进步使生命科学家能够从不同的生物应用领域获取多模态数据。这些数据分为三大类（即图像、信号和序列），数量庞大且性质复杂。为模式识别挖掘如此大量的数据是一项巨大的挑战，需要复杂的数据密集型机器学习技术。基于人工神经网络的学习系统以其模式识别能力而闻名，最近它们的深度架构——称为深度学习 (DL)——已成功应用于解决许多复杂的模式识别问题。为了研究深度学习（尤其是其不同的架构）如何在与这三种类型相关的生物数据的挖掘中做出贡献并被利用，已经进行了荟萃分析，并对由此产生的资源进行了批判性分析。这项工作专注于使用 DL 分析来自不同生物领域的数据中的模式，研究了不同的 DL 架构对这些数据的应用。接下来是探索与这三种数据类型相关的可用开放访问数据源以及适用于这些数据的流行开源 DL 工具。此外，还从定性、定量和基准测试的角度对这些工具进行了比较研究。最后，概述了使用深度学习挖掘生物数据的一些开放研究挑战，并提出了一些可能的未来前景。这项工作专注于使用 DL 分析来自不同生物领域的数据中的模式，研究了不同的 DL 架构对这些数据的应用。接下来是探索与这三种数据类型相关的可用开放访问数据源以及适用于这些数据的流行开源 DL 工具。此外，还从定性、定量和基准测试的角度对这些工具进行了比较研究。最后，概述了使用深度学习挖掘生物数据的一些开放研究挑战，并提出了一些可能的未来前景。这项工作专注于使用 DL 分析来自不同生物领域的数据中的模式，研究了不同的 DL 架构对这些数据的应用。接下来是探索与这三种数据类型相关的可用开放访问数据源以及适用于这些数据的流行开源 DL 工具。此外，还从定性、定量和基准测试的角度对这些工具进行了比较研究。最后，概述了使用深度学习挖掘生物数据的一些开放研究挑战，并提出了一些可能的未来前景。接下来是探索与这三种数据类型相关的可用开放访问数据源以及适用于这些数据的流行开源 DL 工具。此外，还从定性、定量和基准测试的角度对这些工具进行了比较研究。最后，概述了使用深度学习挖掘生物数据的一些开放研究挑战，并提出了一些可能的未来前景。接下来是探索与这三种数据类型相关的可用开放访问数据源以及适用于这些数据的流行开源 DL 工具。此外，还从定性、定量和基准测试的角度对这些工具进行了比较研究。最后，概述了使用深度学习挖掘生物数据的一些开放研究挑战，并提出了一些可能的未来前景。

更新日期：2021-01-05

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>