当前位置: X-MOL 学术J. Cheminfom. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
GPURFSCREEN: a GPU based virtual screening tool using random forest classifier.
Journal of Cheminformatics ( IF 7.1 ) Pub Date : 2016-03-01 , DOI: 10.1186/s13321-016-0124-8
P B Jayaraj 1 , Mathias K Ajay 1 , M Nufail 1 , G Gopakumar 1 , U C A Jaleel 2
Affiliation  

In-silico methods are an integral part of modern drug discovery paradigm. Virtual screening, an in-silico method, is used to refine data models and reduce the chemical space on which wet lab experiments need to be performed. Virtual screening of a ligand data model requires large scale computations, making it a highly time consuming task. This process can be speeded up by implementing parallelized algorithms on a Graphical Processing Unit (GPU). Random Forest is a robust classification algorithm that can be employed in the virtual screening. A ligand based virtual screening tool (GPURFSCREEN) that uses random forests on GPU systems has been proposed and evaluated in this paper. This tool produces optimized results at a lower execution time for large bioassay data sets. The quality of results produced by our tool on GPU is same as that on a regular serial environment. Considering the magnitude of data to be screened, the parallelized virtual screening has a significantly lower running time at high throughput. The proposed parallel tool outperforms its serial counterpart by successfully screening billions of molecules in training and prediction phases.

中文翻译:


GPURFSCREEN:使用随机森林分类器的基于 GPU 的虚拟筛选工具。



计算机模拟方法是现代药物发现范式的一个组成部分。虚拟筛选是一种计算机方法,用于完善数据模型并减少需要进行湿实验室实验的化学空间。配体数据模型的虚拟筛选需要大规模计算,使其成为一项非常耗时的任务。通过在图形处理单元 (GPU) 上实施并行算法可以加快此过程。随机森林是一种鲁棒的分类算法,可用于虚拟筛选。本文提出并评估了一种在 GPU 系统上使用随机森林的基于配体的虚拟筛选工具 (GPURFSCREEN)。该工具可以在较短的执行时间内为大型生物测定数据集生成优化结果。我们的工具在 GPU 上生成的结果质量与常规串行环境中的结果质量相同。考虑到要筛选的数据量,并行虚拟筛选在高吞吐量下的运行时间显着缩短。所提出的并行工具通过在训练和预测阶段成功筛选数十亿个分子,优于其串行工具。
更新日期:2016-03-01
down
wechat
bug