当前位置: X-MOL 学术J. Cheminfom. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Development of machine learning classifiers to predict compound activity on prostate cancer cell lines
Journal of Cheminformatics ( IF 7.1 ) Pub Date : 2022-11-08 , DOI: 10.1186/s13321-022-00647-y
Davide Bonanni 1 , Luca Pinzi 1 , Giulio Rastelli 1
Affiliation  

Prostate cancer is the most common type of cancer in men. The disease presents good survival rates if treated at the early stages. However, the evolution of the disease in its most aggressive variant remains without effective therapeutic answers. Therefore, the identification of novel effective therapeutics is urgently needed. On these premises, we developed a series of machine learning models, based on compounds with reported highly homogeneous cell-based antiproliferative assay data, able to predict the activity of ligands towards the PC-3 and DU-145 prostate cancer cell lines. The data employed in the development of the computational models was finely-tuned according to a series of thresholds for the classification of active/inactive compounds, to the number of features to be implemented, and by using 10 different machine learning algorithms. Models’ evaluation allowed us to identify the best combination of activity thresholds and ML algorithms for the classification of active compounds, achieving prediction performances with MCC values above 0.60 for PC-3 and DU-145 cells. Moreover, in silico models based on the combination of PC-3 and DU-145 data were also developed, demonstrating excellent precision performances. Finally, an analysis of the activity annotations reported for the ligands in the curated datasets were conducted, suggesting associations between cellular activity and biological targets that might be explored in the future for the design of more effective prostate cancer antiproliferative agents.

中文翻译:

开发机器学习分类器以预测化合物对前列腺癌细胞系的活性

前列腺癌是男性中最常见的癌症类型。如果在早期阶段进行治疗,这种疾病的存活率很高。然而,该疾病在其最具侵略性的变体中的演变仍然没有有效的治疗方法。因此,迫切需要鉴定新的有效疗法。在这些前提下,我们开发了一系列机器学习模型,这些模型基于具有高度同质细胞抗增殖测定数据的化合物,能够预测配体对 PC-3 和 DU-145 前列腺癌细胞系的活性。根据活性/非活性化合物分类的一系列阈值、要实现的特征数量以及使用 10 种不同的机器学习算法,对计算模型开发中使用的数据进行了微调。模型评估使我们能够确定用于活性化合物分类的活性阈值和 ML 算法的最佳组合,实现 PC-3 和 DU-145 细胞的 MCC 值高于 0.60 的预测性能。此外,还开发了基于 PC-3 和 DU-145 数据组合的计算机模型,展示了出色的精度性能。最后,对精选数据集中配体报告的活动注释进行了分析,表明细胞活动与生物靶点之间的关联可能在未来探索以设计更有效的前列腺癌抗增殖剂。对于 PC-3 和 DU-145 单元,实现 MCC 值高于 0.60 的预测性能。此外,还开发了基于 PC-3 和 DU-145 数据组合的计算机模型,展示了出色的精度性能。最后,对精选数据集中配体报告的活动注释进行了分析,表明细胞活动与生物靶点之间的关联可能在未来探索以设计更有效的前列腺癌抗增殖剂。对于 PC-3 和 DU-145 单元,实现 MCC 值高于 0.60 的预测性能。此外,还开发了基于 PC-3 和 DU-145 数据组合的计算机模型,展示了出色的精度性能。最后,对精选数据集中配体报告的活动注释进行了分析,表明细胞活动与生物靶点之间的关联可能在未来探索以设计更有效的前列腺癌抗增殖剂。
更新日期:2022-11-08
down
wechat
bug