当前位置: X-MOL 学术Int. J. Intell. Syst. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
A precision-preferred comprehensive information extraction system for clinical articles in traditional Chinese Medicine
International Journal of Intelligent Systems ( IF 5.0 ) Pub Date : 2021-11-16 , DOI: 10.1002/int.22748
Ye Xia 1 , Jianxiong Cai 2, 3 , Yizhen Li 1 , Zhili Dou 1 , Yunan Zhang 1 , Lin Wu 1 , Zhe Huang 1 , Shujing Xu 1 , Jiayi Sun 1 , Yixing Liu 4 , Darong Wu 2, 3, 5 , Dongran Han 1
Affiliation  

This study established a precision-preferred system specially designed for the data extraction of traditional Chinese medicine (TCM) articles, providing foundational data for subsequent clinical article analysis and synthesis of TCM clinical evidence. Information extraction is commonly used in many fields to identify relevant concepts and the relationship between pairs of concepts from the vast information sources. Previous studies that performed information extraction primarily focused on scattering targeted fields to achieve a balance between precision and recall. Therefore, this study aims to create a comprehensive information extraction system for TCM articles. This system will extract all relevant information from research articles on a broad research field, including the 11 diseases that can be efficiently treated with TCM, with high precision and efficient measurement to address bias in every study. It covers the most essential information related to patients, interventions, comparisons, outcomes, and study design (PICOS) principles in TCM clinical trials. This system covers 34 target fields on 14 topics. Impediments such as the various typesetting of TCM clinical articles were managed by a hybrid of machine vision and optical character recognition. Thus, TCM researchers can be spared of laborious, unscalable, and inefficient manual extraction processes. Our system could also enhance TCM researcher awareness of frequently missing information or TCM clinical trial design methods that could introduce bias, by analyzing the overall information integrity of TCM clinical articles, which is beneficial for future research designs.

中文翻译:

一种精准优先的中医临床文献综合信息提取系统

本研究建立了专门针对中医文献数据提取的精准优选系统,为后续临床文献分析和中医临床证据综合提供基础数据。信息抽取通常用于许多领域,以从海量信息源中识别相关概念和概念对之间的关​​系。以前进行信息提取的研究主要集中在散射目标场以实现精确度和召回率之间的平衡。因此,本研究旨在创建一个全面的中医药文献信息提取系统。该系统将从广泛研究领域的研究文章中提取所有相关信息,包括可以用中医有效治疗的11种疾病,具有高精度和高效的测量,以解决每项研究中的偏差。它涵盖了与中医临床试验中的患者、干预、比较、结果和研究设计 (PICOS) 原则相关的最重要信息。该系统涵盖14个主题的34个目标领域。诸如中医临床文章的各种排版等障碍是通过机器视觉和光学字符识别的混合来管理的。因此,中药研究人员可以免于费力、不可扩展和低效的手动提取过程。我们的系统还可以通过分析中医临床文章的整体信息完整性,提高中医研究人员对经常丢失的信息或可能引入偏差的中医临床试验设计方法的认识,这有利于未来的研究设计。
更新日期:2021-11-16
down
wechat
bug