当前位置: X-MOL 学术Methods Inf. Med. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Applying FAIR Principles to Improve Data Searchability of Emergency Department Datasets: A Case Study for HCUP-SEDD.
Methods of Information in Medicine ( IF 1.7 ) Pub Date : 2020-06-14 , DOI: 10.1055/s-0040-1712510
Karishma Bhatia 1 , James Tanch 1 , Elizabeth S Chen 1 , Indra Neil Sarkar 1, 2
Affiliation  

Abstract

Background There is a recognized need to improve how scholarly data are managed and accessed. The scientific community has proposed the findable, accessible, interoperable, and reusable (FAIR) data principles to address this issue.

Objective The objective of this case study was to develop a system for improving the FAIRness of Healthcare Cost and Utilization Project's State Emergency Department Databases (HCUP's SEDD) within the context of data catalog availability.

Materials and Methods A search tool, EDCat (Emergency Department Catalog), was designed to improve the “FAIRness” of electronic health databases and tested on datasets from HCUP-SEDD. ElasticSearch was used as a database for EDCat's search engine. Datasets were curated and defined. Searchable data dictionary-related elements and unified medical language system (UMLS) concepts were included in the curated metadata. Functionality to standardize search terms using UMLS concepts was added to the user interface.

Results The EDCat system improved the overall FAIRness of HCUP-SEDD by improving the findability of individual datasets and increasing the efficacy of searches for specific data elements and data types.

Discussion The databases considered for this case study were limited in number as few data distributors make the data dictionaries of datasets available. The publication of data dictionaries should be encouraged through the FAIR principles, and further efforts should be made to improve the specificity and measurability of the FAIR principles.

Conclusion In this case study, the distribution of datasets from HCUP-SEDD was made more FAIR through the development of a search tool, EDCat. EDCat will be evaluated and developed further to include datasets from other sources.



中文翻译:

应用FAIR原则以提高急诊科数据集的数据可搜索性:HCUP-SEDD的案例研究。

摘要

背景技术 公认需要改进学术数据的管理和访问方式。科学界提出了可发现,可访问,可互操作和可重用(FAIR)的数据原理,以解决此问题。

目的 本案例研究的目的是在数据目录可用性的背景下,开发一种提高医疗成本与利用项目的国家应急部门数据库(HCUP的SEDD)公平性的系统。

资料和方法设计了 一种搜索工具EDCat(紧急部门目录)来改善电子医疗数据库的“公正性”,并对HCUP-SEDD的数据集进行了测试。ElasticSearch被用作EDCat搜索引擎的数据库。整理和定义数据集。策划的元数据中包括与可搜索的数据字典相关的元素和统一的医学语言系统(UMLS)概念。用户界面中添加了使用UMLS概念标准化搜索词的功能。

结果 EDCat系统通过提高单个数据集的可查找性并提高了对特定数据元素和数据类型的搜索效率,改善了HCUP-SEDD的总体公平性。

讨论 本案例研究考虑的数据库数量有限,因为很少有数据分发者可以使用数据集的数据字典。应通过FAIR原则鼓励发布数据字典,并应进一步努力提高FAIR原则的特异性和可测量性。

结论 在本案例研究中,通过开发搜索工具EDCat,使HCUP-SEDD的数据集分布更加公平。将对EDCat进行评估和进一步开发,以包括其他来源的数据集。

更新日期:2020-06-14
down
wechat
bug