当前位置: X-MOL 学术arXiv.cs.DL › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Project Pipeline: Preservation, Persistence, and Performance
arXiv - CS - Digital Libraries Pub Date : 2021-09-13 , DOI: arxiv-2109.06317
Jane GreenbergDrexel University, Christopher B. RauchDrexel University, Mat KellyDrexel University

Preservation pipelines demonstrate extended value when digitized content is also computation ready. Expanding this to historical controlled vocabularies published in analog format requires additional steps if they are to be fully leveraged for research. This paper reports on work addressing this challenge. We report on a pipeline and project progress addressing three key goals: 1) transforming the 1910 Library of Congress Subject Headings (LCSH) to the Simple Knowledge Organization System (SKOS) linked data standard, 2) implementing persistent identifiers (PIDs) and launching our prototype ARK resolver, and 3) importing the 1910 LCSH into the Helping Interdisciplinary Vocabulary Engineering (HIVE) System to support automatic metadata generation and scholarly analysis of the historical record. The discussion considers the implications of our work in the broader context of preservation, and the conclusion summarizes our work and identifies next steps.

中文翻译:

项目管道:保存、持久性和性能

当数字化内容也准备好计算时,保存管道展示了扩展价值。如果要将其扩展到以模拟格式发布的历史受控词汇表,则需要额外的步骤才能充分利用它们进行研究。本文报告了应对这一挑战的工作。我们报告了解决三个关键目标的管道和项目进度:1) 将 1910 年国会图书馆主题词 (LCSH) 转换为简单知识组织系统 (SKOS) 链接数据标准,2) 实施持久标识符 (PID) 并启动我们的原型 ARK 解析器,以及 3) 将 1910 LCSH 导入帮助跨学科词汇工程 (HIVE) 系统,以支持自动元数据生成和历史记录的学术分析。
更新日期:2021-09-15
down
wechat
bug