Evaluating the Impact of the Long-S upon 18th-Century Encyclopedia Britannica Automatic Subject Metadata Generation Results,Information Technology and Libraries

当前位置： X-MOL 学术 › Information Technology and Libraries › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Evaluating the Impact of the Long-S upon 18th-Century Encyclopedia Britannica Automatic Subject Metadata Generation Results
Information Technology and Libraries ( IF 1.5 ) Pub Date : 2020-09-21 , DOI: 10.6017/ital.v39i3.12235
Sam Grabus

This research compares automatic subject metadata generation when the pre-1800s Long-S character is corrected to a standard < s >. The test environment includes entries from the third edition of the Encyclopedia Britannica, and the HIVE automatic subject indexing tool. A comparative study of metadata generated before and after correction of the Long-S demonstrated an average of 26.51 percent potentially relevant terms per entry omitted from results if the Long-S is not corrected. Results confirm that correcting the Long-S increases the availability of terms that can be used for creating quality metadata records. A relationship is also demonstrated between shorter entries and an increase in omitted terms when the Long-S is not corrected.

中文翻译：

评估Long-S对18世纪大英百科全书自动主题元数据生成结果的影响

这项研究比较了将1800年代以前的Long-S字符校正为标准<s>时自动生成主题元数据的情况。测试环境包括来自《大不列颠百科全书》第三版的条目和HIVE自动主题索引工具。对Long-S校正前后产生的元数据的比较研究表明，如果不对Long-S进行校正，则从结果中省略每个条目的平均潜在相关术语的比率为26.51％。结果证实，更正Long-S可以提高可用于创建质量元数据记录的术语的可用性。如果未纠正Long-S，则较短的条目和省略的术语的增加之间也存在关系。

更新日期：2020-09-21

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文