当前位置: X-MOL 学术Information Technology and Libraries › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Evaluating the Impact of the Long-S upon 18th-Century Encyclopedia Britannica Automatic Subject Metadata Generation Results
Information Technology and Libraries ( IF 1.5 ) Pub Date : 2020-09-21 , DOI: 10.6017/ital.v39i3.12235
Sam Grabus

This research compares automatic subject metadata generation when the pre-1800s Long-S character is corrected to a standard < s >. The test environment includes entries from the third edition of the Encyclopedia Britannica, and the HIVE automatic subject indexing tool. A comparative study of metadata generated before and after correction of the Long-S demonstrated an average of 26.51 percent potentially relevant terms per entry omitted from results if the Long-S is not corrected. Results confirm that correcting the Long-S increases the availability of terms that can be used for creating quality metadata records. A relationship is also demonstrated between shorter entries and an increase in omitted terms when the Long-S is not corrected.

中文翻译:

评估Long-S对18世纪大英百科全书自动主题元数据生成结果的影响

这项研究比较了将1800年代以前的Long-S字符校正为标准<s>时自动生成主题元数据的情况。测试环境包括来自《大不列颠百科全书》第三版的条目和HIVE自动主题索引工具。对Long-S校正前后产生的元数据的比较研究表明,如果不对Long-S进行校正,则从结果中省略每个条目的平均潜在相关术语的比率为26.51%。结果证实,更正Long-S可以提高可用于创建质量元数据记录的术语的可用性。如果未纠正Long-S,则较短的条目和省略的术语的增加之间也存在关系。
更新日期:2020-09-21
down
wechat
bug