当前位置: X-MOL 学术International Journal on Digital Libraries › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Building and querying semantic layers for web archives (extended version)
International Journal on Digital Libraries ( IF 1.6 ) Pub Date : 2018-07-05 , DOI: 10.1007/s00799-018-0251-0
Pavlos Fafalios , Helge Holzmann , Vaibhav Kasturia , Wolfgang Nejdl

Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles (“layers”) that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts, and events), and publishing all these data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities, and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different types of web archives. An experimental evaluation showed that a semantic layer can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.

中文翻译:

构建和查询Web归档的语义层(扩展版)

Web归档是收集Web部分以确保保留信息以备将来使用的过程。然而,尽管全球网络档案的数量在增加,但是缺乏有效且有意义的探索方法仍然是将其转变为可用和有用的信息源的主要障碍。在本文中,我们着眼于此问题,并提出了一个RDF / S模型和一个分布式框架,用于构建描述Web档案内容语义信息的语义配置文件(“层”)。语义层允许描述有关归档文​​档的元数据信息,并用有用的语义信息(例如实体,概念和事件)对其进行注释,并将所有这些数据作为链接数据发布在Web上。这样的结构化存储库提供了高级查询和集成功能,并使Web档案可被其他系统和工具直接利用。为了演示其查询功能,我们为三种不同类型的Web档案构建和查询语义层。实验评估表明,语义层可以回答现有的基于关键字的系统无法充分满足的信息需求。
更新日期:2018-07-05
down
wechat
bug