当前位置: X-MOL 学术New Rev. Hypermedia Multimed. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Automated software system for checking the structure and format of ACM SIG documents
New Review of Hypermedia and Multimedia ( IF 1.4 ) Pub Date : 2016-07-22 , DOI: 10.1080/13614568.2016.1209247
Arsalan Rahman Mirza 1 , Melike Sah 2
Affiliation  

ABSTRACT Microsoft (MS) Office Word is one of the most commonly used software tools for creating documents. MS Word 2007 and above uses XML to represent the structure of MS Word documents. Metadata about the documents are automatically created using Office Open XML (OOXML) syntax. We develop a new framework, which is called ADFCS (Automated Document Format Checking System) that takes the advantage of the OOXML metadata, in order to extract semantic information from MS Office Word documents. In particular, we develop a new ontology for Association for Computing Machinery (ACM) Special Interested Group (SIG) documents for representing the structure and format of these documents by using OWL (Web Ontology Language). Then, the metadata is extracted automatically in RDF (Resource Description Framework) according to this ontology using the developed software. Finally, we generate extensive rules in order to infer whether the documents are formatted according to ACM SIG standards. This paper, introduces ACM SIG ontology, metadata extraction process, inference engine, ADFCS online user interface, system evaluation and user study evaluations.

中文翻译:

用于检查 ACM SIG 文件结构和格式的自动化软件系统

摘要 Microsoft (MS) Office Word 是最常用的用于创建文档的软件工具之一。MS Word 2007 及更高版本使用 XML 来表示 MS Word 文档的结构。使用 Office Open XML (OOXML) 语法自动创建有关文档的元数据。我们开发了一个名为 ADFCS(自动文档格式检查系统)的新框架,它利用了 OOXML 元数据,以便从 MS Office Word 文档中提取语义信息。特别是,我们为计算机协会 (ACM) 特别兴趣小组 (SIG) 文档开发了一个新的本体,用于使用 OWL(Web 本体语言)表示这些文档的结构和格式。然后,使用开发的软件根据该本体在RDF(资源描述框架)中自动提取元数据。最后,我们生成广泛的规则以推断文档的格式是否符合 ACM SIG 标准。本文介绍了 ACM SIG 本体、元数据提取过程、推理引擎、ADFCS 在线用户界面、系统评估和用户研究评估。
更新日期:2016-07-22
down
wechat
bug