当前位置: X-MOL 学术Semant. Web › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Discovery of emerging design patterns in ontologies using tree mining
Semantic Web ( IF 3.0 ) Pub Date : 2018-06-29 , DOI: 10.3233/sw-170280
Agnieszka Ławrynowicz 1 , Jedrzej Potoniec 1 , Michał Robaczyk 1 , Tania Tudorache 2
Affiliation  

The research goal of this work is to investigate modeling patterns that recur in ontologies. Such patterns may originate from certain design solutions, and they may possibly indicate emerging ontology design patterns. We describe our tree-mining method for identifying the emerging design patterns. The method works in two steps: (1) we transform the ontology axioms in a tree shape in order to find axiom patterns; and then, (2) we use association analysis to mine co-occuring axiom patterns in order to extract emerging design patterns. We conduct an experimental study on a set of 331 ontologies from the BioPortal repository. We show that recurring axiom patterns appear across all individual ontologies, as well as across the whole set. In individual ontologies, we find frequent and non-trivial patterns with and without variables. Some of the former patterns have more than 300,000 occurrences. The longest pattern without a variable discovered from the whole ontology set has size 12, and it appears in 14 ontologies. To the best of our knowledge, this is the first method for automatic discovery of emerging design patterns in ontologies. Finally, we demonstrate that we are able to automatically detect patterns, for which we have manually confirmed that they are fragments of ontology design patterns described in the literature. Since our method is not specific to particular ontologies, we conclude that we should be able to discover new, emerging design patterns for arbitrary ontology sets.

中文翻译:

使用树挖掘发现本体中的新兴设计模式

这项工作的研究目标是研究在本体中重复出现的建模模式。这样的模式可能源自某些设计解决方案,它们可能表明新兴的本体设计模式。我们描述了用于识别新兴设计模式的树挖掘方法。该方法分两个步骤工作:(1)我们将本体公理转换为树形,以找到公理模式;然后,(2)我们使用关联分析来挖掘共同出现的公理模式,以提取新兴的设计模式。我们对来自 BioPortal 存储库的一组 331 个本体进行了实验研究。我们表明,重复的公理模式出现在所有个体本体以及整个集合中。在个体本体中,我们发现有和没有变量的频繁和非平凡模式。一些以前的模式出现了超过 300,000 次。从整个本体集中发现的最长的无变量模式的大小为 12,它出现在 14 个本体中。据我们所知,这是自动发现本体中新兴设计模式的第一种方法。最后,我们证明我们能够自动检测模式,我们已经手动确认它们是文献中描述的本体设计模式的片段。由于我们的方法不是特定于特定的本体,因此我们得出结论,我们应该能够为任意本体集发现新的、新兴的设计模式。据我们所知,这是自动发现本体中新兴设计模式的第一种方法。最后,我们证明我们能够自动检测模式,我们已经手动确认它们是文献中描述的本体设计模式的片段。由于我们的方法不是特定于特定的本体,因此我们得出结论,我们应该能够为任意本体集发现新的、新兴的设计模式。据我们所知,这是自动发现本体中新兴设计模式的第一种方法。最后,我们证明我们能够自动检测模式,我们已经手动确认它们是文献中描述的本体设计模式的片段。由于我们的方法不是特定于特定的本体,因此我们得出结论,我们应该能够为任意本体集发现新的、新兴的设计模式。
更新日期:2018-06-29
down
wechat
bug