当前位置: X-MOL 学术J. Big Data › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
A new effective method for labeling dynamic XML data
Journal of Big Data ( IF 8.1 ) Pub Date : 2018-12-19 , DOI: 10.1186/s40537-018-0161-4
Eynollah Khanjari , Leila Gaeini

Query processing based on labeling dynamic XML documents has gained more attention in the past several years. An efficient labeling scheme should provide small size labels keeping the simplicity of the exploited algorithm in order to avoid complex computations as well as retaining the readability of structural relationships between nodes. Moreover, for dynamic XML data, relabeling the nodes in XML updates should be avoided. However, the existing schemes lack the capability of supporting all of these requirements. In this paper, we propose a new labeling scheme which assigns variable-length labels to nodes in dynamic XML documents. Our method employs the FibLSS encoding scheme that exploits the properties of the Fibonacci sequence to provide variable-length node labels of appropriate size. In XML updating process, we add a new section only in the new node’s label without relabeling the existing nodes while keeping the order of nodes as well as preserving the structural relationships. Our labeling method is scalable as it is not subject to overflow, and as the number of nodes to be labeled increases exponentially, the size of labels grows linearly, which makes it suitable for big datasets. It also has the best performance in computational processing costs compared to existing approaches. The results of the experiments confirm the advantages of our proposed method in comparison to state-of-the-art techniques.

中文翻译:

标记动态XML数据的新有效方法

在过去的几年中,基于标记动态XML文档的查询处理得到了越来越多的关注。一个有效的标记方案应提供小尺寸的标记,以保持所开发算法的简单性,从而避免复杂的计算并保持节点之间结构关系的可读性。此外,对于动态XML数据,应避免在XML更新中重新标记节点。但是,现有方案缺乏支持所有这些要求的能力。在本文中,我们提出了一种新的标记方案,该方案将可变长度标签分配给动态XML文档中的节点。我们的方法采用FibLSS编码方案,该方案利用Fibonacci序列的属性来提供适当大小的可变长度节点标签。在XML更新过程中,我们仅在新节点的标签中添加一个新部分,而不重新标记现有节点,同时保留节点的顺序并保留结构关系。我们的标注方法具有可伸缩性,因为它不会溢出,并且随着要标注的节点数量呈指数增长,标注的大小呈线性增长,这使其适用于大型数据集。与现有方法相比,它在计算处理成本方面也具有最佳性能。实验结果证实了我们提出的方法与最新技术相比的优势。随着要标记的节点数量呈指数增长,标签的大小呈线性增长,这使其适用于大型数据集。与现有方法相比,它在计算处理成本方面也具有最佳性能。实验结果证实了我们提出的方法与最新技术相比的优势。随着要标记的节点数量呈指数增长,标签的大小呈线性增长,这使其适用于大型数据集。与现有方法相比,它在计算处理成本方面也具有最佳性能。实验结果证实了我们提出的方法与最新技术相比的优势。
更新日期:2018-12-19
down
wechat
bug