The Labeled Direct Product Optimally Solves String Problems on Graphs,arXiv - CS - Computational Complexity

当前位置： X-MOL 学术 › arXiv.cs.CC › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

The Labeled Direct Product Optimally Solves String Problems on Graphs
arXiv - CS - Computational Complexity Pub Date : 2021-09-11 , DOI: arxiv-2109.05290
Nicola Rizzo, Alexandru I. Tomescu, Alberto Policriti

Suffix trees are an important data structure at the core of optimal solutions to many fundamental string problems, such as exact pattern matching, longest common substring, matching statistics, and longest repeated substring. Recent lines of research focused on extending some of these problems to vertex-labeled graphs, although using ad-hoc approaches which in some cases do not generalize to all input graphs. In the absence of a ubiquitous tool like the suffix tree for labeled graphs, we introduce the labeled direct product of two graphs as a general tool for obtaining optimal algorithms: we obtain conceptually simpler algorithms for the quadratic problems of string matching (SMLG) and longest common substring (LCSP) in labeled graphs. Our algorithms are also more efficient, since they run in time linear in the size of the labeled product graph, which may be smaller than quadratic for some inputs, and their run-time is predictable, because the size of the labeled direct product graph can be precomputed efficiently. We also solve LCSP on graphs containing cycles, which was left as an open problem by Shimohira et al. in 2011. To show the power of the labeled product graph, we also apply it to solve the matching statistics (MSP) and the longest repeated string (LRSP) problems in labeled graphs. Moreover, we show that our (worst-case quadratic) algorithms are also optimal, conditioned on the Orthogonal Vectors Hypothesis. Finally, we complete the complexity picture around LRSP by studying it on undirected graphs.

中文翻译：

带标签的直接乘积最优解决图上的字符串问题

后缀树是许多基本字符串问题（如精确模式匹配、最长公共子串、匹配统计和最长重复子串）的最佳解决方案核心的重要数据结构。最近的研究方向集中在将这些问题中的一些扩展到顶点标记图，尽管使用了在某些情况下不能推广到所有输入图的临时方法。在没有像标记图的后缀树这样无处不在的工具的情况下，我们引入了两个图的标记直积作为获得最优算法的通用工具：我们获得了概念上更简单的字符串匹配（SMLG）二次问题算法和最长标记图中的公共子串（LCSP）。我们的算法也更高效，因为它们的运行时间与标记产品图的大小成线性关系，对于某些输入，它可能小于二次方，并且它们的运行时间是可预测的，因为可以有效地预先计算标记的直接乘积图的大小。我们还在包含循环的图上解决了 LCSP，这是 Shimohira 等人留下的一个悬而未决的问题。在 2011 年。为了展示标记产品图的强大功能，我们还应用它来解决标记图中的匹配统计 (MSP) 和最长重复字符串 (LRSP) 问题。此外，我们表明我们的（最坏情况二次）算法也是最优的，条件是正交向量假设。最后，我们通过在无向图上研究它来完成围绕 LRSP 的复杂性图。我们还在包含循环的图上解决了 LCSP，这是 Shimohira 等人留下的一个悬而未决的问题。在 2011 年。为了展示标记产品图的强大功能，我们还应用它来解决标记图中的匹配统计 (MSP) 和最长重复字符串 (LRSP) 问题。此外，我们表明我们的（最坏情况二次）算法也是最优的，条件是正交向量假设。最后，我们通过在无向图上研究它来完成围绕 LRSP 的复杂性图。我们还在包含循环的图上解决了 LCSP，这是 Shimohira 等人留下的一个悬而未决的问题。在 2011 年。为了展示标记产品图的强大功能，我们还应用它来解决标记图中的匹配统计 (MSP) 和最长重复字符串 (LRSP) 问题。此外，我们表明我们的（最坏情况二次）算法也是最优的，条件是正交向量假设。最后，我们通过在无向图上研究它来完成围绕 LRSP 的复杂性图。以正交向量假设为条件。最后，我们通过在无向图上研究它来完成围绕 LRSP 的复杂性图。以正交向量假设为条件。最后，我们通过在无向图上研究它来完成围绕 LRSP 的复杂性图。

更新日期：2021-09-14

点击分享查看原文

点击收藏

阅读更多本刊最新论文