当前位置: X-MOL 学术arXiv.cs.DB › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Fast Subtrajectory Similarity Search in Road Networks under Weighted Edit Distance Constraints
arXiv - CS - Databases Pub Date : 2020-06-09 , DOI: arxiv-2006.05564
Satoshi Koide, Chuan Xiao, Yoshiharu Ishikawa

In this paper, we address a similarity search problem for spatial trajectories in road networks. In particular, we focus on the subtrajectory similarity search problem, which involves finding in a database the subtrajectories similar to a query trajectory. A key feature of our approach is that we do not focus on a specific similarity function; instead, we consider weighted edit distance (WED), a class of similarity functions which allows user-defined cost functions and hence includes several important similarity functions such as EDR and ERP. We model trajectories as strings, and propose a generic solution which is able to deal with any similarity function belonging to the class of WED. By employing the filter-and-verify strategy, we introduce subsequence filtering to efficiently prunes trajectories and find candidates. In order to choose a proper subsequence to optimize the candidate number, we model the choice as a discrete optimization problem (NP-hard) and compute it using a 2-approximation algorithm. To verify candidates, we design bidirectional tries, with which the verification starts from promising positions and leverage the shared segments of trajectories and the sparsity of road networks for speed-up. Experiments are conducted on large datasets to demonstrate the effectiveness of WED and the efficiency of our method for various similarity functions under WED.

中文翻译:

加权编辑距离约束下道路网络中的快速子轨迹相似性搜索

在本文中,我们解决了道路网络中空间轨迹的相似性搜索问题。特别是,我们关注子轨迹相似性搜索问题,它涉及在数据库中查找与查询轨迹相似的子轨迹。我们方法的一个关键特征是我们不关注特定的相似度函数;相反,我们考虑加权编辑距离 (WED),这是一类允许用户定义成本函数的相似函数,因此包括几个重要的相似函数,例如 EDR 和 ERP。我们将轨迹建模为字符串,并提出了一种通用解决方案,该解决方案能够处理属于 WED 类的任何相似性函数。通过采用过滤和验证策略,我们引入了子序列过滤来有效地修剪轨迹并找到候选者。为了选择合适的子序列来优化候选数,我们将选择建模为离散优化问题(NP-hard),并使用 2-近似算法进行计算。为了验证候选者,我们设计了双向尝试,验证从有希望的位置开始,并利用共享的轨迹段和道路网络的稀疏性来加速。在大型数据集上进行了实验,以证明 WED 的有效性以及我们的方法对 WED 下各种相似性函数的效率。验证从有希望的位置开始,并利用共享的轨迹段和道路网络的稀疏性来加速。在大型数据集上进行了实验,以证明 WED 的有效性以及我们的方法对 WED 下各种相似性函数的效率。验证从有希望的位置开始,并利用共享的轨迹段和道路网络的稀疏性来加速。在大型数据集上进行了实验,以证明 WED 的有效性以及我们的方法对 WED 下各种相似性函数的效率。
更新日期:2020-07-13
down
wechat
bug