Abstract
Location-to-trajectory matching plays an important role in trajectory data management and analysis. In this paper, we propose and study a novel problem of privacy-preserving spatial keyword location-to-trajectory matching (PPSKLT Matching). Given a set O of locations with keywords, a set T of activity trajectories, a spatial matching threshold \(\theta _{\text{sp}}\), and a keyword matching threshold \(\theta _{\text{kw}}\), the PPSKLT matching finds all location-trajectory pairs from O and T while preserving the users’ privacy. We believe that the PPSKLT matching may benefit many mobile applications such as trajectory activity analysis, event tracking, and so on. The PPSKLT matching is challenging due to three reasons: (1) how to define the spatial keyword similarity measure between locations and trajectories, (2) how to prune the search space effectively, and (3) how to preserve the users’ privacy during query processing. To overcome these challenges and address the problem efficiently, we develop a novel network expansion algorithm (NEA). We define a pair of upper and lower bounds on the spatio-textual similarity to prune the search space. We also define a privacy-preserving mechanism to protect users’ privacy. We conduct extensive experiments on spatio-textual data sets to verify the performance of the developed algorithms.
Similar content being viewed by others
Notes
While many location points in the road network may have informative keywords such as “restaurant”, “shopping mall”, etc., it is also possible that some location points in V are simply road connectors without any meaningful keywords. In such cases, we may consider that the function \({\mathcal {K}}(\cdot )\) returns the empty set.
Specifically, the for-loop at Line 2 of Baseline-LT (Algorithm 2) and the for-loop at Line 2 of Baseline-TL (Algorithm 3) are parallelized.
References
Agrawal, R., Faloutsos, C., Swami, A.: Efficient similarity search in sequence databases. In: FODO, pp. 69–84 (1993)
Ahuja, R.K., Magnanti, T.L., Orlin, J.B.: Network Flows: Theory, Algorithms, and Applications. Prentice Hall, Upper Saddle River, NJ (1993)
Cao, X., Chen, L., Cong, G., Guan, J., Phan, N.-T., Xiao, X.: KORS: keyword-aware optimal route search system. In: ICDE (2013)
Cao, X., Chen, L., Cong, G., Jensen, C.S., Qu, Q., Skovsgaard, A., Wu, D., Yiu, M. L.: Spatial keyword querying. In: ER (2012)
Cao, X., Chen, L., Cong, G., Xiao, X.: Keyword-aware optimal route search. In: Proceedings of the VLDB Endowment (PVLDB), vol. 5(11), pp. 1136–1147 (2012)
Chen, L., Cong, G.: Diversity-aware top-k publish/subscribe for text stream. In: SIGMOD (2015)
Chen, L., Cong, G., Cao, X.: An efficient query indexing mechanism for filtering geo-textual data. In: SIGMOD (2013)
Chen, L., Cong, G., Cao, X., Tan, K.-L.: Temporal spatial-keyword top-k publish/subscribe. In: ICDE (2015)
Chen, L., Cong, G., Jensen, C.S., Wu, D.: Spatial keyword query processing: an experimental evaluation. PVLDB 6(3), 217–228 (2013)
Chen, L., Cui, Y., Cong, G., Cao, X.: SOPS: a system for efficient processing of spatial-keyword publish/subscribe. PVLDB 7(13), 1601–1604 (2014)
Chen, L., Shang, S.: Region-based message exploration over spatio-temporal data streams. In: AAAI, pp. 873–880 (2019)
Chen, L., Shang, S., Jensen, C.S., Yao, B., Zhang, Z., Shao, L.: Effective and efficient reuse of past travel behavior for route recommendation. In: KDD (2019)
Chen, L., Shang, S., Yang, C., Li, J.: Spatial keyword search: a survey. Geoinformatica 24(1), 85–106 (2020)
Chen, L., Shang, S., Zhang, Z., Cao, X., Jensen, C.S., Kalnis, P.: Location-aware top-k term publish/subscribe. In: ICDE, pp. 749–760 (2018)
Chen, L., Shang, S., Zheng, K., Kalnis, P.: Cluster-based subscription matching for geo-textual data streams. In: ICDE, pp. 890–901 (2019)
Chen, Z., Cong, G., Zhang, Z., Fu, T.Z.J., Chen, L.: Distributed publish/subscribe query processing on the spatio-textual data stream. In: ICDE (2017)
Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to Algorithms, 3rd edn. The MIT Press, New York (2009)
Giannotti, F., Nanni, M., Pinelli, F., Pedreschi, D.: Trajectory pattern mining. In: SIGKDD, pp. 330–339 (2007)
Han, P., Shang, S., Sun, A., Zhao, P., Kalnis, P.: AUC-MF: point of interest recommendation with AUC maximization. In: ICDE (2019)
Ilyas, I.F., Beskales, G., Soliman, M.A.: A survey of top-k query processing techniques in relational database systems. ACM Computing Survey 40(4), Article No. 11 (2008)
Ilyas, I.F., Shah, R., Aref, W.G., Vitter, J.S., Elmagarmid, A.K.: Rank-aware query optimization. In: SIGMOD (2004)
Kou, N.M., Li, Y., Wang, H., U, L.H., Gong, Z.: Crowdsourced top-k queries by confidence-aware pairwise judgments. In: SIGMOD (2017)
wang, Hao, fan, shunguo, song, jinhua, gao, yang, chen, xingguo: R learning transfer based on subgoal discovery and subtask similarity. IEEE/CAA J. Auto. Sin. 1(3), 252–266 (2014)
Levy, O., Goldberg, Y.: Neural word embedding as implicit matrix factorization. In: NIPS (2014)
Li, J., Sun, A., Xing, Z.: Learning to answer programming questions with software documentation through social context embedding. Inform. Sci. 448–449, 36–52 (2018)
Li, J., Xing, Z., Kabir, A.: Leveraging official content and social context to recommend software documentation. In: IEEE Transactions on Services Computing (2018)
Li, M., Chen, L., Cong, G., Gu, Y., Yu, G.: Efficient processing of location-aware group preference queries. In: CIKM (2016)
Li, Y., Kou, N.M., Wang, H., U, L.H., Gong, Z.: A confidence-aware top-k query processing toolkit on crowdsourcing. In: VLDB (2017)
Liu, Y., Zhao, Y., Chen, L., Pei, J., Han, J.: Mining frequent trajectory patterns for activity monitoring using radio frequency tag arrays. IEEE Trans. Parall. Distrib. Syst. 23(11), 2138–2149 (2012)
Lu, Z., Wang, H., Mamoulis, N., Tu, W., Cheung, D.W.: Personalized location recommendation by aggregating multiple recomenders in diversity. Geoinformatica 21(3), 459–484 (2017)
Ramos, J.: Using TF-IDF to determine word relevance in document queries (2003)
Shang, S., Chen, L., Jensen, C.S., Wen, J., Kalnis, P.: Searching trajectories by regions of interest. IEEE Trans. Knowl. Data Eng. 29(7), 1549–1562 (2017)
Shang, S., Chen, L., Jensen, C.S., Wen, J.-R., Kalnis, P.: Search trajectories by regions of interest. In: ICDE (2018)
Shang, S., Chen, L., Wei, Z., Jensen, C.S., Wen, J., Kalnis, P.: Collective travel planning in spatial networks. IEEE Trans. Knowl. Data Eng. 28(5), 1132–1146 (2016)
Shang, S., Chen, L., Wei, Z., Jensen, C.S., Zheng, K., Kalnis, P.: Trajectory similarity join in spatial networks. PVLDB 10(11), 1178–1189 (2017)
Shang, S., Chen, L., Wei, Z., Jensen, C.S., Zheng, K., Kalnis, P.: Parallel trajectory similarity joins in spatial networks. VLDB J. 27(3), 395–420 (2018)
Shang, S., Chen, L., Zheng, K., Jensen, C.S., Wei, Z., Kalnis, P.: Parallel trajectory-to-location join. IEEE Trans. Knowl. Data Eng. 31(6), 1194–1207 (2019)
Shang, S., Ding, R., Zheng, K., Jensen, C.S., Kalnis, P., Zhou, X.: Personalized trajectory matching in spatial networks. VLDB J. 23(3), 449–468 (2014)
Shang, S., Guo, D., Liu, J., Zheng, K., Wen, J.: Finding regions of interest using location based social media. Neurocomputing 173, 118–123 (2016)
Shang, S., Liu, J., Zheng, K., Lu, H., Pedersen, T.B., Wen, J.: Planning unobstructed paths in traffic-aware spatial networks. GeoInformatica 19(4), 723–746 (2015)
Shang, S., Lu, H., Pedersen, T.B., Xie, X.: Finding traffic-aware fastest paths in spatial networks. In: SSTD, pp. 128–145 (2013)
Shang, S., Wei, Z., Wen, J., Zhu, S.: Probabilistic nearest neighbor query in traffic-aware spatial networks. In: APWeb, pp. 3–14 (2016)
Shang, S., Xie, K., Zheng, K., Liu, J., Wen, J.: VID join: Mapping trajectories to points of interest to support location-based services. J. Comput. Sci. Technol. 30(4), 725–744 (2015)
Shang, S., Yuan, B., Deng, K., Xie, K., Zheng, K., Zhou, X.: Pnn query processing on compressed trajectories. GeoInformatica 16(3), 467–496 (2012)
Shang, S., Yuan, B., Deng, K., Xie, K., Zhou, X.: Finding the most accessible locations: reverse path nearest neighbor query in road networks. In: ACM SIGSPATIAL, pp. 181–190 (2011)
Shang, S., Zheng, K., Jensen, C.S., Yang, B., Kalnis, P., Li, G., Wen, J.: Discovery of path nearby clusters in spatial networks. IEEE Trans. Knowl. Data Eng. 27(6), 1505–1518 (2015)
Song, J., Gao, Y., Wang, H.: Feature learning and transfer performance prediction for video reinforcement learning tasks via a Siamese convolutional neural network. In: ICNIP (2018)
Sutton, R.S., Barto, A.: Reinforcement learning: an introduction, 2nd edn. The MIT Press, New York (2018)
Wang, H., Cai, Y., Yang, Y., Zhang, S., Mamoulis, N.: Durable queries over historical time series. IEEE Trans. Knowl. Data Eng. (TKDE) 26(3), 595–607 (2014)
Wang, H., Dong, S., Shao, L.: Measuring structual similarities in finte MDPs. In: IJCAI (2019)
Wang, H., Gao, Y., Chen, X.: RL-DOT: a reinforcement learning NPC team for playing domination games. IEEE Transactions on Computational Intelligence and AI in Games
Wang, H., Lu, Z.: Preference-aware sequence matching for location-based services. Geoinformatica 24(1), 107–131 (2020)
Wang, H., Pan, N., U, L.H., Zhan, B., Gong, Z.: On dynamic top-k influence maximization. In: WAIM (2015)
Wang, H., Terrovitis, M., Mamoulis, N.: Location recommendation in location-based social networks using user check-in data. In: SIGSPATIAL GIS (2013)
Wang, Y., Li, J., Zhong, Y., Zhu, S., Guo, D., Shang, S.: Discovery of accessible locations using region-based geo-social data. World Wide Web 22(3), 929–944 (2019)
Xu, Y., Chen, L., Yao, B., Shang, S., Zhu, S., Zheng, K., Li, F.: Location-based top-k term querying over sliding window. In: WISE, pp. 299–314 (2017)
Yang, C., Chen, L., Shang, S., Zhu, F., Liu, L., Shao, L.: Toward efficient navigation of massive-scale geo-textual streams. In: IJCAI (2019)
Yang, S., Gao, Y., An, B., Wang, H., Chen, X.: Efficient average reward reinforcement learning using constant shifting values. In: AAAI (2016)
Yang, S., Wang, H., Gao, Y., Chen, X.: An optimal algorithm for the stochastic bandits with knowing near-optimal mean reward. In: AAMAS (2018)
Yu, Y., Gao, Y., Wang, H., Wang, R.: Joint user knowledge and matrix factorization for recommender systems. World Wide Web 21(4), 1141–1163 (2018)
Yu, Y., Wang, C., Wang, H., Gao, Y.: Attributes coupling based matrix factorization for item recommendation. Appl. Intell. 46(3), 521–533 (2017)
Yu, Y., Wang, H., Sun, S., Gao, Y.: Exploiting location significance and user authority for point-of-interest recommendation. In: PAKDD (2017)
Zhang, C., Wang, H., Yang, S., Gao, Y.: A contextual bandit approach to personalized online recommendation via sparse intersections. In: PAKDD (2019)
Zhang, W., Qi, G., Pan, G., Lu, H., Li, S., Wu, Z.: City-scale social event detection and evaluation with taxi traces. ACM Trans. Intell. Syst. Technol. 6(3), Article No. 40 (2015)
Zheng, K., Shang, S., Yuan, N.J., Yang, Y.: Towards efficient search for activity trajectories. In: ICDE, pp. 230–241 (2013)
Zheng, K., Su, H., Zheng, B., Xu, J., Liu, J., Zhou, X.: Interactive top-k spatial keyword queries. In: ICDE, pp. 423–434 (2015)
Acknowledgements
This study is supported by the Program of New Century Excellent Talents in Fujian Province University.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wang, N., Zeng, J., Hong, W. et al. Privacy-preserving spatial keyword location-to-trajectory matching. Distrib Parallel Databases 38, 667–686 (2020). https://doi.org/10.1007/s10619-020-07290-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10619-020-07290-2