Skip to main content
Log in

Privacy-preserving spatial keyword location-to-trajectory matching

  • Published:
Distributed and Parallel Databases Aims and scope Submit manuscript

Abstract

Location-to-trajectory matching plays an important role in trajectory data management and analysis. In this paper, we propose and study a novel problem of privacy-preserving spatial keyword location-to-trajectory matching (PPSKLT Matching). Given a set O of locations with keywords, a set T of activity trajectories, a spatial matching threshold \(\theta _{\text{sp}}\), and a keyword matching threshold \(\theta _{\text{kw}}\), the PPSKLT matching finds all location-trajectory pairs from O and T while preserving the users’ privacy. We believe that the PPSKLT matching may benefit many mobile applications such as trajectory activity analysis, event tracking, and so on. The PPSKLT matching is challenging due to three reasons: (1) how to define the spatial keyword similarity measure between locations and trajectories, (2) how to prune the search space effectively, and (3) how to preserve the users’ privacy during query processing. To overcome these challenges and address the problem efficiently, we develop a novel network expansion algorithm (NEA). We define a pair of upper and lower bounds on the spatio-textual similarity to prune the search space. We also define a privacy-preserving mechanism to protect users’ privacy. We conduct extensive experiments on spatio-textual data sets to verify the performance of the developed algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. While many location points in the road network may have informative keywords such as “restaurant”, “shopping mall”, etc., it is also possible that some location points in V are simply road connectors without any meaningful keywords. In such cases, we may consider that the function \({\mathcal {K}}(\cdot )\) returns the empty set.

  2. http://www.iscas.ac.cn/

  3. https://www.cs.utah.edu/~lifeifei/SpatialDataset.htm

  4. https://developer.foursquare.com/

  5. Specifically, the for-loop at Line 2 of Baseline-LT (Algorithm 2) and the for-loop at Line 2 of Baseline-TL (Algorithm 3) are parallelized.

References

  1. Agrawal, R., Faloutsos, C., Swami, A.: Efficient similarity search in sequence databases. In: FODO, pp. 69–84 (1993)

  2. Ahuja, R.K., Magnanti, T.L., Orlin, J.B.: Network Flows: Theory, Algorithms, and Applications. Prentice Hall, Upper Saddle River, NJ (1993)

    MATH  Google Scholar 

  3. Cao, X., Chen, L., Cong, G., Guan, J., Phan, N.-T., Xiao, X.: KORS: keyword-aware optimal route search system. In: ICDE (2013)

  4. Cao, X., Chen, L., Cong, G., Jensen, C.S., Qu, Q., Skovsgaard, A., Wu, D., Yiu, M. L.: Spatial keyword querying. In: ER (2012)

  5. Cao, X., Chen, L., Cong, G., Xiao, X.: Keyword-aware optimal route search. In: Proceedings of the VLDB Endowment (PVLDB), vol. 5(11), pp. 1136–1147 (2012)

  6. Chen, L., Cong, G.: Diversity-aware top-k publish/subscribe for text stream. In: SIGMOD (2015)

  7. Chen, L., Cong, G., Cao, X.: An efficient query indexing mechanism for filtering geo-textual data. In: SIGMOD (2013)

  8. Chen, L., Cong, G., Cao, X., Tan, K.-L.: Temporal spatial-keyword top-k publish/subscribe. In: ICDE (2015)

  9. Chen, L., Cong, G., Jensen, C.S., Wu, D.: Spatial keyword query processing: an experimental evaluation. PVLDB 6(3), 217–228 (2013)

    Google Scholar 

  10. Chen, L., Cui, Y., Cong, G., Cao, X.: SOPS: a system for efficient processing of spatial-keyword publish/subscribe. PVLDB 7(13), 1601–1604 (2014)

    Google Scholar 

  11. Chen, L., Shang, S.: Region-based message exploration over spatio-temporal data streams. In: AAAI, pp. 873–880 (2019)

  12. Chen, L., Shang, S., Jensen, C.S., Yao, B., Zhang, Z., Shao, L.: Effective and efficient reuse of past travel behavior for route recommendation. In: KDD (2019)

  13. Chen, L., Shang, S., Yang, C., Li, J.: Spatial keyword search: a survey. Geoinformatica 24(1), 85–106 (2020)

    Article  Google Scholar 

  14. Chen, L., Shang, S., Zhang, Z., Cao, X., Jensen, C.S., Kalnis, P.: Location-aware top-k term publish/subscribe. In: ICDE, pp. 749–760 (2018)

  15. Chen, L., Shang, S., Zheng, K., Kalnis, P.: Cluster-based subscription matching for geo-textual data streams. In: ICDE, pp. 890–901 (2019)

  16. Chen, Z., Cong, G., Zhang, Z., Fu, T.Z.J., Chen, L.: Distributed publish/subscribe query processing on the spatio-textual data stream. In: ICDE (2017)

  17. Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to Algorithms, 3rd edn. The MIT Press, New York (2009)

    MATH  Google Scholar 

  18. Giannotti, F., Nanni, M., Pinelli, F., Pedreschi, D.: Trajectory pattern mining. In: SIGKDD, pp. 330–339 (2007)

  19. Han, P., Shang, S., Sun, A., Zhao, P., Kalnis, P.: AUC-MF: point of interest recommendation with AUC maximization. In: ICDE (2019)

  20. Ilyas, I.F., Beskales, G., Soliman, M.A.: A survey of top-k query processing techniques in relational database systems. ACM Computing Survey 40(4), Article No. 11 (2008)

  21. Ilyas, I.F., Shah, R., Aref, W.G., Vitter, J.S., Elmagarmid, A.K.: Rank-aware query optimization. In: SIGMOD (2004)

  22. Kou, N.M., Li, Y., Wang, H., U, L.H., Gong, Z.: Crowdsourced top-k queries by confidence-aware pairwise judgments. In: SIGMOD (2017)

  23. wang, Hao, fan, shunguo, song, jinhua, gao, yang, chen, xingguo: R learning transfer based on subgoal discovery and subtask similarity. IEEE/CAA J. Auto. Sin. 1(3), 252–266 (2014)

    Google Scholar 

  24. Levy, O., Goldberg, Y.: Neural word embedding as implicit matrix factorization. In: NIPS (2014)

  25. Li, J., Sun, A., Xing, Z.: Learning to answer programming questions with software documentation through social context embedding. Inform. Sci. 448–449, 36–52 (2018)

    Article  Google Scholar 

  26. Li, J., Xing, Z., Kabir, A.: Leveraging official content and social context to recommend software documentation. In: IEEE Transactions on Services Computing (2018)

  27. Li, M., Chen, L., Cong, G., Gu, Y., Yu, G.: Efficient processing of location-aware group preference queries. In: CIKM (2016)

  28. Li, Y., Kou, N.M., Wang, H., U, L.H., Gong, Z.: A confidence-aware top-k query processing toolkit on crowdsourcing. In: VLDB (2017)

  29. Liu, Y., Zhao, Y., Chen, L., Pei, J., Han, J.: Mining frequent trajectory patterns for activity monitoring using radio frequency tag arrays. IEEE Trans. Parall. Distrib. Syst. 23(11), 2138–2149 (2012)

    Article  Google Scholar 

  30. Lu, Z., Wang, H., Mamoulis, N., Tu, W., Cheung, D.W.: Personalized location recommendation by aggregating multiple recomenders in diversity. Geoinformatica 21(3), 459–484 (2017)

    Article  Google Scholar 

  31. Ramos, J.: Using TF-IDF to determine word relevance in document queries (2003)

  32. Shang, S., Chen, L., Jensen, C.S., Wen, J., Kalnis, P.: Searching trajectories by regions of interest. IEEE Trans. Knowl. Data Eng. 29(7), 1549–1562 (2017)

    Article  Google Scholar 

  33. Shang, S., Chen, L., Jensen, C.S., Wen, J.-R., Kalnis, P.: Search trajectories by regions of interest. In: ICDE (2018)

  34. Shang, S., Chen, L., Wei, Z., Jensen, C.S., Wen, J., Kalnis, P.: Collective travel planning in spatial networks. IEEE Trans. Knowl. Data Eng. 28(5), 1132–1146 (2016)

    Article  Google Scholar 

  35. Shang, S., Chen, L., Wei, Z., Jensen, C.S., Zheng, K., Kalnis, P.: Trajectory similarity join in spatial networks. PVLDB 10(11), 1178–1189 (2017)

    Google Scholar 

  36. Shang, S., Chen, L., Wei, Z., Jensen, C.S., Zheng, K., Kalnis, P.: Parallel trajectory similarity joins in spatial networks. VLDB J. 27(3), 395–420 (2018)

    Article  Google Scholar 

  37. Shang, S., Chen, L., Zheng, K., Jensen, C.S., Wei, Z., Kalnis, P.: Parallel trajectory-to-location join. IEEE Trans. Knowl. Data Eng. 31(6), 1194–1207 (2019)

    Article  Google Scholar 

  38. Shang, S., Ding, R., Zheng, K., Jensen, C.S., Kalnis, P., Zhou, X.: Personalized trajectory matching in spatial networks. VLDB J. 23(3), 449–468 (2014)

    Article  Google Scholar 

  39. Shang, S., Guo, D., Liu, J., Zheng, K., Wen, J.: Finding regions of interest using location based social media. Neurocomputing 173, 118–123 (2016)

    Article  Google Scholar 

  40. Shang, S., Liu, J., Zheng, K., Lu, H., Pedersen, T.B., Wen, J.: Planning unobstructed paths in traffic-aware spatial networks. GeoInformatica 19(4), 723–746 (2015)

    Article  Google Scholar 

  41. Shang, S., Lu, H., Pedersen, T.B., Xie, X.: Finding traffic-aware fastest paths in spatial networks. In: SSTD, pp. 128–145 (2013)

  42. Shang, S., Wei, Z., Wen, J., Zhu, S.: Probabilistic nearest neighbor query in traffic-aware spatial networks. In: APWeb, pp. 3–14 (2016)

  43. Shang, S., Xie, K., Zheng, K., Liu, J., Wen, J.: VID join: Mapping trajectories to points of interest to support location-based services. J. Comput. Sci. Technol. 30(4), 725–744 (2015)

    Article  Google Scholar 

  44. Shang, S., Yuan, B., Deng, K., Xie, K., Zheng, K., Zhou, X.: Pnn query processing on compressed trajectories. GeoInformatica 16(3), 467–496 (2012)

    Article  Google Scholar 

  45. Shang, S., Yuan, B., Deng, K., Xie, K., Zhou, X.: Finding the most accessible locations: reverse path nearest neighbor query in road networks. In: ACM SIGSPATIAL, pp. 181–190 (2011)

  46. Shang, S., Zheng, K., Jensen, C.S., Yang, B., Kalnis, P., Li, G., Wen, J.: Discovery of path nearby clusters in spatial networks. IEEE Trans. Knowl. Data Eng. 27(6), 1505–1518 (2015)

    Article  Google Scholar 

  47. Song, J., Gao, Y., Wang, H.: Feature learning and transfer performance prediction for video reinforcement learning tasks via a Siamese convolutional neural network. In: ICNIP (2018)

  48. Sutton, R.S., Barto, A.: Reinforcement learning: an introduction, 2nd edn. The MIT Press, New York (2018)

    MATH  Google Scholar 

  49. Wang, H., Cai, Y., Yang, Y., Zhang, S., Mamoulis, N.: Durable queries over historical time series. IEEE Trans. Knowl. Data Eng. (TKDE) 26(3), 595–607 (2014)

    Article  Google Scholar 

  50. Wang, H., Dong, S., Shao, L.: Measuring structual similarities in finte MDPs. In: IJCAI (2019)

  51. Wang, H., Gao, Y., Chen, X.: RL-DOT: a reinforcement learning NPC team for playing domination games. IEEE Transactions on Computational Intelligence and AI in Games

  52. Wang, H., Lu, Z.: Preference-aware sequence matching for location-based services. Geoinformatica 24(1), 107–131 (2020)

    Article  Google Scholar 

  53. Wang, H., Pan, N., U, L.H., Zhan, B., Gong, Z.: On dynamic top-k influence maximization. In: WAIM (2015)

  54. Wang, H., Terrovitis, M., Mamoulis, N.: Location recommendation in location-based social networks using user check-in data. In: SIGSPATIAL GIS (2013)

  55. Wang, Y., Li, J., Zhong, Y., Zhu, S., Guo, D., Shang, S.: Discovery of accessible locations using region-based geo-social data. World Wide Web 22(3), 929–944 (2019)

    Article  Google Scholar 

  56. Xu, Y., Chen, L., Yao, B., Shang, S., Zhu, S., Zheng, K., Li, F.: Location-based top-k term querying over sliding window. In: WISE, pp. 299–314 (2017)

  57. Yang, C., Chen, L., Shang, S., Zhu, F., Liu, L., Shao, L.: Toward efficient navigation of massive-scale geo-textual streams. In: IJCAI (2019)

  58. Yang, S., Gao, Y., An, B., Wang, H., Chen, X.: Efficient average reward reinforcement learning using constant shifting values. In: AAAI (2016)

  59. Yang, S., Wang, H., Gao, Y., Chen, X.: An optimal algorithm for the stochastic bandits with knowing near-optimal mean reward. In: AAMAS (2018)

  60. Yu, Y., Gao, Y., Wang, H., Wang, R.: Joint user knowledge and matrix factorization for recommender systems. World Wide Web 21(4), 1141–1163 (2018)

    Article  Google Scholar 

  61. Yu, Y., Wang, C., Wang, H., Gao, Y.: Attributes coupling based matrix factorization for item recommendation. Appl. Intell. 46(3), 521–533 (2017)

    Article  Google Scholar 

  62. Yu, Y., Wang, H., Sun, S., Gao, Y.: Exploiting location significance and user authority for point-of-interest recommendation. In: PAKDD (2017)

  63. Zhang, C., Wang, H., Yang, S., Gao, Y.: A contextual bandit approach to personalized online recommendation via sparse intersections. In: PAKDD (2019)

  64. Zhang, W., Qi, G., Pan, G., Lu, H., Li, S., Wu, Z.: City-scale social event detection and evaluation with taxi traces. ACM Trans. Intell. Syst. Technol. 6(3), Article No. 40 (2015)

  65. Zheng, K., Shang, S., Yuan, N.J., Yang, Y.: Towards efficient search for activity trajectories. In: ICDE, pp. 230–241 (2013)

  66. Zheng, K., Su, H., Zheng, B., Xu, J., Liu, J., Zhou, X.: Interactive top-k spatial keyword queries. In: ICDE, pp. 423–434 (2015)

Download references

Acknowledgements

This study is supported by the Program of New Century Excellent Talents in Fujian Province University.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shunzhi Zhu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, N., Zeng, J., Hong, W. et al. Privacy-preserving spatial keyword location-to-trajectory matching. Distrib Parallel Databases 38, 667–686 (2020). https://doi.org/10.1007/s10619-020-07290-2

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10619-020-07290-2

Keywords

Navigation