skip to main content
research-article

Personalized Query Auto-Completion for Large-Scale POI Search at Baidu Maps

Authors Info & Claims
Published:18 June 2020Publication History
Skip Abstract Section

Abstract

Query auto-completion (QAC) is a featured function that has been widely adopted by many sub-domains of search. It can dramatically reduce the number of typed characters and avoid spelling mistakes. These merits of QAC are highlighted to improve user satisfaction, especially when users intend to type in a query on mobile devices. In this article, we will present our industrial solution to the personalized QAC for the point of interest (POI) search at Baidu Maps, a well-known Web mapping service on mobiles in China. The industrial solution makes a good tradeoff between the offline effectiveness of a novel neural learning model that we devised for feature generation and the online efficiency of an off-the-shelf learning to rank (LTR) approach for the real-time suggestion. Besides some practical lessons from how a real-world QAC system is built and deployed in Baidu Maps to facilitate a large number of users in searching tens of millions of POIs, we mainly explore two specific features for the personalized QAC function of the POI search engine: the spatial-temporal characteristics of POIs and the historically queried POIs of individual users.

We leverage the large-volume POI search logs in Baidu Maps to conduct offline evaluations of our personalized QAC model measured by multiple metrics, including Mean Reciprocal Rank (MRR), Success Rate (SR), and normalized Discounted Cumulative Gain (nDCG). Extensive experimental results demonstrate that the personalized model enhanced by the proposed features can achieve substantial improvements (i.e., +3.29% MRR, +3.78% SR@1, +5.17% SR@3, +1.96% SR@5, and +3.62% nDCG@5). After deploying this upgraded model into the POI search engine at Baidu Maps for A/B testing online, we observe that some other critical indicators, such as the average number of keystrokes and the average typing speed at keystrokes in a QAC session, which are also related to user satisfaction, decrease as well by 1.37% and 1.69%, respectively. So the conclusion is that the two kinds of features contributed by us are quite helpful in personalized mapping services for industrial practice.

References

  1. Nima Asadi and Jimmy Lin. 2012. Fast candidate generation for two-phase document ranking: Postings list intersection with bloom filters. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM’12). ACM, New York, NY, 2419--2422. DOI:https://doi.org/10.1145/2396761.2398656Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Ziv Bar-Yossef and Naama Kraus. 2011. Context-sensitive query auto-completion. In Proceedings of the 20th International Conference on World Wide Web (WWW’11). ACM, New York, NY, 107--116. DOI:https://doi.org/10.1145/1963405.1963424Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Y. Bengio, A. Courville, and P. Vincent. 2013. Representation learning: A review and new perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence 35, 8 (Aug 2013), 1798--1828. DOI:https://doi.org/10.1109/TPAMI.2013.50Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Andrei Z. Broder, David Carmel, Michael Herscovici, Aya Soffer, and Jason Zien. 2003. Efficient query evaluation using a two-level retrieval process. In Proceedings of the 12th International Conference on Information and Knowledge Management (CIKM’03). ACM, New York, NY, 426--434. DOI:https://doi.org/10.1145/956863.956944Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Fei Cai and Maarten de Rijke. 2016. A survey of query auto completion in information retrieval. Found. Trends Inf. Retr. 10, 4 (Sept. 2016), 273–363. DOI:https://doi.org/10.1561/1500000055Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Fei Cai, Shangsong Liang, and Maarten de Rijke. 2014. Time-sensitive personalized query auto-completion. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management (CIKM’14). ACM, New York, NY, 1599--1608. DOI:https://doi.org/10.1145/2661829.2661921Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Fei Cai, Ridho Reinanda, and Maarten De Rijke. 2016. Diversifying query auto-completion. ACM Transactions on Information Systems (TOIS) 34, 4, Article 25 (June 2016), 33 pages. DOI:https://doi.org/10.1145/2910579Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Kyunghyun Cho, Bart van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder--decoder for statistical machine translation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP’14). Association for Computational Linguistics, Doha, Qatar, 1724--1734. DOI:https://doi.org/10.3115/v1/D14-1179Google ScholarGoogle ScholarCross RefCross Ref
  9. V. S. Dandagi and N. Sidnal. 2018. Review on query auto-completion. In 2018 International Conference on Computational Techniques, Electronics and Mechanical Systems. 119--123.Google ScholarGoogle Scholar
  10. Jiafeng Guo, Yixing Fan, Liang Pang, Liu Yang, Qingyao Ai, Hamed Zamani, Chen Wu, W. Bruce Croft, and Xueqi Cheng. 2019. A Deep Look into Neural Ranking Models for Information Retrieval. arxiv:cs.IR/1903.06902Google ScholarGoogle Scholar
  11. Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Computation 9, 8 (1997), 1735--1780.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Aaron Jaech and Mari Ostendorf. 2018. Personalized language model for query auto-completion. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics, Melbourne, Australia, 700--705. DOI:https://doi.org/10.18653/v1/P18-2111Google ScholarGoogle ScholarCross RefCross Ref
  13. Jyun-Yu Jiang, Yen-Yu Ke, Pao-Yu Chien, and Pu-Jen Cheng. 2014. Learning user reformulation behavior for query auto-completion. In Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’14). ACM, New York, NY, 445--454. DOI:https://doi.org/10.1145/2600428.2609614Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Eric R. Kandel, Yadin Dudai, and Mark R. Mayford. 2014. The molecular and systems biology of memory. Cell 157, 1 (2014), 163--186. DOI:https://doi.org/10.1016/j.cell.2014.03.001Google ScholarGoogle ScholarCross RefCross Ref
  15. Hyun-Kyu Kang and Key-Sun Choi. 1997. Two-level document ranking using mutual information in natural language information retrieval. Information Processing 8 Management 33, 3 (1997), 289--306.Google ScholarGoogle Scholar
  16. Gyuwan Kim. 2019. Subword language model for query auto-completion. In Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP’19).Google ScholarGoogle ScholarCross RefCross Ref
  17. Yoon Kim. 2014. Convolutional neural networks for sentence classification. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP’14). Association for Computational Linguistics, Doha, Qatar, 1746--1751. DOI:https://doi.org/10.3115/v1/D14-1181Google ScholarGoogle ScholarCross RefCross Ref
  18. Yann LeCun and Yoshua Bengio. 1998. The handbook of brain theory and neural networks. MIT Press, Cambridge, MA, Chapter Convolutional Networks for Images, Speech, and Time Series, 255--258.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. Nature 521, 7553 (2015), 436.Google ScholarGoogle Scholar
  20. Liangda Li, Hongbo Deng, Anlei Dong, Yi Chang, Hongyuan Zha, and Ricardo Baeza-Yates. 2015. Analyzing user’s sequential behavior in query auto-completion via markov processes. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’15). ACM, New York, NY, 123--132. DOI:https://doi.org/10.1145/2766462.2767723Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Tie-Yan Liu. 2009. Learning to rank for information retrieval. Foundations and Trends in Information Retrieval 3, 3 (March 2009), 225--331.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).Google ScholarGoogle Scholar
  23. Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Distributed representations of words and phrases and their compositionality. In Proceedings of the 26th International Conference on Neural Information Processing Systems—Volume 2 (NIPS’13). Curran Associates Inc., 3111--3119.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Dae Hoon Park and Rikio Chiba. 2017. A neural language model for query auto-completion. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’17). ACM, New York, NY, 1189--1192. DOI:https://doi.org/10.1145/3077136.3080758Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Yang Song, Dengyong Zhou, and Li-wei He. 2011. Post-ranking query suggestion by diversifying search results. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’11). ACM, New York, NY, 815--824. DOI:https://doi.org/10.1145/2009916.2010025Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Saedeh Tahery and Saeed Farzi. 2020. Customized query auto-completion and suggestion—A review. Information Systems 87 (2020), 101415. DOI:https://doi.org/10.1016/j.is.2019.101415Google ScholarGoogle ScholarCross RefCross Ref
  27. Po-Wei Wang, Huan Zhang, Vijai Mohan, Inderjit S. Dhillon, and J. Zico Kolter. 2018. Realtime query completion via deep language models. In the SIGIR 2018 Workshop on eCommerce co-located with the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018), Ann Arbor, Michigan, July 12, 2018. http://ceur-ws.org/Vol-2319/paper24.pdf.Google ScholarGoogle Scholar
  28. Stewart Whiting, Andrew James McMinn, and Joemon M. Jose. 2013. Exploring real-time temporal query auto-completion. In DIR (DIR’13). 12--15.Google ScholarGoogle Scholar
  29. Aston Zhang, Amit Goyal, Weize Kong, Hongbo Deng, Anlei Dong, Yi Chang, Carl A. Gunter, and Jiawei Han. 2015. adaQAC: Adaptive query auto-completion via implicit negative feedback. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’15). ACM, New York, NY, 143--152. DOI:https://doi.org/10.1145/2766462.2767697Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Xiang Zhang, Junbo Zhao, and Yann LeCun. 2015. Character-level convolutional networks for text classification. In Advances in Neural Information Processing Systems (NIPS’15). 649--657.Google ScholarGoogle Scholar
  31. Zhaohui Zheng, Keke Chen, Gordon Sun, and Hongyuan Zha. 2007. A regression framework for learning ranking functions using relative relevance judgments. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’07). ACM, New York, NY, 287--294. DOI:https://doi.org/10.1145/1277741.1277792Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Personalized Query Auto-Completion for Large-Scale POI Search at Baidu Maps

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Asian and Low-Resource Language Information Processing
        ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 19, Issue 5
        September 2020
        278 pages
        ISSN:2375-4699
        EISSN:2375-4702
        DOI:10.1145/3403646
        Issue’s Table of Contents

        Copyright © 2020 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 18 June 2020
        • Online AM: 7 May 2020
        • Accepted: 1 April 2020
        • Revised: 1 December 2019
        • Received: 1 October 2019
        Published in tallip Volume 19, Issue 5

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format