Elsevier

Neurocomputing

Volume 402, 18 August 2020, Pages 256-265
Neurocomputing

Joint Deep Recommendation Model Exploiting Reviews and Metadata Information

https://doi.org/10.1016/j.neucom.2020.03.075Get rights and content

Abstract

User-generated product reviews contain a lot of valuable information including users’ opinions on products and product features that is not fully exploited by the current recommendation models. Similarly, the metadata information related to the products, about the reviews and about the users who have written the reviews has rarely been exploited for recommender systems. These heterogeneous information sources have the potential to alleviate the cold start and sparsity problems and improve the quality of recommendations. In this paper, we present a joint deep recommendation model (JDRM) that consists of two parallel neural networks, learning lower-order feature interactions of users and items separately and higher-order feature interactions jointly using a shared last layer. Each of the networks is further composed of two sub-networks. One of the sub-networks focus on exploiting product reviews (of user/item) and the other sub-network learns user preferences/items properties leveraging metadata information along with the ratings. The learned latent features in each network are concatenated, thus producing the user and item latent feature vectors. We combine the two networks by introducing a shared layer on the top, which is a dense fully connected layer used to learn higher level latent features obtained from the two networks and produces final ratings. Extensive experiments on real-world datasets demonstrate that JDRM significantly outperforms state of the art recommendation models.

Introduction

During the last decade, the number of products, the variety and the services provided by the companies have increased exponentially. Companies produce a large number of products along with enormous information to address customers’ needs. Although this provides more options to customers, but due to increasing information load it makes rigorous for them to render large amount of information provided in order to acquire their desired product needs. Recommender systems help customers to cope this problem of information overload by recommending the products that are probably of their interests based on their needs, preferences and past buying behavior [1]. Now a days, various applications like Amazon, Netflix, and Twitter etc. use recommender systems to help people in their daily life.

In the current era of smart devices and web 2.0, users write textual reviews on many e-commerce systems to express their personal opinions towards the purchased items. They also provide rating score for the products along with the reviews indicating their preferences. The e-commerce systems, like Yelp and Amazon, also maintain the metadata information about the users, products and about reviews written by the users. The textual reviews and metadata information may contain the information related to properties of items, users’ behavior and also the preference of each individual user. Before the exploitation of deep learning techniques in this very area, collaborative filtering (CF) with matrix factorization (MF) [2], [3] was considered to be state-of-the-art technique. This technique is built on the basic idea that people having similar choices in the past are likely to have similar interests in the future [39]. Since, dot product is used in MF which has strong constraint that the latent dimensions are independent of each other i.e. each dimension in latent user vector would only interact with the corresponding dimension in the latent item vector. Thus, the MF based CF technique is incapable of learning complex user-item rating behaviors through higher order feature interactions. Although this technique had shown good performance for recommender systems, however lacking in addressing data sparsity and cold start problems as it uses only explicit ratings given with the reviews. One of the techniques used to cope with the issue of lack of data is exploiting the information from the text in reviews. Recently, some researches [4], [5] have demonstrated that the performance of recommender systems can be improved using review text, particularly in case of cold start problems. Some other works [6], [7] utilized both purchased item review data and user-item interactions to improve recommendations, but jointly using product reviews and metadata has not yet been exploited.

In this paper, we propose a Convolutional Neural Network based model, called Joint Deep Recommendation Model (JDRM), to model user preferences and item properties by jointly using the text from reviews and the associated metadata for rating prediction problems. JDRM learns the latent feature vectors for users and items jointly from the two information sources by using two parallel neural networks such that the performance of the proposed model is maximized. One of the networks exploits the reviews, and the other network uses metadata to model user preferences and item properties. This architecture is inline with [8] where different features exploited jointly by a deep neural network in a query based interactive recommendation system. The learned latent features from product reviews and metadata are then feed (after merging) to the fully connected layer introduced on top of the two networks. The most influential features (higher-order) form the latent (specialized) features are captured in this layer, which are later passed through a regression layer to estimate the rating. Our work is novel in the sense that the proposed model, to the best of our knowledge, is the first one to capture both user preferences and item properties from different information sources in a unified neural way, and we are exploring for the first time the metadata information (which is now commonly available with many e-commerce systems like Yelp and Amazon) along with the review text. A substantial advantage of JDRM is that it performs modeling in an integration manner as compared to numerous other approaches [4], [5], [7] that benefit from reviews and contents of users/items. As most of the similar approaches in literature learn latent features from different information sources (product reviews, item descriptions, user-item ratings) independently, thus there are lesser chances of learning good features that can be useful for rating prediction. We believe this work may be impactful in two scenarios for research community and e-commerce systems: (i) Many different kind of metadata are now managed along with the reviews by different e-commerce systems like Yelp and Amazon. Our work validates the usefulness and impact of metadata when used together with the review text from product reviews, and its exploration urges the researcher community to explore further. (ii) The kind of metadata we used (from Yelp) will encourage other recommender / e-commerce systems to maintain similar or even richer information so that their systems can provide better recommendations after applying artificial intelligence.

The experimental results demonstrate that JDRM outperforms all the baseline methods in prediction accuracy on the evaluated real-world datasets of Yelp. Also, the proposed model exhibits good performance for cold start issues and alleviates the sparsity problem by leveraging product reviews and the associated metadata. In summary, the main contributions made in this paper are listed as follows:

  • We propose a novel joint deep learning model, named JDRM that models user preferences and item properties using product reviews and metadata in an integration manner. The fusion/joint layer on top of the two parallel networks connects them in a way that the most influential features from the two latent representations can be drawn for the final rating predictions. JDRM is the first attempt that exploits products reviews and metadata using neural networks for modeling user preferences and item properties.

  • We demonstrate the use of metadata information empirically. From the results we can see that, on all the datasets, the performance of the proposed model is much better than the performances of the two networks trained separately. It justifies the unified use of the two information sources in the proposed way.

  • The experimental results demonstrate that JDRM achieves superior performance in terms of prediction accuracy on publicly available benchmark datasets, and outperforms a variety of state-of-the-art baseline methods [9], [10] (see Section 4).

The rest of the paper is organized as follows. Section 2 provides a brief literature study related to this work. JDRM is described in detail in Section 3. Experiments are presented in section 4 to examine JDRM and validate its effectiveness in comparison with the baseline methods for recommender systems. Finally, conclusions are presented in Section 5.

Section snippets

Related Work

There are two directions of studies related to our work: the approaches that exploit review text for recommendations, and the techniques that involve content and collaborative filtering including metadata for recommender systems. A brief literature study of the two research areas is presented as follows.

The Proposed Model

In this section, we describe JDRM in detail, a novel Joint Deep Recommendation Model for ratings prediction by integrating information from both textual reviews and metadata of users, items and reviews. It learns latent feature vectors for users and items by leveraging reviews and metadata so that the obtained latent features can better predict ratings for the products. It is a neural network model comprising of two CNN-based parallel networks integrated through a top dense layer.

Experiment

We have conducted several experiments on two real-world datasets to validate the performance of JDRM in comparison with other state-of-the-art baseline alternatives. In Section 4.1, we present the datasets, experimental settings, baselines algorithms for comparisons, and the evaluation metric. Performance evaluation is discussed in Section 4.2. Some model analysis and impact of number of features are discussed in Section 4.3.

Conclusion

In this paper, we proposed a novel joint deep learning model (JDRM) which exploits the information exists in the reviews and metadata (of users, items and reviews) for rating predictions. JDRM is composed of two deep neural sub-networks that are fused together by a fully connected fusion layer to model user preferences and item properties from the product reviews and the associated metadata. It makes representational learning of user and item from different information sources mapped into a

CRediT authorship contribution statement

Zahid Younas Khan: Conceptualization, Methodology, Software, Formal analysis, Investigation, Data curation, Writing - original draft, Visualization. Zhendong Niu: Methodology, Resources, Supervision, Validation, Visualization, Writing - review & editing, Project administration. Abdallah Yousif: Data curation, Investigation, Methodology, Software.

Declaration of Competing Interest

The authors declare that they have no conflict of interest.

Acknowledgements

This work is supported by the National Key R&D Program of China (No. 2019YFB1406302), National Natural Science Foundation of China (No. 61370137 ), the National Basic Research Program of China (No. 2012CB7207002 ), the Ministry of Education - China Mobile Research Foundation Project No. 2016/2-7.

Zahid Younas Khan received his MSc degree and MPhil degree of Computer Science from Quaid-i-Azam University Islamabad in 2005 and 2015, respectively. He is currently pursuing Ph.D. in Computer Science at Beijing Institute of Technology (BIT). He works as a Lecturer at the department of Computer Science and Information Technology, University of Azad Jammu & Kashmir. His research interests include machine learning, data mining algorithms, recommender systems and deep learning techniques.

References (41)

  • A. Yousif et al.

    Multi-task learning model based on recurrent convolutional neural networks for citation sentiment and purpose classification

    Neurocomputing

    (2019)
  • K. Shi et al.

    Wide-grained capsule network with sentence-level feature to detect meteorological event in social network

    Future Gener. Comput. Syst.

    (2020)
  • J.K. Tarus et al.

    Knowledge-based recommendation: a review of ontology-based recommender systems for e-learning

    Artif. Intell. Rev.

    (2018)
  • Y. Koren et al.

    Matrix factorization techniques for recommender systems

    IEEE Computer

    (2009)
  • R. Salakhutdinov et al.

    Probabilistic matrix factorization

    Advances in Neural Information Processing Systems 20, Proceedings of the Twenty-First Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December 3-6, 2007

    (2007)
  • J.J. McAuley et al.

    Hidden factors and hidden topics: understanding rating dimensions with review text

    Seventh ACM Conference on Recommender Systems, RecSys ’13, Hong Kong, China, October 12-16, 2013

    (2013)
  • G. Ling et al.

    Ratings meet reviews, a combined approach to recommend

    Eighth ACM Conference on Recommender Systems, RecSys ’14, Foster City, Silicon Valley, CA, USA - October 06 - 10, 2014

    (2014)
  • Y. Bao et al.

    Topicmf: Simultaneously exploiting ratings and reviews for recommendation

    Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, July 27 -31, 2014, Québec City, Québec, Canada.

    (2014)
  • Y. Tan et al.

    Rating-boosted latent topics: Understanding users and items with ratings and reviews

    Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, IJCAI 2016, New York, NY, USA, 9-15 July 2016

    (2016)
  • Y. Zhu et al.

    Query-based interactive recommendation by meta-path and adapted attention-gru

    Proceedings of the 28th ACM International Conference on Information and Knowledge Management, CIKM 2019, Beijing, China, November 3-7, 2019

    (2019)
  • S. Seo et al.

    Interpretable convolutional neural networks with dual local and global attention for review rating prediction

    Proceedings of the Eleventh ACM Conference on Recommender Systems, RecSys 2017, Como, Italy, August 27-31, 2017

    (2017)
  • Y. Lu et al.

    Coevolutionary recommendation model: Mutual learning between ratings and reviews

    Proceedings of the 2018 World Wide Web Conference on World Wide Web, WWW 2018, Lyon, France, April 23-27, 2018

    (2018)
  • C. Wang et al.

    Collaborative topic modeling for recommending scientific articles

    Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, August 21-24, 2011

    (2011)
  • N. Pappas et al.

    Sentiment analysis of user comments for one-class collaborative filtering over ted talks

    The 36th International ACM SIGIR conference on research and development in Information Retrieval, SIGIR ’13, Dublin, Ireland - July 28 - August 01, 2013

    (2013)
  • Y. Zhang et al.

    Explicit factor models for explainable recommendation based on phrase-level sentiment analysis

    The 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’14, Gold Coast, QLD, Australia - July 06 - 11, 2014

    (2014)
  • Q. Diao et al.

    Jointly modeling aspects, ratings and sentiments for movie recommendation (JMARS)

    The 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’14, New York, NY, USA - August 24 - 27, 2014

    (2014)
  • X. He et al.

    Trirank: Review-aware explainable recommendation by modeling aspects

    Proceedings of the 24th ACM International Conference on Information and Knowledge Management, CIKM 2015, Melbourne, VIC, Australia, October 19 - 23, 2015

    (2015)
  • Z. Cheng et al.

    Aspect-aware latent factor model: Rating prediction with ratings and reviews

    Proceedings of the 2018 World Wide Web Conference on World Wide Web, WWW 2018, Lyon, France, April 23-27, 2018

    (2018)
  • X. He et al.

    Neural collaborative filtering

    Proceedings of the 26th International Conference on World Wide Web, WWW 2017, Perth, Australia, April 3-7, 2017

    (2017)
  • X. He et al.

    Neural factorization machines for sparse predictive analytics

    Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, Shinjuku, Tokyo, Japan, August 7-11, 2017

    (2017)
  • Cited by (14)

    • CNNRec: Convolutional Neural Network based recommender systems - A survey

      2024, Engineering Applications of Artificial Intelligence
    • A deep selective learning network for cross-domain recommendation

      2022, Applied Soft Computing
      Citation Excerpt :

      Wang et al. [21] proposed a deep hybrid model to provide interpretable recommendation via fusing rating embeddings with textual features. Khan et al. [22] presented a parallel neural networks, which focused on exploiting review and metadata information along with rating to learning feature interactions of users and items. Parisa et al. [23] proposed a user–item review text interaction-based model, and they transformed the task of recommendation into a text matching problem.

    • Incorporating textual reviews in the learning of latent factors for recommender systems

      2022, Electronic Commerce Research and Applications
      Citation Excerpt :

      Users’ features were also used in learning latent factors by a Bayesian framework. In (Wang et al., 2019; Khan et al., 2020a), the text descriptions of items are used to enrich their latent factor representations. On e-commerce applications, after experiencing an item, a user often leaves a review in the form of text and a rating in the form of stars.

    View all citing articles on Scopus

    Zahid Younas Khan received his MSc degree and MPhil degree of Computer Science from Quaid-i-Azam University Islamabad in 2005 and 2015, respectively. He is currently pursuing Ph.D. in Computer Science at Beijing Institute of Technology (BIT). He works as a Lecturer at the department of Computer Science and Information Technology, University of Azad Jammu & Kashmir. His research interests include machine learning, data mining algorithms, recommender systems and deep learning techniques.

    Zhendong Niu received his Ph.D degree in Computer Science from Beijing Institute of Technology in 1995. He is currently a Professor of the School of Computer Science and Technology and Dean of Library, Beijing Institute of Technology, China. He was a postdoctoral researcher at the University of Pittsburgh from 1996 to 1998, and a researcher/adjunct faculty member at Carnegie Mellon University from 1999 to 2004, and a joint research Professor in the Information School at University of Pittsburgh from 2006. His research areas include digital libraries, e-learning techniques, information retrieval and recommender systems. He serves as an editorial board member for international journal of learning technology. He has published more than 100 papers in journals and international conferences in his field.

    Abdallah Yousif received his BSc with Computer Science and information technology in 2007 from Al-Neelain University and his MSc degree in Computer Sciences from University of Gezira, Sudan in 2010. He received his Ph.D degree at School of Computer Science and Technology, Beijing Institute of Technology, China in 2018. He also works in Wad Medani Technical Collage, Sudan Technological University as a lecturer. His research interests include sentiment analysis, machine learning, citation analysis, and data mining.

    View full text