Joint Deep Recommendation Model Exploiting Reviews and Metadata Information

doi:10.1016/j.neucom.2020.03.075

Neurocomputing

Volume 402, 18 August 2020, Pages 256-265

https://doi.org/10.1016/j.neucom.2020.03.075 Get rights and content

Abstract

User-generated product reviews contain a lot of valuable information including users’ opinions on products and product features that is not fully exploited by the current recommendation models. Similarly, the metadata information related to the products, about the reviews and about the users who have written the reviews has rarely been exploited for recommender systems. These heterogeneous information sources have the potential to alleviate the cold start and sparsity problems and improve the quality of recommendations. In this paper, we present a joint deep recommendation model (JDRM) that consists of two parallel neural networks, learning lower-order feature interactions of users and items separately and higher-order feature interactions jointly using a shared last layer. Each of the networks is further composed of two sub-networks. One of the sub-networks focus on exploiting product reviews (of user/item) and the other sub-network learns user preferences/items properties leveraging metadata information along with the ratings. The learned latent features in each network are concatenated, thus producing the user and item latent feature vectors. We combine the two networks by introducing a shared layer on the top, which is a dense fully connected layer used to learn higher level latent features obtained from the two networks and produces final ratings. Extensive experiments on real-world datasets demonstrate that JDRM significantly outperforms state of the art recommendation models.

Introduction

During the last decade, the number of products, the variety and the services provided by the companies have increased exponentially. Companies produce a large number of products along with enormous information to address customers’ needs. Although this provides more options to customers, but due to increasing information load it makes rigorous for them to render large amount of information provided in order to acquire their desired product needs. Recommender systems help customers to cope this problem of information overload by recommending the products that are probably of their interests based on their needs, preferences and past buying behavior [1]. Now a days, various applications like Amazon, Netflix, and Twitter etc. use recommender systems to help people in their daily life.

In the current era of smart devices and web 2.0, users write textual reviews on many e-commerce systems to express their personal opinions towards the purchased items. They also provide rating score for the products along with the reviews indicating their preferences. The e-commerce systems, like Yelp and Amazon, also maintain the metadata information about the users, products and about reviews written by the users. The textual reviews and metadata information may contain the information related to properties of items, users’ behavior and also the preference of each individual user. Before the exploitation of deep learning techniques in this very area, collaborative filtering (CF) with matrix factorization (MF) [2], [3] was considered to be state-of-the-art technique. This technique is built on the basic idea that people having similar choices in the past are likely to have similar interests in the future [39]. Since, dot product is used in MF which has strong constraint that the latent dimensions are independent of each other i.e. each dimension in latent user vector would only interact with the corresponding dimension in the latent item vector. Thus, the MF based CF technique is incapable of learning complex user-item rating behaviors through higher order feature interactions. Although this technique had shown good performance for recommender systems, however lacking in addressing data sparsity and cold start problems as it uses only explicit ratings given with the reviews. One of the techniques used to cope with the issue of lack of data is exploiting the information from the text in reviews. Recently, some researches [4], [5] have demonstrated that the performance of recommender systems can be improved using review text, particularly in case of cold start problems. Some other works [6], [7] utilized both purchased item review data and user-item interactions to improve recommendations, but jointly using product reviews and metadata has not yet been exploited.

In this paper, we propose a Convolutional Neural Network based model, called Joint Deep Recommendation Model (JDRM), to model user preferences and item properties by jointly using the text from reviews and the associated metadata for rating prediction problems. JDRM learns the latent feature vectors for users and items jointly from the two information sources by using two parallel neural networks such that the performance of the proposed model is maximized. One of the networks exploits the reviews, and the other network uses metadata to model user preferences and item properties. This architecture is inline with [8] where different features exploited jointly by a deep neural network in a query based interactive recommendation system. The learned latent features from product reviews and metadata are then feed (after merging) to the fully connected layer introduced on top of the two networks. The most influential features (higher-order) form the latent (specialized) features are captured in this layer, which are later passed through a regression layer to estimate the rating. Our work is novel in the sense that the proposed model, to the best of our knowledge, is the first one to capture both user preferences and item properties from different information sources in a unified neural way, and we are exploring for the first time the metadata information (which is now commonly available with many e-commerce systems like Yelp and Amazon) along with the review text. A substantial advantage of JDRM is that it performs modeling in an integration manner as compared to numerous other approaches [4], [5], [7] that benefit from reviews and contents of users/items. As most of the similar approaches in literature learn latent features from different information sources (product reviews, item descriptions, user-item ratings) independently, thus there are lesser chances of learning good features that can be useful for rating prediction. We believe this work may be impactful in two scenarios for research community and e-commerce systems: (i) Many different kind of metadata are now managed along with the reviews by different e-commerce systems like Yelp and Amazon. Our work validates the usefulness and impact of metadata when used together with the review text from product reviews, and its exploration urges the researcher community to explore further. (ii) The kind of metadata we used (from Yelp) will encourage other recommender / e-commerce systems to maintain similar or even richer information so that their systems can provide better recommendations after applying artificial intelligence.

The experimental results demonstrate that JDRM outperforms all the baseline methods in prediction accuracy on the evaluated real-world datasets of Yelp. Also, the proposed model exhibits good performance for cold start issues and alleviates the sparsity problem by leveraging product reviews and the associated metadata. In summary, the main contributions made in this paper are listed as follows:

•
We propose a novel joint deep learning model, named JDRM that models user preferences and item properties using product reviews and metadata in an integration manner. The fusion/joint layer on top of the two parallel networks connects them in a way that the most influential features from the two latent representations can be drawn for the final rating predictions. JDRM is the first attempt that exploits products reviews and metadata using neural networks for modeling user preferences and item properties.
•
We demonstrate the use of metadata information empirically. From the results we can see that, on all the datasets, the performance of the proposed model is much better than the performances of the two networks trained separately. It justifies the unified use of the two information sources in the proposed way.
•
The experimental results demonstrate that JDRM achieves superior performance in terms of prediction accuracy on publicly available benchmark datasets, and outperforms a variety of state-of-the-art baseline methods [9], [10] (see Section 4).

The rest of the paper is organized as follows. Section 2 provides a brief literature study related to this work. JDRM is described in detail in Section 3. Experiments are presented in section 4 to examine JDRM and validate its effectiveness in comparison with the baseline methods for recommender systems. Finally, conclusions are presented in Section 5.

Section snippets

Related Work

There are two directions of studies related to our work: the approaches that exploit review text for recommendations, and the techniques that involve content and collaborative filtering including metadata for recommender systems. A brief literature study of the two research areas is presented as follows.

The Proposed Model

In this section, we describe JDRM in detail, a novel Joint Deep Recommendation Model for ratings prediction by integrating information from both textual reviews and metadata of users, items and reviews. It learns latent feature vectors for users and items by leveraging reviews and metadata so that the obtained latent features can better predict ratings for the products. It is a neural network model comprising of two CNN-based parallel networks integrated through a top dense layer.

Experiment

We have conducted several experiments on two real-world datasets to validate the performance of JDRM in comparison with other state-of-the-art baseline alternatives. In Section 4.1, we present the datasets, experimental settings, baselines algorithms for comparisons, and the evaluation metric. Performance evaluation is discussed in Section 4.2. Some model analysis and impact of number of features are discussed in Section 4.3.

Conclusion

In this paper, we proposed a novel joint deep learning model (JDRM) which exploits the information exists in the reviews and metadata (of users, items and reviews) for rating predictions. JDRM is composed of two deep neural sub-networks that are fused together by a fully connected fusion layer to model user preferences and item properties from the product reviews and the associated metadata. It makes representational learning of user and item from different information sources mapped into a

CRediT authorship contribution statement

Zahid Younas Khan: Conceptualization, Methodology, Software, Formal analysis, Investigation, Data curation, Writing - original draft, Visualization. Zhendong Niu: Methodology, Resources, Supervision, Validation, Visualization, Writing - review & editing, Project administration. Abdallah Yousif: Data curation, Investigation, Methodology, Software.

Declaration of Competing Interest

The authors declare that they have no conflict of interest.

Acknowledgements

This work is supported by the National Key R&D Program of China (No. 2019YFB1406302), National Natural Science Foundation of China (No. 61370137 ), the National Basic Research Program of China (No. 2012CB7207002 ), the Ministry of Education - China Mobile Research Foundation Project No. 2016/2-7.

Zahid Younas Khan received his MSc degree and MPhil degree of Computer Science from Quaid-i-Azam University Islamabad in 2005 and 2015, respectively. He is currently pursuing Ph.D. in Computer Science at Beijing Institute of Technology (BIT). He works as a Lecturer at the department of Computer Science and Information Technology, University of Azad Jammu & Kashmir. His research interests include machine learning, data mining algorithms, recommender systems and deep learning techniques.

References (41)

A. Yousif et al.
Multi-task learning model based on recurrent convolutional neural networks for citation sentiment and purpose classification
Neurocomputing
(2019)
K. Shi et al.
Wide-grained capsule network with sentence-level feature to detect meteorological event in social network
Future Gener. Comput. Syst.
(2020)
J.K. Tarus et al.
Knowledge-based recommendation: a review of ontology-based recommender systems for e-learning
Artif. Intell. Rev.
(2018)
Y. Koren et al.
Matrix factorization techniques for recommender systems
IEEE Computer
(2009)
R. Salakhutdinov et al.
Probabilistic matrix factorization
Advances in Neural Information Processing Systems 20, Proceedings of the Twenty-First Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December 3-6, 2007
(2007)
J.J. McAuley et al.
Hidden factors and hidden topics: understanding rating dimensions with review text
Seventh ACM Conference on Recommender Systems, RecSys ’13, Hong Kong, China, October 12-16, 2013
(2013)
G. Ling et al.
Ratings meet reviews, a combined approach to recommend
Eighth ACM Conference on Recommender Systems, RecSys ’14, Foster City, Silicon Valley, CA, USA - October 06 - 10, 2014
(2014)
Y. Bao et al.
Topicmf: Simultaneously exploiting ratings and reviews for recommendation
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, July 27 -31, 2014, Québec City, Québec, Canada.
(2014)
Y. Tan et al.
Rating-boosted latent topics: Understanding users and items with ratings and reviews
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, IJCAI 2016, New York, NY, USA, 9-15 July 2016
(2016)
Y. Zhu et al.
Query-based interactive recommendation by meta-path and adapted attention-gru
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, CIKM 2019, Beijing, China, November 3-7, 2019
(2019)

S. Seo et al.

Interpretable convolutional neural networks with dual local and global attention for review rating prediction

Proceedings of the Eleventh ACM Conference on Recommender Systems, RecSys 2017, Como, Italy, August 27-31, 2017

(2017)

Y. Lu et al.

Coevolutionary recommendation model: Mutual learning between ratings and reviews

Proceedings of the 2018 World Wide Web Conference on World Wide Web, WWW 2018, Lyon, France, April 23-27, 2018

(2018)

C. Wang et al.

Collaborative topic modeling for recommending scientific articles

Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, August 21-24, 2011

(2011)

N. Pappas et al.

Sentiment analysis of user comments for one-class collaborative filtering over ted talks

The 36th International ACM SIGIR conference on research and development in Information Retrieval, SIGIR ’13, Dublin, Ireland - July 28 - August 01, 2013

(2013)

Y. Zhang et al.

Explicit factor models for explainable recommendation based on phrase-level sentiment analysis

The 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’14, Gold Coast, QLD, Australia - July 06 - 11, 2014

(2014)

Q. Diao et al.

Jointly modeling aspects, ratings and sentiments for movie recommendation (JMARS)

The 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’14, New York, NY, USA - August 24 - 27, 2014

(2014)

X. He et al.

Trirank: Review-aware explainable recommendation by modeling aspects

Proceedings of the 24th ACM International Conference on Information and Knowledge Management, CIKM 2015, Melbourne, VIC, Australia, October 19 - 23, 2015

(2015)

Z. Cheng et al.

Aspect-aware latent factor model: Rating prediction with ratings and reviews

Proceedings of the 2018 World Wide Web Conference on World Wide Web, WWW 2018, Lyon, France, April 23-27, 2018

(2018)

X. He et al.

Neural collaborative filtering

Proceedings of the 26th International Conference on World Wide Web, WWW 2017, Perth, Australia, April 3-7, 2017

(2017)

X. He et al.

Neural factorization machines for sparse predictive analytics

Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, Shinjuku, Tokyo, Japan, August 7-11, 2017

(2017)

Cited by (14)

CNNRec: Convolutional Neural Network based recommender systems - A survey
2024, Engineering Applications of Artificial Intelligence
Easy internet access and technological advancements have resulted in information overload and a plethora of options, making decision-making extremely difficult. Recommender System (RS) is a potential solution for assisting users in making decisions by recommending or predicting product ratings. Three fundamental forms of RS that use implicit or explicit feedback for recommendation are collaborative, content-based, and hybrid filtering. Ratings are the most common form of feedback, but product descriptions, reviews, images, audios, and videos are also important and can help improve the performance of the traditional RS. These additional variables can have a significant impact on RS’s performance. Traditional RSs used approaches based on the nearest neighbor or other machine learning models, but thanks to recent advances in artificial intelligence and deep learning, RSs are now being developed using Convolutional Neural Networks (CNN), which can efficiently exploit auxiliary information. In addition to comparing CNN-based RSs on common grounds, this article provides a full examination of CNN-based RSs and how they might use various types of auxiliary information. The study also discusses data characteristics, data statistics, and auxiliary information in a variety of publicly available datasets. Different evaluation measures for RSs are also discussed, and readers are provided with interesting challenges and open research issues.
A deep selective learning network for cross-domain recommendation
2022, Applied Soft Computing
Citation Excerpt :
Wang et al. [21] proposed a deep hybrid model to provide interpretable recommendation via fusing rating embeddings with textual features. Khan et al. [22] presented a parallel neural networks, which focused on exploiting review and metadata information along with rating to learning feature interactions of users and items. Parisa et al. [23] proposed a user–item review text interaction-based model, and they transformed the task of recommendation into a text matching problem.
In the past two decades, recommendation system has been successfully applied to many e-commerce companies and is a ubiquitous part of today online entertainment. However, many single-domain recommendations suffer from the sparsity problems due to a lack of sufficient interactive data. In fact, user behaviors from different domains are usually relevant. Therefore, cross-domain ideas have been proposed to help alleviate the data sparsity issue in traditional single-domain recommender systems. Motivated by this, we design a deep selective learning network (DSLN) in this paper, for the scenario when domains have minimum or no common users DSLN firstly exploits reviews to profile the preference of users and characteristic of items. Then it selects useful user or item information from the auxiliary domain and transfers it to the target domain to solve the negative transfer problem, even though there may be no overlapping users or items between these two domains. In DSLN model, the selection of useful information is realized by the de-noising auto-encoder (DAE), which is shared between the auxiliary and target domains. By minimizing the reconstruction error of the DAE, on the one hand, only the useful information can be selected from the auxiliary domain; on the other hand, the latent representation of users and items in two domains can be learned. Our experiments on three cross-domain scenarios with different sparsity of Amazon review dataset show that, our proposed model gains 0.58% to 18.16% relative improvement compared to single-domain recommendation models, and from 1.05% to 19.4% relative improvement compared to cross-domain recommendation models.
Incorporating textual reviews in the learning of latent factors for recommender systems
2022, Electronic Commerce Research and Applications
Citation Excerpt :
Users’ features were also used in learning latent factors by a Bayesian framework. In (Wang et al., 2019; Khan et al., 2020a), the text descriptions of items are used to enrich their latent factor representations. On e-commerce applications, after experiencing an item, a user often leaves a review in the form of text and a rating in the form of stars.
In the field of recommender systems, the latent factor model is one of the state-of-the-art ones thanks to its strengths in accuracy and scalability. Its core is to learn latent factors for the representation of users and items using rating data collected through surveys after the users have experienced the items. However, on e-commerce applications, besides ratings, users can write reviews for items. A review generally indicates a user’s experience with an item while a rating indicates his/her level of satisfaction with such an experience. Latent factors can be learned more accurately if supported by such reviews. This study is distinctive in interpreting a review as both a description of the user/item and a description of the surrounding elements affecting the user's experience with the item. It has proven to be more effective than those that only consider a review as a description of the user/item. Especially, the analysis of the experimental results shows that our model provides supportive recommendations for users with detailed reviews in spite of their few collected ratings.
DHMFRD – TER: a deep hybrid model for fake review detection incorporating review texts, emotions, and ratings
2024, Multimedia Tools and Applications
Deep learning-based collaborative filtering recommender systems: a comprehensive and systematic review
2023, Neural Computing and Applications
Deep Learning-Based Recommendation System: Systematic Review and Classification
2023, IEEE Access

View all citing articles on Scopus

Zhendong Niu received his Ph.D degree in Computer Science from Beijing Institute of Technology in 1995. He is currently a Professor of the School of Computer Science and Technology and Dean of Library, Beijing Institute of Technology, China. He was a postdoctoral researcher at the University of Pittsburgh from 1996 to 1998, and a researcher/adjunct faculty member at Carnegie Mellon University from 1999 to 2004, and a joint research Professor in the Information School at University of Pittsburgh from 2006. His research areas include digital libraries, e-learning techniques, information retrieval and recommender systems. He serves as an editorial board member for international journal of learning technology. He has published more than 100 papers in journals and international conferences in his field.

Abdallah Yousif received his BSc with Computer Science and information technology in 2007 from Al-Neelain University and his MSc degree in Computer Sciences from University of Gezira, Sudan in 2010. He received his Ph.D degree at School of Computer Science and Technology, Beijing Institute of Technology, China in 2018. He also works in Wad Medani Technical Collage, Sudan Technological University as a lecturer. His research interests include sentiment analysis, machine learning, citation analysis, and data mining.

View full text

Joint Deep Recommendation Model Exploiting Reviews and Metadata Information

Abstract

Introduction

Section snippets

Related Work

The Proposed Model

Experiment

Conclusion

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgements

Neurocomputing

Future Gener. Comput. Syst.

Knowledge-based recommendation: a review of ontology-based recommender systems for e-learning

Artif. Intell. Rev.

Matrix factorization techniques for recommender systems

IEEE Computer

Probabilistic matrix factorization

Advances in Neural Information Processing Systems 20, Proceedings of the Twenty-First Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December 3-6, 2007

Hidden factors and hidden topics: understanding rating dimensions with review text

Seventh ACM Conference on Recommender Systems, RecSys ’13, Hong Kong, China, October 12-16, 2013

Ratings meet reviews, a combined approach to recommend

Eighth ACM Conference on Recommender Systems, RecSys ’14, Foster City, Silicon Valley, CA, USA - October 06 - 10, 2014

Topicmf: Simultaneously exploiting ratings and reviews for recommendation

Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, July 27 -31, 2014, Québec City, Québec, Canada.

Rating-boosted latent topics: Understanding users and items with ratings and reviews

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, IJCAI 2016, New York, NY, USA, 9-15 July 2016

Query-based interactive recommendation by meta-path and adapted attention-gru

Proceedings of the 28th ACM International Conference on Information and Knowledge Management, CIKM 2019, Beijing, China, November 3-7, 2019

Interpretable convolutional neural networks with dual local and global attention for review rating prediction

Proceedings of the Eleventh ACM Conference on Recommender Systems, RecSys 2017, Como, Italy, August 27-31, 2017

Coevolutionary recommendation model: Mutual learning between ratings and reviews

Proceedings of the 2018 World Wide Web Conference on World Wide Web, WWW 2018, Lyon, France, April 23-27, 2018

Collaborative topic modeling for recommending scientific articles

Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, August 21-24, 2011

Sentiment analysis of user comments for one-class collaborative filtering over ted talks

The 36th International ACM SIGIR conference on research and development in Information Retrieval, SIGIR ’13, Dublin, Ireland - July 28 - August 01, 2013

Explicit factor models for explainable recommendation based on phrase-level sentiment analysis

The 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’14, Gold Coast, QLD, Australia - July 06 - 11, 2014

Jointly modeling aspects, ratings and sentiments for movie recommendation (JMARS)

The 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’14, New York, NY, USA - August 24 - 27, 2014

Trirank: Review-aware explainable recommendation by modeling aspects

Proceedings of the 24th ACM International Conference on Information and Knowledge Management, CIKM 2015, Melbourne, VIC, Australia, October 19 - 23, 2015

Aspect-aware latent factor model: Rating prediction with ratings and reviews

Proceedings of the 2018 World Wide Web Conference on World Wide Web, WWW 2018, Lyon, France, April 23-27, 2018

Neural collaborative filtering

Proceedings of the 26th International Conference on World Wide Web, WWW 2017, Perth, Australia, April 3-7, 2017

Neural factorization machines for sparse predictive analytics

Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, Shinjuku, Tokyo, Japan, August 7-11, 2017