Skip to main content
Log in

Predicting future personal life events on twitter via recurrent neural networks

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

Social network users publicly share a wide variety of information with their followers and the general public ranging from their opinions, sentiments and personal life activities. There has already been significant advance in analyzing the shared information from both micro (individual user) and macro (community level) perspectives, giving access to actionable insight about user and community behaviors. The identification of personal life events from user’s profiles is a challenging yet important task, which if done appropriately, would facilitate more accurate identification of users’ preferences, interests and attitudes. For instance, a user who has just broken his phone, is likely to be upset and also be looking to purchase a new phone. While there is work that identifies tweets that include mentions of personal life events, our work in this paper goes beyond the state of the art by predicting a future personal life event that a user will be posting about on Twitter solely based on the past tweets. We propose two architectures based on recurrent neural networks, namely the classification and generation architectures, that determine the future personal life event of a user. We evaluate our work based on a gold standard Twitter life event dataset and compare our work with the state of the art baseline technique for life event detection. While presenting performance measures, we also discuss the limitations of our work in this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. Available upon request

  2. https://www.sharcnet.ca/

  3. https://goo.gl/ohBcD8

References

  • Bastien, F., Lamblin, P., Pascanu, R., Bergstra, J., Goodfellow, I., Bergeron, A., Bouchard, N., Warde-Farley, D., Bengio, Y. (2012). Theano: new features and speed improvements. In Deep learning and unsupervised feature learning NIPS 2012 Workshop.

  • Bengio, Y., Simard, P., Frasconi, P. (1994). Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on Neural Networks, 5(2), 157–166.

    Article  Google Scholar 

  • Bollen, J., Mao, H., Zeng, X. (2011). Twitter mood predicts the stock market. Journal of Computational Science, 2(1), 1–8.

    Article  Google Scholar 

  • Breiman, L. (1996). Bagging predictors. Machine Learning, 24, 123–140.

    MATH  Google Scholar 

  • Byrd, K., Mansurov, A., Baysal, O. (2016). Mining twitter data for influenza detection and surveillance. In Proceedings of the international workshop on software engineering in healthcare systems, SEHS ’16, (New York, NY, USA), (pp. 43–49). ACM.

  • Cameron, M.P., Barrett, P., Stewardson, B. (2016). Can social media predict election results? evidence from new zealand. Journal of Political Marketing, 15(4), 416–432.

    Article  Google Scholar 

  • Cavalin, P., Gattide Bayser, M., Pinhanez, C. (2014). Towards personalized offers by means of life event detection on social media and entity matching. In HT (Doctoral Consortium/Late-breaking Results/Workshops) (vol. 1210, 01).

  • Cavalin, P.R., Moyano, L.G., Miranda, P.P. (2015). A multiple classifier system for classifying life events on social media. In 2015 IEEE international conference on Data mining workshop (ICDMW) (pp. 1332–1335). IEEE,.

  • Cho, K., van Merrienboer, B., Bahdanau, D., Bengio, Y. (2014). On the properties of neural machine translation: Encoder-decoder approaches. In Eighth workshop on syntax semantics and structure in statistical translation (SSST-8).

  • Cho, K., van Merriënboer, B., Gülçehre, Ç., Bahdanau, D., Bougares, F., Schwenk, H., Bengio, Y. (2014). Learning phrase representations using rnn encoder–decoder for statistical machine translation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), (Doha, Qatar) (pp. 1724–1734). Association for Computational Linguistics.

  • Cho, K., Courville, A., Bengio, Y. (2015). Describing multimedia content using attention-based encoder-decoder networks. IEEE Transactions on Multimedia, 17(11), 1875–1886.

    Article  Google Scholar 

  • Choudhery, D., & Leung, C.K. (2017). Social media mining: prediction of box office revenue. In Proceedings of the 21st international database engineering & applications symposium, IDEAS 2017, (New York, NY, USA) (pp. 20–29). ACM.

  • Choudhury, S., & Alani, H. (2014a). Detecting presence of personal events in twitter streams. In International conference on social informatics, (pp. 157–166), Springer, Springer International Publishing.

  • Choudhury, S., & Alani, H. (2014b). Personal life event detection from social media. In Doctoral consortium and workshop proceedings of the 25th ACM hypertext and social media conference, (vol. 1210, 01).

  • Coppersmith, G., Dredze, M., Harman, C., Hollingshead, K. (2015). From adhd to sad: Analyzing the language of mental health on twitter through self-reported diagnoses. In CLPsych@ HLT-NAACL (pp. 1–10).

  • DeChoudhury, M., Gamon, M., Counts, S., Horvitz, E. (2013). Predicting depression via social media. ICWSM, 13, 1–10.

    Google Scholar 

  • Dhingra, B., Zhou, Z., Fitzpatrick, D., Muehl, M., Cohen, W.W. (2016). Tweet2vec: character-based distributed representations for social media. In Proceedings of the 54th annual meeting of the association for computational linguistics (vol.2, pp. 269–274). 05.

  • Dickinson, T., Fernandez, M., Thomas, L.A., Mulholland, P., Briggs, P., Alani, H. (2015). Identifying prominent life events on twitter. In Proceedings of the 8th international conference on knowledge capture, k-CAP 2015, (New York, NY, USA), (pp. 4:1–4:8). ACM.

  • DiEugenio, B., Green, N., Subba, R. (2013). Detecting life events in feeds from twitter. In 2013 IEEE Seventh international conference on semantic computing (ICSC), (pp. 274–277). IEEE.

  • Eichstaedt, J.C., Schwartz, H.A., Kern, M.L., Park, G., Labarthe, D.R., Merchant, R.M., Jha, S., Agrawal, M., Dziurzynski, L.A., Sap, M., et al. (2015). Psychological language on twitter predicts county-level heart disease mortality. Psychological Science, 26(2), 159–169.

    Article  Google Scholar 

  • Fan, W., & Gordon, M.D. (2014). The power of social media analytics. Communications of the ACM, 57(6), 74–81.

    Article  Google Scholar 

  • Fani, H., Zarrinkalam, F., Bagheri, E., Du, W. (2016). Time-sensitive topic-based communities on twitter. In Canadian conference on artificial intelligence (pp. 192–204). Springer.

  • Fani, H., Bagheri, E., Du, W. (2017a). Temporally like-minded user community identification through neural embeddings. In 26th ACM international conference on information and knowledge management (CIKM).

  • Fani, H., Bagheri, E., Zarrinkalam, F., Zhao, X., Du, W. (2017b). Finding diachronic like-minded users. Computational Intelligence.

  • Franch, F. (2013). (Wisdom of the crowds) 2: 2010 uk election prediction with social media. Journal of Information Technology & Politics, 10(1), 57–71.

    Article  Google Scholar 

  • Glorot, X., Bordes, A., Bengio, Y. (2011). Domain adaptation for large-scale sentiment classification: a deep learning approach. In Proceedings of the 28th international conference on international conference on machine learning, ICML’11, USA (pp. 513–520). Omnipress.

  • Graves, A. (2012a). Generating sequences with recurrent neural networks. In ICML representation learning workshop.

  • Graves, A. (2012b). Sequence transduction with recurrent neural networks. In International conference of machine learning (ICML) 2012 workshop on representation learning.

  • Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9, 1735–1780.

    Article  Google Scholar 

  • Kaleel, S.B., & Abhari, A. (2015). Cluster-discovery of twitter messages for event detection and trending. Journal of Computational Science, 6, 47–57.

    Article  Google Scholar 

  • Khodabakhsh, M., Kahani, M., Bagheri, E., Noorian, Z. (2017). Detecting life events from twitter based on temporal semantic features. Knowledge-based Systems Journal. second revision submitted, http://ls3.rnet.ryerson.ca/wiki/images/4/48/Life_Event_Detection.pdf.

  • Kim, Y. (2014). Convolutional neural networks for sentence classification. In Proceedings of the 2014 conference on empirical methods in natural language processing.

  • Kingma, D., & Ba, J. (2015). Adam: A method for stochastic optimization. In the 3rd International conference for learning representations.

  • Kusner, M.J., Sun, Y., Kolkin, N.I., Weinberger, K.Q. (2015). From word embeddings to document distances. In Proceedings of the 32Nd International conference on international conference on machine learning - Volume 37, ICML’15 (pp.957–966) JMLR.org.

  • Lassen, N.B., Madsen, R., Vatrapu, R. (2014). Predicting iphone sales from iphone tweets. In Proceedings of the 2014 IEEE 18th International enterprise distributed object computing conference, EDOC ’14, (Washington, DC, USA) (pp. 81–90). IEEE Computer Society.

  • Lee, J.Y., & Dernoncourt, F. (2016). Sequential short-text classification with recurrent and convolutional neural networks. In Proceedings of the 2016 conference of the North American chapter of the association for computational linguisticss.

  • Li, J.L., Ritter, A., Cardie, C., Hovy, E.H. (2014). Major life event extraction from twitter based on congratulations/condolences speech acts. In EMNLP.

  • Li, Y., Liu, T., Jiang, J., Zhang, L. (2016). Hashtag recommendation with topical attention-based lstm. In Proceedings of the 26th international conference on computational linguistics (pp. 943–952), Coling .

  • Limsopatham, N., & Collier, N. (2016). Bidirectional lstm for named entity recognition in twitter messages. In Proceedings of the 2nd workshop on noisy user-generated text (pp. 145–152).

  • Lin, H., Jia, J., Guo, Q., Xue, Y., Li, Q., Huang, J., Cai, L., Feng, L. (2014). User-level psychological stress detection from social media using deep neural network. In Proceedings of the 22Nd ACM international conference on multimedia, MM ’14, New York (pp. 507–516), ACM.

  • Liu, T., Ding, X., Chen, Y., Chen, H., Guo, M. (2016). Predicting movie box-office revenues by exploiting large-scale social media content. Multimedia Tools and Applications, 75(3), 1509–1528.

    Article  Google Scholar 

  • Ma, J., Gao, W., Mitra, P., Kwon, S., Jansen, B.J., Wong, K.-F., Cha, M. (2016). Detecting rumors from microblogs with recurrent neural networks. In Proceedings of the Twenty-Fifth international joint conference on artificial intelligence, IJCAI’16 (pp 3818–3824), AAAI Press.

  • Madani, A., Boussaid, O., Zegour, D.E. (2015). Real-time trending topics detection and description from twitter content. Social Network Analysis and Mining, 5, 59.

    Article  Google Scholar 

  • Mao, Y., Wei, W., Wang, B., Liu, B. (2012). Correlating s&p 500 stocks with twitter data. In Proceedings of the first ACM international workshop on hot topics on interdisciplinary social networks research, HotSocial ’12, (New York, NY, USA) (pp. 69–72). ACM.

  • Mikolov, T., Chen, K., Corrado, G., Dean, J. (2013a). Efficient estimation of word representations in vector space. In Proceedings of workshop at ICLR.

  • Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J. (2013b). Distributed representations of words and phrases and their compositionality. In Proceedings of the 26th international conference on neural information processing systems, NIPS’13 USA (pp. 3111–3119). Curran Associates Inc.

  • Moyano, L.G., Cavalin, P.R., Miranda, P.P. (2015). Life event detection using conversations from social media. In Brazilian workshop on social network analysis and mining.

  • Nguyen, T.H., & Shirai, K. (2015). Topic modeling based sentiment analysis on social media for stock market prediction. In Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing (Volume 1: Long Papers) (pp. 1354–1364). Association for Computational Linguistics.

  • Nguyen, T.H., Shirai, K., Velcin, J. (2015). Sentiment analysis on social media for stock movement prediction. Expert Systems with Applications, 42(24), 9603–9611.

    Article  Google Scholar 

  • Ni, M., He, Q., Gao, J. (2014). Using social media to predict traffic flow under special event conditions. In The 93rd annual meeting of transportation research board.

  • Nofer, M., & Hinz, O. (2015). Using twitter to predict the stock market. Business & Information Systems Engineering, 57(4), 229–242.

    Article  Google Scholar 

  • Oak, M., Behera, A., Thomas, T., Alm, C.O., Prud’hommeaux, E., Homan, C., Ptucha, R.W. (2016). Generating clinically relevant texts: A case study on life-changing events. In CLPsych@ HLT-NAACL (pp. 85–94).

  • Owoputi, O., Dyer, C., Gimpel, K., Schneider, N., Smith, N.A. (2013). Improved part-of-speech tagging for online conversational text with word clusters. In Proceedings of NAACL association for computational linguistics.

  • Paltoglou, G. (2016). Sentiment-based event detection in twitter. Journal of the Association for Information Science and Technology, 67(7), 1576–1587.

    Article  Google Scholar 

  • Preotiuc-Pietro, D., Eichstaedt, J., Park, G., Sap, M., Smith, L., Tobolsky, V., Schwartz, H.A., Ungar, L. (2015). The role of personality, age and gender in tweeting about mental illnesses. In NAACL HLT (vol. 2015, pp. 21).

  • Radosavljevic, V., Grbovic, M., Djuric, N., Bhamidipati, N. (2014). Large-scale world cup 2014 outcome prediction based on tumblr posts. In KDD workshop on large-scale sports analytics.

  • Ritter, A., Clark, S., Etzioni, O., et al. (2011). Named entity recognition in tweets: an experimental study. In Proceedings of the conference on empirical methods in natural language processing (pp. 1524–1534) Association for Computational Linguistics.

  • Schuster, M., & Paliwal, K.K. (1997). Bidirectional recurrent neural networks. IEEE Transactions on Signal Processing, 45(11), 2673–2681.

    Article  Google Scholar 

  • Semeniuta, S., Severyn, A., Barth, E. (2017). A hybrid convolutional variational autoencoder for text generation. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017, Copenhagen, Denmark, September 9-11, pp. 638–648.

  • Serban, I.V., Klinger, T., Tesauro, G., Talamadupula, K., Zhou, B., Bengio, Y., Courville, A.C. (2017). Multiresolution recurrent neural networks: An application to dialogue response generation. In AAAI (pp. 3288–3294).

  • Severyn, A., & Moschitti, A. (2015). Twitter sentiment analysis with deep convolutional neural networks. In Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, SIGIR ’15, New York (pp. 959–962), ACM.

  • Shang, L., Lu, Z., Li, H. (2015). Neural responding machine for short-text conversation. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (pp. 1577–1586). The Association for Computer Linguistics.

  • Shi, X., Knight, K., Yuret, D. (2016). Why neural translations are the right length. In EMNLP (pp. 2278–2282).

  • Sordoni, A., Bengio, Y., Vahabi, H., Lioma, C., GrueSimonsen, J., Nie, J.-Y. (2015). A hierarchical recurrent encoder-decoder for generative context-aware query suggestion. In Proceedings of the 24th ACM international on conference on information and knowledge management, CIKM ’15, (New York, USA) (pp. 553–562), ACM.

  • Sutskever, I., Martens, J., Hinton, G. (2011). Generating text with recurrent neural networks. In Proceedings of the 28th international conference on international conference on machine learning, ICML’11, USA (pp. 1017–1024). Omnipress.

  • Sutskever, I., Vinyals, O., Le, Q.V. (2014). Sequence to sequence learning with neural networks. In Proceedings of the 27th international conference on neural information processing systems, NIPS’14 (pp. pp. 3104–3112). Cambridge: MIT Press.

  • Tang, D., Wei, F., Yang, N., Zhou, M., Liu, T., Qin, B. (2014). Learning sentiment-specific word embedding for twitter sentiment classification. In Proceedings of the 52nd annual meeting of the association for computational linguistics (Volume 1: Long Papers), (Baltimore, Maryland) (pp. 1555–1565). Association for Computational Linguistics.

  • Tang, D., Qin, B., Liu, T. (2015). Document modeling with gated recurrent neural network for sentiment classification. In Proceedings of the 2015 conference on empirical methods in natural language processing, Lisbon (pp. 1422–1432). Association for Computational Linguistics.

  • Tsakalidis, A., Papadopoulos, S., Cristea, A.I., Kompatsiaris, Y. (2015). Predicting elections for multiple countries using twitter and polls. IEEE Intelligent Systems, 30(2), 10–17.

    Article  Google Scholar 

  • Unankard, S., Li, X., Sharaf, M.A. (2015). Emerging event detection in social networks with location sensitivity. World Wide Web, 18(5), 1393–1417.

    Article  Google Scholar 

  • Vosoughi, S., Vijayaraghavan, P., Roy, D. (2016). Tweet2vec: Learning tweet embeddings using character-level cnn-lstm encoder-decoder. In Proceedings of the 39th international ACM SIGIR conference on research and development in information retrieval, SIGIR ’16, New York (pp. 1041–1044). ACM.

  • Wald, R., Khoshgoftaar, T.M., Napolitano, A., Sumner, C. (2012). Using twitter content to predict psychopathy. In 2012 11th international conference on machine learning and applications (ICMLA) (vol. 2, pp. 394–401). IEEE.

  • Werbos, P.J. (1990). Backpropagation through time: what it does and how to do it. Proceedings of the IEEE, 78(10), 1550–1560.

    Article  Google Scholar 

  • Woo, H., Cho, Y., Shim, E., Lee, J.-K., Lee, C.-G., Kim, S.H. (2016). Estimating influenza outbreaks using both search engine query data and social media data in south korea. Journal of medical Internet research, 7, 18.

    Google Scholar 

  • Yang, Z., Yang, D., Dyer, C., He, X., Smola, A.J., Hovy, E.H. (2016). Hierarchical attention networks for document classification. In Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies (pp. 1480–1489).

  • Zarrinkalam, F., Fani, H., Bagheri, E., Kahani, M., Di Nunzio, G.M. (2016). Inferring implicit topical interests on twitter. In Ferro, N., Crestani, F., Moens, M.-F., Mothe, J., Silvestri, F., Hauff, C., Silvello, G. (Eds.) Advances in information retrieval: 38th European conference on IR research, ECIR 2016, Padua, Italy, March 20–23, 2016. Proceedings (pp. 479–491). Cham: Springer International Publishing.

    Google Scholar 

  • Zhang, X., Zhao, J., LeCun, Y. (2015). Character-level convolutional networks for text classification. In Proceedings of the 28th international conference on neural information processing systems, NIPS’15 (pp. 649–657). Cambridge: MIT Press.

  • Zhao, Z., Lu, H., Cai, D., He, X., Zhuang, Y. (2016). User preference learning for online social recommendation. IEEE Transactions on Knowledge and Data Engineering, 28(9), 2522–2534.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohsen Kahani.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Khodabakhsh, M., Kahani, M. & Bagheri, E. Predicting future personal life events on twitter via recurrent neural networks. J Intell Inf Syst 54, 101–127 (2020). https://doi.org/10.1007/s10844-018-0519-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-018-0519-2

Keywords

Navigation