Skip to main content
Log in

The societal communication of the Q&A community on topic modeling

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

The emerging trend in technical research is to use client-generated data composed of community media to survey community opinion and scientific communication on employment and care issues. This study gathered data from the social website questions and answers and the indirect effects of stack to survey the key factors that influence public preferences in technical knowledge and opinions. Using a web search engine, topic modeling, and regression data modeling, this study quantitatively analyzed the effects of the textual and auxiliary functions of the response on the number of votes received with the response. Compared to previous surveys based on open assessments, the model results show that Quora users are more likely to talk only about technology. It may fail when query keywords do not match the text content of large documents containing relevant questions from the existing methods. Furthermore, consumers are generally not experts and provide ambiguous queries (Q&As) that lead to mixed results and face a problem with existing methods. To solve these problems, the researchers attempted to reorganize the primary results and present advanced distributed topic modeling techniques to address technologies and platforms by increasing the attributes and the time and space required to generate the model. This work briefly describes the public question-and-answer structure around the world and follows the development of the main themes of housing and employment opportunities for next-generation technologies worldwide in real-time scrolling.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. D M Blei, AY Ng, M I Jordan, 2003. “Latent Dirichlet Allocation,” J Machine Learn Res 3 993–1022,

  2. LI Hua-Meng,LIHai-Rui,XUE Liang 2012 TFIDF Algorithm based on information gain and informati entropy J Comp Eng 38(08): 37–40.

  3. Hanchen Jiang, MaoshanQiang, Dongcheng Zhang, Qi Wen, Bingqing Xia, Nan An 2018 "Climate change communication in an online Q&A community: a case study of quora", Sustainability

  4. Campbell, JC, Hindle, A, Stroulia E, 2014 Latent Dirichlet allocation: extracting topics from software engineering data. In The art and science of analyzing software data (pp. 139–159). Morgan Kaufmann.

  5. Rainer Lienhart, Stefan Romberg, Eva H¨orster 2009 Multilayer pLSA for multimodal image retrieval. In Proceeding of the ACM International Conference on Image and Video Retrieval, CIVR pages 9: 1–9:8, New York, NY, USA, ACM.

  6. S. Arora, R. Ge, R. Kannan, and A. Moitra. Computing a nonnegative matrix factorization provably. In: Proc. the 44th Symposium on Theory of Computing (STOC), pages 145–162,2012

  7. Lee, DD, Seung HS (2000) Algorithms for non-negative matrix factorization. In: Annual Conference on Neural Information Processing Systems, pp 556–562

  8. Yan X, Guo J 2013 Learning topics in short text using ncut-weighted non-negative matrix factorization on term correlation matrix,

  9. Huang L, Ma J, Chen C (2017) Topic detection from microblogs using T-LDA and perplexity. In:24th Asia-Pacific software engineering conference workshops,2018.

  10. W. Xu, X. Liu, Y. Gong. 2003 document clustering based on non-negative matrix factorization. In: Proc. the 26th Annual International ACM SIGIR conference on Research and Development in Information Retrieval (SIGIR), pages 267–273

  11. Peng Zhang ,Department of mathematics, Zhejiang University, Hangzhou, 310027 China ; Wanhua Su Statistical inference on recall, precision and average precision under random selection,2012 7,Print on Demand (PoD) ISBN: 978–1–7281–4715–4,2019.

  12. Gaujoux R, Seoighe C. A flexible R package for nonnegative matrix factorization. BMC Bioinformatics 2010;11:367

  13. Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP et al (1999) Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286:531–537

    Article  Google Scholar 

  14. Dai L, Gao X, Guo Y, Xiao J, Zhang Z (2012) Bioinformatics clouds for big data manipulation. Biol Direct 7:43

    Article  Google Scholar 

  15. V K Vavilapalli, A C Murthy, C Douglas, S Agarwal, M Konar, R Evans, T Graves, J Lowe, H Shah, S Seth, B Saha, C Curino, O O’Malley, S Radia, B Reed, E Baldeschwieler, “Apache hadoop yarn: yet another resource negotiator,” In Proc. ACM.

  16. Gebara FH, Hofstee HP, Nowka KJ (2015) Second-generation big data systems. IEEEComputer 48(1):36–41

    Google Scholar 

  17. Apache Hama, accessed on June 16, 2016. [Online]. Available: https://hama.apache.org/

  18. Yeung, K. Quora Now Has 100 Million Monthly Visitors up from 80 Million in January. Available online:http://venturebeat.com/2016/03/17/quora-now-has-100-million-monthly-visitors-up- from- 80-million-in-january/ (Accessed on 28 March 2016).

  19. AlexaWebpage: Web Traffic Statistics of Quora. Available online: http://www.alexa.com/siteinfo/www. quora.com (accessed on 15 April 2016).

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to P. Venkateswara Rao.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Venkateswara Rao, P., Kumar, A.P.S. The societal communication of the Q&A community on topic modeling. J Supercomput 78, 1117–1143 (2022). https://doi.org/10.1007/s11227-021-03852-y

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-021-03852-y

Keywords

Navigation