Skip to main content
Log in

Pre-production box-office success quotient forecasting

  • Methodologies and Application
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Hollywood enjoys the position of being the biggest movie producers when it comes to global recognition among movie-making industries. Despite being the biggest movie producer, it has been facing high revenue losses lately since most of the films that it has created have failed to capture viewer’s attention in the first few weeks of its release resulting in a box-office flop. It has been observed in a recent study that Hollywood is estimated to witness a loss of around 1 billion to almost 10 billion US dollars till 2020. Revenue risks have created immense pressure on movie producing stakeholders. They feel a constant pressure to come up with a formula to make a successful movie, however, to date; there are no fixed ingredients that can ensure the success of a movie. Researchers and movie producers constantly feel a need to have some expert systems which would predict the fate of the movie prior to its production with reasonable accuracy. Regardless of the difficult nature of the issue area, few researchers have created expert systems to forecast the financial success of movies using different approaches, but most of them are targeted pre-release forecasting or have low prediction accuracy. Such predictions are of a seminal nature as of their limited prediction scope, and non-ability to reduce revenue loss risk. Therefore, there is a constant demand from investors to have pre-production forecasting tools with high accuracy which can help them plan and make necessary alterations to save huge investments. In this study, we proposed eighteen new features to forecast box-office success, as soon as the quotient (director and cast) signs an agreement. This proposed forecasting time is the earliest prediction that has ever been reported in the movie forecasting literature. The decision support system ranks director and lead cast by utilizing their performances of the last 100 years (1915–2015). The processed output file is a table that ranks each director and cast into four categories based on cast experience, journalist critics, media reporting, user ratings, and revenue generated by associating movie. To produce more accurate results, learner-based feature selection is also performed to select the best subsets of features. This system is intended to be a dynamic tool, integrating further data for real-time adaptation. The system has the ability to incorporate different feature selection algorithm for the progressive improvement of movie success forecasting We demonstrate the effectiveness of extracting features and explain how they improve forecasting accuracy over existing models. The adaptive behaviour of the presented system is achieved by incorporating conceptually different machine learning classifiers, i.e. support vector machine, gradient boosting, extreme boosting classifier, and random forest. A voting system is used to make the prediction by averaging the output class-probabilities. To assess the adequacy of new features, a cross-validation test is directed. Our classification results are evaluated by using two performance measures, i.e. average per cent success rate, or within one class away from its actual prediction. The new features have achieved the most noteworthy accuracy of 85% with an expansion of a 46.43% (average per cent success rate) and 5.56% (within a class away) in comparison with other state-of-the-art feature sets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  • Ahmed U, Zafar L, Qayyum F, Islam MA (2018) Irony detector at semeval-2018 task 3: irony detection in English tweets using word graph. In: Proceedings of the 12th international workshop on semantic evaluation, pp 581–586

  • Ahmed U, Liaquat H, Ahmed L, Hussain SJ (2019) Suggestion miner at semeval-2019 task 9: suggestion detection in online forum using word graph. In: Proceedings of the 13th international workshop on semantic evaluation, pp 1242–1246

  • Altuntas S, Dereli T, Kusiak A (2015) Forecasting technology success based on patent data. Technol Forecast Soc Change 96:202–214

    Article  Google Scholar 

  • Basuroy S, Chatterjee S, Ravid SA (2003) How critical are critical reviews? the box office effects of film critics, star power, and budgets. J Market 67(4):103–117

    Article  Google Scholar 

  • Chen X, Chen Y, Weinberg CB (2013) Learning about movies: the impact of movie release types on the nationwide box office. J Cult Econ 37(3):359–386

    Article  Google Scholar 

  • Dahl G, DellaVigna S (2009) Does movie violence increase violent crime? Q J Econ 124(2):677–734

    Article  Google Scholar 

  • Delen D, Sharda R (2010) Predicting the financial success of hollywood movies using an information fusion approach. Indus Eng J 21(1):30–37

    Google Scholar 

  • Einav L (2007) Seasonality in the us motion picture industry. RAND J Econ 38(1):127–145

    Article  MathSciNet  Google Scholar 

  • Elberse A, Eliashberg J (2003) Demand and supply dynamics for sequentially released products in international markets: the case of motion pictures. Market Sci 22(3):329–354

    Article  Google Scholar 

  • Eliashberg J, Jonker JJ, Sawhney MS, Wierenga B (2000) Moviemod: an implementable decision-support system for prerelease market evaluation of motion pictures. Market Sci 19(3):226–243

    Article  Google Scholar 

  • Eliashberg J, Elberse A, Leenders MA (2006) The motion picture industry: Critical issues in practice, current research, and new research directions. Market Sci 25(6):638–661

    Article  Google Scholar 

  • Eliashberg J, Hui SK, Zhang ZJ (2007) From story line to box office: a new approach for green-lighting movie scripts. Manag Sci 53(6):881–893

    Article  Google Scholar 

  • Galvão M, Henriques R (2018) Forecasting movie box office profitability. J Inf Syst Eng Manag 3(3):1–9

    Google Scholar 

  • Ghiassi M, Lio D, Moon B (2015) Pre-production forecasting of movie revenues with a dynamic artificial neural network. Expert Syst Appl 42(6):3176–3193

    Article  Google Scholar 

  • Hennig-Thurau T, Houston MB, Heitjans T (2009) Conceptualizing and measuring the monetary value of brand extensions: the case of motion pictures. J Market 73(6):167–183

    Article  Google Scholar 

  • Jedidi K, Krider R, Weinberg C (1998) Clustering at the movies. Market Lett 9(4):393–405

    Article  Google Scholar 

  • Khalid YN, Aleem M, Ahmed U, Islam MA, Iqbal MA (2019) Troodon: a machine-learning based load-balancing application scheduler for CPU–GPU system. J Parallel Distrib Comput 132(10):79–94. https://doi.org/10.1016/j.jpdc.2019.05.015

  • Lee S, Choeh JY (2018) The interactive impact of online word-of-mouth and review helpfulness on box office revenue. Manag Decis 56(4):849–866

    Article  Google Scholar 

  • Litman BR (1983) Predicting success of theatrical movies: an empirical study. J Popul Cult 16(4):159–175

    Article  Google Scholar 

  • Liu T, Ding X, Chen Y, Chen H, Guo M (2016) Predicting movie box-office revenues by exploiting large-scale social media content. Multimed Tools Appl 75(3):1509–1528

    Article  Google Scholar 

  • Liu Y, Xie T (2019) Machine learning versus econometrics: prediction of box office. Appl Econ Lett 26(2):124–130

    Article  Google Scholar 

  • Ma H, Kim JM, Lee E (2019) Analyzing dynamic review manipulation and its impact on movie box office revenue. Electron Commer Res Appl 35:100–840

    Article  Google Scholar 

  • Madlberger M (2014) Through the eyes of the traveler: consumer evaluation of hotels in eastern European capitals compared with western, southern, and northern Europe. J East Eur Central Asian Res (JEECAR) 1(2):9

    Google Scholar 

  • Marshall P, Dockendorff M, Ibáñez S (2013) A forecasting system for movie attendance. J Bus Res 66(10):1800–1806

    Article  Google Scholar 

  • Mestyán M, Yasseri T, Kertész J (2013) Early prediction of movie box office success based on wikipedia activity big data. PloS ONE 8(8):e71226

    Article  Google Scholar 

  • Moon S, Bergey PK, Iacobucci D (2010a) Dynamic effects among movie ratings, movie revenues, and viewer satisfaction. J Market 74(1):108–121

    Article  Google Scholar 

  • Moon S, Bergey PK, Iacobucci D (2010b) Dynamic effects among movie ratings, movie revenues, and viewer satisfaction. J Market 74(1):108–121

    Article  Google Scholar 

  • Neelamegham R, Chintagunta P (1999) A Bayesian model to forecast new product performance in domestic and international markets. Market Sci 18(2):115–136

    Article  Google Scholar 

  • Numbers (2016) Domestic movie theatrical market summary 1995 to 2019. http://www.the-numbers.com/market/. Accessed 3 Oct 2016

  • Pew Research (2016) 6 new facts about Facebook—Pew Research Center. http://www.pewresearch.org/fact-tank/2014/02/03/6-new-facts-about-facebook/. Accessed 10 Oct 2016

  • Prieto-Rodriguez J, Gutierrez-Navratil F, Ateca-Amestoy V (2015) Theatre allocation as a distributor’s strategic variable over movie runs. J Cult Econ 39(1):65–83

    Article  Google Scholar 

  • Sawhney MS, Eliashberg J (1996) A parsimonious model for forecasting gross box-office revenues of motion pictures. Market Sci 15(2):113–131

    Article  Google Scholar 

  • Scikit (2016) Scikit Documentation. http://scikit-learn.org/stable/documentation.html. Accessed 16 Oct 2016

  • Sharda R, Delen D (2006) Predicting box-office success of motion pictures with neural networks. Expert Syst Appl 30(2):243–254

    Article  Google Scholar 

  • Sprout Social (2016) 17 Powerful Facebook Stats for Marketers—Sprout Social. http://sproutsocial.com/insights/facebook-stats-for-marketers/. Accessed 16 Oct 2016

  • Svetnik V, Liaw A, Tong C, Culberson JC, Sheridan RP, Feuston BP (2003) Random forest: a classification and regression tool for compound classification and QSAR modeling. J Chem Inf Comput Sci 43(6):1947–1958

    Article  Google Scholar 

  • Terry N, Cooley JW, Zachary M (2010) The determinants of foreign box office revenue for English language movies. J Int Bus Cult Stud 2:1

    Google Scholar 

  • Terry N, Butler M, De’Armond D et al (2011) The determinants of domestic box office performance in the motion picture industry. Southwest Econ Rev 32:137–148

    Google Scholar 

  • Viola P, Jones M et al (2001) Rapid object detection using a boosted cascade of simple features. CVPR 1(511–518):3

    Google Scholar 

  • Vogel D (2001) Ships passing in the night: the changing politics of risk regulation in Europe and the United States

  • Vosoughi S (2015) Automatic detection and verification of rumors on twitter. PhD thesis, Massachusetts Institute of Technology

  • Xgboost (2016) Xgboost Parameter. http://xgboost.readthedocs.io/en/latest/parameter.html. Accessed 16 Oct 2016

  • Zafar L, Afzal MT, Ahmed U (2017) Exploiting polarity features for developing sentiment analysis tool. In: EMSASW ESWC

  • Zhang L, Luo J, Yang S (2009) Forecasting box office revenue of movies with BP neural network. Expert Syst Appl 36(3):6580–6587

    Article  Google Scholar 

  • Zhang C, Zhang M, Su Y, Wang W (2012) Smart home design based on zigbee wireless sensor network. In: 2012 7th international ICST conference on communications and networking in China (CHINACOM). IEEE, pp 463–466

  • Zhang Z, Chai J, Li B, Wang Y, An M, Deng Z (2015) Movie box office interval forecasting based on cart. In: 2015 8th international symposium on computational intelligence and design (ISCID). IEEE, vol 2, pp 87–90

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Usman Ahmed.

Ethics declarations

Conflicts of interest

The author declares that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Communicated by V. Loia.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ahmed, U., Waqas, H. & Afzal, M.T. Pre-production box-office success quotient forecasting. Soft Comput 24, 6635–6653 (2020). https://doi.org/10.1007/s00500-019-04303-w

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-019-04303-w

Keywords

Navigation