Polarity Determinationof Movie Reviews: A Systematic Literature Review

  • Shashank Shekhar Sharma PhD Scholar, Indian Institute of Foreign Trade
  • Gautam Dutta IIFT Bhawan, B-21, NRPC Colony, Block B, Qutab Institutional Area, New Delhi, Delhi 110016
Keywords: Polarity determination; sentiment analysis; movie reviews; natural language processing; literature review


Sentiment Analysis has been shown to be extremely sensitive to the domain of interest. Methods, algorithms and lexicons need to be customized for specific domains to ensure robust performance for sentiment analysis tasks like classification of reviews into positive or negative. Over the past decade and a half, several important works have been published using various feature selection and classification methods for polarity determination of movie reviews. These studies use a diverse mix of feature engineering methods to select the features and have tested them with different algorithms to determine to polarity of the reviews. Some of selected research studies have been able to determine polarity with accuracies ranging from 85% to 90%. Models illustrated in these research papers have been helpful in making accurate estimation of how well a particular movie is received by the audience. This is a systematic review of these studies where overview, analysis and comparison of the feature selection and classification methods used in the domain of movie reviews have been presented.


Abbasi, A., France, S., Zhang, Z., & Chen, H. (2011). Selecting attributes for sentiment classification using feature relation networks. IEEE Transactions on Knowledge and Data Engineering, 23(3), 447-462.
Bai, X. (2011).Predicting consumer sentiments from online text. Decision Support Systems, 50(4), 732-742.
Cheung, C. M. K., & Lee, M. K. O. (2008). Online consumer reviews: Does negative electronic word-of-mouth hurt more? Proceedings from the AMCIS 2008. A review for the influential factors in e-WoM research
Cho, H., Kim, S., Lee, J., & Lee, J. S. (2014). Data-driven integration of multiple sentiment dictionaries for lexicon-based sentiment classification of product reviews. Knowledge-Based Systems, 71, 61-71.
Dellarocas, C., Awad, N., & Zhang, X. (2004).Exploring the value of online reviews to organizations: Implications for revenue forecasting and planning.Proceedings from the ICIS 2004.
Dellarocas, C., Zhang, X. &Awad, N. (2007).Exploring the value of online product reviews in forecasting sales: The case of motion pictures. Journal of Interactive Marketing, 21(4), 23
Deng, Z. H., Luo, K. H., & Yu, H. L. (2014).A study of supervised term weighting scheme for sentiment analysis. Expert Systems with Applications, 41(7), 3506-3513.
Duan, W., Gu, B.&Whinston, A. (2008). "The dynamics of online word-of-mouth and product sales—An empirical investigation of the movie industry." Journal of retailing 84.2 , 233-242.
Eliashberg, J. &Shugan, S. (1997). Film critics: Influencers or predictors?.JournalofMarketing, 61, 68-78.
Godes, D., &Mayzlin, D. (2004).Using Online Conversations to Study Word-of-Mouth Communication.Marketing Science, 23, 545-560.
Hatzivassiloglou, V., &McKeown, K. R. (1997, July).Predicting the semantic orientation of adjectives.In Proceedings of the eighth conference on European chapter of the Association for Computational Linguistics (pp. 174-181).Association for Computational Linguistics.
Hennig-Thurau, T., Gwinner, K. P., Walsh, G. &Gremler, D. D. (2004). Electronic Word-of- Mouth Via Consumer-Opinion Platforms: What Motivates Consumers to Articulate Themselves on the Internet? Journal of Interactive Marketing, 18, 38- 52.
Hung, C., & Lin, H. K. (2013).Using objective words in SentiWordNet to improve sentiment classification for word of mouth. IEEE Intelligent Systems, 1.
Kanayama, H., &Nasukawa, T. (2006, July).Fully automatic lexicon expansion for domain-oriented sentiment analysis.In Proceedings of the 2006 conference on empirical methods in natural language processing (pp. 355-363).Association for Computational Linguistics.
Liu, Y. (2006). Word of mouth for movies: Its dynamics and impact on box office revenue. Journal of marketing, 70(3), 74-89.
Mäntylä, M. V., Graziotin, D., &Kuutila, M. (2016).The Evolution of Sentiment Analysis-A Review of Research Topics, Venues, and Top Cited Papers. arXiv preprint arXiv:1612.01556.
Medhat, W., Hassan, A., &Korashy, H. (2014). Sentiment analysis algorithms and applications: A survey. Ain Shams Engineering Journal, 5(4), 1093-1113.
Molina-González, M. D., Martínez-Cámara, E., Martín-Valdivia, M. T., &Perea-Ortega, J. M. (2013). Semantic orientation for polarity classification in Spanish reviews. Expert Systems with Applications, 40(18), 7250-7257.
Moraes, R., Valiati, J. F., &Neto, W. P. G. (2013).Document-level sentiment classification: An empirical comparison between SVM and ANN. Expert Systems with Applications, 40(2), 621-633.
Mukherjee, S., & Joshi, S. (2014).Author-Specific Sentiment Aggregation for Polarity Prediction of Reviews.In LREC (pp. 3092-3099).
Mullen, T., & Collier, N. (2004, July).Sentiment Analysis using Support Vector Machines with Diverse Information Sources.In EMNLP (Vol. 4, pp. 412-418).
Neelamegham, R., &Chintagunta, P. (1999).A Bayesian model to forecast new product performance in domestic and international markets. Marketing Science, 18(2), 115-136.
Niraj, R., & Singh, J. (2015). Impact of user-generated and professional critics reviews on Bollywood movie success. Australasian Marketing Journal (AMJ), 23(3), 179-187.
Niraj, Rakesh, and Jagdip Singh. "Impact of user-generated and professional critics reviews on Bollywood movie success." Australasian Marketing Journal (AMJ) 23, no. 3 (2015): 179-187.
Osgood, C. E. (1957). A behavioristic analysis of perception and language as cognitive phenomena. Contemporary approach to cognition.
Pang, B., & Lee, L. (2004, July). A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of the 42nd annual meeting on Association for Computational Linguistics (p. 271).Association for Computational Linguistics.
Pang, B., Lee, L., &Vaithyanathan, S. (2002, July). Thumbs up?: sentiment classification using machine learning techniques. In Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10 (pp. 79-86).Association for Computational Linguistics.
Penalver-Martinez, I., Garcia-Sanchez, F., Valencia-Garcia, R., Rodriguez-Garcia, M. A., Moreno, V., Fraga, A., & Sanchez-Cervantes, J. L. (2014).Feature-based opinion mining through ontologies. Expert Systems with Applications, 41(13), 5995-6008.
Porter, M. F. (2001). Snowball: A language for stemming algorithms.
Prabowo, R., &Thelwall, M. (2009). Sentiment analysis: A combined approach. Journal of Informetrics, 3(2), 143-157.
Ratnaparkhi, A. (1996, May). A maximum entropy model for part-of-speech tagging.In Proceedings of the conference on empirical methods in natural language processing (Vol. 1, pp. 133-142).
Reinstein, D. A., & Snyder, C. M. (2005). The influence of expert reviews on consumer demand for experience goods: A case study of movie critics. The journal of industrial economics, 53(1), 27-51.
Rui, H., Liu, Y., &Whinston, A. (2013). Whose and what chatter matters? The effect of tweets on movie sales. Decision Support Systems, 55(4), 863-870.
Siedlecki, W., &Sklansky, J. (1989).A note on genetic algorithms for large-scale feature selection. Pattern recognition letters, 10(5), 335-347.
Tan, L. K. W., Na, J. C., Theng, Y. L., & Chang, K. (2012). Phrase-level sentiment polarity classification using rule-based typed dependencies and additional complex phrases consideration. Journal of Computer Science and Technology, 27(3), 650-666.
Uysal, A. K., &Gunal, S. (2014).The impact of preprocessing on text classification. Information Processing & Management, 50(1), 104-112.
Weichselbraun, A., Gindl, S., &Scharl, A. (2013). Extracting and grounding contextualized sentiment lexicons. IEEE Intelligent Systems, 28(2), 39-46.
Weichselbraun, A., Gindl, S., &Scharl, A. (2014).Enriching semantic knowledge bases for opinion mining in big data applications. Knowledge-based systems, 69, 78-85.
Whitelaw, C., Garg, N., &Argamon, S. (2005, October). Using appraisal groups for sentiment analysis.In Proceedings of the 14th ACM international conference on Information and knowledge management (pp. 625-631).ACM.
Wu, C. E., & Tsai, R. T. H. (2014).Using relation selection to improve value propagation in a conceptnet-based sentiment dictionary. Knowledge-Based Systems, 69, 100-107.
Xia, R., Zong, C., & Li, S. (2011). Ensemble of feature sets and classification algorithms for sentiment classification. Information Sciences, 181(6), 1138-1152.
Xuan, H. N. T., Le, A. C., & Nguyen, L. M. (2012, November). Linguistic features for subjectivity classification. In Asian Language Processing (IALP), 2012 International Conference on(pp. 17-20). IEEE.
Yu, X., Liu, Y., Huang, X., &An, A. (2012). Mining online reviews for predicting sales performance: A case study in the movie domain. IEEE Transactions on Knowledge and Data engineering, 24(4), 720-734.
Zhou, L., &Chaovalit, P. (2008).Ontology‐supported polarity mining. Journal of the Association for Information Science and Technology, 59(1), 98-110.