Prediction techniques of movie box office using neural networks and emotional mining.

Sci Rep

Faculty of Humanities and Social Sciences, City University of Macau, Macau, China.

Published: September 2024

Box office prediction is of great significance for understanding investment risks, class construction, promotion and distribution, and theater scheduling. However, due to the insufficient selection of influencing factors of movie box office, the currently existing prediction model restricts the prediction accuracy. A total of 34 influencing factors in 11 categories, such as heat index, movie types, release date, creators, first-day box office, were selected to study the prediction technology of movie box office. The Word2vec algorithm is used to construct a feature thesaurus for nouns in movie domain; adjectives and verbs with emotional coloring are used to construct an emotional dictionary based on the movie domain; and the TF-IDF algorithm is integrated to calculate the emotional scores of movie comments. A prediction method based on comments and Multivariate Linear Regression (MLR) is designed to analyze the relationship between the influencing factors and the movie box office, which provides an important basis for the prediction of the total box office, and also provides a decision-making reference for the movie industry and the related management departments. Incorporating comments as feature values to improve the accuracy, a prediction model based on comments and Convolutional Neural Network (CNN) is constructed. The results show that the average prediction accuracy of the MLR without comments, Back-Propagation Neural Network (BPNN), and CNN is 63.4%, 68.3%, and 71.9%, respectively, and after integrating the comments, the average prediction accuracy of the MLR and CNN is improved by 16.1% and 11.8%, respectively, and the prediction accuracy is significantly improved.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11390969PMC
http://dx.doi.org/10.1038/s41598-024-72340-zDOI Listing

Publication Analysis

Top Keywords

box office
28
movie box
16
prediction accuracy
16
influencing factors
12
prediction
11
movie
9
factors movie
8
prediction model
8
movie domain
8
based comments
8

Similar Publications

Backgrounds: Poverty is a complex and multifaceted global public health issue, particularly prevalent in Ethiopia, including the East Gojjam Zone. Previous studies on poverty have largely relied on unidimensional measures, providing limited evidence on multidimensional poverty (MP). Therefore, this study tried to assess the prevalence and identify the associated factors of MP among rural households in selected woredas of East Gojjam Zone, Northern Ethiopia.

View Article and Find Full Text PDF

Food safety challenges, such as mycotoxin contamination, pose severe threats to public health, agricultural productivity, and economic development across Sub-Saharan African countries and beyond. This study investigated whether government policies related to food safety adequately address these concerns, using Malawi as a case study. We systematically reviewed 29 government-authored policy documents related to food safety.

View Article and Find Full Text PDF

The density (ρ), speed of sound (), and refractive index ( ) of ,-dimethylacetamide (DMA) with 1-butanol, 1-pentanol, furfural (FFL), or furfuryl alcohol (FA) as a function of composition and at = 293.15 to 323.15 K with an interval of 10 K and atmospheric pressure were measured.

View Article and Find Full Text PDF

Expediated modeling of burn events results (EMBER): A screening-level dataset of 2023 ozone fire impacts in the US.

Data Brief

February 2025

Office of Air and Radiation, US Environmental Protection Agency, 109 TW Alexander Dr, PO Box 12055, RTP, NC 27711, USA.

The Expedited Modeling of Burn Events Results (EMBER) dataset consists of 36-km grid-spacing Community Multiscale Air Quality (CMAQ) photochemical modeling for the summer of 2023. For emissions, these simulations utilized representative monthly and day-of-week anthropogenic emissions from a recent year and preliminary day-specific 2023 fire emissions derived using BlueSky pipeline. The base model run simulated ozone concentrations across the contiguous US during Apr 11-Sep 29, 2023.

View Article and Find Full Text PDF

Background: The recent global pandemic posed extraordinary challenges for healthcare systems. Frontline healthcare workers required focused, immediate, practical, evidence-based instruction on optimal patient care modalities as knowledge evolved around disease management.

Objective: This course was designed to provide knowledge to protect healthcare workers; combat disease spread; and improve patient outcomes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!