Sentiment Analysis of Animated Film “JUMBO” on
Twitter Using Random Forest and Semi-Supervised
Learning
Dublin Core
Title
Sentiment Analysis of Animated Film “JUMBO” on
Twitter Using Random Forest and Semi-Supervised
Learning
Twitter Using Random Forest and Semi-Supervised
Learning
Subject
Sentiment analysis, Twitter, Random Forest, Semi-supervised learning, Film review classification
Description
This study investigates public sentiment toward the Indonesian animated film "JUMBO" using Twitter data and a
semi-supervised machine learning approach. Two thousand fifty tweets were collected and preprocessed to
remove noise, standardize text, and extract meaningful features. Data was collected between April 6, 2025, and
May 13, 2025, following the film's official release on March 31, 2025, coinciding with its peak public discussion
window. A semi-supervised learning strategy was applied, where 532 tweets were manually labelled into positive,
neutral, or negative sentiment categories, mitigating the extensive need for labelled data. To address the class
imbalance, the Synthetic Minority Over-sampling Technique (SMOTE) was employed. The labelled data were
then used to train a Random Forest classifier, achieving an accuracy of 90% and balanced F1 scores across all
classes. The model was subsequently applied to classify the remaining unlabeled tweets, which revealed a
dominant proportion of positive sentiments toward the film. These results obtain strong public approval of
"JUMBO" and demonstrate the effectiveness of combining machine learning with semi-supervised techniques for
sentiment analysis, particularly in the context of local cultural products. This research can be an initial stage in a
broader roadmap for analyzing the success factors of Indonesian animated films through AI-driven approach
semi-supervised machine learning approach. Two thousand fifty tweets were collected and preprocessed to
remove noise, standardize text, and extract meaningful features. Data was collected between April 6, 2025, and
May 13, 2025, following the film's official release on March 31, 2025, coinciding with its peak public discussion
window. A semi-supervised learning strategy was applied, where 532 tweets were manually labelled into positive,
neutral, or negative sentiment categories, mitigating the extensive need for labelled data. To address the class
imbalance, the Synthetic Minority Over-sampling Technique (SMOTE) was employed. The labelled data were
then used to train a Random Forest classifier, achieving an accuracy of 90% and balanced F1 scores across all
classes. The model was subsequently applied to classify the remaining unlabeled tweets, which revealed a
dominant proportion of positive sentiments toward the film. These results obtain strong public approval of
"JUMBO" and demonstrate the effectiveness of combining machine learning with semi-supervised techniques for
sentiment analysis, particularly in the context of local cultural products. This research can be an initial stage in a
broader roadmap for analyzing the success factors of Indonesian animated films through AI-driven approach
Creator
Eko Rahmat Slamet Hidayat Saputra1
, Arvin Claudy Frobenius2
, Arvin Claudy Frobenius2
Source
https://ijicom.respati.ac.id/index.php/ijicom/article/view/79/97
Publisher
International Journal of Informatics and Computation (IJICOM)
Date
2025
Contributor
Fajar bagus W
Format
PDF
Language
English
Type
Text
Files
Collection
Citation
Eko Rahmat Slamet Hidayat Saputra1
, Arvin Claudy Frobenius2, “Sentiment Analysis of Animated Film “JUMBO” on
Twitter Using Random Forest and Semi-Supervised
Learning,” Repository Horizon University Indonesia, accessed December 31, 2025, https://repository.horizon.ac.id/items/show/9760.
Twitter Using Random Forest and Semi-Supervised
Learning,” Repository Horizon University Indonesia, accessed December 31, 2025, https://repository.horizon.ac.id/items/show/9760.