5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Prediction of Indonesian Presidential Election Results using Sentiment Analysis with Na�ve Bayes Method

      , , ,
      JURNAL MEDIA INFORMATIKA BUDIDARMA
      STMIK Budi Darma

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Social media serves as a solution for politicians as a campaign tool because it can save costs compared to conventional campaigns. The 2024 Indonesian Presidential Election has drawn public attention, especially among social media users. Twitter, as one of the widely used social media platforms in Indonesia, functions as an effective campaign forum. However, the problem that arises is how to automatically collect social media data related to presidential discussions and provide conclusions on the analysis results. Of course, this is not easy if done manually. Sentiment analysis is one approach that can be used for this in order to draw conclusions and analysis related to the available data. Data was collected shortly after the registration of presidential and vice-presidential candidates in November 2023. This study aims to obtain sentiment results from the latest data obtained, get the best model from the Naive Bayes method, to conduct analysis in predicting presidential election results based on sentiment. However, at the time of data collection, candidate numbers had not been assigned by the Election organizers. The obtained data amounted to 11,569 records using the Valence Aware Dictionary for Sentiment Reasoning (VADER) library for labeling. After removing duplicated tweets, the data was reduced to 4,893 records, with each candidate pair having 1,631 data points. The sentiment analysis classification model was determined using the Nave Bayes method with Term Frequency-Inverse Document Frequency (TF-IDF) feature extraction. Based on the data, the highest percentage of positive sentiment was found in Ganjar Pranowo - Mahfud MD data at 69.16%, and the highest negative sentiment was in Prabowo Subianto - Gibran Rakabuming Raka data at 52.12%. Common words in positive sentiment for Ganjar Pranowo - Mahfud MD include "strong," "corruption," "support," "reward," and others. Meanwhile, frequently appearing negative sentiment words for Prabowo Subianto - Gibran Rakabuming Raka include "child," "eldest," "mk," "young," and others. This research achieved an average accuracy of 76.67% using the Naive Bayes method on the entire dataset, indicating its reliability in similar cases.

          Related collections

          Author and article information

          Journal
          JURNAL MEDIA INFORMATIKA BUDIDARMA
          mib
          STMIK Budi Darma
          2548-8368
          2614-5278
          January 09 2024
          January 09 2024
          : 8
          : 1
          : 41
          Article
          10.30865/mib.v8i1.7007
          ac0e65a2-6b1f-4f2c-9b2e-703b07976104
          © 2024

          http://creativecommons.org/licenses/by/4.0

          History

          Comments

          Comment on this article