PERBANDINGAN KINERJA VARIASI NAÏVE BAYES MULTIVARIATE BERNOULLI DAN NAÏVE BAYES MULTINOMIAL DALAM PENGKLASIFIKASIAN DOKUMEN TEKS

  • Widyawati Widyawati Universitas Banten Jaya
  • Sutanto Sutanto Universitas Banten Jaya
Keywords: Text Classification, Multinomial, Multivariate, Bernoulli, Naïve Bayes

Abstract

Classification of text documents with large amounts will be a job that requires a lot of time, effort and cost of having to read text documents and then categorize them manually, therefore the automatic classification of text documents is needed. The algorithm developed is K-Nearest-Neighbor (KNN), Naïve Bayes, Support Vector Machine (SVM), Decision Tree (DT), Neural Network (NN) and Maximum Entropy. The algorithm used as the object of research is a variation of the Naïve Bayes algorithm, the Naïve Bayes Multivariate Bernoulli and the Naïve Bayes Multinomial. This study discusses whether there are differences between the Algorithms. The Naïve Bayes Multivariate Bernoulli algorithm and the Naïve Bayes Multinomial can be seen from the value of the agreement and the speed of the process of classifying text documents, as well as more information about the process of processing requests that are getting more and more requested. While the highest value using the non-stemming Naïve Bayes Bernoulli method is 71.33%, and the fastest processing time is required using the non-stemming Naïve Bayes method which requires 0.12 seconds processing time.

References

Agastya, M. (2018). Pengaruh Stemmer Bahasa Indonesia Terhadap Performa Analisis Sentimen Terjemahan Ulasan FIlm. Jurnal TEKNOKOMPAK, Vol. 12, No. 1, 2018, 18-23. ISSN 1412-9663 (print), 18-23.
Ibid. (n.d.).
Juang, D. (2016). Analisis Spam dengan menggunakan Naive Bayes . Jurnal Teknovasi Volume 03, Nomor 2, 2016, 51 – 57 ISSN : 2355-701X , 51-57.
Ma, J., Zhang, Y., Liu, J., & Yu, K. (2016). Intelligent SMS Spam Filtering Using Topic Model. 2016 International Conference on Intelligent Networking and Collaborative Systems, 380-383.
Pratiwi, S., & Ulama , B. (2016). Klasifikasi Email Spam dengan Menggunakan Metode Support Vector Machine dan k-Nearest Neighbor. JURNAL SAINS DAN SENI ITS Vol. 5 No. 2 (2016) 2337-3520 (2301-928X Print) , D-344 - D-349.
Rahmayani, I. (2019, Juli Sabtu). https://kominfo.go.id/content/detail/6095. Retrieved from https://kominfo.go.id: https://kominfo.go.id/content/detail/6095/indonesia-raksasa-teknologi-digital-asia/0/sorotan_media
Rahmi, F., & Wibisono, Y. (2016, Juli Sabtu). Aplikasi SMS Spam Filtering pada Android menggunakan Naive Bayes, Unpublished manuscript. Retrieved from http://nlp.yuliadi.pro: http://nlp.yuliadi.pro/dataset
Raschka, S. (2014, Juli Sabtu). https://sebastianraschka.com/Articles. Retrieved from https://sebastianraschka.com: https://sebastianraschka.com/Articles/2014_naive_bayes_1.html
Santosa, B. (2007). Data Mining Teknik Pemanfaatan Data Untuk Keperluan Bisnis. Yogyakarta: Graha Ilmu.
Ting, S., Ip, W., & Tsang , A. (3, July, 2011 ). Is Naïve Bayes a Good Classifier for Document Classification? International Journal of Software Engineering and Its Applications Vol. 5, No. , 37-46.
Published
2020-02-24
How to Cite
Widyawati, W., & Sutanto, S. (2020). PERBANDINGAN KINERJA VARIASI NAÏVE BAYES MULTIVARIATE BERNOULLI DAN NAÏVE BAYES MULTINOMIAL DALAM PENGKLASIFIKASIAN DOKUMEN TEKS. Journal of Innovation And Future Technology (IFTECH), 2(1), 108-125. https://doi.org/10.47080/iftech.v2i1.859