Application of Synthetic Minority Oversampling Technique (SMOTE) for Imbalance Class on Text Data Using kNN

Authors

  • Sultan Maula Chamzah Universitas Muhammadiyah Malang
  • Merinda Lestandy Universitas Muhammadiyah Malang
  • Nur Kasan Universitas Muhammadiyah Malang
  • Adhi Nugraha Universitas Muhammadiyah Malang

DOI:

https://doi.org/10.35706/syji.v11i02.6940

Abstract

Tokopedia is one of the online marketplace providers in Indonesia that facilitates internet users to buy and sell online. Tokopedia gets an average of 147.79 million website and application visitors per month. Although it has many users, of course in an application it has advantages and disadvantages. This was conveyed by users through reviews or reviews contained in the Google Play Store. In the review, it can be seen that more users who gave 5-star rating reviews than users gave 1 star rating. The Synthetic Minority Oversampling Technique or SMOTE is a popular method applied in order to deal with class imbalances. This study aims to determine the performance of the K-Nearest Neighbor algorithm in dealing with imbalance class using Synthetic Minority Oversampling Technique (SMOTE). This study uses 5000 data consisting of 3975 negative data and 1025 positive data. Of the 5000 data divided into two parts, 70% training data and 30% test data. The SMOTE-kNN method shows a better accuracy result, which is 90% compared to using only kNN with an accuracy value of 82%.

Downloads

Download data is not yet available.

Published

2022-11-16

How to Cite

Maula Chamzah, S., Lestandy, M. ., Kasan, N., & Nugraha, A. (2022). Application of Synthetic Minority Oversampling Technique (SMOTE) for Imbalance Class on Text Data Using kNN. Syntax : Jurnal Informatika, 11(02), 56–67. https://doi.org/10.35706/syji.v11i02.6940