Syntax : Jurnal Informatika https://journal.unsika.ac.id/syntax <p>Syntax Journal of Information (ISSN 2302-156X and E-ISSN 2541-5344) is a scientific journal of information and communication technology with a frequency published twice a year, in May and October. The informatics journal syntax is published by the publishing body of the Faculty of Computer Science, Universitas Singaperbangsa Karawang.</p> <p>Syntax Journal of Informatics focuses on Software Engineering, Compilation Engineering, Database Design, Data Mining, Web Services Technology, Business Intelligent, Artificial Intelligence, Fuzzy Logic, Computer Vision, Embedded Systems, Robotics, Expert Systems, Machine Learning, E-Commerce, Digital and Network Security, Neuro Fuzzy, E-Government, Bioinformatics, Geographic Information Systems, Mobile Applications, Games Technology, Computer Networks, Cloud Computing.</p> en-US agung@unsika.ac.id (Agung Susilo Yuda Irawan, S.Kom., M.Kom.) syntax@unsika.ac.id (TEAM) Mon, 17 Jun 2024 17:06:31 +0700 OJS 3.2.1.4 http://blogs.law.harvard.edu/tech/rss 60 Comparison Email Spam detection vectorizing using bag of word, TFIDF and Word2Vec in Multinomial Naïve Bayes https://journal.unsika.ac.id/syntax/article/view/11254 <p>Email has become very popular among people nowadays. In fact, it the cheapest, popular and fastest means of communication in recent times. Email also has become official communication media in business area. The popularity of email is also used by irresponsible people as a medium for sending fake news, as a medium for fraud and so on. We call this kind email as spam email. There are dangerous and not dangerous spam email. We will focus on detection dangerous spam email, there are 2 type dangerous spam email. The first is email Phishing: Phishing is a term used to define fraudulent practices in which spammers try to trick victims. This can be detrimental to the person who receives these emails. And this kind email may deliver massively and very disturbing the email user. This research will try to find better preprocessing text technique to support the Multinomial Naïve Bayes algorithm with 3 class (ham, phishing and fraud) to classify kind of email, it is hoped that it can help users more accurately classify spam emails. To be able to do that, in preprocessing data we need to vectorizing body email so machine learning can make calculation. Vectorization enables the machines to understand the textual contents by converting them into meaningful numerical representations. The effectiveness of various text vectorization methods, namely the bag of word, TF-IDF and word2vec are investigated for email spam detection using the Multinomial Naïve Bayes. The paper presents the comparative analysis of different vectorization methods on spam email dataset. This paper will give the best vectorization with Multinomial Naive Bayes.</p> Rony Arifiandy, Hasanul Fahmi Copyright (c) 2024 Syntax : Jurnal Informatika https://journal.unsika.ac.id/syntax/article/view/11254 Mon, 17 Jun 2024 00:00:00 +0700 Sari, Eka DESAIN SISTEM PEMINJAMAN PERALATAN KANTOR BERBASIS WEBSITE https://journal.unsika.ac.id/syntax/article/view/11764 <p><em>Asset management in the company is an important part of ensuring smooth operations and supporting employee performance. However, the lack of a structured lending system often causes problems such as damage, loss of assets, difficulties in monitoring and recording. Those who still use Microsoft Excel are also very vulnerable to input errors and data loss. Therefore this study aims to design and build a website-based information system for borrowing office tools assets that can simplify the process of borrowing assets, improve monitoring of office tools assets, and provide accurate reports regarding borrowing office tools. The system development method used in this study uses the waterfall method, and for system design using UML (Unified Modeling Language). The final results of this study are a website-based office tools asset lending system using the Laravel framework which can facilitate monitoring of the condition and availability of assets, reduce the risk of data loss, and increase the accuracy of reporting.</em></p> Eka Puspita Sari, Fahri Ridwan Copyright (c) 2024 Syntax : Jurnal Informatika https://journal.unsika.ac.id/syntax/article/view/11764 Mon, 17 Jun 2024 00:00:00 +0700 Penerapan Algoritma Apriori untuk Memprediksi Pembayaran UKT https://journal.unsika.ac.id/syntax/article/view/11342 <p>Penelitian ini menerapkan algoritma <em>Apriori</em> untuk memprediksi hasil anilsis pola asosiasi pembayaran cicilan uang kuliah di Universitas Buana Perjuangan Karawang. Aturan asosiasi menunjukkan bahwa pembayaran Cicilan 3 memiliki dampak besar terhadap Cicilan 4, dengan tingkat <em>support</em> sebesar 84.60% dan <em>confidence</em> sebesar 93.47%. Ketergantungan positif antara Cicilan 2 dan Cicilan 3 dengan Cicilan 4 juga teridentifikasi dengan nilai <em>support</em> sebesar 84.57% dan nilai <em>confidence</em> sebesar 94.03%. Rekomendasi kebijakan mencakup penggabungan paket pembayaran pada Cicilan 3 dan Cicilan 4 serta insentif pembayaran lebih awal. Pemodelan menggunakan algoritma <em>Apriori</em> dengan implementasi <em>Python</em> dan <em>Google Colaboratory</em>.</p> Ayu Ratna Juwita, Tohirin Al Mudzakir, Adi Rizky Pratama, Bagja Nugraha, Nono Heryana Copyright (c) 2024 Syntax : Jurnal Informatika https://journal.unsika.ac.id/syntax/article/view/11342 Mon, 17 Jun 2024 00:00:00 +0700 TOPIC MODELING ANALYSIS OF ACCESS BY KAI APPLICATION REVIEWS ON GOOGLE PLAY STORE USING LATENT DIRICHLET ALLOCATION https://journal.unsika.ac.id/syntax/article/view/11697 <p><em>PT Kereta Api Indonesia (KAI) has released a ticket booking application named Access by KAI, which has been downloaded over 10 million times and has received more than 187,000 reviews on Google Playstore. However, with the vast amount of review data for the Access by KAI application, it is still challenging to understand the aspects that need improvement. In this case, topic modeling is necessary to classify the reviews. The aim of this research is to apply the Latent Dirichlet Allocation (LDA) method to model topics of user reviews of the Access by KAI application on Google Playstore and to present recommendations derived from the data dictionary or bag-of-words through a fishbone diagram. This research uses the lifecycle of the data mining methodology, which consists of the stages of problem definition, selecting text data mining approach, data collecting, text standardization, text processing, feature extraction, analysis, and discovery. The results of this research identified a total of 7 topics with a coherence score of 0.40279302. The conclusions from each topic are as follows: Topic 1 discusses application updates, available versions, interface, and the relationship with stations and cities. Topic 2 involves users complaining about decreased application performance after updates. Topic 3 covers the use of the Access by KAI application to book train tickets, highlighting the app version, user experience, and app quality ranging from good to cumbersome. Topic 4 reports user difficulties in accessing, particularly issues with login and payment after app updates. Topic 5 focuses on login difficulties, slow app performance, and issues in the ticket booking and payment process. Topic 6 reflects user disappointment regarding performance decline in speed and login difficulties after updates. Topic 7 addresses user complaints about difficulties in purchasing train tickets through the KAI app following updates or upgrades.</em></p> Amanda Febrianti, Intan Purnamasari, Iqbal Maulana Copyright (c) 2024 Syntax : Jurnal Informatika https://journal.unsika.ac.id/syntax/article/view/11697 Fri, 31 May 2024 00:00:00 +0700 Comparison Of Naïve Bayes And Support Vector Machines In Classifying Sentiment On Twitter About Artificial Intelligence Development https://journal.unsika.ac.id/syntax/article/view/11814 <p>Analisis sentimen merupakan bagian dari data mining yang digunakan untuk mengolah dan memproses teks dengan tujuan untuk mengetahui bagaimana opini atau pandangan masyarakat tentang suatu isu atau masalah tertentu. Metode klasifikasi yang digunakan untuk melakukan analisis sentimen pada data berupa teks, diantaranya <em>Naive Bayes</em> dan <em>Support Vector Machine</em> (SVM). Dalam mengevaluasi performa model klasifikasi yang telah dibuat, biasanya akan diukur nilai akurasinya. Oleh karena itu, penelitian ini bertujuan untuk membandingkan performa dari model klasifikasi sentimen yang menggunakan metode <em>Naive Bayes</em> dan SVM, dengan TF-IDF dan <em>CountVectorizer </em>sebagai ekstraksi fitur serta <em>Information Gain </em>sebagai seleksi fitur. Selain itu, digunakan juga N-gram sebagai upaya untuk dapat meningkatkan akurasi model klasifikasi. Penelitian ini menggunakan dataset berupa cuitan pengguna Twitter tentang perkembangan <em>Artificial Intelligence</em>. Data tersebut nantinya dikategorikan menjadi dua kelas, yaitu positif dan negatif, serta akan diolah dengan menggunakan tahapan <em>knowledge discovery in databases</em> (KDD). Hasil penelitian menunjukkan bahwa model hasil <em>Naive Bayes</em> mendapatkan akurasi tertinggi saat menggunakan ekstraksi fitur <em>CountVectorizer</em>, sedangkan model hasil SVM mendapatkan akurasi tertinggi saat menggunakan TF-IDF. Selain itu, penggunaan <em>Information Gain</em> ternyata dapat meningkatkan nilai akurasi model hasil <em>Naive Bayes</em> sebesar 12% menggunakan <em>CountVectorizer</em> dengan N-gram. Namun penggunaan <em>Information Gain</em> justru menurunkan nilai akurasi model hasil SVM sebesar 0,73% menggunakan TF-IDF dengan N-gram.</p> Iqbal Maulana, Roland Vincent, Oman Komarudin Copyright (c) 2024 Syntax : Jurnal Informatika https://journal.unsika.ac.id/syntax/article/view/11814 Fri, 31 May 2024 00:00:00 +0700