Penerapan Algoritma C4.5 untuk Prediksi Pola Pembelian Pelanggan pada Dataset Transaksi Retail
Main Article Content
Abstract
The increasing volume of sales transaction data has driven the need for data analysis to uncover patterns that are beneficial for companies. This study aims to classify product sales transaction data using data mining techniques in order to identify sales patterns based on product categories. The classification process is conducted with the assistance of RapidMiner Studio, encompassing data preprocessing, model construction, and evaluation of classification performance. Model evaluation is carried out using the Confusion Matrix with accuracy as the evaluation parameter. The results indicate that the classification model achieves an accuracy rate of 42.67%. These findings show that the model is able to correctly classify a portion of the data, but still has limitations due to the complexity and quality of the transaction data. Therefore, optimization at the data preprocessing stage as well as the selection of more relevant features are required to improve the model’s accuracy in future research
Article Details

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
References
Alyshen, A., & Harman, R. (2024). Penerapan Algoritma C4.5 Untuk Memprediksi Penjualan Barang Pada Pt Prima Niaga Indomas. Computer Based Information System Journal, 12(1), 73–83. https://doi.org/10.33884/cbis.v12i1.8328
Antoh, S., Herteno, R., Budiman, I., Kartini, D., & Mazdadi, M. I. (2025). Prediksi Churn Pelanggan Telekomunikasi dengan Optimalisasi Seleksi Fitur dan Tuning Hyperparameter pada Algoritma Klasifikasi C4.5. Jurnal Sistem Informasi Bisnis, 15(1), 60–67. https://doi.org/10.14710/vol15iss1pp60-67
Haafizh, S., Merdekawati, A., & Yuliani, Y. (2024). Classification of Product Predicates Based on Sales Rate Using the C4.5 Decision Tree Algorithm in Retail Companies. International Journal Multidisciplinary (IJMI), 1(3), 183–201. https://doi.org/10.61796/ijmi.v1i3.201
Isyriyah, L., Baihaqi, I., & Purwiantono, F. E. (2024). Prediksi Loyalitas Pelanggan Pada Fast Moving Consumer Goods Menggunakan Klasifikasi Metode C4.5. Smatika Jurnal, 13(02), 369–380. https://doi.org/10.32664/smatika.v13i02.1115
Jaisy Kahfi, A., Arga Djendra, F., Putri Ananda, Y., Wijayanti, Z., Khoerotunnisa, N., & Oloan Lubis, B. (2025). Analisis Data Penjualan Produk Herbal Menggunakan Algoritma C4.5 Pada E-Commerce Azka Jaisy Store. JATI (Jurnal Mahasiswa Teknik Informatika), 9(2), 2416–2421. https://doi.org/10.36040/jati.v9i2.12717
Juledi, A. P., Munthe, I. R., Informasi, S., Batu, U. L., Data, A., Pelanggan, S. K., & Keputusan, P. (2024). data mining Hardiman purba1. 7, 91–95.
Kurniawan Maranto, A. R., Liliy Damayanti, & Irvan Rahul Ramadika. (2024). Perbandingan Algoritma C4.5 dengan Naïve Bayes untuk Menduga Loyalitas Pelanggan pada Perusahaan Internet Service Provider. Bit-Tech, 7(2), 396–405. https://doi.org/10.32877/bt.v7i2.1825
Murlena, M., & Apriana, D. (2022). Penerapan Data Mining Untuk Memprediksi Ketersediaan Stok Produk HNI HPAI Menggunakan Algoritma C4.5. Arcitech: Journal of Computer Science and Artificial Intelligence, 2(1), 19. https://doi.org/10.29240/arcitech.v2i1.5271
Naya, C., & Rilvani, E. (2025). Prediksi Penjualan Brand di HGVR Store Menggunakan Algoritma C4.5 dan Naïve Bayes. Jurnal Informatika Ekonomi Bisnis, 7(3), 646–652. https://doi.org/10.37034/infeb.v7i3.1242
No, V., Ramadhani, W. A., Rozi, F., & No, V. (2025). Infotek : Jurnal Informatika dan Teknologi Prediksi Kepuasan Pelanggan Berdasarkan Ulasan Produk di Lazada Indonesia Menggunakan Algoritma Decision Tree C4 . 5 Perkembangan teknologi informasi dan internet telah mengubah pola belanja konsumen , menjadikan. 8(2), 499–510.
Putriani, D., Prayogi, A. P. A., Shofyana, A. I., Ristyawan, A., & Daniati, E. (2024). Prediksi Customer Churn Menggunakan Algoritma Decision Tree. Inotek, 8, 85–94. https://doi.org/doi.org/10.29407/inotek.v8i1.491
Salsabila, S. M., Alim Murtopo, A., & Fadhilah, N. (2022). Analisis Sentimen Pelanggan Tokopedia Menggunakan Metode Naïve Bayes Classifier. Jurnal Minfo Polgan, 11(2), 30–35. https://doi.org/10.33395/jmp.v11i2.11640
Surojudin, N., & Danny, M. (2025). Penerapan Data Mining dengan Algoritma C4.5 dan K-nearest Neighbor untuk Prediksi Penjualan Bahan Bangunan Terlaris. Jurnal Informatika Ekonomi Bisnis, 7, 672–679. https://doi.org/10.37034/infeb.v7i3.1241
Syahputra, F., Hartono, H., & Rosnelly, R. (2021). Penerapan Algoritma C4.5 Dalam Memprediksi Ketersediaan Uang Pada Mesin ATM. Jurnal Media Informatika Budidarma, 5(2), 556. https://doi.org/10.30865/mib.v5i2.2933
Wati, E. F., Sunita Perangin-Angin, E., & Indriyani, L. (2024). Customer Loyalty Classification with Comparison of Naive Bayes, C4.5, and KNN Methods. International Journal of Information System & Technology Akreditasi, 8(158), 177–185. https://doi.org/doi.org/10.30645