Penerapan Logistic Regression Untuk Klarifikasi Pendapatan Berdasarkan Variabel Sosial Ekonomi
Main Article Content
Abstract
This study aims to analyze the effectiveness of the Logistic Regression algorithm in classifying individual income levels based on socio-economic variables. The research employs a quantitative experimental approach using the Adult Census Income dataset, which consists of demographic and employment-related attributes. Data preprocessing stages include handling missing values, encoding categorical attributes, and splitting the dataset into training and testing subsets with an 80:20 ratio. Model development and evaluation were conducted using RapidMiner Studio to ensure a structured and reproducible workflow. The performance of the proposed model was evaluated using accuracy, precision, recall, F1-score, and confusion matrix analysis. Experimental results show that the Logistic Regression model achieved an accuracy of 80.91%, indicating a reliable capability in distinguishing income categories ≤50K and >50K. The model demonstrates strong performance in identifying low-income individuals, while challenges remain in improving precision for the high-income class due to class imbalance. Overall, the findings confirm that Logistic Regression remains a relevant and interpretable baseline model for income classification tasks, particularly in socio-economic analysis and policy-oriented studies where transparency and interpretability are essential.
Article Details

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
References
Alexander, D., & Supriyanto, R. (2023). Comparative Analysis of Machine Learning Models for Predicting Default in Home Credit Companies. 671–679.
Alghamdi, H. M. (2024). Unveiling Sentiments : A Comprehensive Analysis of Arabic Hajj-Related Tweets from 2017 – 2022 Utilizing Advanced AI Models.
Ardana, N. K. K., Amany, N., Kevin, T., Karunia, R., & Fauzia, S. (2023). Perbandingan Metode KNN , Naive Bayes , dan Regresi Logistik Binomial dalam Pengklasifikasian Status Ekonomi Negara. 5(2), 404–418.
Chu, H. (2025a). Adult Income Prediction and Key Factors Analysis Based on Machine Learning Algorithm. 0, 143–147. https://doi.org/10.54254/2754-1169/2025.BL29341
Chu, H. (2025b). Adult Income Prediction and Key Factors Analysis Based on Machine Learning Algorithm. 0, 142–146. https://doi.org/10.54254/2754-1169/2025.BL29341
Dey, A. R., & Roy, N. (2023). An Investigation into the Prediction of Annual Income Levels through the Utilization of Demographic Features Employing the Modified UCI Adult Dataset. 2023 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS), 1080–1086. https://doi.org/10.1109/ICCCIS60361.2023.10425394
Islam, M. A., Nag, A., Roy, N., Dey, A., Fahim, S., & Ghosh, A. (2024). An Investigation into the Prediction of Annual Income Levels Through the Utilization of Demographic Features Employing the Modified UCI Adult Dataset. https://doi.org/10.1109/ICCCIS60361.2023.10425394
Li, L., Zhao, K., Gan, J., Cai, S., Liu, T., Mu, H., & Sun, R. (2021). Robust Adaptive Semi-supervised Classification Method based on Dynamic Graph and Self-paced Learning. Information Processing & Management, 58(1), 102433. https://doi.org/https://doi.org/10.1016/j.ipm.2020.102433
Pradesh, A. (2025). ADULT INCOME CLASSIFICATION USING MACHINE LEARNING TECHNIQUES. 16(05), 1195–1209.
Sharifzadeh, A., Azad, S., & Ameli, M. T. (2023). Modern Smart Multi‐Dimensional Infrastructure Energy Systems – State of the Arts. In Coordinated Operation and Planning of Modern Heat and Electricity Incorporated Networks (pp. 57–78). IEEE. https://doi.org/10.1002/9781119862161.ch4
Shuvo, S., Mohanty, J., & Patel, D. (2024). Predicting Annual Income of Individuals using Classification Techniques. https://doi.org/10.13140/RG.2.2.30102.15680
Siringoringo, R., Arisandi, D., Kurniawan, E., & Nababan, E. B. (2024). MODEL KLASIFIKASI DENGAN LOGISTIC REGRESSION DAN RECURSIVE CLASSIFICATION MODEL USING LOGISTIC REGRESSION AND RECURSIVE. 11(4). https://doi.org/10.25126/jtiik.1148198
Susetyoko, R., Yuwono, W., & Purwantini, E. (2022). Model Klasifikasi Pada Seleksi Mahasiswa Baru Penerima KIP Kuliah Menggunakan Regresi Logistik Biner. Jurnal Informatika Polinema, 8, 31–40. https://doi.org/10.33795/jip.v8i4.914
Thapa, S. (2023). Adult Income Prediction Using various ML Algorithms.
Wan, Z. (2023). Performances evaluation of machine learning models on income forecasting. 0, 24–29. https://doi.org/10.54254/2755-2721/27/20230111
Wang, J. (2022). Research on Income Forecasting based on Machine Learning Methods and the Importance of Features. https://doi.org/10.4108/eai.17-6-2022.2322745