Klasifikasi Status Batu Empedu Menggunakan KNN: Analisis Prediktif Berbasis Data Komposisi Tubuh dan Konfirmasi Diagnostik Visual (RapidMiner)
Main Article Content
Abstract
Gallstones are one of the disorders of the digestive system that are often difficult to detect in the early stages due to mild or non-specific symptoms. Delayed diagnosis can increase the risk of serious complications; therefore, a decision support method is needed to assist in classifying gallstone status quickly and accurately. This study aims to classify gallstone status using a data mining approach with the K-Nearest Neighbor (KNN) algorithm based on body composition data and visual diagnostic confirmation. The dataset used in this study consists of 319 patient records with 38 attributes, including age, gender, comorbid conditions, and other clinical indicators, with class labels representing gallstone status (true and false). The research process was conducted using RapidMiner software through several stages, including dataset retrieval, label determination, numerical-to-binominal data transformation, application of the KNN algorithm, and model performance evaluation using a confusion matrix and accuracy metrics. The experimental results show that the KNN algorithm achieved an accuracy of 52.98%. Although the accuracy obtained is considered moderate, the results indicate that the KNN algorithm can be applied as an initial method to support gallstone status classification. This study is expected to serve as a foundation for further development of medical prediction systems through improved data quality and algorithm parameter optimization
Article Details

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
References
Cover, T. M. (1968). Nearest neighbor pattern classification. IEEE Trans. Information Theory, 4(5), 515–516.
Dimitrov, A., Olteanu, A., Mcdowell, L., & Aberer, K. (2012). Topick: Accurate Topic Distillation for User Streams. 2012 IEEE 12th International Conference on Data Mining Workshops, 882–885. https://doi.org/10.1109/ICDMW.2012.47
Han, J. (2019). Data Mining : Concepts and Techniques.
Hofmann, M., & Klinkenberg, R. (2016). RapidMiner: Data mining use cases and business analytics applications. CRC Press.
Kohavi, R. (1995). A study of cross-validation and bootstrap for accuracy estimation and model selection. Ijcai, 14(2), 1137–1145.
Kotsiantis, S. B. (2007). Supervised Machine Learning : A Review of Classification Techniques. 31, 249–268.
Lemmens, A., & Croux, C. (2006). Bagging and boosting classification trees to predict churn. Journal of Marketing Research, 43(2), 276–286.
Peterson, L. E. (2009). K-nearest neighbor. Scholarpedia, 4(2), 1883.
Shaffer, E. A. (2005). Epidemiology and risk factors for gallstone disease: has the paradigm changed in the 21st century? Current Gastroenterology Reports, 7(2), 132–140.