Penerapan ETL dan Decision Tree untuk Klasifikasi Rating Produk Tokopedia
Main Article Content
Abstract
The growth of e-commerce in Indonesia has increased the volume of transaction data and customer reviews. Tokopedia, as one of the largest e-commerce platforms, generates a large amount of product and review data that can be analyzed to support decision-making. This study aims to analyze factors affecting Tokopedia product review ratings by applying the Extract, Transform, Load (ETL) process and the Decision Tree classification method. The dataset was obtained from Kaggle and includes product, seller, and customer review information. The results show that variables such as price, number of products sold, seller status, and product category influence review ratings. The Decision Tree model provides clear interpretation of decision-making patterns in determining product ratings.
Article Details

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
References
K. C. Laudon and C. G. Traver, E-Commerce: Business, Technology, Society., 16th ed. New York: Pearson, 2020.
Statista, “E-commerce in Indonesia,” 2023.
C. Chen, Y. Xu, and J. Wang, “Online consumer review analysis and its impact on purchasing decisions,” Decis Support Syst, vol. 136, pp. 113–124, 2020.
P. Kotler and K. L. Keller, Marketing Management, 15th ed. Pearson Education, 2019.
W. H. Inmon, Building the Data Warehouse, 5th ed. Hoboken, NJ, USA: Wiley, 2019.
K. Kimball and M. Ross, The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling, 3rd ed. Wiley, 2013.
J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques, 3rd ed. Waltham: Morgan Kaufmann, 2012.
S. B. Kotsiantis, “Decision trees: A recent overview,” Artif Intell Rev, vol. 39, no. 4, pp. 261–283, 2013.
F. Provost and T. Fawcett, Data Science for Business. Sebastopol, CA: O’Reilly Media, 2013.
Kaggle, “Tokopedia Product and Review Dataset,” 2023. Accessed: Jan. 02, 2026. [Online]. Available: https://www.kaggle.com
A. Géron, Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd ed. O’Reilly Media, 2019.
M. Zaki and W. Meira, Data Mining and Analysis. Cambridge University Press, 2014.
T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning, 2nd ed. Springer, 2009.
I. H. Witten, E. Frank, and M. A. Hall, Data Mining: Practical Machine Learning Tools and Techniques, 4th ed. Morgan Kaufmann, 2017.
S. Aggarwal, Data Mining: The Textbook. Springer, 2015.