ESTIMATING OF HOUSEHOLDS SHOPPING ON THE INTERNET USING RANDOM FOREST METHOD Cover Image

İNTERNETTEN ALIŞVERİŞ YAPAN HANELERİN RASTGELE ORMAN YÖNTEMİYLE TAHMİN EDİLMESİ
ESTIMATING OF HOUSEHOLDS SHOPPING ON THE INTERNET USING RANDOM FOREST METHOD

Author(s): Uğur Ercan
Subject(s): Methodology and research technology, Marketing / Advertising, ICT Information and Communications Technologies
Published by: Kafkas Üniversitesi Sağlık, Kültür ve Spor Daire Başkanlığı Dijital Baskı Merkezi
Keywords: Random forest; shop on the internet; imbalanced dataset; SMOTE; random undersampling;

Summary/Abstract: The aim of the study is to determine the households shopping online in Turkey. During the modeling phase, the Random Forest method, which is frequently preferred in classification problems, was used. The data set in the TÜİK 2019 Household Budget Survey and gathered from 11521 households was used. The data set of the study was balanced with SMOTE and Random Undersampling methods. The cross-validation method was used to increase the accuracy of the study. The performances of the established models were compared and interpreted, and it was shown that the classifier performance could be increased with the correct use of sampling methods and cross-validation. In the training dataset, the model established by applying the SMOTE method was found to be more successful than the results of all criteria (F, DP, G-Means and MCC ) compared to other models. In the test data set, while it was observed that the model with the SMOTE method was more successful than the results of the F and MCC criteria, the model established with the Undersampling method was more successful according to the result of the G-Means criterion, and the model created without using any method was found to be successful according to the result of the DP criterion.

  • Issue Year: 12/2021
  • Issue No: 24
  • Page Range: 728-752
  • Page Count: 25
  • Language: Turkish
Toggle Accessibility Mode