Comparison of Classification Data Mining C4.5 and Naïve Bayes Algorithms of EDM Dataset
Comparison of Classification Data Mining C4.5 and Naïve Bayes Algorithms of EDM Dataset
Author(s): Joseph Teguh Santoso, Ni Luh Wiwik Sri Rahayu Ginantra, Muhammad Arif, R Riinawati, Dadang Sudrajat, Robbi RahimSubject(s): ICT Information and Communications Technologies
Published by: UIKTEN - Association for Information Communication Technology Education and Science
Keywords: Comparison; data mining; Classification; C4.5; Naive Bayes; Performance; EDM;
Summary/Abstract: The purpose of this research is to choose the best method by comparing two classification methods of data mining C4.5 and Naïve Bayes on Educational Data Mining, in which the data used is student graduation data consisting of 79 records. Both methods are tested for validation with 10-ford X Validation and perform a T-Test difference test to produce a table that contains the best method ranking. Different results were obtained for each method. Based on the results of these two methods, it is very influential on the dataset and the value of the area under curve in the Naïve Bayes method is better than the C4.5 method in various datasets. Comparison of the method with the 10-Ford X Validation test and the T-Test difference test is that the Naïve Bayes method is better than C4.5 with an average accuracy value of 73.41% and an under-curve area of 0.664.
Journal: TEM Journal
- Issue Year: 10/2021
- Issue No: 4
- Page Range: 1738-1744
- Page Count: 7
- Language: English