Comparison of decision trees used in data mining

Aksu, GoKhan; DOĞAN, NURİ

doi:10.14527/pegegog.2019.039

Comparison of decision trees used in data mining

Aksu G., DOĞAN N.

PEGEM EGITIM VE OGRETIM DERGISI, cilt.9, sa.4, ss.1183-1208, 2019 (ESCI, Scopus, TRDizin)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 9 Sayı: 4
Basım Tarihi: 2019
Doi Numarası: 10.14527/pegegog.2019.039
Dergi Adı: PEGEM EGITIM VE OGRETIM DERGISI
Derginin Tarandığı İndeksler: Emerging Sources Citation Index (ESCI), Scopus, TR DİZİN (ULAKBİM)
Sayfa Sayıları: ss.1183-1208
Açık Arşiv Koleksiyonu: AVESİS Açık Erişim Koleksiyonu
Hacettepe Üniversitesi Adresli: Evet

Özet

The purpose of this study is to compare decision trees obtained by data mining algorithms used in various areas in recent years according to different criteria. In the study, similar and different aspects of the decision trees obtained by different methods for classifying the students as successful and unsuccessful in terms of science literacy were revealed with the help of 12 independent variables included in the PISA 2015 student survey. Data collected across Turkey, from a total of 5895 students in the age group of 15, were analyzed in Java-based Weka software, which has an open source code. As a result of the analysis, it was found that the most successful algorithms in terms of correct classification rate were respectively Logistic Model, Hoeffding Tree, J.48, REPTree and Random Tree. In addition, regarding the decision trees obtained by different learning algorithms, variables that have been effective in the classification were found to be different. According to the results, it was concluded that independent variables found to be effective in the classification of the students for the decision trees obtained by different algorithms differed from each other and it was suggested to report the finding of more than one algorithm instead of those of only one algorithm.