Adıyaman Üniversitesi Kurumsal Arşivi

Comparison of Data Mining Classification Algorithms on Educational Data under Different Conditions

Basit öğe kaydını göster

dc.contributor.author Koyuncu, İlhan
dc.contributor.author Gelbal, Selahattin
dc.date.accessioned 2025-08-11T07:35:41Z
dc.date.available 2025-08-11T07:35:41Z
dc.date.issued 2020
dc.identifier.issn 1309-6575
dc.identifier.uri http://dspace.adiyaman.edu.tr:8080/xmlui/handle/20.500.12414/6577
dc.description.abstract The purpose of this study was to examine the performance of Naive Bayes, k-nearest neighborhood, neural networks, and logistic regression analysis in terms of sample size and test data rate in classifying students according to their mathematics performance. The target population was 62728 students in the 15-year-old group who were participated in the Programme for International Student Assessment (PISA) in 2012 from The Organisation for Economic Co-operation and Development (OECD) countries. The performance of each algorithm was tested by using 11%, 22%, 33%, 44% and 55% of each dataset for small (500 students), medium (1000 students) and large (5000 students) sample sizes. 100 replications were performed for each analysis. As the evaluation criteria, accuracy rates, RMSE values, and total elapsed time were used. RMSE values for each algorithm were statistically compared by using Friedman and Wilcoxon tests. The results revealed that while the classification performance of the methods increased as the sample size increased, the increase of training data ratio had different effects on the performance of the algorithms. The Naive Bayes showed high performance even in small samples, performed the analyzes very quickly, and was not affected by the change in the training data ratio. Logistic regression analysis was the most effective method in large samples but had a poor performance in small samples. While neural networks showed a similar tendency, its overall performance was lower than Naive Bayes and logistic regression. The lowest performances in all conditions were obtained by the k-nearest neighborhood algorithm. tr
dc.language.iso en tr
dc.publisher ASSOC MEASUREMENT & EVALUATION EDUCATION & PSYCHOLOGYASSOC MEASUREMENT & EVALUATION EDUCATION & PSYCHOLOGY, tr
dc.subject Artificial neural networks tr
dc.subject educational data mining tr
dc.subject k-nearest neighborhood tr
dc.subject logistic regression tr
dc.subject naive Bayes tr
dc.title Comparison of Data Mining Classification Algorithms on Educational Data under Different Conditions tr
dc.type Article tr
dc.contributor.authorID 0000-0002-0009-5279 tr
dc.contributor.department Adiyaman Univ, Fac Educ tr
dc.contributor.department Hacettepe Univ, Fac Educ tr
dc.identifier.endpage 345 tr
dc.identifier.issue 4 tr
dc.identifier.startpage 325 tr
dc.identifier.volume 11 tr
dc.source.title JOURNAL OF MEASUREMENT AND EVALUATION IN EDUCATION AND PSYCHOLOGY-EPOD tr


Bu öğenin dosyaları:

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster