Investigation of Measurement Precision and Test Length in Computerized Adaptive Tests under Different Conditions


Creative Commons License

Yildiz H., Demir C. T., Ulku S., Giray G., KELECİOĞLU H.

JOURNAL OF MEASUREMENT AND EVALUATION IN EDUCATION AND PSYCHOLOGY-EPOD, cilt.15, sa.1, ss.5-17, 2024 (ESCI) identifier identifier

Özet

In this study, it is aimed to examine item exposure rate, content balancing, and ability estimation in terms of termination rules with regard to testing lengths and testing accuracy in computerized adaptive testing. In this context, EAP and MLE ability estimation methods were compared in terms of correlation, bias, RMSE, and test length. In the study EAP and MLE were compared with a total of 72 different conditions; including 1, 2, and 4 group content balancing patterns; 0.50, 0.75, and 1.00 exposure rates; 0.35 and 0.40 standard error-based and the termination rule based on the test length of 15 and 30. This research is Monte-Carlo simulation study, which was carried out in relational screening model of the quantitative research methods. The production and analysis of the data were performed in the Rstudio. As a result, the best performance in the measurement is a fixed test length of 30 items with 0.35 standard error; in group 1 pattern where the content balancing is not a group limitation; the exposure rate was displayed in the range of 0.75 and 1.00. Depending on the test length of ability estimation methods, scope balancing patterns and exposure rates, the number of items changes in the range of 22 and 25; Based on the termination rule, it was estimated that at least 0.40 standard errors with a standard error based on 39 items.