Skip to main content

Showing 1–3 of 3 results for author: Gurdil, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.15525  [pdf

    cs.HC cs.AI cs.CY

    The Use of Artificial Intelligence Tools in Assessing Content Validity: A Comparative Study with Human Experts

    Authors: Hatice Gurdil, Hatice Ozlem Anadol, Yesim Beril Soguksu

    Abstract: In this study, it was investigated whether AI evaluators assess the content validity of B1-level English reading comprehension test items in a manner similar to human evaluators. A 25-item multiple-choice test was developed, and these test items were evaluated by four human and four AI evaluators. No statistically significant difference was found between the scores given by human and AI evaluators… ▽ More

    Submitted 3 February, 2025; originally announced March 2025.

  2. arXiv:2412.16657  [pdf

    cs.CY stat.OT

    A Comprehensive Guide to Item Recovery Using the Multidimensional Graded Response Model in R

    Authors: Yesim Beril Soguksu, Ayse Bilicioglu Gunes, Hatice Gurdil

    Abstract: The purpose of this study is to provide a step-by-step demonstration of item recovery for the Multidimensional Graded Response Model (MGRM) in R. Within this scope, a sample simulation design was constructed where the test lengths were set to 20 and 40, the interdimensional correlations were varied as 0.3 and 0.7, and the sample size was fixed at 2000. Parameter estimates were derived from the gen… ▽ More

    Submitted 24 December, 2024; v1 submitted 21 December, 2024; originally announced December 2024.

  3. arXiv:2402.01731  [pdf

    cs.CY

    Integration of Artificial Intelligence in Educational Measurement: Efficacy of ChatGPT in Data Generation within the Scope of Item Response Theory

    Authors: Hatice Gurdil, Yesim Beril Soguksu, Salih Salihoglu, Fatma Coskun

    Abstract: The aim of this study is to investigate the effectiveness of ChatGPT 3.5 in developing algorithms for data generation within the framework of Item Response Theory (IRT) using the R programming language. In this context, validity examinations were conducted on data sets generated according to the Two-Parameter Logistic Model (2PLM) with algorithms written by ChatGPT 3.5 and researchers. These exami… ▽ More

    Submitted 3 July, 2024; v1 submitted 28 January, 2024; originally announced February 2024.