Statistical and qualitative analysis of ChatGPT and human raters in preservice teachers' writing assessment

Gulden, Bahadir; Bilge, Huzeyfe; Uysal, Pinar Kanik

Statistical and qualitative analysis of ChatGPT and human raters in preservice teachers' writing assessment

Tarih

2026

Yazarlar

Gulden, Bahadir

Bilge, Huzeyfe

Uysal, Pinar Kanik

Yayıncı

Izzet Kara

Erişim Hakkı

info:eu-repo/semantics/openAccess

Özet

Teachers spend a significant amount of time providing feedback. This study compared expert and ChatGPT assessments and feedback on written texts to determine the suitability of AI for writing skill assessments that are time-consuming to assess and provide feedback. Three experts and ChatGPT graded 14 Turkish undergraduate students' assignments using rubric that included content, language use, vocabulary, organization, and mechanics, and justified their decisions. The study involved document review and triangulation, a qualitative design. In addition, an intraclass correlation coefficient was used to assess the consistency of the ChatGPT and the experts' scores. All feedback was qualitatively analyzed to identify the strengths and weaknesses of the experts and their similarities with ChatGPT. Experts and ChatGPT had moderate to weak consistency in the writing subscales, while good reliability was found in the total score. Experts excelled in 'explanatory feedback', 'interpretation' and 'experience', while ChatGPT excelled in 'automation and continuity' and 'data processing capacity'. Experts' weaknesses included 'limited time and energy' and 'comparison bias', while ChatGPT's weaknesses were 'ambiguous expressions' and 'repetition'. The study also found that experts and ChatGPT preferred to provide constructive and supportive feedback.

Anahtar Kelimeler

Artificial Intelligence, ChatGPT, Writing feedback, Human-raters

Kaynak

International Journal of Assessment Tools in Education

WoS Q Değeri

Q3

Cilt

13

Sayı

1

Bağlantı

https://doi.org/10.21449/ijate.1678002
https://hdl.handle.net/20.500.12403/6149

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu

Detaylı Öğe Kaydı

Statistical and qualitative analysis of ChatGPT and human raters in preservice teachers' writing assessment

Tarih

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Erişim Hakkı

Özet

Açıklama

Anahtar Kelimeler

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Bağlantı

Koleksiyon