Zubaer, A.A., Granitzer, M., Geschwind, S., Graf Lambsdorff, J., Voss, D. GPT-4 shows comparable performance to human examiners in ranking open-text answers. Sci Rep 15, 35045 (2025). https://doi.org/10.1038/s41598-025-21572-8
Zubaer, A.A., Geschwind, S., Voss, D., Wendlinger, L., Graf Lambsdorff, J., Granitzer, M., Mitrovic, J. Comparative Study of Language Models and Prompt Paradigms in Short Answer Grading. 2025 IEEE 37th International Conference on Tools with Artificial Intelligence (ICTAI), Athens, Greece (2025). doi: 10.1109/ICTAI66417.2025.00049