Development and evaluation of an artificial intelligence-based model for assessment improvement in Brazil's national online Continuing Medical Education program

Alisson Oliveira dos Santos; Tales Mota Machado; Josué de Lacerda Silva; João Paulo Valadares Vilaça; Henrique Pereira Alves; Moreno Magalhães de Souza Rodrigues; Leonardo Cançado Monteiro Savassi; Adelson Guaraci Jantsch; Alysson Feliciano Lemos

PDF

Published: Mar 30, 2026

Keywords:

Large Language Models, Continuing education, Familly Medicine, mentoring

Alisson Oliveira dos Santos

MD, PhD, Professor, Universidade Federal de Mato Grosso do Sul, Campus Três Lagoas, Três Lagoas, Brazil

https://orcid.org/0000-0002-4648-9951

Tales Mota Machado

MSc, IT Technician, Universidade Federal de Ouro Preto, Ouro Preto, Brazil

https://orcid.org/0000-0003-0603-823X

Josué de Lacerda Silva

BS, Technology Coordinator, Open University of SUS, Brasilia, Brazil

João Paulo Valadares Vilaça

BS, Systems Analyst, Open University of SUS, Brasília, Brazil

Henrique Pereira Alves

BS, Data Analysis Specialist, Open University of SUS, Brasilia, Brazil

Moreno Magalhães de Souza Rodrigues

PhD, Public Health Researcher, Center for Data and Knowledge Integration in Health, Fiocruz-Rondônia, Porto Velho, Brazil

https://orcid.org/0000-0002-1594-2311

Leonardo Cançado Monteiro Savassi

MD, PhD, Associate Professor, Federal University of Ouro Preto, Ouro Preto, Brazil

https://orcid.org/0000-0001-6780-0377

Adelson Guaraci Jantsch

MD, PhD, Researcher, Open University of SUS, Brasila, Brazil

https://orcid.org/0000-0002-3012-5619

Alysson Feliciano Lemos

MSc, Coordinator of Program and Project Evaluation and Monitoring, Executive Secretariat, Open University of SUS, Brasilia, Brazil

https://orcid.org/0000-0002-9451-2546

Abstract

Background:

The challenge of providing timely, high-quality feedback to thousands of healthcare professionals enrolled in distance-based Family Medicine specialization programs in Brazil creates an opportunity for artificial intelligence implementation. This study aimed to evaluate the effectiveness of Large Language Models (LLMs) in assisting tutors with student assessment in these programs.

Methods:

We implemented GPT-4o to analyze student responses to practical challenges in a Family Medicine distance education course. The system was structured through dataset preparation (518 student responses), prompt engineering, fine-tuning, and Retrieval-Augmented Generation. Evaluation included: human expert assessment using a 5-item Likert questionnaire (n=26 responses); metrics-based analysis comparing text length between LLM and tutor feedback (n=104); semantic similarity analysis between tutor- and LLM-generated texts (n=11); and comparison of scores assigned by tutors versus LLM (n=104).

Results:

Expert assessment showed high ratings for clarity (100% scoring "strongly agree") but lower scores regarding LLM's ability to replace tutors. LLM-generated feedback was significantly longer than tutors' (mean 190.11 vs. 109.69 words, p<.001). Semantic similarity between LLM and tutor responses was high (mean 85.92%). LLM-assigned scores differed slightly but significantly from tutor scores (mean 8.31 vs. 8.80, p<.001).

Discussion:

LLMs can generate clear, semantically aligned feedback and assign grades that approximate tutor scoring, offering a scalable enhancement to assessment in distance‑based medical education. Nevertheless, they should be seen as a complement to human tutors rather than a replacement, especially where nuanced, contextualized guidance is required. Careful attention to regional language variation and domain‑specific content will be essential for the safe, equitable integration of AI into continuing professional development.

Downloads

Download data is not yet available.

How to Cite

Oliveira dos Santos, A., Mota Machado, T., de Lacerda Silva, J., Valadares Vilaça, J. P., Pereira Alves, H., Magalhães de Souza Rodrigues, M., … Feliciano Lemos, A. (2026). Development and evaluation of an artificial intelligence-based model for assessment improvement in Brazil’s national online Continuing Medical Education program. Education for Health, 39(1). Retrieved from https://educationforhealthjournal.org/index.php/efh/article/view/441

Issue

Vol. 39 No. 1 (2026): Vol. 39 No. 1: January-March 2026

Section

Original Research Paper

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Creative Commons Attribution 4.0 International license

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details