Evaluation of the approaches of different specialties and artificial intelligence systems on traumatic dental injuries

ŞAHİN, GONCA; EYMİRLİ, AYHAN

doi:10.1186/s12903-026-08106-y

Evaluation of the approaches of different specialties and artificial intelligence systems on traumatic dental injuries

ŞAHİN G. Z., EYMİRLİ A.

BMC Oral Health, cilt.26, sa.1, 2026 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 26 Sayı: 1
Basım Tarihi: 2026
Doi Numarası: 10.1186/s12903-026-08106-y
Dergi Adı: BMC Oral Health
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, CINAHL, MEDLINE, Directory of Open Access Journals
Anahtar Kelimeler: Artificial intelligence, Dental specialists, Knowledge assessment, Large language models, Traumatic dental injuries
Hacettepe Üniversitesi Adresli: Evet

Özet

Aim: This study aimed to evaluate and compare the knowledge of three artificial intelligence (AI) systems and three dental specialist groups regarding the management of traumatic dental injuries (TDIs). Methodology: An online survey consisting of three sections was administered: (1) an information sheet with electronic informed consent, (2) five sociodemographic questions, and (3) thirty multiple-choice questions assessing TDI knowledge based on the International Association of Dental Traumatology (IADT) guidelines and current literature. The survey was distributed via e-mail to specialists in endodontics, pediatric dentistry, and oral and maxillofacial surgery through the Turkish Dental Association database. For the AI systems (ChatGPT-4o, Gemini Advanced, and DeepSeek-V3), the 30-item questionnaire was administered under a standardized prompt three times per day over 10 consecutive days, and all responses were recorded. Internal consistency and reliability were assessed using Cronbach’s alpha and intraclass correlation coefficients (ICC). Statistical analyses were performed using one-way ANOVA and independent-samples t-tests (p < 0.05). Results: A total of 95 specialists participated in the study (64 females, 31 males). Female participants demonstrated significantly higher correct-response rates than males. Participants classified as having very good knowledge achieved significantly higher scores than those with acceptable or low knowledge levels. No statistically significant differences were observed according to professional experience or workplace setting. Significant differences were found among specialist groups, with pediatric dentistry specialists achieving the highest correct-response rate. The overall mean accuracy among all specialists was 70.98% (SD = 15.6). AI systems performed as follows: ChatGPT-4o, 86% (SD = 3.2); Gemini Advanced, 79.33% (SD = 3.75); and DeepSeek-V3, 72.67% (SD = 4.32). Conclusions: AI systems demonstrated higher overall accuracy than human participants; however, certain items resulted in misleading responses, highlighting the importance of case-specific interpretation. AI tools may serve as supportive adjuncts in clinical decision-making, but further rigorously designed studies are required to clarify their optimal role and limitations in TDI management.