DefinePK

DefinePK hosts the largest index of Pakistani journals, research articles, news headlines, and videos. It also offers chapter-level book search.

Comparing artificial intelligence and human intelligence in diagnosing oral lesions


Article Information

Title: Comparing artificial intelligence and human intelligence in diagnosing oral lesions

Authors: Aziz Ali Khowaja, Tarwani Bhart, Raja Muhammad Daniyal, Naeemullah Awan, Ayesha Abbasi, Hasan Mehdi

Journal: Pakistan Journal of Pathology

HEC Recognition History
Category From To
Y 2023-07-01 2024-09-30
Y 2022-07-01 2023-06-30
Y 2021-07-01 2022-06-30
Y 2020-07-01 2021-06-30
Y 1900-01-01 2005-06-30

Publisher: Pakistan Association of Pathologists

Country: Pakistan

Year: 2025

Volume: 36

Issue: 3

Language: en

DOI: 10.55629/pakjpathol.v36i3.946

Keywords: ARTIFICIAL INTELLIGENCELarge Language ModelsDiagnosisHouse officersOral lesions

Categories

Abstract

Objective: To evaluate the diagnostic accuracy of artificial intelligence (AI) tools compared with human clinicians in identifying oral lesions.
Material and Methods: A comparative cross-sectional diagnostic accuracy (quantitative) study research has been conducted at the Department of Oral and Maxillofacial Surgery, Fatima Jinnah Dental College, Jinnah Hospital, Karachi, following approval from the ethical board, from November 2023 to January 2024.involved an examination consisting of 20 MCQs. These questions had options of responses titled A to D. The exam, aligned as multiple-choice questionnaire by an expert of that content field, was administered via Google Forms.
Results: Among the 35 participants, 14.3% were LLMs and 85.7% were house officers, with the mean ages of house officer’s 29.07±4.05 years. The majority of participants were male. In terms of diagnostic accuracy, LLMs and house officers performed similarly, with correct answer rates of 80.0% and 83.3%, respectively, and no statistically significant difference (p=0.855). Additionally, artificial intelligence tools demonstrated varying efficiencies, with OpenAI achieving 91.4% correct answers, PopAI at 77.1%, and Gemini at 80.0%.
Conclusion: The findings notably demonstrate that LLMs exhibit a level of performance that is on par with that of senior dental house officers, thus underlining their promise as a valuable tool in augmenting diagnostic capabilities within clinical settings.
Keywords: Artificial intelligence, Diagnosis, Human intelligence, Large language models, Oral lesions.


Paper summary is not available for this article yet.

Loading PDF...

Loading Statistics...