Affiliations 

  • 1 Department of Otorhinolaryngology-Head and Neck Surgery, Faculty of Medicine, Universiti Kebangsaan Malaysia (UKM), Hospital Canselor Tuanku Muhriz, Jalan Yaacob Latif, Bandar Tun Razak, 56000, Kuala Lumpur, Malaysia
  • 2 Faculty of Electrical Engineering, Universiti Teknologi Malaysia (UTM), 81310, Johor Bahru, Johor, Malaysia
  • 3 College of Information Technology, Imam Ja'afar Al-Sadiq University, Al-Muthanna, 66001, Iraq
  • 4 Department of Otorhinolaryngology-Head and Neck Surgery, Faculty of Medicine, Universiti Kebangsaan Malaysia (UKM), Hospital Canselor Tuanku Muhriz, Jalan Yaacob Latif, Bandar Tun Razak, 56000, Kuala Lumpur, Malaysia. marinamatbaki@ppukm.ukm.edu.my
J Otolaryngol Head Neck Surg, 2023 Sep 20;52(1):62.
PMID: 37730624 DOI: 10.1186/s40463-023-00661-6

Abstract

BACKGROUND: A multidimensional voice quality assessment is recommended for all patients with dysphonia, which requires a patient visit to the otolaryngology clinic. The aim of this study was to determine the accuracy of an online artificial intelligence classifier, the Online Sequential Extreme Learning Machine (OSELM), in detecting voice pathology. In this study, a Malaysian Voice Pathology Database (MVPD), which is the first Malaysian voice database, was created and tested.

METHODS: The study included 382 participants (252 normal voices and 130 dysphonic voices) in the proposed database MVPD. Complete data were obtained for both groups, including voice samples, laryngostroboscopy videos, and acoustic analysis. The diagnoses of patients with dysphonia were obtained. Each voice sample was anonymized using a code that was specific to each individual and stored in the MVPD. These voice samples were used to train and test the proposed OSELM algorithm. The performance of OSELM was evaluated and compared with other classifiers in terms of the accuracy, sensitivity, and specificity of detecting and differentiating dysphonic voices.

RESULTS: The accuracy, sensitivity, and specificity of OSELM in detecting normal and dysphonic voices were 90%, 98%, and 73%, respectively. The classifier differentiated between structural and non-structural vocal fold pathology with accuracy, sensitivity, and specificity of 84%, 89%, and 88%, respectively, while it differentiated between malignant and benign lesions with an accuracy, sensitivity, and specificity of 92%, 100%, and 58%, respectively. Compared to other classifiers, OSELM showed superior accuracy and sensitivity in detecting dysphonic voices, differentiating structural versus non-structural vocal fold pathology, and between malignant and benign voice pathology.

CONCLUSION: The OSELM algorithm exhibited the highest accuracy and sensitivity compared to other classifiers in detecting voice pathology, classifying between malignant and benign lesions, and differentiating between structural and non-structural vocal pathology. Hence, it is a promising artificial intelligence that supports an online application to be used as a screening tool to encourage people to seek medical consultation early for a definitive diagnosis of voice pathology.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.