Affiliations 

  • 1 Computer Science Department, Southern Connecticut State University, New Haven, CT 06514, USA
  • 2 Department of Computer Systems Engineering, Arab American University, Jenin P.O. Box 240, Palestine
  • 3 Department of Pulmonary, Critical Care & Sleep Medicine, Texas A&M University, College Station, TX 77843, USA
  • 4 Health Management and Informatics Department, School of Medicine, University of Missouri, Columbia, MO 65212, USA
  • 5 Department of Computer Science, Al-Balqa Applied University, Salt 19117, Jordan
  • 6 Faculty of Electrical Engineering, Universiti Teknikal Malaysia Melaka, Hang Tuah Jaya, Durian Tunggal 76100, Melaka, Malaysia
  • 7 Center of Medical Informatics and Enterprise Analytics, University of Kansas Medical Center, Kansas City, KS 66160, USA
  • 8 Department of Computer Science, Birzeit University, Birzeit P.O. Box 14, Palestine
  • 9 Faculty of Information Technology, Sebha University, Sebha 18758, Libya
  • 10 Pulmonary, Critical Care & Sleep Medicine, Sutter Health, Tracy, CA 95376, USA
Diagnostics (Basel), 2023 Jul 20;13(14).
PMID: 37510161 DOI: 10.3390/diagnostics13142417

Abstract

Obstructive sleep apnea (OSA) is a prevalent sleep disorder that affects approximately 3-7% of males and 2-5% of females. In the United States alone, 50-70 million adults suffer from various sleep disorders. OSA is characterized by recurrent episodes of breathing cessation during sleep, thereby leading to adverse effects such as daytime sleepiness, cognitive impairment, and reduced concentration. It also contributes to an increased risk of cardiovascular conditions and adversely impacts patient overall quality of life. As a result, numerous researchers have focused on developing automated detection models to identify OSA and address these limitations effectively and accurately. This study explored the potential benefits of utilizing machine learning methods based on demographic information for diagnosing the OSA syndrome. We gathered a comprehensive dataset from the Torr Sleep Center in Corpus Christi, Texas, USA. The dataset comprises 31 features, including demographic characteristics such as race, age, sex, BMI, Epworth score, M. Friedman tongue position, snoring, and more. We devised a novel process encompassing pre-processing, data grouping, feature selection, and machine learning classification methods to achieve the research objectives. The classification methods employed in this study encompass decision tree (DT), naive Bayes (NB), k-nearest neighbor (kNN), support vector machine (SVM), linear discriminant analysis (LDA), logistic regression (LR), and subspace discriminant (Ensemble) classifiers. Through rigorous experimentation, the results indicated the superior performance of the optimized kNN and SVM classifiers for accurately classifying sleep apnea. Moreover, significant enhancements in model accuracy were observed when utilizing the selected demographic variables and employing data grouping techniques. For instance, the accuracy percentage demonstrated an approximate improvement of 4.5%, 5%, and 10% with the feature selection approach when applied to the grouped data of Caucasians, females, and individuals aged 50 or below, respectively. Furthermore, a comparison with prior studies confirmed that effective data grouping and proper feature selection yielded superior performance in OSA detection when combined with an appropriate classification method. Overall, the findings of this research highlight the importance of leveraging demographic information, employing proper feature selection techniques, and utilizing optimized classification models for accurate and efficient OSA diagnosis.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.