Affiliations 

  • 1 Royal College of Surgeons in Ireland, Faculty of Medicine and Health Science, Dublin, Ireland
  • 2 Sumy State University, Sumy, Ukraine
  • 3 Government Medical College and Hospital, Chandigarh, India
  • 4 Faculty of Medicine and Health Sciences, University of Putra Malaysia, Serdang, Malaysia
  • 5 Faculty of Medicine, University of Debrecen, Debrecen, Hungary
  • 6 Department of Biochemistry and Microbiology, Rutgers University-New Brunswick, New Jersey
  • 7 Drexel University College of Medicine, Philadelphia, Pennsylvania, USA
  • 8 Department of Biochemistry and Molecular Biology, Faculty of Life Science, Mawlana Bhashani Science and Technology University, Tangail, Bangladesh
Ann Med Surg (Lond), 2023 Jun;85(6):2743-2748.
PMID: 37363524 DOI: 10.1097/MS9.0000000000000743

Abstract

The emergence of genome-wide association studies (GWAS) has identified genetic traits and polymorphisms that are associated with the progression of nonalcoholic fatty liver disease (NAFLD). Phospholipase domain-containing 3 and transmembrane 6 superfamily member 2 are genes commonly associated with NAFLD phenotypes. However, there are fewer studies and replicability in lesser-known genes such as LYPLAL1 and glucokinase regulator (GCKR). With the advent of artificial intelligence (AI) in clinical genetics, studies have utilized AI algorithms to identify phenotypes through electronic health records and utilize convolution neural networks to improve the accuracy of variant identification, predict the deleterious effects of variants, and conduct phenotype-to-genotype mapping. Natural language processing (NLP) and machine-learning (ML) algorithms are popular tools in GWAS studies and connect electronic health record phenotypes to genetic diagnoses using a combination of international classification disease (ICD)-based approaches. However, there are still limitations to machine-learning - and NLP-based models, such as the lack of replicability in larger cohorts and underpowered sample sizes, which prevent the accurate prediction of genetic variants that may increase the risk of NAFLD and its progression to advanced-stage liver fibrosis. This may be largely due to the lack of understanding of the clinical consequence in the majority of pathogenic variants. Though the concept of evolution-based AI models and evolutionary algorithms is relatively new, combining current international classification disease -based NLP models with phylogenetic and evolutionary data can improve prediction accuracy and create valuable connections between variants and their pathogenicity in NAFLD. Such developments can improve risk stratification within clinical genetics and research while overcoming limitations in GWAS studies that prevent community-wide interpretations.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.