Affiliations 

  • 1 Yale Center for Genome Analysis (YCGA), Yale University, New Haven, CT, USA
  • 2 Department of Biological Sciences, Sunway University, Petaling Jaya, Malaysia
  • 3 College of Public Health, The University of Iowa, Iowa City, IA, USA
Chem Biol Drug Des, 2020 09;96(3):995-1004.
PMID: 32410355 DOI: 10.1111/cbdd.13672

Abstract

Biomarkers can offer great promise for improving prevention and treatment of complex diseases such as cancer, cardiovascular diseases, and diabetes. These can be used as either diagnostic or predictive or as prognostic biomarkers. The revolution brought about in biological big data analytics by artificial intelligence (AI) has the potential to identify a broader range of genetic differences and support the generation of more robust biomarkers in medicine. AI is invigorating biomarker research on various fronts, right from the cataloguing of key mutations driving the complex diseases like cancer to the elucidation of molecular networks underlying diseases. In this study, we have explored the potential of AI through machine learning approaches to propose that these methods can act as recommendation systems to sort and prioritize important genes and finally predict the presence of specific biomarkers. Essentially, we have utilized microarray datasets from open-source databases, like GEO, for breast, lung, colon, and ovarian cancer. In this context, different clustering analyses like hierarchical and k-means along with random forest algorithm have been utilized to classify important genes from a pool of several thousand genes. To this end, network centrality and pathway analysis have been implemented to identify the most potential target as CREB1.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.