Affiliations 

  • 1 Department of Mathematical Sciences, Universiti Teknologi Malaysia , Skudai, Malaysia
  • 2 Department of Statistics and Informatics, University of Mosul , Mosul, Iraq
  • 3 Department of Chemistry, Faculty of Science, Universiti Teknologi Malaysia , Johor, Malaysia
SAR QSAR Environ Res, 2020 Aug;31(8):571-583.
PMID: 32628042 DOI: 10.1080/1062936X.2020.1782467

Abstract

One of the most challenging issues when facing a Quantitative structure-activity relationship (QSAR) classification model is to deal with the descriptor selection. Penalized methods have been adapted and have gained popularity as a key for simultaneously performing descriptor selection and QSAR classification model estimation. However, penalized methods have drawbacks such as having biases and inconsistencies that make they lack the oracle properties. This paper proposes an adaptive penalized logistic regression (APLR) to overcome these drawbacks. This is done by employing a ratio (BWR) of the descriptors between-groups sum of squares (BSS) to the within-groups sum of squares (WSS) for each descriptor as a weight inside the L1-norm. The proposed method was applied to one dataset that consists of a diverse series of antimicrobial agents with their respective bioactivities against Candida albicans. By experimental study, it has been shown that the proposed method (APLR) was more efficient in the selection of descriptors and classification accuracy than the other competitive methods that could be used in developing QSAR classification models. Another dataset was also successfully experienced. Therefore, it can be concluded that the APLR method had significant impact on QSAR analysis and studies.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.