High-dimensionality is one of the major problems which affect the quality of the quantitative structure-activity relationship (QSAR) modelling. Obtaining a reliable QSAR model with few descriptors is an essential procedure in chemometrics. The binary grasshopper optimization algorithm (BGOA) is a new meta-heuristic optimization algorithm, which has been used successfully to perform feature selection. In this paper, four new transfer functions were adapted to improve the exploration and exploitation capability of the BGOA in QSAR modelling of influenza A viruses (H1N1). The QSAR model with these new quadratic transfer functions was internally and externally validated based on MSEtrain, Y-randomization test, MSEtest, and the applicability domain (AD). The validation results indicate that the model is robust and not due to chance correlation. In addition, the results indicate that the descriptor selection and prediction performance of the QSAR model for training dataset outperform the other S-shaped and V-shaped transfer functions. QSAR model using quadratic transfer function shows the lowest MSEtrain. For the test dataset, proposed QSAR model shows lower value of MSEtest compared with the other methods, indicating its higher predictive ability. In conclusion, the results reveal that the proposed QSAR model is an efficient approach for modelling high-dimensional QSAR models and it is useful for the estimation of IC50 values of neuraminidase inhibitors that have not been experimentally tested.
Time-varying binary gravitational search algorithm (TVBGSA) is proposed for predicting antidiabetic activity of 134 dipeptidyl peptidase-IV (DPP-IV) inhibitors. To improve the performance of the binary gravitational search algorithm (BGSA) method, we propose a dynamic time-varying transfer function. A new control parameter, μ , is added in the original transfer function as a time-varying variable. The TVBGSA-based model was internally and externally validated based on
Q
int
2
,
Q
L G O
2
,
Q
B o o t
2
,
M S
E
t r a i n
,
Q
e x t
2
,
M S
E
t e s t
, Y-randomization test, and applicability domain evaluation. The validation results indicate that the proposed TVBGSA model is robust and not due to chance correlation. The descriptor selection and prediction performance of TVBGSA outperform BGSA method. TVBGSA shows higher
Q
int
2
of 0.957,
Q
L G O
2
of 0.951,
Q
B o o t
2
of 0.954,
Q
e x t
2
of 0.938, and lower
M S
E
t r a i n
and
M S
E
t e s t
compared to obtained results by BGSA, indicating the best prediction performance of the proposed TVBGSA model. The results clearly reveal that the proposed TVBGSA method is useful for constructing reliable and robust QSARs for predicting antidiabetic activity of DPP-IV inhibitors prior to designing and experimental synthesizing of new DPP-IV inhibitors.
An improved binary differential search (improved BDS) algorithm is proposed for QSAR classification of diverse series of antimicrobial compounds against Candida albicans inhibitors. The transfer functions is the most important component of the BDS algorithm, and converts continuous values of the donor into discrete values. In this paper, the eight types of transfer functions are investigated to verify their efficiency in improving BDS algorithm performance in QSAR classification. The performance was evaluated using three metrics: classification accuracy (CA), geometric mean of sensitivity and specificity (G-mean), and area under the curve. The Kruskal-Wallis test was also applied to show the statistical differences between the functions. Two functions, S1 and V4, show the best classification achievement, with a slightly better performance of V4 than S1. The V4 function takes the lowest iterations and selects the fewest descriptors. In addition, the V4 function yields the best CA and G-mean of 98.07% and 0.977%, respectively. The results prove that the V4 transfer function significantly improves the performance of the original BDS.