Affiliations 

  • 1 College of Mathematics and Physics, Center for Applied Mathematics of Guangxi, Guangxi Minzu University, Nanning 530006, China; Faculty of Mechanical and Manufacturing Engineering, Universiti Tun Hussein Onn Malaysia, 86400 Parit Raja, Batu Pahat, Johor, Malaysia. Electronic address: ongp@uthm.edu.my
  • 2 College of Mathematics and Physics, Center for Applied Mathematics of Guangxi, Guangxi Minzu University, Nanning 530006, China; Guangxi Key Laboratory of Hybrid Computation and IC Design Analysis, Guangxi Minzu University, Nanning 530006, China. Electronic address: jianjb@gxu.edu.cn
  • 3 College of Mathematics and Physics, Center for Applied Mathematics of Guangxi, Guangxi Minzu University, Nanning 530006, China. Electronic address: jianghuayin1017@126.com
  • 4 College of Mathematics and Physics, Center for Applied Mathematics of Guangxi, Guangxi Minzu University, Nanning 530006, China. Electronic address: mgd2006@163.com
PMID: 37451211 DOI: 10.1016/j.saa.2023.123095

Abstract

Wavelength selection is crucial to the success of near-infrared (NIR) spectroscopy analysis as it considerably improves the generalization of the multivariate model and reduces model complexity. This study proposes a new wavelength selection method, interval flower pollination algorithm (iFPA), for spectral variable selection in the partial least squares regression (PLSR) model. The proposed iFPA consists of three phases. First, the flower pollination algorithm is applied to search for informative spectral variables, followed by variable elimination. Subsequently, the iFPA performs a local search to determine the best continuous interval spectral variables. The interpretability of the selected variables is assessed on three public NIR datasets (corn, diesel and soil datasets). Performance comparison with other competing wavelength selection methods shows that the iFPA used in conjunction with the PLSR model gives better prediction performance, with the root mean square error of prediction values of 0.0096-0.0727, 0.0015-3.9717 and 1.3388-29.1144 are obtained for various responses in corn, diesel and soil datasets, respectively.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.