Affiliations 

  • 1 Department of Biomechatronics Engineering, National Taiwan University, Taipei, Taiwan; Faculty of Mechanical and Manufacturing Engineering, Universiti Tun Hussein Onn Malaysia, Johor, Malaysia. Electronic address: ongp@uthm.edu.my
  • 2 Department of Biomechatronics Engineering, National Taiwan University, Taipei, Taiwan
  • 3 Master Program in Food Safety, College of Nutrition, Taipei Medical University, Taipei, Taiwan; School of Food Safety, College of Nutrition, Taipei Medical University, Taipei, Taiwan; Nutrition Research Center, Taipei Medical University Hospital, Taipei, Taiwan
PMID: 33744842 DOI: 10.1016/j.saa.2021.119657

Abstract

In this study, near-infrared (NIR) spectroscopy was exploited for non-destructive determination of theanine content of oolong tea. The NIR spectral data (400-2500 nm) were correlated with the theanine level of 161 tea samples using partial least squares regression (PLSR) with different wavelengths selection methods, including the regression coefficient-based selection, uninformative variable elimination, variable importance in projection, selectivity ratio and flower pollination algorithm (FPA). The potential of using the FPA to select the discriminative wavelengths for PLSR was examined for the first time. The analysis showed that the PLSR with FPA method achieved better predictive results than the PLSR with full spectrum (PLSR-full). The developed simplified model using on FPA based on 12 latent variables and 89 selected wavelengths produced R-squared (R2) value and root mean squared error (RMSE) of 0.9542, 0.8794 and 0.2045, 0.3219 for calibration and prediction, respectively. For PLSR-full, the R2 values of 0.9068, 0.8412 and RMSEs of 0.2916, 0.3693, were achieved for calibration and prediction. Also, the optimized model using FPA outperformed other wavelengths selection methods considered in this study. The obtained results indicated the feasibility of FPA to improve the predictability of the PLSR and reduce the model complexity. The nonlinear regression models of support vector machine regression and Gaussian process regression (GPR) were further utilized to evaluate the superiority of using the FPA in the wavelength selection. The results demonstrated that utilizing the wavelength selection method of FPA and nonlinear regression model of GPR could improve the predictive performance.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.