Genetic algorithm is used to perform variable selection to determine the ranges of wavelengths in NIR spectral data suitable to be used as predictors in multivariate calibration model via partial least squares. The NIR spectral data consists of three components of active substances, namely human serum albumin (HSA), γ-globulin and glucose. The wavelength selection is able to improve the calibration model by selecting the wavelengths that contains information or correlated with the concentration of substances, while others non-chosen wavelengths, which contribute no information or contain noises, are excluded from the calibration model.