光谱学与光谱分析, 2009, 29 (10): 2637, 网络出版: 2010-08-31  

BiPLS结合GA优选可见/近红外光谱MLR变量

Selection of Variables for MLR in Vis/NIR Spectroscopy Based on BiPLS Combined with GA
作者单位
中国农业大学食品科学与营养工程学院, 北京 100083
摘要
利用反向区间偏最小二乘法(BiPLS)定位光谱糖度若干信息区间,运用遗传算法(GA)从中选择波长点,建立了多元线性回归(MLR)模型。光谱进行卷积平滑和二阶导数处理后,将光谱(225个数据点)分割成25个子区间时,BiPLS优化结果最优。在所定位的信息区间进行GA二次选择特征变量,运行100次依次选择入选频率较高的12个波长点。为简化MLR模型,对于入选的相邻波长选择频率较高者,最后选择638,734,752,868,910,916和938nm作为回归变量,建立的MLR预测模型相关系数(R^2)、校正均方根误差(RMSEC)和预测均方根误差(RMSEP)分别为0.984,0.364和0.471,优于常用的逐步多元线性回归的建模结果。表明BiPLS结合GA可以有效地对李子糖度可见/近红外光谱MLR回归变量进行筛选,提高了模型的精度。
Abstract
The feasibility of using efficient selection of variables in Vis/NIR for a rapid and conclusive determination of fruit inner qualities such as soluble solids content (SSC) of plums was investigated. A new strategy was proposed in the present paper, i. e. two-stage variable selection using the backward interval partial least squares (BiPLS) combined with genetic algorithm (GA). Firstly, it splits the whole spectral region into equidistant sub-regions and then develops all BiPLS regression models, and the informative regions which are used to constructed PLS models with the lowest error can be located. Secondly, GA method is used to select variable in these informative regions, which are used for regression variables of MLR model. The Vis/NIR spectra containing 225 individual data points were processed by Savizky-Golay filter smoothing and second-order derivative, and 9 sub-regions were selected by BiPLS procedure when the spectra were divided into 25 sub-regions. The optimal 12 variables, which were the output of the GA procedure, were selected by the higher occurrence frequency while the GA procedure ran 100 times. In order to simplify the multiple linear regression (MLR) modeling, the wavelength variables with the maximum occurrence frequency were chosen when the adjacent wavelengths were selected by GA. Finally, 638, 734, 752, 868, 910, 916 and 938 nm were used to build a MLR model. The results show that MLR model produced by BiPLS-GA performs well with correlation coefficients (R) of 0. 984, root mean standard error of calibration (RMSEC) of 0. 364 and root mean standard error of prediction (RMSEP) of 0. 471 for SSC, which outperforms models using stepwise regression analysis (SRA). This work proved that the BiPLS-GA could determine optimal variables in Vis/NIR spectra and improve the accuracy of model.

李鹏飞, 王加华, 曹楠宁, 韩东海. BiPLS结合GA优选可见/近红外光谱MLR变量[J]. 光谱学与光谱分析, 2009, 29(10): 2637. LI Peng-fei, WANG Jia-hua, CAO Nan-ning, HAN Dong-hai. Selection of Variables for MLR in Vis/NIR Spectroscopy Based on BiPLS Combined with GA[J]. Spectroscopy and Spectral Analysis, 2009, 29(10): 2637.

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!