光谱学与光谱分析, 2015, 35 (2): 372, 网络出版: 2015-02-15   

应用CARS和SPA算法对草莓SSC含量NIR光谱预测模型中变量及样本筛选

Near-Infrared Spectra Combining with CARS and SPA Algorithms to Screen the Variables and Samples for Quantitatively Determining the Soluble Solids Content in Strawberry
作者单位
北京市农林科学院北京农业智能装备技术研究中心, 北京 100097
摘要
采用光谱技术对水果进行定量或定性分析, 如何获得一个简单、有效的校正模型对后续模型的应用和维护都非常关键。以草莓内部品质近红外光谱预测为例, 从关键变量和特征样本优选两方面进行研究。采用竞争性自适应重加权CARS算法对光谱变量进行初次选择, 随后采用连续投影算法SPA对校正集样本进行优选, 获得98个特征样本, 针对优选后的变量/样本子集利用SPA算法作二次关键变量提取, 获得25个关键变量。为了验证CARS算法的性能, 蒙特卡罗无信息变量消除MC-UVE和连续投影算法SPA用于比较研究。CARS算法在消除无信息变量的同时可以对共线性信息进行去除。同样, 为了评估SPA算法在特征样本选择中的性能, 经典的Kennard-Stone算法也用于比较分析。SPA算法能够用于校正集特征样本的优选。针对最终优选后的变量/样本(25/98)子集建立PLS和MLR模型对草莓内部可溶性固形物含量SSC含量进行定量预测。结果表明, 两个模型利用原始变量/样本的0.59%/65.33%的信息均能够获得比基于原始变量/样本所建模型更好的性能, 且MLR模型比PLS模型性能略优, r2pre, RMSEP和RPD分别为0.909 7, 0.348 4和3.327 8。
Abstract
In using spectroscopy to quantitatively or qualitatively analyze the quality of fruit, how to obtain a simple and effective correction model is very critical for the application and maintenance of the developed model. Strawberry as the research object, this research mainly focused on selecting the key variables and characteristic samples for quantitatively determining the soluble solids content. Competitive adaptive reweighted sampling (CARS) algorithm was firstly proposed to select the spectra variables. Then, Samples of correction set were selected by successive projections algorithm (SPA), and 98 characteristic samples were obtained. Next, based on the selected variables and characteristic samples, the second variable selection was performed by using SPA method. 25 key variables were obtained. In order to verify the performance of the proposed CARS algorithm, variable selection algorithms including Monte Carlo-uninformative variable elimination (MC-UVE) and SPA were used as the comparison algorithms. Results showed that CARS algorithm could eliminate uninformative variables and remove the collinearity information at the same time. Similarly, in order to assess the performance of the proposed SPA algorithm for selecting the characteristic samples, SPA algorithm was compared with classical Kennard-Stone algorithm. Results showed that SPA algorithm could be used for selection of the characteristic samples in the calibration set. Finally, PLS and MLR model for quantitatively predicting the SSC (soluble solids content) in the strawberry were proposed based on the variables/samples subset (25/98), respectively. Results show that models built by using the 0.59% and 65.33% information of original variables and samples could obtain better performance than using the ones obtained by using all information of the original variables and samples. MLR model was the best with R2pre=0.909 7, RMSEP=0.348 4 and RPD=3.327 8.

李江波, 郭志明, 黄文倩, 张保华, 赵春江. 应用CARS和SPA算法对草莓SSC含量NIR光谱预测模型中变量及样本筛选[J]. 光谱学与光谱分析, 2015, 35(2): 372. LI Jiang-bo, GUO Zhi-ming, HUANG Wen-qian, ZHANG Bao-hua, ZHAO Chun-jiang. Near-Infrared Spectra Combining with CARS and SPA Algorithms to Screen the Variables and Samples for Quantitatively Determining the Soluble Solids Content in Strawberry[J]. Spectroscopy and Spectral Analysis, 2015, 35(2): 372.

本文已被 5 篇论文引用
被引统计数据来源于中国光学期刊网
引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!